Cron Jobs are used for scheduling tasks to run on the server. They’re most commonly used for automating system maintenance or administration. However, they are also relevant to web application development. There are many situations when a web application may need certain tasks to run periodically. Today we are going to explore the fundamentals of Cron Jobs.

First let’s familiarize ourselves with the terms related to this subject.

“Cron” is a time-based job scheduler in Unix-like operating systems (Linux, FreeBSD, Mac OS etc…). And these jobs or tasks are referred to as “Cron Jobs”.

There is a cron “daemon” that runs on these systems. A daemon is a program that runs in the background all the time, usually initiated by the system. This cron daemon is responsible for launching these cron jobs on schedule.

The schedule resides in a configuration file named “crontab”. That’s where all the tasks and their timers are listed.

Server admins have been using cron jobs for a long time. But since the target audience of this article is web developers, let’s look at a few use cases of cron jobs that are relevant in this area:

Here is a simple cron job:

There are two main parts:

The command itself in this example has three parts:

This is the first part of the cron job string, as mentioned above. It determines how often and when the cron job is going to run.

It consists of five parts:

Here is an illustration:

Quite often, you will see an asterisk (*) instead of a number. This represents all possible numbers for that position. For example, asterisk in the minute position would make it run every minute.

We need to look at a few examples to fully understand this Syntax.

This cron job will run every minute, all the time:

This cron job will run at minute zero, every hour (ie. an hourly cron job):

This is also an hourly cron job but runs at 15 minutes after the hour instead (ie. 00:15, 01:15, 02:15, etc.):

This will run once a day, at 2:30am:

This will run once a month, on the second day of the month at midnight (eg. January 2nd 12:00 am or February 2nd 12:00 am):

This will run on Mondays, every hour (ie. 24 times in one day, but only on Mondays):

You can use multiple numbers separated by commas. This will run three times every hour, at minutes 0, 10 and 20:

Division operator is also used. This will run 12 times per hour—every 5 minutes:

Dash can be used to specify a range. This will run once every hour between 5:00am and 10:00am:

Also there is a special keyword that will let you run a cron job every time the server is rebooted:

There are a few different ways to create and manage your cron jobs.

Many web hosting companies provide control panels for their customers. If you are one of them, you might be able to find a section in your control panel to manage your cron jobs.

Running this command will launch vi (text editor) and let you edit the contents of the crontab. If you dont have any crontabs files on your system, the command will create one.

If you would just like to see the existing crontab without editing it, you can run this command:

To delete the contents of the crontab:

You can write all of your cron jobs into a file and then push it into the crontab:

Be careful, because this will overwrite all existing cron jobs with this files contents, without warning.

You can add comments followed by the # character.

As I mentioned earlier, by default the output from the crons get sent via email, unless you discard them or redirect them to a file. The MAILTO setting let’s you set or change which email address to send them to:

CGI scripts are executable by default, but PHP scripts are not. They need to run through the PHP parser. That’s why we need to put the path to the parser before the path of the script.

Sometimes it might be under another location like: /usr/local/bin/php. To find out, you can try running this in the command line:

If you do not handle the output of the cron script, it will send them as emails to your user account on the server.

If you put > /dev/null 2>&1 at the end of the cron job command (or any command), the output will be discarded.

The closing bracket (>) is used for redirecting output. /dev/null is like a black hole for output. Anything that goes there is ignored by the system.

This part 2>&1 causes the STDERR (error) output to be redirected to the STDOUT (normal) output. So that also ends up in the /dev/null.

To store the cron output in a file, use the closing bracket (>) again:

That will rewrite the output file every time. If you would like to append the output at the end of the file instead of a complete rewrite, use a double closing bracket (>>) instead:

Normally you need to specify the parser at the beginning of the command as we have been doing. But there is actually a way to make your PHP scripts executable from the command line like a CGI script.

You need is to add the path to the parser as the first line of the script:

Also make sure to set the proper permissions (like 755) to make the file executable.

When you have an executable script, the cron job can be shorter like this:

In some cases you may have frequent running cron jobs, and you may not want them to collide if they take longer to run than the frequency itself.

For example, you may have a cron job running every minute. Yet, every once in a while it may take longer than one minute to run. This can cause another instance of the same cron script to start running before the previous one finishes. You can create too many busy processes this way and possibly crash the server if they keep slowing down each other, and cause even more processes to be created over time.

You can add this code to the cron job script:

With regular file locks the flock() function call would block the script if there is an existing lock. And it would release once that lock is gone. However, with a non-blocking lock, such as in the code above, the function call does not stop the script, but it immediately returns FALSE if there is an existing lock. So in this case, we can immediately exit the script when we see there is an existing lock, which indicates that another cron job is currently running.

When you write a cron job in a web scripting language like PHP, you may want to make sure that nobody can execute it by just loading it from their browser. One easy option would be to store these scripts outside of your web folder. However this may not be practical or preferable for some developers, if they want to keep their cron job scripts right within their web application folders.

If you put all of the cron job scripts in a folder, you block access by putting this line in an .htaccess file:

Or you can also deny access to scripts on an individual basis by putting this line at the beginning:

This will ensure that, when the script is accessed from the web, it will abort immediately.

Setting cron jobs and assuming they will all work as expected is not a good practice. Wouldn’t it be cool to have a tool that monitors your running tasks and ensure that any failing, broken, or delayed cron jobs are discovered promptly?

It even allows you to integrate with popular messaging platforms like slack, Microsoft teams, SMS, and email to ensure that failure alerts are delivered on time.

Thank you for reading. Even though cron jobs just seem like a tool just for system admins, they are actually relevant to many kinds of web applications.

Please leave your comments and questions, and have a great day!

source