Difference: BatchSystem (11 vs. 12)

Revision 122016-04-22 - GordonStewart

Line: 1 to 1
 
META TOPICPARENT name="WebHome"

Batch System

Changed:
<
<
The PPE group has limited resources for batch computing. The ppepbs batch system is provided for running a small number of jobs. If a large number of jobs are needed then please use the Grid or try the Compute Cluster.
>
>
The PPE group maintains a PBS cluster for running small quantities of jobs. If you need to run large numbers of jobs, you should investigate the possibility of running on ScotGrid.
 
Changed:
<
<
The PPE batch system that is managed via the TORQUE Resource Manager (based on OpenPBS) and the Maui scheduler. The batch system can be accessed from any linux desktop using the pbs commands described below.
>
>
The batch system uses the TORQUE resource manager (based on OpenPBS) and the Maui scheduler. It can be accessed from any Linux desktop using the commands described below.
 
Changed:
<
<
The batch nodes are installed with mixture of 64 bit SL4 and SL5. There are 47 cpus for SL5 jobs and 40 cpus for SL4 jobs. Eight queues are available spilt into two groups: four queues for SL4 jobs and four queues for SL5 jobs (see the queues section below). Executables should be built on one of the PPE Linux desktops machines of the required flavour. The version of scientific linux install on a machine can be checked by examining the /etc/redhat-release file:
>
>
The current composition of the batch system is as follows:
 
Changed:
<
<
cat /etc/redhat-release
>
>
Nodes Operating System Total CPU Cores
node123 to node456 SL5 999
 
Changed:
<
<
and to check if the a machine is 32 or 64 bit:
>
>
The following queues are provided:
 
Changed:
<
<
uname -m
>
>
Name Operating System Maximum runtime
short5 SL5 1 hour
medium5 SL5 6 hours
long5 SL5 1 day
vlong5 SL5 5 days
short6 SL6 1 hour
medium6 SL6 6 hours
long6 SL6 1 day
vlong6 SL6 5 days
 
Changed:
<
<
a 64 bit machine will return x86_64 and 32 bit machine i686.
>
>

Using PBS

 
Changed:
<
<

Job submission

From any ppe linux desktop jobs can be submitted to a TORQUE queue via qsub, e.g.:
>
>

Create a submission script

 
Changed:
<
<
qsub test.job

where test.job might contain

>
>
Jobs are defined using a submission script, which is like a shell script with the addition of certain directives (indicated by the #PBS prefix) which tell PBS how the job should be handled. A simple submission script might look like the following:
 
#PBS -N TestJob

Line: 38 to 38
 echo "This is a test..."
Changed:
<
<
More documentation is given in the qsub man page.
>
>

Submit a job

Jobs are submitted using the qsub command:

$ qsub <FILENAME>

To submit a job defined by the submission script test.pbs:

$ qsub test.pbs

More details can be found in the qsub man page.

Show running jobs

 

Queues

There are currently eight queues on the batch system. The four queues ending in '4' will run jobs on SL4 machines and the four queues ending in '5' will run jobs on SL5 machines:
Line: 77 to 93
 

Killing a job

Jobs may be terminated by executing qdel JOBID where the JOBID is the numerical ID code returned in the qstat listing.

Deleted:
<
<
-- AndrewPickford - 12 Jan 2009
 \ No newline at end of file
 
This site is powered by the TWiki collaboration platform Powered by PerlCopyright © 2008-2022 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Send feedback