- Infos im HLRS Wiki sind nicht rechtsverbindlich und ohne Gewähr -
- Information contained in the HLRS Wiki is not legally binding and HLRS is not responsible for any damages that might result from its use -

CRAY XE6 Using the Batch System: Difference between revisions

From HLRS Platforms
Jump to navigationJump to search
No edit summary
No edit summary
Line 8: Line 8:
Detailed information about how to use this system and many examples can be found in [https://fs.hlrs.de/projects/craydoc/docs/books/S-2396-50/html-S-2396-50/chapter-djg9hyw1-brbethke.html Cray Application Developer's Environment User's Guide] and [https://fs.hlrs.de/projects/craydoc/docs/books/S-2496-31/html-S-2496-31/chapter-3vnhd83p-oswald-runningbatchjobs.html Workload Management and Application Placement for the Cray Linux Environment].
Detailed information about how to use this system and many examples can be found in [https://fs.hlrs.de/projects/craydoc/docs/books/S-2396-50/html-S-2396-50/chapter-djg9hyw1-brbethke.html Cray Application Developer's Environment User's Guide] and [https://fs.hlrs.de/projects/craydoc/docs/books/S-2496-31/html-S-2496-31/chapter-3vnhd83p-oswald-runningbatchjobs.html Workload Management and Application Placement for the Cray Linux Environment].


== Running Jobs ==
= Running Jobs =
Writing a submission script is typically the most convenient way to submit your job to the batch system.
Writing a submission script is typically the most convenient way to submit your job to the batch system.
You generally interact with the batch system in two ways: through options specified in job submission scripts (these are detailed below in the examples) and by using torque or moab commands on the login nodes. There are three key commands used to interact with torque:
You generally interact with the batch system in two ways: through options specified in job submission scripts (these are detailed below in the examples) and by using torque or moab commands on the login nodes. There are three key commands used to interact with torque:
Line 19: Line 19:
   man pbs  
   man pbs  


=== the qsub command ===
== The qsub command ==
To submit a job, type
To submit a job, type
   qsub my_batchjob_script.pbs
   qsub my_batchjob_script.pbs
This will submit your job script "my_batchjob_script.pbs" to the job-queues.
This will submit your job script "my_batchjob_script.pbs" to the job-queues.
A simple MPI job submission script for the XE6 would look like:
<pre>
#!/bin/bash --login
#PBS -N job_name
#PBS -A account
#PBS -l mppwidth=32
#PBS -l mppnppn=4
#PBS -l walltime=00:20:00           
 
# Change to the direcotry that the job was submitted from
cd $PBS_O_WORKDIR
# Launch the parallel job
aprun -n 32 -N 4 ./my_mpi_executable.x arg1 arg2
</pre>

Revision as of 15:48, 13 December 2010

The only way to start a parallel job on the compute nodes of this system is to use the batch system. The installed batch system is based on

  • the resource management system torque and
  • the scheduler moab

Additional you have to know on CRAY XE6 the user applications are always launched on the compute nodes using the application launcher, aprun, which submits applications to the Application Level Placement Scheduler (ALPS) for placement and execution.

Detailed information about how to use this system and many examples can be found in Cray Application Developer's Environment User's Guide and Workload Management and Application Placement for the Cray Linux Environment.

Running Jobs

Writing a submission script is typically the most convenient way to submit your job to the batch system. You generally interact with the batch system in two ways: through options specified in job submission scripts (these are detailed below in the examples) and by using torque or moab commands on the login nodes. There are three key commands used to interact with torque:

  • qsub
  • qstat
  • qdel

Check the man page of torque for more advanced commands and options

 man pbs 

The qsub command

To submit a job, type

 qsub my_batchjob_script.pbs

This will submit your job script "my_batchjob_script.pbs" to the job-queues.

A simple MPI job submission script for the XE6 would look like:

#!/bin/bash --login
#PBS -N job_name
#PBS -A account
#PBS -l mppwidth=32
#PBS -l mppnppn=4
#PBS -l walltime=00:20:00             
  
# Change to the direcotry that the job was submitted from
cd $PBS_O_WORKDIR

# Launch the parallel job
aprun -n 32 -N 4 ./my_mpi_executable.x arg1 arg2