Pbs Job Scheduler – Importants Commands1


PBS GUI Window
COMMAND PURPOSE
#xpbs PBS Graphical User Interface.
#xpbsmon PBS Graphical Monitoring Tool.

Qstat Command Special Option.
#qstat Get the Detail Information about
Server, Job ,Queue
#qstat -B Show Server Status
#qstat -Bf Show detail Information about Server Status
#qstat -Q List all the Queue
#qstat -Qf List Detail Information about Queue.
#qstat -f <JOB-ID> Show Detail Information about job.

qstat  Some Special New Option
-T Estimate Job starting time (When the job will start if it is Queue).
-p How much percentage job is completed.
-n Get List of nodes allocated to the job number(#qstat -n JOBID).

#pbs_probe - It will review installation files,directory,file permission and review the report the problem.

Job Script Generator Links:

Verification (or) Testing

Future Article Should Focus on.
1)How To Configure Queue Parameter on PBS.
Queue Parameters.
Create Queue.
#qmgr -c Create queue.
#qmgr -c 'p s' List the Queue.


1.1)PBS JOB SCRIPT.
username@hostname$ tracejob 33288
Job: 33288.login1
05/07/2014 12:25:37  L    Considering job to run
05/07/2014 12:25:37  <div style="background-color: black; color: lime; line-height: 18pt; width: 600px;">S    enqueuing into workq, state 1 hop 1
05/07/2014 12:25:37  S    Job Queued at request of username@hostnmae, owner = username@hostname,
                          job name = mpiblast, queue = workq
05/07/2014 12:25:37  S    Job Run at request of Scheduler@login1.cmmacs.ernet.in on exec_vnode (cn1063:ncpus=16)
05/07/2014 12:25:37  S    Job Modified at request of Scheduler@0.hostname.domamain.com
05/07/2014 12:25:37  L    Job run
05/07/2014 12:25:46  S    Obit received momhop:1 serverhop:1 state:4 substate:42
05/07/2014 12:27:22  S    Post job file processing error
05/07/2014 12:27:22  S    Exit_status=255 resources_used.cpupercent=0 resources_used.cput=00:00:05
                          resources_used.mem=0kb resources_used.ncpus=16 resources_used.vmem=0kb
                          resources_used.walltime=00:00:09
</div>
By  through Exit Status we may know whether jobs is completed successfully or not.

PBS_Job Which one is completed successfully where (EXIT_STATUS =0)_.
$ tracejob 44213
Job: 44213.login1
07/09/2014 12:24:04  L    Considering job to run
07/09/2014 12:24:04  S    enqueuing into workq, state 1 hop 1
07/09/2014 12:24:04  S    Job Queued at request of sbag@login1.cmmacs.ernet.in, owner = sbag@login1.cmmacs.ernet.in, job name = mpiblastx, queue =
                          workq
07/09/2014 12:24:04  S    Job Run at request of Scheduler@login1.cmmacs.ernet.in on exec_vnode
                          (cn0206:ncpus=16)+(cn0207:ncpus=16)+(cn0214:ncpus=16)+(cn0215:ncpus=16)+(cn0223:ncpus=16)+(cn0243:ncpus=16)+(cn0418:ncpus=16)+(cn0459:ncpus=16)+(cn0460:ncpus=16)+(cn0461:ncpus=16)+(cn0486:ncpus=16)+(cn0487:ncpus=16)
07/09/2014 12:24:05  S    Job Modified at request of Scheduler@login1.cmmacs.ernet.in
07/09/2014 12:24:05  L    Job run
07/09/2014 12:25:23  S    Obit received momhop:1 serverhop:1 state:4 substate:42
07/09/2014 12:25:56  S    Post job file processing error
07/09/2014 12:25:56  S    Exit_status=0 resources_used.cpupercent=0 resources_used.cput=00:14:34 resources_used.mem=6517280kb
                          resources_used.ncpus=192 resources_used.vmem=23267096kb resources_used.walltime=00:01:18

PBS Job which is not completed successfully(Exit STATUS=1). 
tracejob 44144

Job: 44144.login1

07/09/2014 10:00:11  L    Considering job to run
07/09/2014 10:00:11  S    enqueuing into workq, state 1 hop 1
07/09/2014 10:00:11  S    Job Queued at request of sbag@login1.cmmacs.ernet.in, owner = sbag@login1.cmmacs.ernet.in, job name = mpiblastx, queue =
                          workq
07/09/2014 10:00:11  S    Job Run at request of Scheduler@login1.cmmacs.ernet.in on exec_vnode
                          (cn0244:ncpus=16)+(cn0758:ncpus=16)+(cn0780:ncpus=16)+(cn0781:ncpus=16)+(cn0782:ncpus=16)+(cn0783:ncpus=16)+(cn0784:ncpus=16)+(cn0785:ncpus=16)+(cn0786:ncpus=16)+(cn0787:ncpus=16)+(cn0788:ncpus=16)+(cn0943:ncpus=16)
07/09/2014 10:00:11  S    Job Modified at request of Scheduler@login1.cmmacs.ernet.in
07/09/2014 10:00:11  L    Job run
07/09/2014 10:00:17  S    Obit received momhop:1 serverhop:1 state:4 substate:42
07/09/2014 10:00:50  S    Post job file processing error
07/09/2014 10:00:50  S    Exit_status=1 resources_used.cpupercent=0 resources_used.cput=00:00:00 resources_used.mem=952kb resources_used.ncpus=192
                          resources_used.vmem=13304kb resources_used.walltime=00:00:06

0 comments:

Post a Comment