PBS GUI
Window
COMMAND | PURPOSE |
#xpbs | PBS Graphical User Interface. |
#xpbsmon | PBS Graphical Monitoring Tool. |
Qstat Command
Special Option.
#qstat | Get the Detail Information about Server, Job ,Queue |
#qstat -B | Show Server Status |
#qstat -Bf | Show detail Information about Server Status |
#qstat -Q | List all the Queue |
#qstat -Qf | List Detail Information about Queue. |
#qstat -f <JOB-ID> | Show Detail Information about job. |
-T Estimate Job starting time (When the job will start if it is Queue).
-p How much percentage job is completed.
-n Get List of nodes allocated to the job number(#qstat -n JOBID).
#pbs_probe - It will review
installation files,directory,file permission and review the report
the problem.
Job Script Generator Links:
Verification
(or) Testing
Future Article Should Focus on.
1)How To Configure Queue Parameter on
PBS.
Queue Parameters.
Create Queue.
#qmgr -c Create queue.
#qmgr -c 'p s' List the Queue.
By through Exit Status we may know whether jobs is completed successfully or not.
1.1)PBS JOB SCRIPT.
username@hostname$ tracejob 33288
Job: 33288.login1
05/07/2014 12:25:37 L Considering job to run
05/07/2014 12:25:37 <div style="background-color: black; color: lime; line-height: 18pt; width: 600px;">S enqueuing into workq, state 1 hop 1
05/07/2014 12:25:37 S Job Queued at request of username@hostnmae, owner = username@hostname,
job name = mpiblast, queue = workq
05/07/2014 12:25:37 S Job Run at request of Scheduler@login1.cmmacs.ernet.in on exec_vnode (cn1063:ncpus=16)
05/07/2014 12:25:37 S Job Modified at request of Scheduler@0.hostname.domamain.com
05/07/2014 12:25:37 L Job run
05/07/2014 12:25:46 S Obit received momhop:1 serverhop:1 state:4 substate:42
05/07/2014 12:27:22 S Post job file processing error
05/07/2014 12:27:22 S Exit_status=255 resources_used.cpupercent=0 resources_used.cput=00:00:05
resources_used.mem=0kb resources_used.ncpus=16 resources_used.vmem=0kb
resources_used.walltime=00:00:09
</div>
username@hostname$ tracejob 33288
Job: 33288.login1
05/07/2014 12:25:37 L Considering job to run
05/07/2014 12:25:37 <div style="background-color: black; color: lime; line-height: 18pt; width: 600px;">S enqueuing into workq, state 1 hop 1
05/07/2014 12:25:37 S Job Queued at request of username@hostnmae, owner = username@hostname,
job name = mpiblast, queue = workq
05/07/2014 12:25:37 S Job Run at request of Scheduler@login1.cmmacs.ernet.in on exec_vnode (cn1063:ncpus=16)
05/07/2014 12:25:37 S Job Modified at request of Scheduler@0.hostname.domamain.com
05/07/2014 12:25:37 L Job run
05/07/2014 12:25:46 S Obit received momhop:1 serverhop:1 state:4 substate:42
05/07/2014 12:27:22 S Post job file processing error
05/07/2014 12:27:22 S Exit_status=255 resources_used.cpupercent=0 resources_used.cput=00:00:05
resources_used.mem=0kb resources_used.ncpus=16 resources_used.vmem=0kb
resources_used.walltime=00:00:09
</div>
PBS_Job Which one is completed successfully where (EXIT_STATUS =0)_.
$ tracejob 44213
Job: 44213.login1
07/09/2014 12:24:04 L Considering job to run
07/09/2014 12:24:04 S enqueuing into workq, state 1 hop 1
07/09/2014 12:24:04 S Job Queued at request of sbag@login1.cmmacs.ernet.in, owner = sbag@login1.cmmacs.ernet.in, job name = mpiblastx, queue =
workq
07/09/2014 12:24:04 S Job Run at request of Scheduler@login1.cmmacs.ernet.in on exec_vnode
(cn0206:ncpus=16)+(cn0207:ncpus=16)+(cn0214:ncpus=16)+(cn0215:ncpus=16)+(cn0223:ncpus=16)+(cn0243:ncpus=16)+(cn0418:ncpus=16)+(cn0459:ncpus=16)+(cn0460:ncpus=16)+(cn0461:ncpus=16)+(cn0486:ncpus=16)+(cn0487:ncpus=16)
07/09/2014 12:24:05 S Job Modified at request of Scheduler@login1.cmmacs.ernet.in
07/09/2014 12:24:05 L Job run
07/09/2014 12:25:23 S Obit received momhop:1 serverhop:1 state:4 substate:42
07/09/2014 12:25:56 S Post job file processing error
07/09/2014 12:25:56 S Exit_status=0 resources_used.cpupercent=0 resources_used.cput=00:14:34 resources_used.mem=6517280kb
resources_used.ncpus=192 resources_used.vmem=23267096kb resources_used.walltime=00:01:18
Job: 44213.login1
07/09/2014 12:24:04 L Considering job to run
07/09/2014 12:24:04 S enqueuing into workq, state 1 hop 1
07/09/2014 12:24:04 S Job Queued at request of sbag@login1.cmmacs.ernet.in, owner = sbag@login1.cmmacs.ernet.in, job name = mpiblastx, queue =
workq
07/09/2014 12:24:04 S Job Run at request of Scheduler@login1.cmmacs.ernet.in on exec_vnode
(cn0206:ncpus=16)+(cn0207:ncpus=16)+(cn0214:ncpus=16)+(cn0215:ncpus=16)+(cn0223:ncpus=16)+(cn0243:ncpus=16)+(cn0418:ncpus=16)+(cn0459:ncpus=16)+(cn0460:ncpus=16)+(cn0461:ncpus=16)+(cn0486:ncpus=16)+(cn0487:ncpus=16)
07/09/2014 12:24:05 S Job Modified at request of Scheduler@login1.cmmacs.ernet.in
07/09/2014 12:24:05 L Job run
07/09/2014 12:25:23 S Obit received momhop:1 serverhop:1 state:4 substate:42
07/09/2014 12:25:56 S Post job file processing error
07/09/2014 12:25:56 S Exit_status=0 resources_used.cpupercent=0 resources_used.cput=00:14:34 resources_used.mem=6517280kb
resources_used.ncpus=192 resources_used.vmem=23267096kb resources_used.walltime=00:01:18
PBS Job which is not completed successfully(Exit STATUS=1).
tracejob 44144
Job: 44144.login1
07/09/2014 10:00:11 L Considering job to run
07/09/2014 10:00:11 S enqueuing into workq, state 1 hop 1
07/09/2014 10:00:11 S Job Queued at request of sbag@login1.cmmacs.ernet.in, owner = sbag@login1.cmmacs.ernet.in, job name = mpiblastx, queue =
workq
07/09/2014 10:00:11 S Job Run at request of Scheduler@login1.cmmacs.ernet.in on exec_vnode
(cn0244:ncpus=16)+(cn0758:ncpus=16)+(cn0780:ncpus=16)+(cn0781:ncpus=16)+(cn0782:ncpus=16)+(cn0783:ncpus=16)+(cn0784:ncpus=16)+(cn0785:ncpus=16)+(cn0786:ncpus=16)+(cn0787:ncpus=16)+(cn0788:ncpus=16)+(cn0943:ncpus=16)
07/09/2014 10:00:11 S Job Modified at request of Scheduler@login1.cmmacs.ernet.in
07/09/2014 10:00:11 L Job run
07/09/2014 10:00:17 S Obit received momhop:1 serverhop:1 state:4 substate:42
07/09/2014 10:00:50 S Post job file processing error
07/09/2014 10:00:50 S Exit_status=1 resources_used.cpupercent=0 resources_used.cput=00:00:00 resources_used.mem=952kb resources_used.ncpus=192
resources_used.vmem=13304kb resources_used.walltime=00:00:06
Job: 44144.login1
07/09/2014 10:00:11 L Considering job to run
07/09/2014 10:00:11 S enqueuing into workq, state 1 hop 1
07/09/2014 10:00:11 S Job Queued at request of sbag@login1.cmmacs.ernet.in, owner = sbag@login1.cmmacs.ernet.in, job name = mpiblastx, queue =
workq
07/09/2014 10:00:11 S Job Run at request of Scheduler@login1.cmmacs.ernet.in on exec_vnode
(cn0244:ncpus=16)+(cn0758:ncpus=16)+(cn0780:ncpus=16)+(cn0781:ncpus=16)+(cn0782:ncpus=16)+(cn0783:ncpus=16)+(cn0784:ncpus=16)+(cn0785:ncpus=16)+(cn0786:ncpus=16)+(cn0787:ncpus=16)+(cn0788:ncpus=16)+(cn0943:ncpus=16)
07/09/2014 10:00:11 S Job Modified at request of Scheduler@login1.cmmacs.ernet.in
07/09/2014 10:00:11 L Job run
07/09/2014 10:00:17 S Obit received momhop:1 serverhop:1 state:4 substate:42
07/09/2014 10:00:50 S Post job file processing error
07/09/2014 10:00:50 S Exit_status=1 resources_used.cpupercent=0 resources_used.cput=00:00:00 resources_used.mem=952kb resources_used.ncpus=192
resources_used.vmem=13304kb resources_used.walltime=00:00:06
0 comments:
Post a Comment