Computing/Linux/OpenPBS: Difference between revisions

From OpenWetWare
Jump to navigationJump to search
No edit summary
No edit summary
Line 15: Line 15:
* [[offline-node]] - a simple shell wrapper to [pbsnodes]
* [[offline-node]] - a simple shell wrapper to [pbsnodes]
* [[pbsnodes]] - maintain which compute-nodes participate the queuing system
* [[pbsnodes]] - maintain which compute-nodes participate the queuing system
* for i in `qstat | grep <username> | awk -F. '{print $1}'` ; do qdel $i ; done -- delete all submitted jobs run by <username>
* checknode <nodename> -- check node with name <nodename>
* diagnose -n | grep -v nonspeedy -- look at all speedy nodes
* tracejob <jobid> -- look at the history of a particular job with ID <jobid> that was run
* showq -- show jobstart order
* diagnose -p -- show priorities


Other Important Queuing Bits
Other Important Queuing Bits

Revision as of 09:48, 24 August 2006

OpenPBS is a GPL'd batch queuing system. In brief, both the [Biowulf] and the [Roadrunner] clusters use OpenPBS to distribute compute-jobs across their several nodes.


Common OpenPBS Commands

  • qsub - submit a compute-job for execution
  • qstat - query status of the queues
  • qdel - remove a (possibly running) compute-job

UnCommon OpenPBS Commands

  • offline-node - a simple shell wrapper to [pbsnodes]
  • pbsnodes - maintain which compute-nodes participate the queuing system
  • for i in `qstat | grep <username> | awk -F. '{print $1}'` ; do qdel $i ; done -- delete all submitted jobs run by <username>
  • checknode <nodename> -- check node with name <nodename>
  • diagnose -n | grep -v nonspeedy -- look at all speedy nodes
  • tracejob <jobid> -- look at the history of a particular job with ID <jobid> that was run
  • showq -- show jobstart order
  • diagnose -p -- show priorities


Other Important Queuing Bits

  • What queues exist? --> qstat -q
  • showstart - When will my job run?
  • fairshare - What is the amount my group's cluster usage?

How to manage job output