Changeset 89 for trunk/oarutils


Ignore:
Timestamp:
Jun 5, 2012, 11:14:20 PM (12 years ago)
Author:
g7moreau
Message:
  • Example of transmit use
File:
1 edited

Legend:

Unmodified
Added
Removed
  • trunk/oarutils/oar-parexec

    r88 r89  
    589589
    590590C<oar-parexec> is compatible with the OAR checkpointing.
    591 Il you have 2000 small jobs that need 55h to be done on 6 cores,
     591If you have 2000 small jobs that need 55h to be done on 6 cores,
    592592you can cut this in small parts.
    593593
     
    631631They will be launch with the same command line at the next OAR run.
    632632
     633Example: if you have 50 small jobs that each need 72h to be done on 1 cores,
     634you can cut this in 24h parts.
     635
     636For this example, we suppose that each long job loop need about 20min...
     637So, we send a checkpoint 30min before the end of the process
     638to let C<oar-parexec> suspend the jobs started.
     639After being checkpointed, C<oar-parexec> do not start any new small job.
     640
     641 oarsub -t idempotent -n test \
     642   -l /core=6,walltime=24:00:00 \
     643   --checkpoint 1800 \
     644   --transmit \
     645   "oar-parexec -f ./subjob.list.txt -l ./subjob.list.log"
     646
     647After 23h30min, the OAR job will begin to stop launching new small job.
     648When all running small job are suspend, it's exit.
     649But as the OAR job is type C<idempotent>,
     650OAR will re-submit it as long as all small job are not finished...
     651
    633652=head1 SEE ALSO
    634653
Note: See TracChangeset for help on using the changeset viewer.