|
Using Condor to Run Sims (VLSI, ULP and Arch Group) |
|---|
|
Overview: |
|
Our group currently has several server machines running the job sharing application “Condor.” Over the past few months I have been running many circuit simulations some of which can take days. As a result is makes sense to harness the power of this computing cluster. Condor is pretty simple, just take a script or other file that you have written and upload it to the cluster. Your job will automatically be sent to an available machine, and when its done all of the output will appear in the directory that you started the job from, and condor will email you to say your job is done. |
|
Submitting a Job |
|
- the executable. Condor can run any executable, however if you have lots of parallel jobs to do I recommend using a perl script with different command line parameters. Here is an example script [char.pl] that I used to test many different process tech configurations, here are the related hspice files[ring_a.sp, ring_l.sp]. The condor machines will automatically mount /home/cktcad - the command file. The next step before submitting a job is
to create a condor command file. This tells condor what files to
upload, and what program to execute. Here is an example,
ckt.cmd. Now you are ready to submit. Log on to one of the condor machines (see below) and cd into the subdirectory with your files (executable, input, command). Run the following command %condor_submit <command_file> Condor should report “submitted X jobs to cluster N” That's it your jobs are submitted! |
|
Checking on Job Status |
|
Use the following commands when logged into a condor machine to check on the status of your jobs. There are many others so check out the condor manual. %condor_q %condor_q -global %condor_q -analyze %condor_rm <job or cluster number> %condor_status %/usr/local/condor/bin/condor_prio -p <priority> <job
or cluster number> %condor_version |
|
|
|
Condor Machines (more to come) |
|
|
FAQs |
|
|
|
|
|
Links |
|