Difference: ProductionHowto (12 vs. 13)

Revision 132012-10-23 - DanProtopopescu

Line: 1 to 1
 
META TOPICPARENT name="WebHome"

NA62 Monte Carlo Production Howto

Line: 36 to 36
 This is an HTML form with many input filelds, most of them self-explanatory. The pre-filled values are inherited from the previous submission (which could have been a test job for example), so you must check that they fit the production round you are managing.
Added:
>
>
ALERT! There is now a simpler, production-only scripter interface. Use that for production! It has less settings than the one described below.
 Here is how the scripter interface looks like:

ScripterUI.001.png

Line: 44 to 46
 
  1. Description - this is a short description of the job. It must contain the production tag (get it from here if unsure), and it must contain the keyword "production". Should not contain quotes or any other non-text characters. Leave out the tag and replace production with test if you want to submit a few test jobs.
  2. Run interval - the start run is pre-filled with the next available run number (from the DB). Choose the upper limit such that you submit not more than 100 jobs at a time. Do not use job cloning for production at the moment.
Changed:
<
<
  1. Number of events - this is the number of events per job (run). We aim to keep the job runtime below 20 hours, so for channel 1 that means 1500 events per job. For other channels it could be more or less than that. Check previous production jobs to find the optimal number of events. Leave the random seed as it is, because it will be set automatically for each run.
  2. MC software version - you must use the latest software version (check here if unsure), and make sure the wrapper script supports this version. Take a look at the scripts of previous jobs to make sure (click here, select the last production job, then click the corresponding green box link on the "Exe" column and check the line vers="v?"). There is a grid "version" for each installed software revision (e.g. v6/r188), see this wiki.
>
>
  1. Number of events - this is the number of events per job (run). We aim to keep the job runtime below 20 hours, so for channel 1 an r188 that means 1500 events per job, for r193 and channel 1 the optimum would be 6000 events etc. For other channels it could be more or less than that. Check previous production jobs to find the optimal number of events. Leave the random seed as it is, because it will be set automatically for each run.
  2. MC software version - you must use the latest software version (check here if unsure), and make sure the wrapper script supports this version. Take a look at the scripts of previous jobs to make sure (click here, select the last production job, then click the corresponding green box link on the "Exe" column and check the line vers="v?"). There is a grid "version" for each installed software revision (e.g. v6/r188, v7/r193), see this wiki.
 
  1. Decay type - is the reaction channel to simulate. Choose from the drop-down menu and make sure it corresponds to the current production tag and description (check here if unsure). Leave default values for the remaining options, unless instructed otherwise.
Changed:
<
<
  1. Destination - tick here only the sites that have the chosen MC software version installed. Check this table to make sure. Check the jobs history to detect any problems at sites (e.g. jobs consistently finishing early, or going to status CLEARED without registering any output). ALERT! If jobs fail at a site, uncheck it here and notify the site admin!
>
>
  1. Destination - tick here only the sites that have the chosen MC software version installed. Check this table to make sure. Check the jobs history to detect any problems at sites (e.g. jobs consistently finishing early, or going to status CLEARED without registering any output). ALERT! If jobs fail at a site, uncheck it here, notify the site admin and add a comment in the logbook!
 
  1. Executable - automatically selected now - this is the name of the wrapper script that is executed on the worker node. It checks if the software is installed, runs the actual MC simulation, registers the output and triggers the FTS transfer(s). In single jobs mode, you can display commented lines from scripts in case you would like to check extra settings, comments etc. For production, leave this unchecked.
  2. User and password - for multiple job submissions, you need to tick the "Write scripts to disk" checkbox, and introduce your uid and password for this interface. ALERT! You must have registered and your credentials must have been validated for this to work. In single job mode, uid and password are not needed, since you will have to submit the (test) job with your credentials from your UI.
Changed:
<
<
  1. Click Prepare, and you are taken to a new page. ALERT! If the page says "There are scheduled submissions in there. Please try again in 10 minutes" it means that you have (or someone else has) just scheduled another batch of jobs and you have to wait for these to be actually submitted, else the scripts may be overwritten - with unpredictable results.
>
>
  1. Click Prepare, and you are taken to a new page. ALERT! If the page says "There are scheduled submissions in there. Please try again in 5 minutes" it means that you have (or someone else has) just scheduled another batch of jobs and you have to wait for these to be actually submitted, else the scripts may be overwritten - with unpredictable results.
 

Multiple Submissions

Line: 58 to 60
  ScripterUI2.002.png
Changed:
<
<
This example shows only two jobs. You can submit up to 100 at a time, but it is best to submit batches of 50 (these numbers may change, check this wiki before your shift). You can open the linked files to check is all settings are correct.
>
>
This example shows only two jobs. You can submit up to 60 at a time, but it is best to submit batches of 50 (these numbers may change, check this wiki before your shift). You can open the linked files to check is all settings are correct.
 Do not use manual submission. Click Schedule to send these jobs to the bot. Relax. A cronjob will pick these commands and execute them within the next 10 minutes. You will be able to see the result of your multiple submission by checking the jobs table.
 
This site is powered by the TWiki collaboration platform Powered by PerlCopyright © 2008-2020 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Send feedback