Line: 1 to 1 | ||||||||
---|---|---|---|---|---|---|---|---|
ADCoS shift summary | ||||||||
Line: 7 to 7 | ||||||||
See at the bottom of the page for general conclusions.
Before Your Shift | ||||||||
Changed: | ||||||||
< < | To book your shifts, you need to use the Operation Task Planner (OTP)![]() | |||||||
> > | To book your shifts, you need to use the Operation Task Planner (OTP)![]() | |||||||
REMEMBER: if you book a shift and the box turns YELLOW, it's already booked by someone else! Unbook and choose a RED slot. |
Line: 1 to 1 | ||||||||
---|---|---|---|---|---|---|---|---|
ADCoS shift summary | ||||||||
Line: 113 to 113 | ||||||||
| ||||||||
Added: | ||||||||
> > | NB: Cloud support often has to bew cc'd and the emails are hard to find.. cloud support emails are here![]() | |||||||
The DDM Dashboard tells the successes of transfers between sites, and for some reason seems to respond quicker to queries then the Production Dashboard - so if you're having a quick hunt through the clouds to try and find where a specific site is located, I'd recommend using this (or PANDA, which is also fairly responsive). If you find somewhere that's failing a lot of transfers, once you've narrowed it down to the site, click on the number of failures to get the error message, and then click on the '+' next to the site's name to see where the sources for these transfers are - it can sometimes be a case that the transfers are failing thanks to the source, not the site that's showing the errors (either through downtime, or some other reason) For both of the dashboards, keep an eye on the graphs at the top of the page, not only the numbers down at the bottom - the numbers may suggest you have a problem, but the graphs can tell you if it's still an ongoing concern - it might have been a temporary blip (it happened that once a site (BNL) was going for some scheduled downtime, a lot of failed transfers happened involving it). |
Line: 1 to 1 | ||||||||
---|---|---|---|---|---|---|---|---|
ADCoS shift summary | ||||||||
Line: 47 to 47 | ||||||||
Even if you are doing a 'Trainee' shift, you should still read the previous shift report- it will tell you what's outstanding and urgent! Go to this page and sort it by "LAST MODIFIED": http://lhcweb01.pic.es/atlas/Shift_Summaries/reports/![]() | ||||||||
Changed: | ||||||||
< < | First, open the ADCoS TWiki![]() ![]() ![]() ![]() ![]() ![]() | |||||||
> > | First, open the ADCoS TWiki![]() ![]() ![]() ![]() ![]() ![]() | |||||||
| ||||||||
Line: 176 to 176 | ||||||||
0 PandaID=1031455144 and the test jobs appeared here: http://panda.cern.ch:25980/server/pandamon/query?job=*&type=test&hours=3 ![]() | ||||||||
Added: | ||||||||
> > | *
|
Line: 1 to 1 | ||||||||
---|---|---|---|---|---|---|---|---|
ADCoS shift summary | ||||||||
Line: 117 to 117 | ||||||||
In the Production Dashboard, there was a time when there was a 'None' cloud listed - this seemed to contain only sites from other clouds that were known to be having problems - I asked, and apparently 'it happens when an input dataset/file replica is not found in any cloud or not in a cloud where the task is assigned'. | ||||||||
Added: | ||||||||
> > |
Useful Links not on the ADCoS official pages:SAM![]() ![]() ![]() ![]() | |||||||
===================================================================================================== General Comments: The people in the Skype control room seem friendly enough, and will happily answer any questions you have - however, the process is still inherently a remote one. I don't know how they'll be able to tell when you are no longer a trainee and can be considered an expert. I felt a bit lost, and just wandered around the various websites looking for things that looked red (or not green) - with time it'll be easier to tell what's worth looking at and what's not, but it's made a bit harder by the fact you only see the end result of the expert's efforts - you don't get to see the processes and tricks he/she uses to quickly find and diagnose a problem. Once the eLog came out, I could follow the information contained in that and generally find the fault myself. I found a couple of things myself, but when I mentioned them in the control room the expert seemed to be about 5 pages ahead of me, though he didn't seem to mind telling me why he was leaving it, or not. It's also a bit easy to lose track of what's been dealt with and what hasn't. Partly because it's the expert, not me who was dealing with the problems, but also because it's also all very interconnected, and a fault at one place could be caused by a fault at another. |
Line: 1 to 1 | ||||||||
---|---|---|---|---|---|---|---|---|
ADCoS shift summary | ||||||||
Line: 19 to 19 | ||||||||
| ||||||||
Changed: | ||||||||
< < |
| |||||||
> > |
| |||||||
| ||||||||
Added: | ||||||||
> > |
| |||||||
| ||||||||
Changed: | ||||||||
< < |
| |||||||
> > |
| |||||||
|
Line: 1 to 1 | ||||||||
---|---|---|---|---|---|---|---|---|
ADCoS shift summary | ||||||||
Line: 29 to 29 | ||||||||
| ||||||||
Changed: | ||||||||
< < |
| |||||||
> > |
| |||||||
| ||||||||
Changed: | ||||||||
< < |
| |||||||
> > |
| |||||||
Starting Your Shift |
Line: 1 to 1 | ||||||||
---|---|---|---|---|---|---|---|---|
ADCoS shift summary | ||||||||
Line: 8 to 8 | ||||||||
Before Your ShiftTo book your shifts, you need to use the Operation Task Planner (OTP)![]() | ||||||||
Added: | ||||||||
> > | REMEMBER: if you book a shift and the box turns YELLOW, it's already booked by someone else! Unbook and choose a RED slot. | |||||||
You can see who is supposed to be on shift:
|
Line: 1 to 1 | ||||||||
---|---|---|---|---|---|---|---|---|
ADCoS shift summary | ||||||||
Line: 29 to 29 | ||||||||
Finally, Savannah:
| ||||||||
Changed: | ||||||||
< < |
| |||||||
> > |
| |||||||
Starting Your Shift |
Line: 1 to 1 | ||||||||
---|---|---|---|---|---|---|---|---|
ADCoS shift summary | ||||||||
Line: 28 to 28 | ||||||||
Finally, Savannah:
| ||||||||
Added: | ||||||||
> > |
| |||||||
Starting Your Shift |
Line: 1 to 1 | ||||||||
---|---|---|---|---|---|---|---|---|
ADCoS shift summary | ||||||||
Line: 38 to 38 | ||||||||
First, open the ADCoS TWiki![]() ![]() ![]() ![]() ![]() ![]() | ||||||||
Added: | ||||||||
> > |
| |||||||
Once you have all the windows open, probably best to check the eLog and previous shift summary (there will be a list of hot issues!) You may also find it useful to have a window open with your 'shifts' emails right by the browser. |
Line: 1 to 1 | ||||||||
---|---|---|---|---|---|---|---|---|
ADCoS shift summary | ||||||||
Line: 6 to 6 | ||||||||
See at the bottom of the page for general conclusions. | ||||||||
Added: | ||||||||
> > | Before Your Shift | |||||||
To book your shifts, you need to use the Operation Task Planner (OTP)![]() | ||||||||
Line: 25 to 26 | ||||||||
| ||||||||
Added: | ||||||||
> > | Finally, Savannah:
Starting Your Shift | |||||||
So, starting your shift:
Bear in mind, if you are "senior shifter" you will have to submit a shift report https://twiki.cern.ch/twiki/bin/view/Atlas/ADCoS#Daily_SHIFT_report![]() | ||||||||
Added: | ||||||||
> > | Even if you are doing a 'Trainee' shift, you should still read the previous shift report- it will tell you what's outstanding and urgent! Go to this page and sort it by "LAST MODIFIED": http://lhcweb01.pic.es/atlas/Shift_Summaries/reports/![]() | |||||||
First, open the ADCoS TWiki![]() ![]() ![]() ![]() ![]() ![]() | ||||||||
Added: | ||||||||
> > | Frozen URL's - CAREFULI always RIGHT-CLICK to open a new tab/window when I am drilling down from one of the main screens. This is because, if you hit 'back' you mostly will end up with a FROZEN URL which will never update... please read on! | |||||||
ChrisCT got a reply from Graeme: apparently the dashboards do not really update very well at all- be careful! - If you go in to the cloud view and get a url like http://dashb-atlas-prodsys.cern.ch/dashboard/request.py/overview?grouping=cloud&start-date=2009-11-11%2000:00:00&end-date=2009-11-11%2012:59:59&grouping=cloud as opposed to: | ||||||||
Line: 47 to 58 | ||||||||
In the ADC monitoring window https://sls.cern.ch/sls/service.php?id=ADC_CS![]() | ||||||||
Added: | ||||||||
> > | ||||||||
Changed: | ||||||||
< < | Also have a look at the sheduled downtime for sites here: https://atlas-install.roma1.infn.it/atlas_install/list.php?sitename=AGLT2![]() ![]() | |||||||
> > | Also have a look at the sheduled downtime for sites here: https://goc.gridops.org/downtime?scope=ALL![]() <--https://atlas-install.roma1.infn.it/atlas_install/list.php?sitename=AGLT2or here: https://twiki.cern.ch/twiki///bin/view/Atlas/AtlasGridDowntime ![]() | |||||||
|
Line: 1 to 1 | ||||||||
---|---|---|---|---|---|---|---|---|
ADCoS shift summary | ||||||||
Line: 22 to 22 | ||||||||
| ||||||||
Changed: | ||||||||
< < |
| |||||||
> > |
| |||||||
|
Line: 1 to 1 | ||||||||
---|---|---|---|---|---|---|---|---|
ADCoS shift summary | ||||||||
Changed: | ||||||||
< < | Report from Alistair. 27/10/2009. Modified by ChrisCT on 11/11/2009: | |||||||
> > | Report from Alistair. 27/10/2009. Modified by ChrisCT on 11/11/2009 and on 17/6/2010: | |||||||
See at the bottom of the page for general conclusions.
To book your shifts, you need to use the Operation Task Planner (OTP)![]() | ||||||||
Changed: | ||||||||
< < | You can see who is supposed to be on shift at the ADCPoint1Shift page: https://twiki.cern.ch/twiki/bin/view/Atlas/ADCPoint1Shift#Shift_schedule![]()
| |||||||
> > | You can see who is supposed to be on shift:
| |||||||
| ||||||||
Added: | ||||||||
> > |
| |||||||
Added: | ||||||||
> > | So, starting your shift:
Bear in mind, if you are "senior shifter" you will have to submit a shift report https://twiki.cern.ch/twiki/bin/view/Atlas/ADCoS#Daily_SHIFT_report![]() | |||||||
Changed: | ||||||||
< < | So, starting your shift: First, open the ADCoS TWiki![]() ![]() ![]() ![]() ![]() ![]() | |||||||
> > | First, open the ADCoS TWiki![]() ![]() ![]() ![]() ![]() ![]() | |||||||
Once you have all the windows open, probably best to check the eLog and previous shift summary (there will be a list of hot issues!) You may also find it useful to have a window open with your 'shifts' emails right by the browser. |
Line: 1 to 1 | ||||||||
---|---|---|---|---|---|---|---|---|
ADCoS shift summary | ||||||||
Line: 36 to 36 | ||||||||
In the ADC monitoring window https://sls.cern.ch/sls/service.php?id=ADC_CS![]() ![]() | ||||||||
Added: | ||||||||
> > | NB: you might want to check the new EVO room:
http://evo.caltech.edu/evoGate/koala.jnlp?meeting=MtM8Ma2B2DD8Dv9D92Ds9t | |||||||
|
Line: 1 to 1 | ||||||||
---|---|---|---|---|---|---|---|---|
ADCoS shift summary | ||||||||
Line: 111 to 111 | ||||||||
#if isinstance(val,cx_Oracle.Timestamp): # val = datetime.datetime(val.year,val.month,val.day, # val.hour,val.minute,val.second) | ||||||||
Changed: | ||||||||
< < | Job was submitted. My output looked like: | |||||||
> > | Job was submitted:
python testG4sim15.py (with hardcoded site inside as site = 'TRIUMF' and cloud = 'NL')
My output looked like: | |||||||
--------------------- 0 PandaID=1031455144 |
Line: 1 to 1 | ||||||||
---|---|---|---|---|---|---|---|---|
ADCoS shift summary | ||||||||
Line: 16 to 16 | ||||||||
| ||||||||
Changed: | ||||||||
< < | So, starting your shift: First, open the ADCoS TWiki![]() ![]() ![]() ![]() ![]() ![]() | |||||||
> > | So, starting your shift: First, open the ADCoS TWiki![]() ![]() ![]() ![]() ![]() ![]() | |||||||
Once you have all the windows open, probably best to check the eLog and previous shift summary (there will be a list of hot issues!) You may also find it useful to have a window open with your 'shifts' emails right by the browser. |
Line: 1 to 1 | ||||||||
---|---|---|---|---|---|---|---|---|
ADCoS shift summary | ||||||||
Line: 78 to 78 | ||||||||
General Comments: The people in the Skype control room seem friendly enough, and will happily answer any questions you have - however, the process is still inherently a remote one. I don't know how they'll be able to tell when you are no longer a trainee and can be considered an expert. I felt a bit lost, and just wandered around the various websites looking for things that looked red (or not green) - with time it'll be easier to tell what's worth looking at and what's not, but it's made a bit harder by the fact you only see the end result of the expert's efforts - you don't get to see the processes and tricks he/she uses to quickly find and diagnose a problem. Once the eLog came out, I could follow the information contained in that and generally find the fault myself. I found a couple of things myself, but when I mentioned them in the control room the expert seemed to be about 5 pages ahead of me, though he didn't seem to mind telling me why he was leaving it, or not. It's also a bit easy to lose track of what's been dealt with and what hasn't. Partly because it's the expert, not me who was dealing with the problems, but also because it's also all very interconnected, and a fault at one place could be caused by a fault at another. But I'm pretty sure it'll get easier with a few shifts - and once I manage to digest the entire TWiki properly! \ No newline at end of file | ||||||||
Added: | ||||||||
> > |
=====================================================================================================
Specific issues: Chris CT 24/11/09: I was asked to submit some test jobs to cloud 'CA' site 'TRIUMF'. I followed the procedures but got a nasty python error! On lxplus, I did: source /afs/cern.ch/project/gd/LCG-share/current/etc/profile.d/grid_env.sh source /afs/cern.ch/atlas/offline/external/GRID/ddm/DQ2Clients/setup.zsh voms-proxy-init --voms atlas mkdir panda cd panda =svn co http://www.usatlas.bnl.gov/svn/panda/panda-server/current/pandaserver/test= ![]() =svn co http://www.usatlas.bnl.gov/svn/panda/panda-server/current/pandaserver/taskbuffer= ![]() =svn co http://www.usatlas.bnl.gov/svn/panda/panda-server/current/pandaserver/userinterface= ![]() cd test export PYTHONPATH ..:$PYTHONPATH= edited the file test/testG4sim15.py to make sure site='TRIUMF' and cloud='CA'
then ran:python testG4sim15.py got error: File "/afs/cern.ch/user/c/ccollins/panda/taskbuffer/FileSpec.py", line 94, in __getstate__ if isinstance(val,cx_Oracle.Timestamp): NameError: global name 'cx_Oracle' is not defined I was using python 2.5 (you can check this by typing python then CTRL-D to exit)
so the fix!...edit the file../taskbuffer/FileSpec.py and comment out these 3 lines below the comment line which is already there: # convert cx_Oracle.Timestamp to datetime. this is not needed since python 2.4 #if isinstance(val,cx_Oracle.Timestamp): # val = datetime.datetime(val.year,val.month,val.day, # val.hour,val.minute,val.second) Job was submitted. My output looked like: --------------------- 0 PandaID=1031455144 and the test jobs appeared here: http://panda.cern.ch:25980/server/pandamon/query?job=*&type=test&hours=3 ![]() |
Line: 1 to 1 | ||||||||
---|---|---|---|---|---|---|---|---|
ADCoS shift summary | ||||||||
Line: 19 to 19 | ||||||||
So, starting your shift: First, open the ADCoS TWiki![]() ![]() ![]() ![]() ![]() ![]() | ||||||||
Added: | ||||||||
> > | You may also find it useful to have a window open with your 'shifts' emails right by the browser. | |||||||
ChrisCT got a reply from Graeme: apparently the dashboards do not really update very well at all- be careful! - If you go in to the cloud view and get a url like http://dashb-atlas-prodsys.cern.ch/dashboard/request.py/overview?grouping=cloud&start-date=2009-11-11%2000:00:00&end-date=2009-11-11%2012:59:59&grouping=cloud |
Line: 1 to 1 | ||||||||
---|---|---|---|---|---|---|---|---|
ADCoS shift summary | ||||||||
Line: 38 to 38 | ||||||||
| ||||||||
Added: | ||||||||
> > |
| |||||||
First thing you should probably do is read the previous eLog and shift summary. |
Line: 1 to 1 | ||||||||
---|---|---|---|---|---|---|---|---|
ADCoS shift summary | ||||||||
Line: 40 to 40 | ||||||||
| ||||||||
Changed: | ||||||||
< < | Also have a look at the sheduled downtime for sites here: https://atlas-install.roma1.infn.it/atlas_install/list.php?sitename=AGLT2![]() ![]() | |||||||
> > |
Also have a look at the sheduled downtime for sites here: https://atlas-install.roma1.infn.it/atlas_install/list.php?sitename=AGLT2![]() ![]() | |||||||
|
Line: 1 to 1 | ||||||||
---|---|---|---|---|---|---|---|---|
ADCoS shift summary | ||||||||
Line: 18 to 18 | ||||||||
So, starting your shift: First, open the ADCoS TWiki![]() ![]() ![]() ![]() ![]() ![]() | ||||||||
Added: | ||||||||
> > | Once you have all the windows open, probably best to check the eLog and previous shift summary (there will be a list of hot issues!) | |||||||
ChrisCT got a reply from Graeme: apparently the dashboards do not really update very well at all- be careful! - If you go in to the cloud view and get a url like http://dashb-atlas-prodsys.cern.ch/dashboard/request.py/overview?grouping=cloud&start-date=2009-11-11%2000:00:00&end-date=2009-11-11%2012:59:59&grouping=cloud as opposed to: |
Line: 1 to 1 | ||||||||
---|---|---|---|---|---|---|---|---|
ADCoS shift summary | ||||||||
Changed: | ||||||||
< < | Report from Alistair. 27/10/2009: | |||||||
> > | Report from Alistair. 27/10/2009. Modified by ChrisCT on 11/11/2009: | |||||||
See at the bottom of the page for general conclusions. | ||||||||
Line: 16 to 16 | ||||||||
| ||||||||
Changed: | ||||||||
< < | So, starting your shift: First, open the ADCoS TWiki![]() ![]() ![]() ![]() ![]() ![]() | |||||||
> > | So, starting your shift: First, open the ADCoS TWiki![]() ![]() ![]() ![]() ![]() ![]() http://dashb-atlas-prodsys.cern.ch/dashboard/request.py/overview?grouping=cloud&start-date=2009-11-11%2000:00:00&end-date=2009-11-11%2012:59:59&grouping=cloud as opposed to: http://dashb-atlas-prodsys.cern.ch/dashboard/request.py/overview?grouping=cloud then you have a 'frozen' url which will NEVER update! On the Panda monitor, look out for 'Waiting' jobs. These might come from jobs not able to get their input (e.g. AOD). you can tell this by
![]() | |||||||
Then sign into the ADC Control Room (if at CERN, refer here![]() | ||||||||
Line: 24 to 38 | ||||||||
| ||||||||
Added: | ||||||||
> > | Also have a look at the sheduled downtime for sites here: https://atlas-install.roma1.infn.it/atlas_install/list.php?sitename=AGLT2![]() ![]()
| |||||||
Everyone seems friendly, and will try to answer any questions - I spotted a couple of things and mentioned them during my time on shift, and invariably the expert had already seen them, but was happy to explain that he'd already dealt with it, or why it wasn't important. |
Line: 1 to 1 | ||||||||
---|---|---|---|---|---|---|---|---|
ADCoS shift summary | ||||||||
Line: 21 to 21 | ||||||||
Then sign into the ADC Control Room (if at CERN, refer here![]() | ||||||||
Added: | ||||||||
> > |
| |||||||
Everyone seems friendly, and will try to answer any questions - I spotted a couple of things and mentioned them during my time on shift, and invariably the expert had already seen them, but was happy to explain that he'd already dealt with it, or why it wasn't important. |
Line: 1 to 1 | ||||||||
---|---|---|---|---|---|---|---|---|
ADCoS shift summary | ||||||||
Line: 8 to 8 | ||||||||
To book your shifts, you need to use the Operation Task Planner (OTP)![]() | ||||||||
Changed: | ||||||||
< < | First, open the ADCoS TWiki![]() ![]() ![]() ![]() ![]() ![]() | |||||||
> > | You can see who is supposed to be on shift at the ADCPoint1Shift page: https://twiki.cern.ch/twiki/bin/view/Atlas/ADCPoint1Shift#Shift_schedule![]()
![]() ![]() ![]() ![]() ![]() ![]() | |||||||
Then sign into the ADC Control Room (if at CERN, refer here![]() |
Line: 1 to 1 | |||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Added: | |||||||||||||||||||||||||||
> > |
ADCoS shift summaryReport from Alistair. 27/10/2009: See at the bottom of the page for general conclusions. To book your shifts, you need to use the Operation Task Planner (OTP)![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|