MinutesDataManageJul21
Introduction
Minutes of the US ATLAS Data Management meeting, July 21, 2009
- Previous meetings and background : IntegrationProgram
- Coordinates: Tuesdays, Noon Central
- (309) 946-5300, Access code: 735188; Dial *6 to mute/un-mute.
Attending
- Meeting attendees: Shawn, Armen, Charles, John, Michael, Wensheng, Patrick, Hiro
- Apologies: Pedro, Wei
- Guests:
Topics for this week
- Local site movers - Charles
- UC, IU, NE, ITB: working well
- AGLT2: under development for Chimera
- BNL: Hiro and Paul working on using pnfsid
- Storage status - Armen
- BNL: space on all space tokens look ok, new extension to Thorr's being commissioned, maybe new 800 TB by end of week
- Continuing work on Bestman reproting
- BNL worker node storage retirement plan - Michael
- Going according to plan, in 1 week all acas storage will be gone when batch nodes upgrade to SL5
- Armen, Pedro working on details
- BNL deletion rate
- Central deletion should be limited to 1 Hz
- Hiro's deletion should be limited to 2 Hz
- Incorrect GUID's - Hiro
- Some Fast Reprocessing files had wrong GUID from pilot job recovery
- Hiro fixed some files manually
- Pilot will be fixed in next release
- Kaushik will let sites know which files are important to be fixed - otherwise ignore problem
- Dataset consistency
- Action item from last week - run Charles ccc at all Tier 2 sites, and look at results, to decide if we need to take this issue up with DQ2 team
- Discussion AGLT2 - increasing number of LFC orphans (in pnfs but not LFC), and some ghosts (in LFC but not dcache)
- Discussion MWT2 - using a new version, better html output, few problems found
- Charles - 30 min time cutoff to avoid transient issues, seems to be enough for now
- Charles - central deletion is making this complicated, can we get status from DQ2 while dataset is being deleted?
- Kaushik - will follow up on the issues found via email to DDM team
- Missing subscriptions - Hiro, Shawn
- AGLT2 - don't understand what was wrong, but working now, maybe hanging connections to CERN?
- Action item - continue debugging, follow up next week
- Hot issues
- AGLT2 has had a problem posted about a user dataset (see https://rt-racf.bnl.gov/rt/Ticket/Display.html?id=13599) User was unable to get dq2-get to work for them on dataset group09.PhotonAnalysis.mc08.105802.JF17_pythia_jet_filter.merge.AOD.e347_s462_d153_r643_t53.NTUP.rel15301.PAU-00-00-10.v1 I would like to understand if there is a DQ2 problem here. Using dq2-list-file-replicas for this dataset fails.
- AOB
--
KaushikDe - 20 Jul 2009
About This Site
Please note that this site is a content mirror of the BNL US ATLAS TWiki. To edit the content of this page, click the Edit this page button at the top of the page and log in with your US ATLAS computing account name and password.
Attachments
run_ccc.log (587.3K) |
ShawnMckee, 20 Jul 2009 - 15:43 | Top-level AGLT2 run log for ccc.py
ccc_20090720_005616.log (2.2K) |
ShawnMckee, 20 Jul 2009 - 15:43 | Today's logfile from running ccc.py at AGLT2
run_ccc_jul21.log (129.8K) |
ShawnMckee, 21 Jul 2009 - 11:23 | Run ccc log from July 21
ccc_20090721_005553.log (2.2K) |
ShawnMckee, 21 Jul 2009 - 11:24 | ccc.py log from July 21