Special topic: DQ2 0.6 upgrade (Hiro, Alexei, Miguel)
Hiro believes its not stable enough for deployment - no stable version yet of rpms declared.
Alexei: no urgency for 0.6.2 upgrade. PIC and LYON upgraded - will run functional tests. For BNL - important that its upgraded for functional tests - all sites will send data to BNL - BNL-OSG2_DATADISK.
Leave alone BNLDISK, BNLTAPE for now.
Alexei: FT will be happening over the next few weeks. Each T1 will subscribe to LYON, few terabytes in size.
AOD distribution within the US - equipping of Hiro for tools. But what other AODs are needed, eg. Release 13 AODs? All AODs to Tier2s.
Alexei will update the monitoring page for the fdr data.
Pilots query both the site's LRC and the DQ2 catalog - Kaushik.
All M6 have replicated to BNL.
Following up from US ATLAS Transparent Distributed Facility Workshop
Follow-up - last week: Still need to add the SWT2_CPB. Still waiting to get done.
IU_OSG - Fred is following up.
Michael urges all sites to give this priority, and to check that information is reported correctly.
Throughput initiative - status (Shawn)
Next meeting? Probably next week.
Jay has generated graphs for the Tier2 --> BNL.
Panda release installation issues (Xin)
Any release installation issues to follow up?
Xin thinks we can start using it now. The issue is how to submit the pilots to do these jobs. There is a plan possible to use autopilot, but there may be problems.
Xin will follow-up with Torre.
Nagios Alerts - Focus review (Dantong)
Follow-up next.
RSV, Nagios, SAM (WLCG) site availability monitoring program (Tomasz)
Facility Nagios
Follow-up: Split of Nagios server into internal and external. Done.
Local RSV to Nagios publishing
Port now working at MWT2_IU - some problems to be cleaned up, but basically working.
RSV to SAM
Now working - needs review.
PROOF / Xrootd
See presentation from Sergey at last week's workshop.
There will be a meeting this Thursday, see ProofXrootd
Hiro points out there is an xrootd deamon for a site's LRC.
T1: Gabriele working on stabilzing the dCache instance. Finding lots of hanging instances - perhaps related the patches to the OS's on the thumpers. Hope the build process will be completed later today, fully completed by the end of the week. There are performance problems regarding job throughput through BNL. New machines came in, some machines going to other network segments, configuration changes through autopilot, etc - lots of things changed at once. Last week John Hover and Xin were able to talk with Condor-G team - discovered probs within Condor-G that resolved part of the issues. Still not completely understood.
AGLT2: Tom - all okay. Monalisa at OSG being deprecated. Jay is going to host this - only needs configuration files.
NET2: all okay. Need to register
MWT2: dcache upgrade.
SWT2 (UTA): Moving to a rocks install for upgarde to dpcc.
SWT2 (OU): all okay. still waiting for 10G equipment from UTA; switch now in place. will also have to upgrade ibrix segment servers to rhel5.
WT2: starting to work on implementation of space tokens for srm v2.2 - working with xrootd developer, sent to Alex. Long power outtage in April, last weekend 2/3 day.
Please note that this site is a content mirror of the BNL US ATLAS TWiki. To edit the content of this page, click the Edit this page button at the top of the page and log in with your US ATLAS computing account name and password.