Action item - Rob: Compare pledge amounts to current capacity. See: CapacitySummary
Next update - end of March.
Facility FDR analysis roundup (Nurcan)
Happy to report that all Tier2's have passed all FDR tests!
Will likely have to follow-up with different users, eg., Akiro.
NET2 now finishing fine.
HighPt package also running okay at OU.
Thanks to all, esp. Patrick, Horst!
ANALY_GLOW_ATLAS - will add. Not completely certified, jobs currently failing with config errors.
Major Facility milestone achieved.
Issue - special tags in the release..discuss / follow-up with Fred.
LRC, Adler32 updates at Tier2 sites (Hiro, all)
Follow-up - any remaining issues?
All sites updated. Any problems?
Adler32 update script for FDR data - have all sites updated? Reminder to do so.
Adler32 updates for Panda services (Kaushik)
Follow-up from last week:
Mark reports - there have been pilot-specific issues. Paul is on top of this. Updates to pilot3 are automatic. Pilot2 have to be done manually - will check.
No update.
Summary of problems at BNL
Analysis jobs running at BNL too slow. New autopilot + BNL gatekeper take 20 minutes to schedule jobs. Currently troubleshooting, expect update tonight/tomorrow.
Dantong suggested to Kaushik to revert to the old version for the time being.
Operations: Production (Kaushik)
Production summary (Michael)
FDR-2 preparations are now beginning.
Reprocess
Production shift report (Mark)
Barry on shift.
Operations: DDM (Kaushik/Hiro)
CCRC08 replication plan
Hiro started creating subscriptions - checking size.
Would like to run for 3 days. Setting up a DQ2 subscriptions monitor.
ATLAS requirements for storage elements
Now a formal requirement space tokens at the Tier2s. Outlined in Kor's document for CCRC08.
We can no longer afford only gridftp only for entry points.
Need srm v2.2 at Tier2s.
Need firm plan for equipping T2's with srm v2.2 by April 2
Space token - a method to reserve space for dedicated purposes. Necessary for managing the space. Basically a quota system.
See further ToA file for how tokens are used.
We will continue the prep phase for FDR2 in a mixed mode.
Role determines authorization to use the space. Will start with a single space token for analysis groups.
Lets reserve some time to discuss this at UNC.
What about Storm? Tightly coupled with LCG and glite.
LFC integration (John/Mark/Hiro)
Still waiting on a testsite installed in scheddb. Mark will get Torre.
Hiro and John working on migration problem. Takes a lot of time. There may be speedups by running locally.
T1: Gabriele - need to upgrade dCache again. Plan for next Tuesday. All day exercise. Upgrade will make pools start more quickly. 1.8p6, released on Monday. Need to consult Kaushik and Hong.
AGLT2: Scheduled maintenance shutdown tomorrow. Been getting lots of Nagios tickets lately - 50 seconds. Suggest forming a small task force to look into these details. Calibration center - need to decide whether you want to use a dedicated channel to the T0. This requires an FTS channel (maybe not?). Endpoint needs to be defined in the GOC database (for WLCG BDII). Bob will be in touch with Hiro.
NET2: Shutdown was on Tuesday - upgraded worker nodes to RHEL4. Production back up this evening; analysis queue setup.
MWT2: Postgres database filling up- caused a partition to fill.
SWT2_UTA: UTA_dpcc - building lost power, recovering. (Note - this was cause for many of the Nagios tickets). Will retire SWT2_UTA analysis queue.
SWT2_OU: Subscribed FDR data - in support of analysis queue. Ibrix crash - to be cleaned up.
WT2: Updating endpoint - doesn't work with dq2_get. The external commands used are not working on the slac srm server. It is hard-coded for BNL. lcg-copy
Please note that this site is a content mirror of the BNL US ATLAS TWiki. To edit the content of this page, click the Edit this page button at the top of the page and log in with your US ATLAS computing account name and password.