SWT2_UTA still being addressed. Need VORS registration - not yet. Hopefully this week, may take until January.
Still not in VORS correctly. Will get fixed this week.
BNL accounting info was lost - Xin investigating. There was confusion on the WLCG APEL site - having to do with the change in the Gratia site name - they appear to have static mappings. Xin still investigating.
Being sorted out be Xin. Converging, hope to have correct accounting by end of week.
John W is still clarifying w/ EGEE people on naming convention. Xin will continue to push the issue w/ John Weigand. Michael will push this with Ruth.
Schedule a phone call w/ Sue to get the US Facility view available. Not done - need to follow-up (Rob).
Split of Nagios server into internal and external - still working on this. Work has now started.
Wisconsin LRC problem
Increase the timeouts for MWT2
RSV publishing to WLCG
Starting to publish this now
Dantong will follow-up
Site news and issues (all sites)
T1: Getting prepared for FDR; discussed how to distribute data. Rearrange space usage in dCache. 210 TB, 20-30 to be used for data in front of HPSS. Retiring some worker nodes, so will need to figure out where data goes. Pinning system starting to give good results (3 files out of 200K lost).
AGLT2: Running well recently - up to 850 jobs. Issues with Dell switches, upgraded firmware. Lessons: switches do no work as documented, Dell support lagging. The switches are stacked, and had to take the
SWT2_UTA: new cluster about online (will have 75 TB online) - test jobs running. Running xrootd, running a gridftp door. Will add srm doors w/ new purchase. (Note need spare machines for proof clusters.) Finalizing purchase for next round - online in March (capacity 240 TB disk, >400 cores, ~100 servers).
SWT2_OU: all is well. Problems w/ motherboard on gridftp server (keeps crashing w/ dropped packets, not understoood). Working on 10G upgrade.
WT2: using SRM load balancer from Bestman, working well. 2 new gridftp servers. External network - working on 10G network. Problem is identifying power for this - and Ganglia monitoring (for external viewing). Hardware - CPUs from purchase arrived (34 machines - 272 cores). Installing these now - expect them to come online by end of the month. Storage - not clear what the situtation is given funding situation.
Please note that this site is a content mirror of the BNL USATLAS TWiki. To edit the content of this page, click the Edit this page button at the top of the page and log in with your BNL USATLAS account.