All Tier 2's low on space, Tier 1 cleanup also needed
Stephane started central cleanup of aborted tasks at SWT2 and SLACT2
Seeing high rate of LFC dying at SWT2 since central deletion started (half dozen time in 3-4 days)
Seen ~4TB cleaned up, no LFC crashes, but problem with Bestman crashes (may be unrelated - probably firewall related)
Many LFC entries deleted, but files not deleted, but file deletion is a delayed process by 1-2 days (will provide statistics next week of how many files left behind on storage after 1 week)
Problem noticed by Patrick - central deletion leaves metadata in LFC
What should the protection settings be for this space? (Who can write?) - consensus is usatlas1.
AGLT2 has setup space-tokens setup for the new SCRATCHDISK area for '/atas/Role=production', '/atlas/usatlas/Role=production', 'usatlas3','usatlas4'. See:
"File exists" problem. See https://rt-racf.bnl.gov/rt/Ticket/Display.html?id=12638. Almost all AGLT2 SRM failures are just "file exists" errors. Where are these coming from? On MCDISK and DATADISK - so they are from DQ2 SS. Could be due to dq2_recreate after SS restart.
AOB
Major failure at BNL last week, some corrupted files from that, Hiro cleaning up
AGLT2 seeing files coming from TRIUMF, FZK. LYON, INFN, NDGF... to MCDISK (follow up with Alexei, Hiro)
Please note that this site is a content mirror of the BNL US ATLAS TWiki. To edit the content of this page, click the Edit this page button at the top of the page and log in with your US ATLAS computing account name and password.