r2 - 09 Jun 2009 - 11:12:42 - RobertGardnerYou are here: TWiki >  Admins Web > FacilityWGAPMinutesJun9



Meeting of the Facilities working group on analysis queue performance, June 9, 2009



  • Meeting attendees: Rob, Bob, Saul, Patrick, Saul, Tom
  • Apologies: none

Status of stress test jobs in US

  • last week
    • Issue for failing DB access job at SLAC and SWT2 is now understood. Input files were being copied correctly however were failing in reading. Tadashi found the problem, PyUtils? /AthFile.py failed to check the file, this supports os and rfio only, but needs to support root:// (and dcap://). Sebastien Binet provided a new tag, PyUtils? -00-06-17. Nurcan tried to check out Tools/PyUtils-00-06-17 package locally from SVN so that it will be compiled on WNs with pathena. However currently this package does not compile locally. Investigating.

  • this week
    • AGLT2 studies
    • finding lots of jobs sitting in IO_WAIT
    • dccp used as copy input tool at AGLT2, MWT2
    • Bob notes pilots seem to be using gt2
    • Problematic jobs using very little nice cpu.
    • finding disk contention for 8 core machines - maxing out the number of IOPs
    • Need to be able to find out which jobs are running on which nodes
    • Saul: can determine nodes from job types - send list
    • Saul - ANALY_NET2 doesn't see anything in IO_WAIT
    • Patrick - working on direct I/O from xrootd issues
    • I/O - no sustained WAN traffic.

-- RobertGardner - 08 Jun 2009

About This Site

Please note that this site is a content mirror of the BNL US ATLAS TWiki. To edit the content of this page, click the Edit this page button at the top of the page and log in with your US ATLAS computing account name and password.


Powered by TWiki
This site is powered by the TWiki collaboration platformCopyright © by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Send feedback