r1 - 16 Apr 2008 - 05:01:51 - RobertGardnerYou are here: TWiki >  Admins Web > OSGservicesP4

OSGservicesP4

Background

Previous phases:

Goals for Phase IV

In this phase there are the following OSG-related software/service goals:

  1. RSV --> WLCG SAM service availability monitoring of all OSG sites
    • In SiteCertificationP4, check off if RSV is reporting correctly to SAM. Need instructions for how to do this, or carry this issue over to Phase V.
  2. Evaluation of dCache 1.8 from the OSG-VDT storage group, including functional testing, on Integration Testbed sites (UC, BNL, AGLT2) Deployed DONE.
  3. WLCG accounting portal view for the US ATLAS Facility. Delay
  4. A site level RSV --> Nagios demonstrator: publish the results of RSV probes into a local Nagios instance Prototype complete
  5. Provisioning of OSG ITB 0.9 testbed sites at BNL, UC, OU Delay
  6. Validation of ATLAS/Panda on ITB 0.9, in advance of OSG 1.0 release (estimate May 1, 2008) Delay

Schedule

  • RSV --> WLCG SAM: depends on OSG-GOC progress. When finished, site-by-site validation is all that would be required initially, and periodic monitoring aftwerwards.
  • dCache 1.8 evaluation: begin Feb 1 2008; complete by Feb 15. Done at both AGLT2 and MWT2 DONE.
  • WLCG accounting: depends on WLCG action. No action needed by sites.
  • Site level RSV: only valid for sites running Nagios monitoring consoles. Prototype done at MWT2_IU DONE.
  • ITB 0.9 activities may carry-over into Phase V. They will - given OSG 1.0 scehdule slippage.

Site Availability Monitoring (SAM)

All US ATLAS Distributed Facility Sites need to appear in the SAM plots at CERN. The site that wLCG is using to track availability is the GridView site: http://gridview.cern.ch/GRIDVIEW/same_index.php. This site is moderately difficult to navigate and in the instructions below a different site is suggested for checking that your data is reaching the SAM system at CERN.

There are three steps to making your site report to SAM:

  1. Setup the RSV site availability monitoring probes.
  2. Setup a grid proxy for the probes
  3. Check that you site is entering data in the database (note that the tests run only once every two hours).

To check that a site is reporting correctly browse to https://lcg-sam.cern.ch:8443/sam/sam.py, select the OSGCE radio button, and click ShowSensorTests. NB Accessing this page requires that your browser have a valid Grid or CERN certificate loaded. This produces a plot similar to SAM.pdf attached below (note that the plot shows both US ATLAS and US CMS sites).

As of April 9, 2008 the following US ATLAS Distributed Facility Sites are shown in SAM:

SITE STATUS
AGLT2 choice-no
IU_OSG choice-no
MWT2_IU choice-no
MWT2_UC choice-no
PROD_SLAC choice-no
UC_ATLAS_UC choice-yes
UTA_DPCC choice-no
bnl_atlas_1 choice-yes

Only the two sites are passing all tests. Any site not listed in the table has probably not configured RSV yet.

The documentation to configure RSV is found at: http://rsv.grid.iu.edu/documentation/vdt-package.html. For help with debugging problems configuring RSV please contact goc at opensciencegrid dot org. You can also contact me: luehring at indiana dot edu.

The list of sites known to SAM can be seen at: http://oim.grid.iu.edu/publisher/get_osg_interop_monitoring_list.php in comma delimited format. If your site is not on this list, please send an email to the GOC (goc at opensciencegrid dot org).

Running RSV does require having a proxy. To avoid having to manually renew the proxy from time to time, a service certificate can be used if the proxy does not leave the local machine. A service certificate can be renewed with a cron job while a proxy based on a user certificate requires the user to input the pass phrase to renew it. Do NOT create a proxy with a long expiration time using a user certificate.

NB: The GOC team is working on a modification to RSV to allow the use of a gridftp server that is not on same node as the OSG Gatekeeper. This code should be available by the end of April 9.


-- FrederickLuehring - 09 Apr 2008
-- RobertGardner - 30 Jan 2008
-- RobertGardner - 16 Apr 2008

About This Site

Please note that this site is a content mirror of the BNL US ATLAS TWiki. To edit the content of this page, click the Edit this page button at the top of the page and log in with your US ATLAS computing account name and password.


Attachments

 
Powered by TWiki
This site is powered by the TWiki collaboration platformCopyright © by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Send feedback