ADA data access and movement

Contact: David Adams

ATLAS Offline Computing: [ Top | Grid | ATLAS grid | Analysis | ADA ]


Introduction
Data access and movement are critical issues for distributed analysis. Data is specified in terms of datasets with varying degrees of abstraction. Typically, the ultimate access is via physical files which are replicas of logical files. It is expected that users will work with datasets: a job definition includes the name or ID of and input dataset and the result of that job is an output dataset. A user wishing to examine the result obtains replicas of the relevant files (e.g. a ROOT histogram file) from the output dataset.

Users often will want to know if the data associated with a dataset exists and whether it is "nearby", i.e. rapidly accessible at low cost. A user may also want to localize the data in a manner that allows access without a network connection, e.g. to examine on a laptop computer.

ADA data management systems
The important data management systems for ADA are NFS/AFS, magda and Don Quijote.

Other products and systems
There are also a number of products or ideas that are are not specific to ATLAS.


Last modified 27jun05 by dla