next up previous contents index
Next: Accessing data / Notes Up: Data migration to dCache Previous: Preparation / Naming   Contents   Index


The Migration Procedure

The data has to be migrated to dCache from selected work group servers (see Infobox below) by the tool called migrate. It allows to recursively migrate a given source directory or single larger files into dCache. In common, the script call results in a creation of a tarball in the dCache tape instance1.4 and thus generates an archive copy of data. It can be executed in different manner as follows:

/opt/gamma/migrate --beamtime=<AppID> --pack or --copy [--stage] [--parent] [--split] /path/to/dir

or

/opt/gamma/migrate --beamtime=<AppID> --copy --format=tar [--stage] /path/to/file.tar1.5

or (most likely for future Nexus format)

/opt/gamma/migrate --beamtime=<AppID> --copy [--stage] /path/to/file

where

<AppID> denotes the Beamtime Application ID (NOT the proposal ID) for the dataset taken,
--pack option is used to create a tarball from the source directory,
--copy option can be used to directly copy a source file (! ONLY USE THIS FOR SOME SINGLE LARGE FILES [100MB $\le$ filesize $\le$ 300GB] such as tarballs or Nexus-Files! Smaller or larger file sizes will cause a tremendous increase of file access times [tape rewinds, tape changes etc.]). Typically one uses this option together with the format option given below.
--format option can be used as option when copying a single large file. It tells the script that the large file is a tarball and will cause a tarball extraction call when staging the data to dCache disc instance. Currently, you can provide uncompressed as well as compressed tarballs to the script. In case one wants to provide a compressed archive, it is recommended to use bzip2 compression scheme for compression (tar option 'j').1.6
--stage option is optional and means that the data should be staged to dCache disc instance in addition to bringing it to tape.
--parent option is optional and means that the data should be migrated by preserving the name of the upper directory (i.e. if migration partially failed).
--split option is optional and means that data migration should start from on the first subdirectory, i.e. the script does not create a single tarball for directories smaller than 350 GB. It treats the given directory in same manner as for directories larger than 350 GB.
Mandatory for data migration are the provision of the beamtime application ID $<$AppID$>$, the pack (or copy) option and the provision of a data data source. In common, the data source is the name of the 1$^\mathrm{st}$ level of a directory tree and thus the pack option is mandatory as well.
The most common call for data migration is:
/opt/gamma/migrate --beamtime=<AppID> --pack --split /path/to/dir

Current limitations / currently to take into account:

\fbox{\begin{minipage}[t]{.96\textwidth}
{\it Note:} FAQ here ? like e.g.:
\par
...
... other options exists
\par
A4: Beg for Nexus format to show up.\end{minipage}}


next up previous contents index
Next: Accessing data / Notes Up: Data migration to dCache Previous: Preparation / Naming   Contents   Index
Andre Rothkirch 2013-07-17