Knowing where all the important files are is really all that is needed. Sofistication can come later.
I would welcome a CCP4 database-assisted data archive system.

Here is my contribution to the discussion:

I agree with Paul Paukstelis that getting users to use any database-assisted data archive system is the biggest obstacle. I have had problems with compliance with my system, where all that the student has to do is to provide file and directory names each Friday to keep the database up to date.

It is a simple html based access system where through hyperlinks one can access the data anywhere where it is stored. Users need only provide the directories names of where the various pieces of data are stored within the accessible network and the data manager (any HTML competent individual) can then set-up the links to the main control platform (start-up html page). The advantage of such system is that it is platform independent and needs only a well configured browser.
It is backward compatible with any old data.

George Pelios may want to consider an automated system where mosflm, scala and all subsequent programs contribute to create and update a raw data retrieval file on the basis of the files they have used. When the project is finished a backup program should be able to retrieve all such files to be stored in a consolidated manner for transfer to a long term storage server.

A brief description of the system I use for synchrotron data collection:
========================================================================
Prior to the synchrotron trip, each sample taken to the synchrotron is entered in a table that represents its position in the puck with hyperlinks to a file describing its position in the crystallization tray (this file will have hyperlinks
to crystallization and all prior preparation steps).
As data is collected a short comment (resolution and number of frames is included if data has been collected) as the data is transfered in the home lab a link to the directory where the data is
stored is then added.
To give an idea of data quality Mosflm and gimp screen capture are used to create a jpg of the first data image (with the frame filename added) which is stored in the same directory as
the raw data frames. This image is accessed when clicking on the comment.
Compliance with the system can be checked by clicking on comments other than "not tested".

It is all manual but is not very time consuming once the initial html templates have been set up. Still I am looking foward to a simple CCP4 designed system that can do something similar
automatically.

I would also recommend looking at ispyb implemented at the ESRF which is also web based:
www.esrf.eu/UsersAndScience/Experiments/MX/Software/ispyb

Enrico.

--
Enrico A. Stura D.Phil. (Oxon) ,    Tel: 33 (0)1 69 08 4302 Office
Room 19, Bat.152,                   Tel: 33 (0)1 69 08 9449    Lab
LTMB, SIMOPRO, IBiTec-S, CE Saclay, 91191 Gif-sur-Yvette,   FRANCE
http://www-dsv.cea.fr/en/institutes/institute-of-biology-and-technology-saclay-ibitec-s/unites-de-recherche/department-of-molecular-engineering-of-proteins-simopro/molecular-toxinology-and-biotechnology-laboratory-ltmb/crystallogenesis-e.-stura
http://www.chem.gla.ac.uk/protein/mirror/stura/index2.html
e-mail: [email protected]                             Fax: 33 (0)1 69 08 90 71

Reply via email to