19th International CODATA Conference
Category: Data Archiving
Digital archiving of scientific information: Czech experience
Prof. Dr. Pavel Slavik (slavik@fel.cvut.cz), P. Mach, M. Snorek
Very interesting source of scientific data are MSc and PhD theses. Having large amount of these documents after certain period of time it might be possible to obtain an overall picture about the context, in which the particular research has been performed. As majority of theses exist in a digital form at the present time, it is necessary to develop methods for their preserving. This problem is not a new one. Nevertheless, existing solutions are mostly directed towards the needs of digital libraries that are responsible for their loans and distribution (systems like ProQuest etc.).
The approach selected at the
The solution developed is based on redundancy, where the theses are stored in three formats allowing the user later digital archiving in environment of migration of documents. The main idea was to have formats as much independent as possible on software and hardware platforms with easy document reconstruction in case of some migration problems. The formats chosen are: PDF, text and bitmap. A pilot implementation of the system has been realized and currently the first practical experiments are being done.