In order to help understand the possible interplay between transmission and digitization, a pilot project for the long-term preservation of research data in the social sciences and humanities (SSH) ...is presented by its two coordinators. The article provides some background context on transmission in digital form of past and present research in SSH. It shows the discrepancy between the increasing role of digital information and its fragility. It presents the standard abstract model for archival information systems and the way it was instantiated in the pilot project. It ends with some reflexive remarks on the factors that are bound to act upon the future of such projects: organizational behaviours, role of data and knowledge, communities of users, institutional issues and status of collective memory in SSH.
Pour permettre de comprendre les interactions possibles entre transmission et numérisation, un projet pilote d’archivage numérique pérenne est présenté par ses deux coordinateurs. L’article évoque le contexte actuel de transmission sous forme numérique des recherches passées et présentes en sciences humaines et sociales (SHS). Il souligne l’écart entre le rôle croissant des données numériques et leur fragilité. Il présente le modèle abstrait standard d’archivage numérique pérenne et la manière dont il a été instancié dans le projet pilote. Il termine par un retour réflexif sur les facteurs qui vont conditionner l’avenir de projets similaires: choix et comportements organisationnels, rôles respectifs des données et des connaissances, constitution et comportement des communautés d’utilisateurs, statut de la mémoire collective en SHS.
Data preservation deals with ensuring that digital data stored today can be read and interpreted tens or hundreds of years from now. As the amount of data that needs to be preserved has been growing ...significantly, thus a standardized preservation process into big data environment is required. One of the standardized process recognized by ISO 14721:2012 is Open Archival Information System (OAIS). This research proposed a framework for data preservation process in big data architecture by using OAIS main components: Ingest, Archival Storage, Preservation Planning, Access, and Data Management. Based on the analysis, four out of five OAIS main components working in the proposed framework in the big data environment, which are Archival Storage, Preservation Planning, Data Management, and Access.
This paper reviews the archival process at the Inter-university Consortium for Political and Social Research (ICPSR), a repository of digital social science data, and maps ICPSR's Ingest and Access ...operations to the Open Archival Information System (OAIS) Reference Model. The paper also assesses ICPSR's conformance with the archival responsibilities of "trusted" OAIS repositories, with the proviso that audit criteria for archival certification are still under development. The ICPSR to OAIS mapping exercise has benefits for the larger social science archiving community because it provides an interpretation of the reference model in the quantitative social science environment and points to preservation-related issues that may be salient for other social science archives. Building on the archives; long tradition of shared norms and cooperation, we may ultimately be able to design a federated system of trusted social science repositories that provides access to the global heritage. PUBLICATION ABSTRACT
Autori polaze od činjenice da bi proces uspostave digitalnog arhiva trebao biti planski i sistematično proveden. U radu uvodno obrazlažu potrebu za takvim pristupom. Potom opisuju Agenciju za ...lijekove i medicinske proizvode kao specifičnog stvaratelja gradiva, jer će ju kasnije koristiti kao primjer konkretne realizacije teorijskih uvjeta za uspostavu digitalnoga arhiva od povjerenja koji može dugoročno očuvati autentične elektroničke zapise. Pri tome opisuju trenutne postupke s dokumentacijom o lijeku te postojeće i nadolazeće standarde njezinoga elektroničkoga formata zapisa. Nakon utvrđivanja postojećeg stanja, autori analiziraju uvjete koje digitalni arhiv mora zadovoljiti kako bi bila uspostavljena i očuvana autentičnost elektroničkih zapisâ i digitalnog arhiva. U sklopu toga apliciraju apstraktni OAIS referentni model na konkretni planirani digitalni arhiv Agencije. Autori nastavljaju s pojašnjavanjem projektnoga pristupa izradi digitalnog arhiva kroz IPA projekt. Završno je prikazana konkretna realizacija apstraktnih zahtjeva analiziranih normi za uspostavu digitalnog arhiva od povjerenja kroz IPA projekt Agencije "Priprema za eCTD i uvođenje digitalnog arhivskog informacijskog sustava".
Archivists define and manage relations during the archival professional processes in different ways. Consequently, they understand them differently and in specific situations they implement them in ...different ways. Since the adoption of the first international archival standard ISAD(G), the concept of relations (i.e. relationships) has been systematically incorporated into the description of archival entities. Today’s archivists involved in the archival description must understand and implement them in accordance with the purpose of existing standards. The standards present basic practical framework for the use of relations in different environments. At the same time, quantitative relations are created mainly between archival top entities. As a consequence of increased number of relations, the complexity of the management of archival top-level entities arises. All that requires a relatively higher intellectual effort of archivists in comparison with non-standardized solutions for description of archival materials. However, when evaluating the actual strength of the archival profession, it is necessary to consider many relative factors and the related values. It includes the ability of managing and formalizing the relations between the top level archival entities, but also between their low level versions, and relates to the wholeness of archival descriptions. These abilities are dependent of the technology, archival information system, known theory and practice, archival individual experiences etc. By establishing different types of relations between archival entities, we can detect and define the extent of the intellectual work of archivists. It has not been negligible in the past. However, the standardization of the archival description only increased it. The number of relations between the so-called archival top entities had been increased with each new international archival standard during last three decades and has arisen to almost 800 within the RiC (Records in Context). Since there is no upward limit set up in this case we can expect further increase of the number and type of relations in the future. Complex relations created between archival descriptive entities represent the intellectually demanding task for modern archivists. This is why the requirement for higher technological support of the process in existing archival information systems is needed. We can soon expect the upgrading of the existing solutions with the implementation of artificial intelligence. Restrictions on human resources, available time, and existing technological and procedural solutions are becoming apparent by increasing number of descriptions in databases. That is why we can expect major changes in the near future in the field of data capture as well as in the field of creating their relations.
We propose a user-oriented approach for the automated and scheduled maintenance of Greenstone digital library collections. Existing systems require the user either to add new data manually to a ...collection, or to have programming knowledge in order to use existing application programming interfaces (APIs) in order to automate scheduled collection updates. The Greenstone Scheduler can automate the construction of any existing collection, and schedule the construction to occur periodically. This is accomplished through incorporating a module specific to this purpose into the Greenstone Librarian Interface.
RODA-in is an offline tool designed to easily create thousands of SIPs with gigabytes of data in an easy to use way. This is possible by using aggregation rules, which map files and folders to SIPs, ...and metadata association rules, which add metadata to the created SIPs. The basic workflow can be defined in a sequence of easy steps where the user starts by selecting the folders to be archived and then chooses which patterns will be used to transform the data in SIPs. As an optional step, it's possible to edit the generated SIPs to either enrich them or fix exceptions to the rule. Lastly, it's possible to export to two different formats: BagIt and E-ARK SIP. In this paper we present and discuss all the decisions and ideas taken to implement RODA-in like which workflow should be used, what aggregation and metadata association options are currently implemented, how the metadata templating system works and which other features can be used to enrich the SIPs.
Extending Greenstone for Institutional Repositories Bainbridge, David; Osborn, Wendy; Witten, Ian H. ...
Digital Libraries: Achievements, Challenges and Opportunities,
2006
Book Chapter, Conference Proceeding
Recenzirano
Odprti dostop
We examine the problem of designing a generalized system for building institutional repositories. Widely used schemes such as DSpace are tailored to a particular set of requirements: fixed metadata ...set; standard view when searching and browsing; pre-determined sequence for depositing items; built-in workflow for vetting new items. In contrast, Fedora builds in flexibility: institutional repositories are just one possible instantiation—however generality incurs a high overhead and uptake has been sluggish. This paper shows how existing components of the Greenstone software can be repurposed to provide a generalized institutional repository that falls between these extremes.
The Archivists' Toolkit is a software application currently in development and designed to support the creation and management of archival information. This article summarizes the development of the ...application, including some of the problems the application is designed to resolve. Primary emphasis is placed on describing the application's functional requirements and architecture, as well as options for its sustainability.
In contrast to standard digital libraries, systems addressing the specific requirements of cultural heritage need to deal with digitized material like scanned documents instead of borne digital ...items. Such systems aim at providing the means for domain experts, e.g. historians, to collaboratively work with the given material. To support their work, automatic indexing mechanisms for both textual and pictorial digitized documents need to be combined with retrieval methods exploiting the content as well as the context of information items for precise searches. In the COLLATE project we devised several access methods using textual contents, feature extraction from images, metadata, and annotations provided by the users.