The paper describes architecture of a distributed OAIS-based digital preservation system which uses HDFS as a file storage system and supports wide distribution on a number of cluster's nodes. It is ...based on Apache Hadoop framework - a reliable open source solution with well horizontally scalable distributed architecture. Novelty of the proposed system is defined by the fact that none of existing OAIS digital preservation systems use HDFS storage for both structured and unstructured data archiving. Implementation of the system's prototype and results of its testing are also shown.
The Consultative Committee for Space Data Systems (CCSDS), in 2002, released their first version of a Reference Model for an Open Archival Information System (OAIS). In 2003, the model was adopted by ...the International Standards Organization (ISO) as ISO 14721:2003. The CCSDS document was updated in 2012 with additional focus on verifying the authenticity of data and developing concepts of access rights and a security model. The OAIS model is the basis of research data management systems across institutions and disciplines around the world. The Organization for the Advancement of Structured Information Standards (OASIS), in 2006, released their first version of a Reference Model for Service Oriented Architecture (SOA). OASIS defines the SOA as “a paradigm for organizing and utilizing distributed capabilities that may be under the control of different ownership domains.” Systems designed around the SOA model benefit from improved scalability, flexibility, and agility. This paper applies the SOA model to the OAIS repository to describe how repositories can be implemented and extended through the use of services that may be internal or external to the host institution, including the consumption of network- or cloud-based services and resources. We use the Service Oriented Architecture (SOA) design paradigm to describe a set of potential extensions to OAIS Reference Model: purpose and justification for each extension, where and how each extension connects to the model, and an example of a specific service that meets the purpose.
Este trabalho apresenta o modelo de referência OAIS (Open Archival Information System) e sua relação com a preservação digital distribuída. Para tanto, tomou-se por base dados obtidos mediante ...revisão de literatura e utilizando o exemplo de ferramenta de preservação digital distribuída LOCKSS, em analogia aos preceitos observados pelo OAIS. Conclui-se que a ferramenta contempla os requisitos principais descritos no modelo de referência, e que a adoção da preservação de forma descentralizada pode ser vista como uma alternativa viável para a preservação dos documentos técnico-científicos, patrimoniais e culturais.
A number of approaches have been proposed for the problem of digital preservation, and the number of tools offering solutions is steadily increasing. However, the decision making procedures are still ...largely ad-hoc actions. Especially, the process of selecting the most suitable preservation action tool as one of the key issues in preservation planning has not been sufficiently standardised in practice. The Open Archival Information Systems (OAIS) model and corresponding criteria catalogues for trustworthy repositories specify requirements that such a process should fulfill, but do not provide concrete guidance. This article describes a systematic approach for evaluating potential alternatives for preservation actions and building thoroughly defined, accountable preservation plans for keeping digital content alive over time. In this approach, preservation planners empirically evaluate potential action components in a controlled environment and select the most suitable one with respect to the particular requirements of a given setting. The method follows a variation of utility analysis to support multi-criteria decision making procedures in digital preservation planning. The selection procedure leads to well-documented, well-argued and transparent decisions that can be reproduced and revisited at a later point of time. We describe the context and foundation of the approach, discuss the definition of a preservation plan and describe the components that we consider necessary to constitute a solid and complete preservation plan. We then describe a repeatable workflow for accountable decision making in preservation planning. We analyse and discuss experiences in applying this workflow in case studies. We further set the approach in relation to the OAIS model and show how it supports criteria for trustworthy repositories. Finally, we present a planning tool supporting the workflow and point out directions for future research.
El presente trabajo describe los problemas fundamentales a los que se enfrentan las bibliotecas en cuanto a la preservación a largo plazo de la información digital, dejando en evidencia la necesidad ...de tomar acciones para proteger el patrimonio digital de las amenazas a las que está expuesto. Se realiza una síntesis de las soluciones tecnológicas más importantes, las cuales constituyen puntos de referencia en cuanto a lo que se está realizando en el campo de los sistemas de preservación digital. Se valora de forma general el estado actual de la preservación digital a largo plazo en bibliotecas y las alternativas de software más importantes.
Muchas universidades en el mundo han comenzado a implementar repositorios digitales al disponer de herramientas de software libre para su construcción. Es necesario que paralelamente se implementen ...estrategias y modelos de preservación digital para evitar la pérdida de información en el futuro por diferentes factores tales como la obsolescencia de hardware o software. En este artículo se presenta un resumen de la evolución de los trabajos más importantes en preservación digital, así como los estándares y modelos que pueden ser implementados en el contexto de repositorios institucionales. Además se presentan las tendencias de investigación en esta área. El estudio se ha basado en la revisión de distintas publicaciones de referencia en esta temática.
Providing access to digital information for the indefinite future is the intention of long-term digital preservation systems. One application domain that certainly needs to implement such long-term ...digital preservation processes is the design and engineering industry. In this industry, products are designed, manufactured, and operated with the help of sophisticated software tools provided by product lifecycle management (PLM) systems. During all PLM phases, including geographically distributed cross-domain and cross-company collaboration, a huge amount of heterogeneous digital product data and metadata is created. Legal and economic requirements demand that this product data has to be archived and preserved for a long-time period. Unfortunately, the software that is able to interpret the data will become obsolete earlier than the data since the software and hardware lifecycle is relatively short-lived compared to a product lifecycle. Companies in the engineering industry begin to realize that their data is in danger of becoming unusable while the products are in operation for several decades. To address this issue, different academic and industrial initiatives have been initiated that try to solve this problem. This article provides an overview of these projects including their motivations, identified problems, and proposed solutions. The studied projects are also verified against a classification of important aspects regarding scope and functionality of digital preservation in the engineering industry. Finally, future research topics are identified.