Conference Agenda

Overview and details of the sessions of this conference. Please select a date or location to show only sessions at that day or location. Please select a single session for detailed view (with abstracts and downloads if available).

Session Overview
P2E: Developer track
Tuesday, 11/Jun/2019:
1:30pm - 3:30pm

Session Chair: Michael Giarlo, Stanford University
Location: Lecture Hall M
Universität Hamburg, Main Building, Edmund-Siemers-Alle 1

Show help for 'Increase or decrease the abstract text size'

Automating OAIS compliant digital preservation using Archivematica and DSpace

Hrafn Malmquist

University of Edinburgh, United Kingdom

Developing a comprehensive system for permanent digital preservation is a daunting task that entails taking due consideration to both preservation and access. Such a workflow can result in a multi-step manual process including steps for file normalisation and metadata creation/extraction which can be relatively time consuming. Currently there is open source software available that supports this but with limited in-built integration. Some content, notably archives of individuals, needs to be manually appraised by a digital archivist. Other content, such as documents created systematically as part of operating a large bureaucratic organisation follow a set prescription enabling a high degree of automation for archiving. At The University of Edinburgh we have been developing such an integrated workflow using open source software; Archivematica and Dspace optionally integrating with ArchivesSpace as well. This has meant developing Archivematica in cooperation with it’s main developer Artefactual culminating in support for DSpace REST integration being included in the Archivematica 1.8/Storage Service 0.13 release in the autumn of 2018.

Developer Track-P2E-316Malmquist_a.pptx

Using DSpace as backend service - Workflow-centric repository development in practice

Ari Häyrinen

University of Jyväskylä, Finland

We moved from a DSpace -centric development model to a workflow-centric model in order to speed up our repository development. In the workflow-centric model the starting point is not the workflows that the *system* can offer but the workflows that are needed. The organisation defines efficient workflows for content management and then implements tools that support those workflows independently from repository software. The result is a network of applications where the repository software is just one part of the network via REST-api.

The best part of the model is that it allows quick experimenting without any significant risks (financial or otherwise). The model also helps to separate tasks that are part of maintaining the core infrastructure from tasks that are content specific. This separation is essential so that we can be sure that libary's development resources are used in best possible ways.

In this session I'll demonstrate workflows that are in use in Jyväskylä University Digital Repository JYX and tools that make them possible.

Developer Track-P2E-170Häyrinen_a.pdf

Longleaf: a repository-independent utility for applying digital preservation processes to files

Benjamin Pennell, Jason Casden

University of North Carolina at Chapel Hill University Libraries, United States of America

Our institution has developed longleaf, a new portable, command-line, repository-agnostic, rules-based tool for monitoring, replicating, and applying preservation processes to files. As our digital collections infrastructure has grown over the past 20 years, we’ve found it difficult to apply digital preservation plans consistently across system-defined content boundaries. We chose to develop longleaf in order to address several ongoing technological preservation challenges that we feel are also common at other institutions, including the uneven application of preservation practices across systems and rising computational cost as collections grow. We argue that the complexity of digital preservation technologies and the manner in which they are coupled with repository management systems contribute significantly to these problems. Longleaf reduces the interference of repository system constraints in what should be a needs-based digital preservation planning process by applying preservation processes at the file and storage level rather than through a repository system intermediary. Files managed in temporary storage or non-preservation asset management systems can now benefit from the same replication and verification processes as those ingested into preservation repositories.

Developer Track-P2E-409Pennell_a.pptx

A Multi-Tenancy Cloud-Native Digital Library Platform

Yinlin Chen, Jim Tuttle, William A. Ingram

Virginia Tech, United States of America

Virginia Tech Libraries presents our next generation digital library platform. Our design and implementation addresses the maintainability, sustainability, modularity, and scalability of a digital repository using a Cloud- native architecture, in which the entire platform is deployed in a cloud environment - Amazon Web Services (AWS). Our next-gen digital library eschews the old model of multiple siloed systems and embraces a common, sustainable infrastructure. This approach facilitates a more maintainable approach to managing and providing access to collections allowing us to focus on content and user experience.

This platform is composed of a suite of microservices and cloud services. Microservices implemented as Lambda functions handle specific tasks and communicate with each other and other cloud services using lightweight asynchronous messaging. Cloud-native application development embodies the future of digital asset management and content delivery. Shared infrastructure throughout the stack and a clear demarcation between front- and back-end makes the platform more generalizable and supports independent replacement of components.

We share our experiences and lessons learned developing this digital library platform, including architecture design, microservice implementation, cloud integration, best practices, and practical strategies and directions for developing a Cloud-native repository.

Developer Track-P2E-461Tuttle_a.pptx

DSpace-Clustering via Puppet, HAProxy and CephFS

Bernd Nicklas, Paul Münch

Philipps-Universität Marburg, Germany

Running a DSpace repository in a single all-in-one installation is limited both

in hardware-resources and in maintainability. Therefore the Philipps-University

Marburg implemented a DSpace clustering solution based on an in-house devel-

oped Puppet module, a slightly modified DSpace installation process, HAProxy

and CephFS which allows for customization, horizontal scaling, better perfor-

mance and higher availability. The presentation features the short-comings of

all-in-one installations, the benefits and also the problems of clustering.

Developer Track-P2E-184Nicklas_a.pdf

Contact and Legal Notice · Contact Address:
Privacy Statement · Conference: OR2019
Conference Software - ConfTool Pro 2.6.129+TC
© 2001 - 2020 by Dr. H. Weinreich, Hamburg, Germany