Overview and details of the sessions of this conference. Please select a date or location to show only sessions at that day or location. Please select a single session for detailed view (with abstracts and downloads if available).
A Community Preservation Specification: Oxford Common Filesystem Layout
Andrew Woods1, Andrew Hankinson2, Rosalyn Metz3, Julian Morley4, Don Brower5
1DuraSpace, United States of America; 2Bodleian Libraries, Oxford; 3Emory University; 4Stanford; 5University of Notre Dame
This proposal describes a common approach to filesystem layout for institutional repositories, providing recommendations for how open repository systems should structure and store files on disk. It is developed under the name "Oxford Common Filesystem Layout" (OCFL) because the impetus for this effort grew out of discussions held at the Fedora / Samvera Camp held at the University of Oxford in September 2017. The panelists represent institutions where digital preservation practices have been established and proven over time or where significant work has been done to flesh out the digital preservation practices. This community of practitioners is surfacing and assessing successful preservation approaches designed to address a spectrum of digital preservation use cases. With this context as background, the OCFL will be described as the culmination of over two decades of existing standards and practices.
11:45am - 12:30pm
Enabling Computational Access at Scale: Are Repositories Serving Collections-as-Data?
Helen Bailey1, Hannah Frost2, Mark Jordan3, Katherine Lynch4, Sarah Potvin5
1Massachusetts Institute of Technology, United States of America; 2Stanford University Libraries, United States of America; 3Simon Fraser University, Canada; 4University of Pennsylvania, United States of America; 5Texas A&M University, United States of America
There is growing articulation on the part of scholars for access to repository resources -- content and metadata held by research libraries, archives, and museums -- in bulk and at scale, in order to apply algorithmically-based techniques of inquiry for new discoveries. Scholars from a broad and expanding range of disciplines want to work in this way, yet broadly speaking, they still face significant barriers to finding and obtaining collections-as-data material. While many of these barriers are not exclusively technical -- there are, for example, thorny rights and discovery issues -- technical barriers to delivery remain significant, and may be rooted in competing goals around repositories. This panel, organized by members of the “Always Already Computational: Collections as Data” (CAD) project team, will explore the role of the digital repository in the larger context of collections-as-data work and ask: how do today’s open source repository systems support or inhibit working with collections as data?