Status of students’ graduation (masters) theses in repositories of six European universities
1ST-OPEN, University of Split, Split, Croatia; 2Split University Library, Split, Croatia; 3Department of Research in Biomedicine and Health, University of Split School of Medicine, Split, Croatia
We studied the status of students’ graduation (masters) theses in repositories of six European universities, which are all partners in The European University of the Seas (SEA-EU) Association. Our interest in that subject stems from our work with the University of Split overlay+ journal ST-OPEN, which, transforms students’ graduation theses in research reports and publishes them as scholarly articles in the journal. ST-OPEN thus contributes to the personal advancement of young graduates, increases the quality of teaching at the graduate levels, and increases production and visibility of the University. We searched the official repository of universities of Brest (France), Cadiz (Spain), Gdansk (Poland), Kiel (Germany), Malta (Malta), and Split (Croatia). The first findings are not encouraging. Besides their organizational differences, the six universities differ immensely in their policies of posting graduation theses in their repositories. They have very different internal organizations, including the type and number of schools, institutes and educational programs. Also, they differ overall and within themselves in the openness of access to graduation theses.
Repositories and publishers: AgEcon Search forging new relationships
University of Minnesota-United States of America
Many repositories do not work with publishers on a regular basis but that is not true for AgEcon Search, the subject repository in agricultural and applied economics. With journal articles as the most common document type and over 40 professional societies among the 340 groups that contribute material, we have formed a number of different relationships with publishers. The most common is that AgEcon Search hosts material initially created and made available by a small (often professional society) publisher. In one uniquely beneficial case, we receive and upload pdfs of each individual article from a large commercial publisher after an embargo period. In an opposite, negative situation, we were asked by a publisher to remove years of articles after a society moved to a large commercial publisher and all older material was placed behind the publisher’s paywall. In recent, new collaborations, we assist publishers with creating DOIs, and are negotiating to serve as the pre-print server for one society journal that contracts with a commercial publisher. We encourage other repositories to consider how they might work together with publishers to make research materials more widely available, including institutional repositories whose faculty serve on national or international society journal editorial boards.
A Framework for an Arabic Terminology Management System (TMS) using Artificial Intelligence
Bibliotheca Alexandrina, Egypt
Many translation projects in the Arab world were not implemented, or were delayed, due to manual translation processes and/or budgetary reasons. The poster proposes the use of Artificial intelligence (AI) in the form of Arabic machine translation (MT) to develop an Arabic Terminology Management System (TMS), followed by human Post-editing of machine translation output. The system shall ease the translation process and generate output products, such as glossaries and taxonomies.
Advancing Hyku Project Update
1University of Virginia, United States of America; 2Ubiquity Press; 3The British Library, United Kingdom
Join a Year Two project update for the Advancing Hyku collaborative project, which aims to support the growth of green open access through institutional repositories. The deliverables of the project are to introduce significant structural improvements and new features to the Samvera Community's Hyku platform. The project partners are University of Virginia Library, Ubiquity Press and the British Library, with funding from Arcadia, a charitable fund of philanthropists Lisbet Rausing and Peter Baldwin. The project began October 2019 and is scheduled to conclude with a rollout of the Advanced Hyku platform community-wide after February 2022.
SANDIMS: South African National Geophysical Data and Instrumentation Management System
South African National Space Agency, South Africa
The archiving and dissemination of geophysical research data collected over Southern Africa, Antarctic research base (SANAE IV) and at the South African high latitude observatories on Marion Island and Gough Island, has until recently been fragmented and inaccessible to international researchers.
At the Space Science Directorate of the South African National Space Agency (SANSA) in Hermanus we have implemented a scientific data portal called the South African National Geophysical Data and Instrumentation Management System (SANDIMS) which for the first time makes the geophysical data collected and used by SANSA available through a single data portal.
Our aim is that the system will meet national and international obligations and expectations, as well as raise the standard of South African research. The system’s unique database will contain high-quality data from areas in space that, potentially, could supply information for unanswered scientific questions and enhance scientific development.
The paper will share insights from various topics including Data Policies, Licensing, Data Discovery and showcase tools for researchers and practitioners.
Geodisy -- New geospatial data discovery for Canadian research data
University of British Columbia, Canada
With the rapid proliferation of research data, it is vital to create innovative tools for data discovery and access. This is the goal of Portage’s Geodisy project, an open-source spatial discovery tool for Canadian interdisciplinary open research data. Geodisy provides a map-based search available alongside the Federated Research Data Repository (FRDR), a national discovery layer indexing over 70 Canadian open repositories. Geodisy is intended for users with diverse experience levels and subject interests and is designed to be accessible for those without GIS knowledge. Data and metadata are extracted from the native repositories and are discoverable based on their location, and individual geospatial files are previewed as visual overlays. For any research that relates to geospatial location, this tool provides a new and useful form of visual discovery.
Understanding IR Impact: What Do Users Do with Our Stuff?
University of Montana, Missoula, United States of America
Like many institutions, the University of Montana’s institutional strategy includes a focus on the impact of faculty and student work(1). It is relatively easy to use quantitative data such as download counts(2), location information(3), and citations(4,5) to help demonstrate the IR’s reach and impact; however, there are limits to using these numbers(5), especially when attempting to understand impact outside an academic context. Could knowing more about how or why users use IR content address our own curiosity, help us understand the IR’s impact more completely, and help us contextualize and describe its impact more effectively than we can by reporting quantitative data alone? In April 2018 we added a link to the cover pages that are included with most of the downloadable items in our IR. The link directs users to a form where they can tell us how access to the IR item benefitted them. To date, we have received just over 200 responses. While we now know more about the wide range of uses of our IR content, we are still determining if/how to utilize this information to help demonstrate impact. Our initial evaluation has raised a host of new, useful questions that will inform next steps.
Atrium Repository: diffusion of cardiology knowledge
1Instituto Nacional de Cardiologia - INC, Brazil; 2Fundação Oswaldo Cruz - FIOCRUZ, Brazil; 3Universidade Estadual do Rio de Janeiro , Brazil; 4Universidade Federal do Rio de Janeiro - UFRJ, Brazil; 5Universidade UNIGRANRIO, Brazil
The Instituto Nacional de Cardiologia (INC), a public tertiary referral cardiology centre located in Rio de Janeiro, Brazil, aims to structure and organize all available institutional data into one single space that is easily accessible to its workers and also to the wider scientific community and general public. At present, these data are dispersed in different sources, with little acessibility and low safety .
International experience suggests the most adequate way to preserve the institutional memory and to diffuse knowledge is through a structure called repository. Technological tools make the creation of a repository easier nowadays. Our Project aims to structure the ATRIUM, INC’s institutional repository .
The ATRIUM will store all relevant institutional data and allow members of the public safe and timely access to it.
RCIN - Digital Repository of Scientific Institutes in Poland
Poznan Supercomputing and Networking Center, Poland
Changes and impacts of the COVID-19 pandemic response on Arca
Fundação Oswaldo Cruz. Instituto de Comunicação e Informação Científica e Tecnológica em Saúde
This work presents the changes and impacts on Arca - Oswaldo Cruz Foundation’s (Fiocruz) Institutional Repository, of the COVID-19 response, which has been terrorizing the world. Due to the recognition from the World Health Organization, the importance of the scientific research of the Institution and the technological progress related to vaccines development, supplies and innovative treatments have become crucial. In this epidemiological scenario, including scientific documents and research material of this Institution related to COVID-19 on Arca was deemed critical and of large relevance. It is important to note that the changes, adaptations and focused efforts for the pandemic response, especially with the reduced issuing time, are intended to increase visibility, promote health with public goods and provide access to the knowledge produced by Fiocruz to the whole society.
Repository Re(volution): AgEcon Search Goes Global
University of Minnesota-Waite Library, United States of America
The AgEcon Search (http://ageconsearch.umn.edu/) subject repository has developed over 25 years from a tiny local repository with 50 papers, into an international body of literature with 155,000 papers in agricultural and applied economics. Many of its most active contributing members are in parts of Africa and Asia that have long had difficulty getting their work out into the world (Kelly & Eells 2015; Ihli 2019). AgEcon Search contains free, full-text papers of many types covering agriculture, development, energy, natural resources, food security, and related areas. It receives over 15,000 visits every day from nearly every country in the world and is the premier resource in agricultural economics. We will discuss the evolution of the repository, including the decision to migrate from DSpace to TIND IR in 2017, and financial sustainability challenges. We will highlight how AgEcon Search enables researchers and practitioners to distribute their work to the world while serving as a free resource to all. Repositories have a unique role in helping those who have difficulty accessing – either as producers or as users – the more formal (and expensive) literature, so it is critical to maintain relevance and financial stability as we move into the future.
PeruCRIS: A National Research Information Infrastructure based on DSpace-CRIS
1Concytec, Peru; 24Science, Italy; 3Semicrol, Spain
PeruCRIS is the Peruvian project for setting up and operating a National Information Network on Science, Technology and Technological Innovation. It is based on open source software and open standards, especifically as an extension of the already existing network of Peruvian open access repositories. The network is designed to be interoperable with CRIS systems and Open Access Repositories, and also to allow for direct submission into the National central hub (PeruCRIS Platform) from institutions and researchers. PeruCRIS Platform goes beyond mere aggregation, supporting data normalization, enrichment and curation for collected data, and direct editing in the Directorios. These roles are reserved for Concytec staff, and serve as a means for ensuring data completeness, accuracy and general quality. A particular CERIF profile and a set of controlled vocabularies are being developed in order to accommodate for particular information needs of national scope. An initial partial release of main directories is scheduled for Q2 2021.
Availability of open government data in the Maghreb countries
1Sultan Qaboos University, Oman, Tanta University (Egypt); 2Sultan Qaboos University, Oman, Minia University (Egypt)
Recently, many Arab governments have adopted open government data policies, whereby web technologies are harnessed to provide access to government data. The study's problem lies in the fact that open data practices in the Arab world are relatively recent and are still in their nasal stages, making those practices the subject of constant criticism and evaluation to develop and improve them to be consistent with international practices. Moreover, there is no research carried out to identify the extent of the Maghreb Arab states' progress in this field. The government open data portals in the Maghreb countries (Algeria, Libya, Mauritania, Morocco, and Tunisia) are considered the gateway to studying the reality of open data services in those countries. This study explores the current practices used in preserving and sharing data in open data portals in the Maghreb countries and assessing whether its structure and organization of data on these portals are consistent with the goals for which these portals were established. The study sample included three open data portals on the Internet: the Arab Maghreb countries' portals (Tunisia, Algeria, and Morocco). In order to analyse the portals selected in the study sample, a list of 47 criteria was developed.
Sherpa Romeo – our roadmap
Jisc, United Kingdom
This poster will present the Jisc Open Research team’s roadmap for the development and enhancement of Sherpa Romeo over the coming years, and invite feedback, comments and questions from the user community. We will be looking at the ways we plan to enhance the service, infrastructure and usability of Romeo, to ensure we are continuing to meet the needs of all our users throughout the changing OA policy landscape.
Discovery after migration
Haplo, United Kingdom
Institutions are often apprehensive when moving their repository to a new repository system, as this will affect hundreds of thousands of records collected over decades and the discoverability of their institution's research.
This poster will present statistics on item usage and discovery after repository migration, using data from an external independent repository monitoring service to show an increase in usage of up to 250% and explain how this can be achieved.
Introduce Impact-Pathways in a CRIS – support societal impact orientation in research projects and funding processes
1University of Kassel, Germany; 2Disy; 34Science
Societal challenges require research contributions to solve them. Accordingly, societal impact assessment is an object of increasing interest in publicly funded research. Some countries have build elaborated national systems, applied on the level of research institutions. The approach of the SynSICRIS project (Synergies for Societal Impact in Current Research Information Systems) focuses on societal impact creation and assessment in research projects. Therefore, a repository/CRIS system is being built with additional entities related to societal impact and functionalities to record the information during funding processes.
Our system is built upon the open source software DSpace-CRIS. The additional entities include process-oriented indicators that represent an increase in the likelihood of societal impact. The additional functionalities, allow planning, documenting and structuring contributions of a project to societal impact via interfaces to build impact pathways and working plans. The development built on a synthesis of existing approaches, participatory requirements analysis and agile software development.
Using such a system at the funding body enables to assess information related to societal impact without additional documentation burden for researchers, allows to manage sensitive project information and supports the dissemination, reusing and sharing of outputs and information tailored to actors in practice and society.
A Recommendation System for an Open Archive
Middle East Technical University, Turkey
In this study, we developed a recommender system for OpenMETU which is the open archive of Middle East Technical University. Our system recommends items by using a content-based approach. In the content-based approach, the properties of items are vital due to the recommendations are based on them. Our recommendation system is based on the author, abstract, title, and subject similarity. To calculate these similarities, first of all, we extract features of each item by using natural language processing algorithms such as TF-IDF and Universal sentence encoder. We use cosine distance to measure the similarity between two items. Later, we give different weights to each of the similarities to calculate the overall score. Our system recommends the most similar 5 items to the visitor who visits an item in our archive.
The Scholarship of The Ohio State University: Open for All
The Ohio State University, United States of America
The Ohio State University Libraries promotes innovative research and creative expression and curates and preserves information essential for scholarship and learning. Making the research and scholarship of Ohio State’s faculty, staff, and students openly available allows us to live our land grant mission – sharing knowledge and culture with the people of Ohio, the nation, and the world.
This 24x7 will discuss current developments with The Ohio State University Libraries’ "Transforming the Scholarly Publishing Economy" strategic initiative. It will highlight Ohio State's Open Access agreements, the ongoing development of partnerships across campus and with our consortia and peers, early outcomes of our faculty, scholarly society, and publisher engagements, and thoughts towards future opportunities and challenges.
Interoperability Standards for Distributed Collaboration in InvenioRDM
1Northwestern University, United States of America; 2European Organization for Nuclear Research (CERN), Geneve, Switzerland
CERN (The European Organization for Nuclear Research) has collaborated for the past 2 years with a widely distributed international community on the development of InvenioRDM, an extensible turn-key repository solution. Differing metadata standards utilized for description and access across the globe have led to fertile discussions among team regarding the base data model to employ and field-specific controlled vocabularies. As discussions evinced the need to consider increasing stakeholder needs and requirements, it became clear that a dedicated group was needed to support systematic recording and discussion of partner requirements in order to reach decisions on the data model that could have a positive impact on all adopters. A metadata interest group was formed, with membership open to all project stakeholders, and regular meetings set to discuss user needs, field-by-field. Discussions in this group have helped to reduce the boundaries between distributed stakeholders and bolster the repository community as a whole through a joint, democratic effort open to all, including technical and non-technical participants.
Zenodo spam detection using neural networks
Nobody wants to get something unwanted, like spam. The increase of spam content has become a problem in our digital era, and therefore it also affects digital repositories. Hosting spam can have an impact on a service, i.e. the actual hardware costs of storing it, getting skewed usage statistics, including distribution of material that violates copyright, and, most importantly, serving undesired content to users.
Zenodo is a generalist research repository fostering open science practices. As the barrier for submissions is low, it is an easy target for spam. The repository’s staff has spent many hours manually detecting spam content, a process now assisted by an automated spam classification system, which still does not produce satisfactory results.
Improvements of this classifier were based on an in-depth study of Zenodo’s data, a descriptive analysis, and feature extraction to corroborate expert knowledge gathered over years by Zenodo’s staff, as well as on a literature review of related topics such as spam classification in emails.
Several types of neural network models were tested, displaying promising results for future integration. However, as the false positive rate is still unacceptable, Random Forest classifiers still prevail over neural network models.
DataCORE - Grappling With Big Files and Big Problems
Indiana University, United States of America
IU DataCORE is a brand new Samvera-based repository at Indiana University focused exclusively on research data. This poster will show how the system works, including detailing how data flows in and out and some of the challenges we overcame in implementing it. The changing landscape of handling large data and how to move it around has necessitated some updates to our workflows that we will detail.
Is it interoperability or is it integration?
Jisc, United Kingdom
For those not of a technical bent, there is a great deal of confusion surrounding the terms 'interoperability' and 'integration', especially when it comes to the exchange of information in 'black-box' systems. The average depositor of data doesn't want (or need) to know how data and meta data move around a system as long as what they put in eventually gets to where it should be, in the form it should be and can be seen by the appropriate people. However, a basic understanding of the similarities and differences between 'interoperable' systems and 'integrated' systems and the pros and cons of each approach when it comes to depositing, preserving and discovering data will help users and administrators make informed decisions when it comes to the specification of data management systems, and will help inform their day-to-day data management practices.
This poster is intended to highlight those similarities and differences, pros and cons.
Interoperability Intersection: Partnerships in Open Science Infrastructure
Center for Open Science, United States of America
Enabling interoperability throughout the research lifecycle often aligns with or is a key component of the missions of open source tool providers. Even before the introduction of the FAIR framework, open source tools and infrastructure have emphasized opportunities to connect the systems and workflows that researchers rely on so that research communication can be faster, more efficient, and more secure.
OSF, like many of our friends in the open science infrastructure space, is strengthened both as a technical tool and as a ‘community of communities’ by integrations with other platforms and services. In this poster we will describe our product philosophy that emphasizes interoperability and meeting researchers where they are, while also discussing our experiences with partnership building across outstanding organizations and providers in research. We will also visualize current and upcoming integrations that strengthen both OSF and partner tools while also making it much easier for researchers to manage and collaborate across the research data, planning, and outcomes.
“Precedented”: Public Health, Open Access Infrastructure, and Interrogating Power in Repository Debates
1Hispanic American Periodicals Index, UCLA; 2University of Maryland Libraries
In January 2020, researchers released the initial DNA sequence for the COVID-19 virus on Twitter, bypassing both traditional publication models and institutional open infrastructure in the interest of expediency. The global pandemic has drawn open scientific publishing into the spotlight in the mainstream American press over the past year, and citizens outside the Western research community are gaining new exposure to longstanding scholarly communications debates about public health and open science. Unlike many aspects of the COVID-19 pandemic, however, this is not “unprecedented”--freely accessible health information was also the impetus for the open access movement in Latin America years ahead of comparable efforts in the United States, and it continues with great success today. However, discussions of these platforms in US-based publications have centered on questions of prestige and functionality, rather than reach and stability--concepts that are rooted in imperialist, English-first thinking. This poster will highlight the early history of open access repositories in Latin America, their focus on regionally-produced scientific research rather than institutional efforts, the current state of these projects, and apply a critical lens to Western discourse about these projects and their impact.
Bepress to DSpace in PJs: Migrating Two Open Repositories from Home
Cornell University Library, United States of America
In 2019, Cornell University Library (CUL) made the decision to migrate multiple institutional repositories from bepress into eCommons, its locally supported DSpace instance.
When migration project planning began in February of 2020, we could not have anticipated the impact the COVID-19 pandemic would soon have on our work. We'll share some background--how Cornell ended up with multiple institutional repositories and our rationale for consolidation, and then describe the work itself, with particular focus on the challenges and approaches we used to successfully complete this project during a pandemic, with all staff making an abrupt and unplanned transition to working remotely.