The why and how of research data archiving and sharing
Simone Sacchi, Ph.D.
Research data Librarian
simone.sacchi@eui.eu
We take an inclusive notion of research data at the core of our work:
Research data is all information intentionally collected, observed, generated or reused to validate research findings and substantiate scholarly claims

Data can take a broad variety of forms, for example: tables, texts, images, audio/video recordings, archival material and other sources or physical evidence.
Read more on the EUI Research Data Guide: What is Research Data
We organise our work around the idea that research data, much like research itself, goes through stages as identified in a research data lifecycle.
Today we are focusing on the Archive and Share stages, i.e. what shall happen towards the completion of a research project to elevate our work to its fullest!
Read more on the EUI Research Data Guide: Research Data Lifecycle
Research data archiving: organization, storage and the long-term preservation of research data after a project ends, ensuring its integrity, security, and accessibility for future use.
Research data sharing: the practice of making data, observations, and analytical code, accessible to other researchers and the public after a study is completed.

Read more on the EUI Research Data Guide: Register, archive and share data
As open as possible, and as closed as necessary
FAIR guiding principles for scientific data management and stewardship Wilkinson et al., 2016, developed to improve the Findability, Accessibility, Interoperability and Reusability of research data.
.jpg)


According to the Database Directive1, A dataset is considered an original work if:
OR
If any of the above is the case, you can claim intellectual paternity over a dataset, and therefore have copyright over it.
Tip
Reusing and integrating data from existing dataset does consitute an act of creation of an orginal work, if the structure is original (individual data points are considered “facts” and therefore are do not fall under copyright).
![]()
Tip
Apply the same priciples you would apply when reusing (e.g. quoting) from other publications
Special terms and conditions apply to access and use of personal data, including micro-socioeconomic and qualitative data.
Data Protection at the EUI is governed by President’s Decision No.10/2019, which was introduced following the adoption of the General Data Protection Regulation (GDPR).
The EUI adopts several policies and best practices, here are some pages where to find all relevant resources:
Data documentation could be defined as the clear description of everything that a new “data user” or “your future-self” would need to know in order to find, understand, reproduce and reuse your data, independently.
Clear and accurate documentation should include:
Tip
Consider the following elements1
Best practice:
The folder structure gives an overview of which information can be found where, enabling present as well as future stakeholders to understand what files have been produced in the project.
Folders should:
Tip
The top folder should have a README.txt file describing the folder structure and what files are contained within the folders.
When data are anonymised, individual research participants or third persons cannot be identified based on indirect identifiers or by combining the data with information available elsewhere.
When data are pseudonymised, unique records are replaced by consistent values either derived from the original values or independent of them so that specific data subjects are no longer identifiable.
Caution
Pseudonymised data can be anonymised by destroying the encryption key. Data should be deleted at end of retention period if not anonymised.
When it comes to file formats, you must distinguish between Working and Archiving file formats
You can opt for different options when deciding to archive your data, depending on your needs and research community:
Tip
You can search for the repository that works best for you using re3data.org a comprehensive registry of research data repositories that is global and covers all research disciplines.
Cadmus, the EUI Research Repository, collects, preserves, and provides access to the EUI research outputs
Cadmus, includes a Research Data Collection, where all members of the EUI community can archive and share (or simply register) their original datasets.
Discover here how to initiate a submission process.

Tip
Three options: 1) Archive your data; 2) archive and share your data; 3) Register (if archived elsewhere)
Archiving your dataset in a trusted research repository ensures that your data can be found by both humans and machines by assigning a unique and persistent identifier (such as a DOI or HANDLE) and standardised, machine-readable metadata.
Criteria for archiving your dataset:
Tip
If your dataset is already archived in a third-party trusted repository (e.g. Harvard Dataverse, Zenodo, etc.), you can register it in Cadmus and a link to the archived data will be added.
Once your dataset are archived in Cadmus, and your dataset qualifies (e.g. does not contain personal and sensitive data according to GDPR), you can make it open and enable its full reuse for the broadest academic community and society.
In Cadmus you can choose:
Tip
If your dataset contains sensitive and/or personal data, consider create an anonymised version
Research data are becoming more and more first class citizens in the scholarly communication landscape, let them stand out!
Care, care, care about all your scholarly outputs, including your data!
Think about your future self as your first and foremost collaborator
Everything you produce has potential future value for you, for the academic community and for society, so consider opening your research data to the world.
We are here to help you succeed in working with data and making sure you leverage them at best.
Simone Sacchi
Research Data Librarian, EUI Library
(Also Teams and BF-278)

Make your original datasets stand out: the why and how of research data archiving and sharing