1.6 How does an archive make data available?

To make data available and reusable in the best way, it needs to be prepared adequately. 

Illustration of distribution of data through a data archive

How an archive can make data available starts with the characteristics of the deposited data. When a researcher prepares the data to be deposited there are some requirements to the data provided that are set by the archive. Archives often support the data producer to prepare the data:

  • using file formats according to a preferred formats policy (see also What is archived?),
  • ensuring data is stored and shared with the consent from the participants and anonymised as presribed by (inter)national mandates,
  • using suitable license agreements,
  • determining the right access category, and
  • providing data with proper documentation using suitable metadata standards.

These characteristics of the data determine how the data can be made accessible and available for reuse.

Licences

A data producer decides under what license data can be shared for reuse by the archive. Depending on the license, the data can be reused by a more broadly or narrowly defined group of users for more broad or narrow purposes. An archive should clearly state what licenses they support, to make it clear to the data producer whether their needs can be met. For an overview of the process of assigning licenses and the choice of licenses see ’DMEG - Chapter 6 Archive & Publish - Licensing your data’ (CESSDA Training Team 2017 - 2022).

Access categories

The license that a depositor assigns to a data set determines the level of access and the availability for reuse that can be offered by the data archive. Many data archives for social science data offer the following categories of access:

  • open access: no restrictions to access
  • access for registered users only
  • restricted access, limited access upon request, for example in the case of sensitive data or if data can be used for specific purposes only
  • under embargo: access only after a certain period

Even if the data is only available for restricted access the metadata can be made openly available, so that the data is still easily discoverable.

Metadata standards

In the social sciences data is often described and documented using the Data Documentation Initiative (DDI) standards (DDI Alliance n.d.) and using codebooks. See also ‘DMEG - Chapter 2 Organize & Document - Documentation and metadata’ (CESSDA Training Team 2017 - 2022). Using machine-readable metadata is needed to archive the data in a meaningful way and it is the first step into making the data  discoverable.

Archives provide direct searching functionalities on the metadata provided. Furthermore, the discoverability of the data is highly improved by making the metadata available for harvesting by data portals using PMH endpoints. Common frameworks for this are OAI-PMH and ResourceSync (Open Archives Initiative n.d.), allowing the data to be harvested by repositories such as the CESSDA Data Catalogue (CESSDA n.d. [Accessed July 30, 2022a]) and the ‘Explore dashboard’ (OpenAIRE n.d. [Accessed July 30, 2022h]).

Promoting the reuse of data

Apart from offering search functionalities on their data and providing the metadata for harvesting, archives actively promote the discoverability and reuse of the data.

Examples of how archives can promote the use of new or good quality data are:

  • publish information on the data sets in their news channels.
  • promoting data citation by providing guidance for researchers on using PIDs for person names, projects, organisations in articles and metadata of the data, see for example the video ‘What are Persistent Identifiers and why to use them?' by FAIRsFAIR EU (2022) explaining the principle, and the ‘DMEG - Chapter 6 Archive & Publish - Citing your data’ (CESSDA Training Team 2017-2022).
  • organise events such as the ‘Dutch Data Prize’ (RDNL n.d. [Accessed July 30, 2022]).
  • organise webinars on the reuse of specific data, see the planned events of the UKDS (UK Data Service n.d. [Accessed July 30, 2022f]).
  • create statistics on the usage of the data (OpenAIRE n.d. [Accessed July 30, 2022j]).

For more information on this, see ‘DMEG - Chapter 6 Archive and Publish - Promoting your data’ (CESSDA Training Team 2017-2022).

Find out more about your archive

Here are some questions that you can ask yourself to learn more about your own archive:

  • What license and access categories does your archive support?
  • Are there agreements that depositors have to sign when archiving data?
  • Is the data of your data archive is harvested by other data repositories? What metadata service endpoint does your archive provide to facilitate this?
  • What activities does your data archive carry out to promote the use of the data?

 

Expert tips

Watch webinars from the UKDS on the reuse of data, for example “Key issues in reusing data” (UK Data Service 2020)