Skip to content

Commit

Permalink
Merge pull request #151 from jennyostrop/main
Browse files Browse the repository at this point in the history
minor updates
  • Loading branch information
korbinib authored Nov 21, 2024
2 parents ad6481e + 1d6fcaa commit b8eb043
Show file tree
Hide file tree
Showing 6 changed files with 11 additions and 14 deletions.
2 changes: 1 addition & 1 deletion pages/dmp_knowledge_base/difficult_faq.md
Original file line number Diff line number Diff line change
Expand Up @@ -38,7 +38,7 @@ Reading through different DMPs can be an useful exercise. Keep in mind that not
## Questions on level of the DMP

### How much of the DMP should I fill out at this stage?
Ideally, a DMP should be as filled out as much as possible before the active phase of the project starts and some aspects should alreday be in place at the proposal stage. Mapping out aspects of the research life cycle as early as possible will make it easier for you to have an idea of which data should be gathered while conducting the research project. For example; deciding early on which data repository you will submit the data to for long storage will give you an idea of which metadata will be necessary to gather in order to submit to the relevant archive.
Ideally, a DMP should be as filled out as much as possible before the active phase of the project starts and some aspects should already be in place at the proposal stage. Mapping out aspects of the research life cycle as early as possible will make it easier for you to have an idea of which data should be gathered while conducting the research project. For example; deciding early on which data repository you will submit the data to for long storage will give you an idea of which metadata will be necessary to gather in order to submit to the relevant archive.

The DMP should be updated as the project develops and decisions taken, it should be considered 'a living document'. Scheduling regular updates is recommended, either at given intervals or in connections with milestones in the project.

Expand Down
2 changes: 1 addition & 1 deletion pages/dmp_supporting_info/dmp_supporting_info.md
Original file line number Diff line number Diff line change
Expand Up @@ -31,4 +31,4 @@ The core section is structured according to our DMP template. For each chapter i
- [9 - Responsibilities and resources](/pages/support_09_responsibilities_resources)

Finally, we explain relevant terminology:
- [Research data tems](/pages/support_00_rdm_terms) gives definitions of relevant terms used in the context of data management plans.
- [Research data terms](/pages/support_00_rdm_terms) gives definitions of relevant terms used in the context of data management plans.
Original file line number Diff line number Diff line change
Expand Up @@ -3,7 +3,7 @@ layout: page
<!--type: cheat_sheet-->
title: Pre-start/Pre-award planning considerations
search_exclude: false
contributors: [Jenny Ostrop]
contributors: [Jenny Ostrop, Svein Høier, Live Kvale]
page_id: support_00_planning_considerations
description: Supporting DMP Information, planning considerations, pre-start, pre-award, before you start
affiliations:
Expand Down
4 changes: 2 additions & 2 deletions pages/dmp_supporting_info/support_06_document.md
Original file line number Diff line number Diff line change
Expand Up @@ -35,7 +35,7 @@ This chapter includes information about how metadata and other accompanying info

## Question-specific guidance

### How will you link data and metadata?
### How will you connect data and respective metadata/data documentation?
Metadata is data about data, providing the necessary context that allows understanding or use of data. Providing this information in a structured way facilitates data reuse. Metadata can be descriptive (e.g. title, data content, date of creation), structural (e.g. explaining file organisation), inform about data provenance (e.g. data origin, versions), administrative (e.g. access permissions), legal (e.g. data license), or technical (e.g. data format, tools and software). A metadata standard is a predefined way of describing data.

Often there will be multiple ways in which data and metadata can be linked within a project. Basic descriptive techniques will be relevant to many projects: these include structured and consistent naming of files and folders, using a README-file to provide information, using embedded metadata in files, or using a separate metadata-file (a sidecar file for each file in the dataset). More advanced techniques that may be relevant include using a database system for linking metadata and data, or establishing a data/variable dictionary for the data in the project.
Expand All @@ -46,7 +46,7 @@ _Supplementary info: Almost all computer systems will provide some system metada


### Do suitable metadata standards exist for the data?
When planning and embarking on your project, you should familiarize yourself with, and choose, a suitable metadata standard if one exists. Many research fields have agreed upon, established and adopted a metadata standard suitable for discipline-specific needs. If available, applying a domain/dicipline-specific standard ensures that all necessary information facilitating data use and reuse is included. In addition to domain-specific standards, domain-agnostic standards also exist. The "Minimal Information Standard" describes a defined minimal set of metadata. A metadata standard can also include optional values.
When planning and embarking on your project, you should familiarize yourself with, and choose, a suitable metadata standard if one exists. Many research fields have agreed upon, established and adopted a metadata standard suitable for discipline-specific needs. If available, applying a domain/discipline-specific standard ensures that all necessary information facilitating data use and reuse is included. In addition to domain-specific standards, domain-agnostic standards also exist. The "Minimal Information Standard" describes a defined minimal set of metadata. A metadata standard can also include optional values.

Most research data repositories will implement specific standards. That is why the use of a particular archive often will lead to the use of a particular metadata standard.\
"Minimal Information Standards" can be important from the [FAIRsharing registry of standards](http://fairsharing.org/).
Expand Down
2 changes: 1 addition & 1 deletion pages/dmp_supporting_info/support_07_process_analyse.md
Original file line number Diff line number Diff line change
Expand Up @@ -5,7 +5,7 @@ title: Processing, analysing and interpreting data
search_exclude: false
contributors: [Korbinian Bösl, Jenny Ostrop, Ingrid Heggland]
page_id: support_07_process_analyse
description: Supporting DMP Information, process data, data processing, analyse data, data analysis, analyze, analyzis
description: Supporting DMP Information, process data, data processing, analyse data, data analysis, analyze, analysis
sidebar: dmp_supporting_info
dsw:
- name: Processing, analysing and interpreting data
Expand Down
13 changes: 5 additions & 8 deletions pages/dmp_supporting_info/support_08_preserve_publish.md
Original file line number Diff line number Diff line change
Expand Up @@ -49,13 +49,12 @@ You may browse or search for suitable archives in the [re3data registry](https:/
You are advised to choose a so-called [trustworthy archive](/pages/support_00_rdm_terms#trustworthy-repository-trustworthy-archive) for the data that meets minimum criteria on provision of persistent and unique identifiers (PIDs), metadata, data access and usage licenses, and data preservation. A common certification is the [Core Trust Seal](https://www.coretrustseal.org/) which certifies that the archive in question operates in accordance with the Core Trust Seal quality criteria, both in terms of its financial foundation and its quality assuring data curation routines.

### Restricted access to research data
Some data should not be made openly available, according to law regulations (including the General Data Protection Regulation (GDPR)) and/or ethical considerations. These may be data with person identifying information, or data that are sensitive for other reasons (national security or business information). It is still important to archive such data, to ensure their long term preserving. These data thus need to be archived with restricted access.
Some data should not be made openly available, according to law regulations (including the General Data Protection Regulation (GDPR) and/or ethical considerations. These may be data with person identifying information, or data that are sensitive for other reasons (national security or business information). It may still be important to ensure their long term preservation and regulate data access beyond the project period. In many cases data can be archived with restricted access, enabling access only to eligible actors.

In Norway, the [Sikt archive](https://sikt.no/en/archiving-research-data) may be suitable for this.

[FEGA (Federated European Genome-phenome Archive)](https://ega.elixir.no/) is another archive that may be suitable. FEGA is designed to help researchers securely store, access, and share sensitive human data across multiple countries, while adhering to local privacy regulations.

[CLARINO](https://repo.clarino.uib.no/xmlui/) may also be mentioned here. CLARINO is Norway's part of the European CLARIN infrastructure network, which stands for Common Language Resources and Technology Infrastructure. CLARINO includes a platform where researchers can store, share, and access language data in standardized formats. This also includes access with restricted access, since language research easily includes person identifying data.
Archives with restricted access in Norway:
* [Sikt archive](https://sikt.no/en/archiving-research-data) for data about people and society
* [FEGA (Federated European Genome-phenome Archive)](https://ega.elixir.no/) is designed to help researchers securely store, access, and share sensitive human data across multiple countries, while adhering to local privacy regulations
* [CLARINO](https://repo.clarino.uib.no/xmlui/) is Norway's part of the European CLARIN infrastructure network, which stands for Common Language Resources and Technology Infrastructure. CLARINO includes a platform where researchers can store, share, and access language data in standardized formats. This also includes access with restricted access, since language research easily includes person identifying data.

[Data Use Ontology](https://www.ga4gh.org/product/data-use-ontology-duo/) is a service that may be useful for data that cannot be openly available. This service is especially developed for sensitive human data.

Expand All @@ -75,8 +74,6 @@ If certain datasets cannot be made available or only be made available under res


### Identify archive(s) for publishing datasets
*add figure for flow chart*

This question, and its sub-questions, will assist you in finding archive(s) suitable to deposit the data.

Where do you plan to archive the research data? Remember that this can also include project results, or background material that you not immediately are thinking of as 'data'. As archive choice often decides what metadata standards to be followed, it is advisable to investigate this question early in the project. When identifying alternative archives to use, have a look at the metadata schema they use, to see if it is suitable to describe the data.
Expand Down

0 comments on commit b8eb043

Please sign in to comment.