From 891a6c7a55533b6bba921b42a57a73c52a639c71 Mon Sep 17 00:00:00 2001 From: mtpeterson901 Date: Mon, 12 Jan 2026 16:39:15 +0100 Subject: [PATCH 01/20] Create research-data.qmd --- topics/research-data.qmd | 6 ++++++ 1 file changed, 6 insertions(+) create mode 100644 topics/research-data.qmd diff --git a/topics/research-data.qmd b/topics/research-data.qmd new file mode 100644 index 000000000..faa2c5ecf --- /dev/null +++ b/topics/research-data.qmd @@ -0,0 +1,6 @@ +--- +title: Research Data +categories: [Research Data] +--- + +## What is Research Data? From dc09602e261b75014baa59ccc64fc17171c63539 Mon Sep 17 00:00:00 2001 From: mtpeterson901 Date: Tue, 13 Jan 2026 09:31:02 +0100 Subject: [PATCH 02/20] Update research-data.qmd --- topics/research-data.qmd | 20 +++++++++++++++++++- 1 file changed, 19 insertions(+), 1 deletion(-) diff --git a/topics/research-data.qmd b/topics/research-data.qmd index faa2c5ecf..e6c3f78d4 100644 --- a/topics/research-data.qmd +++ b/topics/research-data.qmd @@ -3,4 +3,22 @@ title: Research Data categories: [Research Data] --- -## What is Research Data? +## What is Research Data? + +Research Data can be defined as information that is collected, observed, generated, or reused for the purpose of underpinning academic research. Depending on the discipline, research data may consist of, for example, text, images, audio recordings, video, spreadsheets, databases, statistical data, geographic data, or sensor measurements. + +When we refer to research data in this policy, we refer to the entirety of the data itself, including any associated metadata, documentation, and contextual information necessary to understand and reuse the data. + +There is a widely used definition of research data from the [OECD](): + +> “Research data are factual records (numerical scores, textual records, images, and sounds) used as primary sources for scientific research, and that are commonly accepted in the scientific community as necessary to validate research findings.” + +It is important to note that not all data used in research qualifies as research data. + +For example, administrative data such as HR records, email correspondence, or generic software logs are typically not considered research data unless they are explicitly collected or repurposed to answer a research question. + +Similarly, publicly available datasets that are used without modification are not newly created research data, although they may still require proper documentation, citation, and ethical consideration. + +Research data can be produced at different stages of the research lifecycle, including during data collection, processing and analysis, and validation of results. Different types of data may exist at each stage, such as raw data, processed data, and derived or aggregated datasets. + +_Materials adapted from..._ From ec9f5b53ba7a8bc593358715b8f99a01de215b6d Mon Sep 17 00:00:00 2001 From: mtpeterson901 Date: Tue, 13 Jan 2026 14:40:22 +0100 Subject: [PATCH 03/20] Update research-data.qmd --- topics/research-data.qmd | 24 ++++++++++++++++++------ 1 file changed, 18 insertions(+), 6 deletions(-) diff --git a/topics/research-data.qmd b/topics/research-data.qmd index e6c3f78d4..dae102244 100644 --- a/topics/research-data.qmd +++ b/topics/research-data.qmd @@ -5,20 +5,32 @@ categories: [Research Data] ## What is Research Data? -Research Data can be defined as information that is collected, observed, generated, or reused for the purpose of underpinning academic research. Depending on the discipline, research data may consist of, for example, text, images, audio recordings, video, spreadsheets, databases, statistical data, geographic data, or sensor measurements. +Research data can be defined as information that is collected, observed, generated, or reused for the purpose of underpinning academic research. Depending on the discipline, research data may consist of, for example, text, images, audio recordings, video, spreadsheets, databases, statistical data, geographic data, or sensor measurements. -When we refer to research data in this policy, we refer to the entirety of the data itself, including any associated metadata, documentation, and contextual information necessary to understand and reuse the data. +When we refer to research data in this policy, we mean the entirety of the data itself, including any associated metadata, documentation, and contextual information necessary to understand, interpret, and reuse the data. -There is a widely used definition of research data from the [OECD](): +A widely used definition of research data is provided by the Organisation for Economic Co-operation and Development (OECD): > “Research data are factual records (numerical scores, textual records, images, and sounds) used as primary sources for scientific research, and that are commonly accepted in the scientific community as necessary to validate research findings.” +This definition originates from the OECD *Principles and Guidelines for Access to Research Data from Public Funding* and is widely referenced in research data management guidance, for example by the [University of Geneva](https://www.unige.ch/researchdata/generate-collect/identifier-donnees-de-recherche/). + +--- + +### What is *not* Research Data? + It is important to note that not all data used in research qualifies as research data. -For example, administrative data such as HR records, email correspondence, or generic software logs are typically not considered research data unless they are explicitly collected or repurposed to answer a research question. +For example, administrative data such as HR records, email correspondence, or generic software logs are typically **not** considered research data unless they are explicitly collected or repurposed to answer a research question. + +Similarly, publicly available datasets that are reused without modification are not newly created research data. However, their use may still require appropriate documentation, citation, and ethical or legal consideration. + +--- + +### Research Data Across the Research Lifecycle -Similarly, publicly available datasets that are used without modification are not newly created research data, although they may still require proper documentation, citation, and ethical consideration. +Research data can be produced at different stages of the research lifecycle, including data collection, processing and analysis, and validation of results. Multiple versions of data may exist at each stage, such as raw data, processed data, and derived or aggregated datasets. -Research data can be produced at different stages of the research lifecycle, including during data collection, processing and analysis, and validation of results. Different types of data may exist at each stage, such as raw data, processed data, and derived or aggregated datasets. +Managing these different forms of research data appropriately is essential for transparency, reproducibility, and long-term reuse. _Materials adapted from..._ From 4af0e275e753684ded2525f0a3ca841d9ff3fd0d Mon Sep 17 00:00:00 2001 From: mtpeterson901 Date: Tue, 13 Jan 2026 14:48:27 +0100 Subject: [PATCH 04/20] Update research-data.qmd --- topics/research-data.qmd | 15 +++++++++++++-- 1 file changed, 13 insertions(+), 2 deletions(-) diff --git a/topics/research-data.qmd b/topics/research-data.qmd index dae102244..9abd07a4c 100644 --- a/topics/research-data.qmd +++ b/topics/research-data.qmd @@ -9,7 +9,7 @@ Research data can be defined as information that is collected, observed, generat When we refer to research data in this policy, we mean the entirety of the data itself, including any associated metadata, documentation, and contextual information necessary to understand, interpret, and reuse the data. -A widely used definition of research data is provided by the Organisation for Economic Co-operation and Development (OECD): +A widely used definition of research data is provided by the Organisation for Economic Co-operation and Development [OECD](https://www.oecd.org/en/publications/frascati-manual-2015_9789264239012-en.html): > “Research data are factual records (numerical scores, textual records, images, and sounds) used as primary sources for scientific research, and that are commonly accepted in the scientific community as necessary to validate research findings.” @@ -33,4 +33,15 @@ Research data can be produced at different stages of the research lifecycle, inc Managing these different forms of research data appropriately is essential for transparency, reproducibility, and long-term reuse. -_Materials adapted from..._ +--- + +## References + +- Organisation for Economic Co-operation and Development (OECD). *Principles and Guidelines for Access to Research Data from Public Funding*. + Definition widely referenced via the [University of Geneva Research Data Management guide](https://www.unige.ch/researchdata/generate-collect/identifier-donnees-de-recherche/). + +- Vrije Universiteit Amsterdam. *Research Data and Software Management (RDSM) Policy, version 3.0*. + https://rdm.vu.nl/public/policies-regulations/RDSM-policy-VU-EN-v3.0.pdf + +- Wilkinson, M. D., et al. (2016). *The FAIR Guiding Principles for scientific data management and stewardship*. Scientific Data, 3, 160018. + https://doi.org/10.1038/sdata.2016.18 From 5171d9a2a7413b362775e4adabe6fb1e721af70b Mon Sep 17 00:00:00 2001 From: mtpeterson901 Date: Tue, 13 Jan 2026 14:57:14 +0100 Subject: [PATCH 05/20] Update research-data.qmd --- topics/research-data.qmd | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/topics/research-data.qmd b/topics/research-data.qmd index 9abd07a4c..dcdaed160 100644 --- a/topics/research-data.qmd +++ b/topics/research-data.qmd @@ -5,7 +5,7 @@ categories: [Research Data] ## What is Research Data? -Research data can be defined as information that is collected, observed, generated, or reused for the purpose of underpinning academic research. Depending on the discipline, research data may consist of, for example, text, images, audio recordings, video, spreadsheets, databases, statistical data, geographic data, or sensor measurements. +[Research data](https://rdm.vu.nl/public/policies-regulations/RDSM-policy-VU-EN-v3.0.pdf) can be defined as information that is collected, observed, generated, or reused for the purpose of underpinning academic research. Depending on the discipline, research data may consist of, for example, text, images, audio recordings, video, spreadsheets, databases, statistical data, geographic data, or sensor measurements. When we refer to research data in this policy, we mean the entirety of the data itself, including any associated metadata, documentation, and contextual information necessary to understand, interpret, and reuse the data. From 352a383d9e1e71cd33eeecffacb94b95681eba89 Mon Sep 17 00:00:00 2001 From: mtpeterson901 Date: Tue, 13 Jan 2026 15:11:51 +0100 Subject: [PATCH 06/20] Update research-data.qmd --- topics/research-data.qmd | 6 ++++-- 1 file changed, 4 insertions(+), 2 deletions(-) diff --git a/topics/research-data.qmd b/topics/research-data.qmd index dcdaed160..7d4ce9651 100644 --- a/topics/research-data.qmd +++ b/topics/research-data.qmd @@ -5,9 +5,11 @@ categories: [Research Data] ## What is Research Data? -[Research data](https://rdm.vu.nl/public/policies-regulations/RDSM-policy-VU-EN-v3.0.pdf) can be defined as information that is collected, observed, generated, or reused for the purpose of underpinning academic research. Depending on the discipline, research data may consist of, for example, text, images, audio recordings, video, spreadsheets, databases, statistical data, geographic data, or sensor measurements. +When we talk about Research Data at VU Amsterdam, it is defined as the following: -When we refer to research data in this policy, we mean the entirety of the data itself, including any associated metadata, documentation, and contextual information necessary to understand, interpret, and reuse the data. +> "Information that is collected, observed, generated, or reused for the purpose of underpinning academic research. Depending on the discipline, research data may consist of, for example, text, images, audio recordings, video, spreadsheets, databases, statistical data, geographic data, or sensor measurements." + +When we refer to research data in the [Research Data and Software Management Policy](https://rdm.vu.nl/public/policies-regulations/RDSM-policy-VU-EN-v3.0.pdf), we mean the entirety of the data itself, including any associated metadata, documentation, and contextual information necessary to understand, interpret, and reuse the data. A widely used definition of research data is provided by the Organisation for Economic Co-operation and Development [OECD](https://www.oecd.org/en/publications/frascati-manual-2015_9789264239012-en.html): From fe4a61f93e511f71a9bf0346614cf2c432c2c3d2 Mon Sep 17 00:00:00 2001 From: mtpeterson901 Date: Tue, 13 Jan 2026 15:13:53 +0100 Subject: [PATCH 07/20] Update research-data.qmd --- topics/research-data.qmd | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/topics/research-data.qmd b/topics/research-data.qmd index 7d4ce9651..e839ecb89 100644 --- a/topics/research-data.qmd +++ b/topics/research-data.qmd @@ -31,7 +31,7 @@ Similarly, publicly available datasets that are reused without modification are ### Research Data Across the Research Lifecycle -Research data can be produced at different stages of the research lifecycle, including data collection, processing and analysis, and validation of results. Multiple versions of data may exist at each stage, such as raw data, processed data, and derived or aggregated datasets. +Research data can be produced at different stages of the research lifecycle, including data collection, processing and analysis, and validation of results. Multiple versions of data may exist at each stage, such as raw data, processed data, and derived or aggregated datasets. Managing these different forms of research data appropriately is essential for transparency, reproducibility, and long-term reuse, in line with the FAIR data principles (Wilkinson et al., 2016). Managing these different forms of research data appropriately is essential for transparency, reproducibility, and long-term reuse. From 367d2d92cc9ddc7b0323fa4920816b7ab6b6b333 Mon Sep 17 00:00:00 2001 From: mtpeterson901 Date: Tue, 13 Jan 2026 15:14:57 +0100 Subject: [PATCH 08/20] Update research-data.qmd --- topics/research-data.qmd | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/topics/research-data.qmd b/topics/research-data.qmd index e839ecb89..6cadbf536 100644 --- a/topics/research-data.qmd +++ b/topics/research-data.qmd @@ -19,7 +19,7 @@ This definition originates from the OECD *Principles and Guidelines for Access t --- -### What is *not* Research Data? +## What is *not* Research Data? It is important to note that not all data used in research qualifies as research data. @@ -29,7 +29,7 @@ Similarly, publicly available datasets that are reused without modification are --- -### Research Data Across the Research Lifecycle +## Research Data Across the Research Lifecycle Research data can be produced at different stages of the research lifecycle, including data collection, processing and analysis, and validation of results. Multiple versions of data may exist at each stage, such as raw data, processed data, and derived or aggregated datasets. Managing these different forms of research data appropriately is essential for transparency, reproducibility, and long-term reuse, in line with the FAIR data principles (Wilkinson et al., 2016). From 5782e01a3e6a0c53c63337238472763b51437b03 Mon Sep 17 00:00:00 2001 From: mtpeterson901 Date: Thu, 15 Jan 2026 09:49:54 +0100 Subject: [PATCH 09/20] Update research-data.qmd examples of research data still need some details --- topics/research-data.qmd | 26 ++++++++++++++++++++++++-- 1 file changed, 24 insertions(+), 2 deletions(-) diff --git a/topics/research-data.qmd b/topics/research-data.qmd index 6cadbf536..389d6f7c1 100644 --- a/topics/research-data.qmd +++ b/topics/research-data.qmd @@ -11,11 +11,31 @@ When we talk about Research Data at VU Amsterdam, it is defined as the following When we refer to research data in the [Research Data and Software Management Policy](https://rdm.vu.nl/public/policies-regulations/RDSM-policy-VU-EN-v3.0.pdf), we mean the entirety of the data itself, including any associated metadata, documentation, and contextual information necessary to understand, interpret, and reuse the data. -A widely used definition of research data is provided by the Organisation for Economic Co-operation and Development [OECD](https://www.oecd.org/en/publications/frascati-manual-2015_9789264239012-en.html): +A widely used definition of research data is provided by the Organisation for Economic Co-operation and Development [OECD](https://www.oecd.org/en/publications/oecd-principles-and-guidelines-for-access-to-research-data-from-public-funding_9789264034020-en-fr.html): > “Research data are factual records (numerical scores, textual records, images, and sounds) used as primary sources for scientific research, and that are commonly accepted in the scientific community as necessary to validate research findings.” -This definition originates from the OECD *Principles and Guidelines for Access to Research Data from Public Funding* and is widely referenced in research data management guidance, for example by the [University of Geneva](https://www.unige.ch/researchdata/generate-collect/identifier-donnees-de-recherche/). +--- + +## Examples of Research Data + +Research data can be created in many formats and through a wide range of research methods. Nearly all fields of study and academic disciplines generate research data, including mathmatics, social sciences, computer science, humanities, and law. + +Some examples of research data include: + +Text-based files such as documents, spreadsheets, and presentation slides +Images, photographs, films, and other visual materials +Survey data, interview transcripts, and codebooks +Physical samples and genomic or sequence data +Laboratory and field notebooks +Audio and video recordings +Computer code, algorithms, models, and scripts +Research methodologies, protocols, and workflows +Bibliographies and reference datasets + +Because research data takes on many forms, it can be difficult to clearly identify them. The pyramid below, developed by Andoefer (2015) and provided by the [University of Geneva] (https://www.unige.ch/researchdata/generate-collect/identifier-donnees-de-recherche), provides a useful framework for understanding how research data function within the research process, especially in social sciences and humanities. + +Insert image here.... --- @@ -39,6 +59,8 @@ Managing these different forms of research data appropriately is essential for t ## References +- Andorfer, P. (2015). Forschen und Forschungsdaten in den Geisteswissenschaften: Zwischenbericht einer Interviewreihe. Niedersächsische Staats- und Universitätsbibliothek Göttingen. + - Organisation for Economic Co-operation and Development (OECD). *Principles and Guidelines for Access to Research Data from Public Funding*. Definition widely referenced via the [University of Geneva Research Data Management guide](https://www.unige.ch/researchdata/generate-collect/identifier-donnees-de-recherche/). From 489deea22007328b4b86cc943f301f7a2b526fdb Mon Sep 17 00:00:00 2001 From: mtpeterson901 Date: Thu, 15 Jan 2026 13:40:26 +0100 Subject: [PATCH 10/20] Update links and wording research-data.qmd --- topics/research-data.qmd | 38 +++++++++++++++++++------------------- 1 file changed, 19 insertions(+), 19 deletions(-) diff --git a/topics/research-data.qmd b/topics/research-data.qmd index 389d6f7c1..876cc65ae 100644 --- a/topics/research-data.qmd +++ b/topics/research-data.qmd @@ -9,11 +9,11 @@ When we talk about Research Data at VU Amsterdam, it is defined as the following > "Information that is collected, observed, generated, or reused for the purpose of underpinning academic research. Depending on the discipline, research data may consist of, for example, text, images, audio recordings, video, spreadsheets, databases, statistical data, geographic data, or sensor measurements." -When we refer to research data in the [Research Data and Software Management Policy](https://rdm.vu.nl/public/policies-regulations/RDSM-policy-VU-EN-v3.0.pdf), we mean the entirety of the data itself, including any associated metadata, documentation, and contextual information necessary to understand, interpret, and reuse the data. +In the context of the [Research Data and Software Management Policy](https://rdm.vu.nl/public/policies-regulations/RDSM-policy-VU-EN-v3.0.pdf), research data refers to the **entirety of the data**, including associated metadata, documentation, and contextual information required to understand, interpret, and reuse the data. -A widely used definition of research data is provided by the Organisation for Economic Co-operation and Development [OECD](https://www.oecd.org/en/publications/oecd-principles-and-guidelines-for-access-to-research-data-from-public-funding_9789264034020-en-fr.html): +This understanding aligns with widely accepted international definitions. For example, the Organisation for Economic Co-operation and Development (OECD) defines research data as: -> “Research data are factual records (numerical scores, textual records, images, and sounds) used as primary sources for scientific research, and that are commonly accepted in the scientific community as necessary to validate research findings.” +> “Factual records (numerical scores, textual records, images, and sounds) used as primary sources for scientific research, and that are commonly accepted in the scientific community as necessary to validate research findings.” --- @@ -23,27 +23,27 @@ Research data can be created in many formats and through a wide range of researc Some examples of research data include: -Text-based files such as documents, spreadsheets, and presentation slides -Images, photographs, films, and other visual materials -Survey data, interview transcripts, and codebooks -Physical samples and genomic or sequence data -Laboratory and field notebooks -Audio and video recordings -Computer code, algorithms, models, and scripts -Research methodologies, protocols, and workflows -Bibliographies and reference datasets +- Text-based files such as documents, spreadsheets, and presentation slides +- Images, photographs, films, and other visual materials +- Survey data, interview transcripts, and codebooks +- Physical samples and genomic or sequence data +- Laboratory and field notebooks +- Audio and video recordings +- Computer code, algorithms, models, and scripts +- Research methodologies, protocols, and workflows +- Bibliographies and curated reference datasets -Because research data takes on many forms, it can be difficult to clearly identify them. The pyramid below, developed by Andoefer (2015) and provided by the [University of Geneva] (https://www.unige.ch/researchdata/generate-collect/identifier-donnees-de-recherche), provides a useful framework for understanding how research data function within the research process, especially in social sciences and humanities. +Because research data can take many forms, it is not always easy to identify what qualifies as research data. The pyramid below, developed by Andorfer (2015) and provided by the [University of Geneva](https://www.unige.ch/researchdata/generate-collect/identifier-donnees-de-recherche), offers a useful framework for understanding how different types of research data function within the research process, particularly in the social sciences and humanities. Insert image here.... --- -## What is *not* Research Data? +## What Is *Not* Research Data? -It is important to note that not all data used in research qualifies as research data. +Not all data used in a research context qualifies as research data. -For example, administrative data such as HR records, email correspondence, or generic software logs are typically **not** considered research data unless they are explicitly collected or repurposed to answer a research question. +Administrative or operational data—such as HR records, routine email correspondence, or generic software logs—are generally **not** considered research data unless they are explicitly collected or repurposed to answer a research question. Similarly, publicly available datasets that are reused without modification are not newly created research data. However, their use may still require appropriate documentation, citation, and ethical or legal consideration. @@ -51,15 +51,15 @@ Similarly, publicly available datasets that are reused without modification are ## Research Data Across the Research Lifecycle -Research data can be produced at different stages of the research lifecycle, including data collection, processing and analysis, and validation of results. Multiple versions of data may exist at each stage, such as raw data, processed data, and derived or aggregated datasets. Managing these different forms of research data appropriately is essential for transparency, reproducibility, and long-term reuse, in line with the FAIR data principles (Wilkinson et al., 2016). +Research data can be generated at multiple stages of the research lifecycle, including data collection, processing and analysis, and validation of results. At each stage, multiple versions of data may exist, such as raw data, cleaned or processed data, and derived or aggregated datasets. -Managing these different forms of research data appropriately is essential for transparency, reproducibility, and long-term reuse. +Appropriate management of these different forms of research data is essential for ensuring transparency, reproducibility, and long-term reuse, in line with the [FAIR data principles](https://rdm.vu.nl/topics/fair-principles.html) (Wilkinson et al., 2016). --- ## References -- Andorfer, P. (2015). Forschen und Forschungsdaten in den Geisteswissenschaften: Zwischenbericht einer Interviewreihe. Niedersächsische Staats- und Universitätsbibliothek Göttingen. +- Andorfer, P. (2015). *Forschen und Forschungsdaten in den Geisteswissenschaften: Zwischenbericht einer Interviewreihe*. Niedersächsische Staats- und Universitätsbibliothek Göttingen. - Organisation for Economic Co-operation and Development (OECD). *Principles and Guidelines for Access to Research Data from Public Funding*. Definition widely referenced via the [University of Geneva Research Data Management guide](https://www.unige.ch/researchdata/generate-collect/identifier-donnees-de-recherche/). From 79bee7f5a938b5b0695bfa5ded542aea0a23e22b Mon Sep 17 00:00:00 2001 From: mtpeterson901 Date: Mon, 19 Jan 2026 11:16:30 +0100 Subject: [PATCH 11/20] Update research-data.qmd categories of research data --- topics/research-data.qmd | 23 +++++++++++++++-------- 1 file changed, 15 insertions(+), 8 deletions(-) diff --git a/topics/research-data.qmd b/topics/research-data.qmd index 876cc65ae..ba2aff111 100644 --- a/topics/research-data.qmd +++ b/topics/research-data.qmd @@ -11,7 +11,7 @@ When we talk about Research Data at VU Amsterdam, it is defined as the following In the context of the [Research Data and Software Management Policy](https://rdm.vu.nl/public/policies-regulations/RDSM-policy-VU-EN-v3.0.pdf), research data refers to the **entirety of the data**, including associated metadata, documentation, and contextual information required to understand, interpret, and reuse the data. -This understanding aligns with widely accepted international definitions. For example, the Organisation for Economic Co-operation and Development (OECD) defines research data as: +This understanding aligns with widely accepted international definitions. For example, the Organisation for Economic Co-operation and Development [OECD](https://www.oecd.org/en/publications/oecd-principles-and-guidelines-for-access-to-research-data-from-public-funding_9789264034020-en-fr.html) defines research data as: > “Factual records (numerical scores, textual records, images, and sounds) used as primary sources for scientific research, and that are commonly accepted in the scientific community as necessary to validate research findings.” @@ -31,12 +31,22 @@ Some examples of research data include: - Audio and video recordings - Computer code, algorithms, models, and scripts - Research methodologies, protocols, and workflows -- Bibliographies and curated reference datasets +- Bibliographies and reference datasets Because research data can take many forms, it is not always easy to identify what qualifies as research data. The pyramid below, developed by Andorfer (2015) and provided by the [University of Geneva](https://www.unige.ch/researchdata/generate-collect/identifier-donnees-de-recherche), offers a useful framework for understanding how different types of research data function within the research process, particularly in the social sciences and humanities. Insert image here.... +### Five Categories of Research Data + +The [University of Bristol](https://data.blogs.bristol.ac.uk/bootcamp/data/) identifies five main catergories of research data based on how they are produced and whether they can be reproduced: + +- Observational: Data collected in real time within a specific context; typically unique and irreplaceable (e.g. neuroimaging, surveys, field recordings). +- Experimental: Data generated using laboratory instruments or standardized methods; potentially reproducible but often costly and time-consuming (e.g. gene sequences, animal experiments). +- Simulation / models: Data produced by computational or experimental models, where the model itself is often the primary research output (e.g. climate or economic models). +- Derived / compiled: Data created by processing, aggregating, or combining raw data (e.g. data mining outputs, compiled databases such as the UNSCdeb8 database). +- Reference: Curated corpora or collections that serve as authoritative resources in a field (e.g. gene databases, archival collections, historical image databases). + --- ## What Is *Not* Research Data? @@ -53,19 +63,16 @@ Similarly, publicly available datasets that are reused without modification are Research data can be generated at multiple stages of the research lifecycle, including data collection, processing and analysis, and validation of results. At each stage, multiple versions of data may exist, such as raw data, cleaned or processed data, and derived or aggregated datasets. -Appropriate management of these different forms of research data is essential for ensuring transparency, reproducibility, and long-term reuse, in line with the [FAIR data principles](https://rdm.vu.nl/topics/fair-principles.html) (Wilkinson et al., 2016). +Appropriate management of these different forms of research data is essential for ensuring transparency, reproducibility, and long-term reuse, in line with [FAIR data principles](https://rdm.vu.nl/topics/fair-principles.html) (Wilkinson et al., 2016). --- ## References - Andorfer, P. (2015). *Forschen und Forschungsdaten in den Geisteswissenschaften: Zwischenbericht einer Interviewreihe*. Niedersächsische Staats- und Universitätsbibliothek Göttingen. - -- Organisation for Economic Co-operation and Development (OECD). *Principles and Guidelines for Access to Research Data from Public Funding*. - Definition widely referenced via the [University of Geneva Research Data Management guide](https://www.unige.ch/researchdata/generate-collect/identifier-donnees-de-recherche/). - +- Organisation for Economic Co-operation and Development. (2007). OECD principles and guidelines for access to research data from public funding. OECD Publishing. https://doi.org/10.1787/9789264034020-en-fr - Vrije Universiteit Amsterdam. *Research Data and Software Management (RDSM) Policy, version 3.0*. https://rdm.vu.nl/public/policies-regulations/RDSM-policy-VU-EN-v3.0.pdf - +- University of Geneva. (n.d.). Identify research data. Researchdata. https://www.unige.ch/researchdata/generate-collect/identifier-donnees-de-recherche - Wilkinson, M. D., et al. (2016). *The FAIR Guiding Principles for scientific data management and stewardship*. Scientific Data, 3, 160018. https://doi.org/10.1038/sdata.2016.18 From e75dc1899e70ce5e48105368e4a8539f12d834bd Mon Sep 17 00:00:00 2001 From: mtpeterson901 Date: Thu, 22 Jan 2026 15:55:43 +0100 Subject: [PATCH 12/20] Update research-data.qmd --- topics/research-data.qmd | 3 ++- 1 file changed, 2 insertions(+), 1 deletion(-) diff --git a/topics/research-data.qmd b/topics/research-data.qmd index ba2aff111..ca0934833 100644 --- a/topics/research-data.qmd +++ b/topics/research-data.qmd @@ -35,7 +35,8 @@ Some examples of research data include: Because research data can take many forms, it is not always easy to identify what qualifies as research data. The pyramid below, developed by Andorfer (2015) and provided by the [University of Geneva](https://www.unige.ch/researchdata/generate-collect/identifier-donnees-de-recherche), offers a useful framework for understanding how different types of research data function within the research process, particularly in the social sciences and humanities. -Insert image here.... + ![A three layer pyramid that displays the different types of research data](public/research data pyramid-eng_universitedegeneve.png) + ### Five Categories of Research Data From 297a16a0ddf92fae4d5fb177f84f43a5721ddb62 Mon Sep 17 00:00:00 2001 From: mtpeterson901 Date: Thu, 22 Jan 2026 16:01:10 +0100 Subject: [PATCH 13/20] Update research-data.qmd --- topics/research-data.qmd | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/topics/research-data.qmd b/topics/research-data.qmd index ca0934833..d444b02e0 100644 --- a/topics/research-data.qmd +++ b/topics/research-data.qmd @@ -35,7 +35,7 @@ Some examples of research data include: Because research data can take many forms, it is not always easy to identify what qualifies as research data. The pyramid below, developed by Andorfer (2015) and provided by the [University of Geneva](https://www.unige.ch/researchdata/generate-collect/identifier-donnees-de-recherche), offers a useful framework for understanding how different types of research data function within the research process, particularly in the social sciences and humanities. - ![A three layer pyramid that displays the different types of research data](public/research data pyramid-eng_universitedegeneve.png) +![A three-layer pyramid showing different types of research data](public/research-data-pyramid-eng_universitedegeneve.png) ### Five Categories of Research Data From 209e2aa3505de111710a2a4eff98c7702ebbcfa5 Mon Sep 17 00:00:00 2001 From: mtpeterson901 Date: Thu, 22 Jan 2026 16:11:51 +0100 Subject: [PATCH 14/20] Update research-data.qmd descriptive text --- topics/research-data.qmd | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/topics/research-data.qmd b/topics/research-data.qmd index d444b02e0..3973d755d 100644 --- a/topics/research-data.qmd +++ b/topics/research-data.qmd @@ -35,7 +35,7 @@ Some examples of research data include: Because research data can take many forms, it is not always easy to identify what qualifies as research data. The pyramid below, developed by Andorfer (2015) and provided by the [University of Geneva](https://www.unige.ch/researchdata/generate-collect/identifier-donnees-de-recherche), offers a useful framework for understanding how different types of research data function within the research process, particularly in the social sciences and humanities. -![A three-layer pyramid showing different types of research data](public/research-data-pyramid-eng_universitedegeneve.png) +![Three-layer pyramid illustrating different types of research data, from primary sources at the base to publications at the top](public/research-data-pyramid-eng_universitedegeneve.png) ### Five Categories of Research Data From 3e4dbc3b839404e388e8d90e1b1825f8cac700d0 Mon Sep 17 00:00:00 2001 From: mtpeterson901 Date: Fri, 23 Jan 2026 09:27:42 +0100 Subject: [PATCH 15/20] Update research-data.qmd --- topics/research-data.qmd | 13 +------------ 1 file changed, 1 insertion(+), 12 deletions(-) diff --git a/topics/research-data.qmd b/topics/research-data.qmd index 3973d755d..8a2aee35c 100644 --- a/topics/research-data.qmd +++ b/topics/research-data.qmd @@ -35,18 +35,7 @@ Some examples of research data include: Because research data can take many forms, it is not always easy to identify what qualifies as research data. The pyramid below, developed by Andorfer (2015) and provided by the [University of Geneva](https://www.unige.ch/researchdata/generate-collect/identifier-donnees-de-recherche), offers a useful framework for understanding how different types of research data function within the research process, particularly in the social sciences and humanities. -![Three-layer pyramid illustrating different types of research data, from primary sources at the base to publications at the top](public/research-data-pyramid-eng_universitedegeneve.png) - - -### Five Categories of Research Data - -The [University of Bristol](https://data.blogs.bristol.ac.uk/bootcamp/data/) identifies five main catergories of research data based on how they are produced and whether they can be reproduced: - -- Observational: Data collected in real time within a specific context; typically unique and irreplaceable (e.g. neuroimaging, surveys, field recordings). -- Experimental: Data generated using laboratory instruments or standardized methods; potentially reproducible but often costly and time-consuming (e.g. gene sequences, animal experiments). -- Simulation / models: Data produced by computational or experimental models, where the model itself is often the primary research output (e.g. climate or economic models). -- Derived / compiled: Data created by processing, aggregating, or combining raw data (e.g. data mining outputs, compiled databases such as the UNSCdeb8 database). -- Reference: Curated corpora or collections that serve as authoritative resources in a field (e.g. gene databases, archival collections, historical image databases). +![Three-layer pyramid illustrating different types of research data, from primary sources at the base to publications at the top.](public/research-data-pyramid-eng_universitedegeneve.png) --- From a434fb14efaa5decb43b6e0a50503d2adcee8b0d Mon Sep 17 00:00:00 2001 From: mtpeterson901 Date: Mon, 23 Feb 2026 14:50:13 +0100 Subject: [PATCH 16/20] Add additional informatopm research-data.qmd I really liked reading through the RDM guidebook for students as it provided some additional context into different types and categories of research data that I thought would be beneficial to add here. --- topics/research-data.qmd | 83 ++++++++++++++++++++++++++++------------ 1 file changed, 59 insertions(+), 24 deletions(-) diff --git a/topics/research-data.qmd b/topics/research-data.qmd index 8a2aee35c..7399d837b 100644 --- a/topics/research-data.qmd +++ b/topics/research-data.qmd @@ -5,13 +5,13 @@ categories: [Research Data] ## What is Research Data? -When we talk about Research Data at VU Amsterdam, it is defined as the following: +At VU Amsterdam, research data is defined as: > "Information that is collected, observed, generated, or reused for the purpose of underpinning academic research. Depending on the discipline, research data may consist of, for example, text, images, audio recordings, video, spreadsheets, databases, statistical data, geographic data, or sensor measurements." In the context of the [Research Data and Software Management Policy](https://rdm.vu.nl/public/policies-regulations/RDSM-policy-VU-EN-v3.0.pdf), research data refers to the **entirety of the data**, including associated metadata, documentation, and contextual information required to understand, interpret, and reuse the data. -This understanding aligns with widely accepted international definitions. For example, the Organisation for Economic Co-operation and Development [OECD](https://www.oecd.org/en/publications/oecd-principles-and-guidelines-for-access-to-research-data-from-public-funding_9789264034020-en-fr.html) defines research data as: +This definition aligns with widely accepted international standards. For example, the Organisation for Economic Co-operation and Development [OECD](https://www.oecd.org/en/publications/oecd-principles-and-guidelines-for-access-to-research-data-from-public-funding_9789264034020-en-fr.html) defines research data as: > “Factual records (numerical scores, textual records, images, and sounds) used as primary sources for scientific research, and that are commonly accepted in the scientific community as necessary to validate research findings.” @@ -19,21 +19,59 @@ This understanding aligns with widely accepted international definitions. For ex ## Examples of Research Data -Research data can be created in many formats and through a wide range of research methods. Nearly all fields of study and academic disciplines generate research data, including mathmatics, social sciences, computer science, humanities, and law. +Research data can be created in many formats and through a wide range of research methods. Nearly all academic disciplines produce research data, including mathematics, social sciences, computer science, the humanities, and law. -Some examples of research data include: +Examples include: -- Text-based files such as documents, spreadsheets, and presentation slides -- Images, photographs, films, and other visual materials -- Survey data, interview transcripts, and codebooks -- Physical samples and genomic or sequence data -- Laboratory and field notebooks -- Audio and video recordings -- Computer code, algorithms, models, and scripts -- Research methodologies, protocols, and workflows -- Bibliographies and reference datasets +- Text-based files such as documents, spreadsheets, and presentation slides +- Images, photographs, films, and other visual materials +- Survey data, interview transcripts, and codebooks +- Physical samples and genomic or sequence data +- Laboratory and field notebooks +- Audio and video recordings +- Computer code, algorithms, models, and scripts +- Research methodologies, protocols, and workflows +- Bibliographies and reference datasets -Because research data can take many forms, it is not always easy to identify what qualifies as research data. The pyramid below, developed by Andorfer (2015) and provided by the [University of Geneva](https://www.unige.ch/researchdata/generate-collect/identifier-donnees-de-recherche), offers a useful framework for understanding how different types of research data function within the research process, particularly in the social sciences and humanities. +While these examples show the variety of formats, understanding research data also requires looking beyond format to how data are produced, sourced, and regulated. + +--- + +### Categorisation by Production Method + +The University of Bristol identifies five main categories of research data based on how they are produced and whether they can be reproduced: + +- **Observational**: Data collected in real time within a specific context; typically unique and irreplaceable (e.g. neuroimaging, surveys, field recordings). +- **Experimental**: Data generated using laboratory instruments or standardized methods; potentially reproducible but often costly and time-consuming (e.g. gene sequences, animal experiments). +- **Simulation / Models**: Data produced by computational or experimental models, where the model itself is often the primary research output (e.g. climate or economic models). +- **Derived / Compiled**: Data created by processing, aggregating, or combining raw data (e.g. data mining outputs, compiled databases). +- **Reference**: Curated corpora or collections that serve as authoritative resources in a field (e.g. gene databases, archival collections, historical image databases). + +These categories highlight that research data may include both raw materials and processed or curated outputs. + +--- + +### Categorisation by Source, Type, and Privacy Risk + +Research data can also be categorised in other ways, including: + +- **By source**: primary vs. secondary data +- **By type**: qualitative vs. quantitative data +- **By privacy risk**: personal vs. non-personal data; sensitive vs. non-sensitive data + +**Primary vs. Secondary Data** +Primary data are collected or generated by the researcher specifically for a particular project and are usually gathered for the first time. Secondary data are obtained from existing sources, such as researchers, institutions, companies, databases, publications, or the internet. Secondary data may also include data previously collected for other projects, including earlier work within the same research group. + +**Quantitative vs. Qualitative Data** +Quantitative data consist of numerical information that can be measured, counted, and statistically analyzed (e.g. age, income, temperature, survey ratings). Qualitative data consist of descriptive, non-numerical information such as interview transcripts, observations, open-ended responses, images, or archival documents. These data are typically analyzed using interpretative methods such as thematic or content analysis. + +**Personal vs. Non-Personal Data** +Personal data can be traced back to a living individual and are subject to legal and ethical requirements, including GDPR compliance in Europe. Their use may require additional safeguards, such as secure storage, GDPR registration, or a Data Protection Impact Assessment (DPIA). Ethical review and informed consent procedures are often required when working with personal data. Non-personal data cannot be linked to a living individual, although ethical considerations may still apply. + +**Special Categories of Personal Data** +Certain types of personal data are considered particularly sensitive and require enhanced protection measures. These include health information, genetic and biometric data, racial or ethnic origin, religious or philosophical beliefs, political opinions, sexual orientation, and trade union membership. Additional security measures and consultation with a privacy or data protection officer are recommended when working with these data. + +Because research data can take so many forms and functions, it is not always straightforward to determine what qualifies as research data within a specific project. The pyramid below, developed by Andorfer (2015), provided by the University of Geneva, offers a useful conceptual framework for understanding how different types of research data function within the research process, particularly in the social sciences and humanities. ![Three-layer pyramid illustrating different types of research data, from primary sources at the base to publications at the top.](public/research-data-pyramid-eng_universitedegeneve.png) @@ -41,9 +79,7 @@ Because research data can take many forms, it is not always easy to identify wha ## What Is *Not* Research Data? -Not all data used in a research context qualifies as research data. - -Administrative or operational data—such as HR records, routine email correspondence, or generic software logs—are generally **not** considered research data unless they are explicitly collected or repurposed to answer a research question. +Although the concept of research data is broad, it is not unlimited. Administrative or operational data—such as HR records, routine email correspondence, or generic software logs—are generally **not** considered research data unless they are explicitly collected or repurposed to answer a research question. Similarly, publicly available datasets that are reused without modification are not newly created research data. However, their use may still require appropriate documentation, citation, and ethical or legal consideration. @@ -51,18 +87,17 @@ Similarly, publicly available datasets that are reused without modification are ## Research Data Across the Research Lifecycle -Research data can be generated at multiple stages of the research lifecycle, including data collection, processing and analysis, and validation of results. At each stage, multiple versions of data may exist, such as raw data, cleaned or processed data, and derived or aggregated datasets. +Beyond defining and categorising research data, it is also important to consider how data evolve over the course of a research project. Research data can be generated at multiple stages, including data collection, processing and analysis, and validation of results. At each stage, multiple versions of data may exist, such as raw data, cleaned or processed data, and derived or aggregated datasets. Appropriate management of these different forms of research data is essential for ensuring transparency, reproducibility, and long-term reuse, in line with [FAIR data principles](https://rdm.vu.nl/topics/fair-principles.html) (Wilkinson et al., 2016). --- -## References +## Reference and Further Reading - Andorfer, P. (2015). *Forschen und Forschungsdaten in den Geisteswissenschaften: Zwischenbericht einer Interviewreihe*. Niedersächsische Staats- und Universitätsbibliothek Göttingen. +- Li, M., Marcoux, K., Nazareth, D., Nikuze, A., & Plomp, W. (2025, December). Research Data Management Guidebook for Students. Zenodo. https://doi.org/10.5281/zenodo.15576176 - Organisation for Economic Co-operation and Development. (2007). OECD principles and guidelines for access to research data from public funding. OECD Publishing. https://doi.org/10.1787/9789264034020-en-fr -- Vrije Universiteit Amsterdam. *Research Data and Software Management (RDSM) Policy, version 3.0*. - https://rdm.vu.nl/public/policies-regulations/RDSM-policy-VU-EN-v3.0.pdf -- University of Geneva. (n.d.). Identify research data. Researchdata. https://www.unige.ch/researchdata/generate-collect/identifier-donnees-de-recherche -- Wilkinson, M. D., et al. (2016). *The FAIR Guiding Principles for scientific data management and stewardship*. Scientific Data, 3, 160018. - https://doi.org/10.1038/sdata.2016.18 +- University of Geneva. (n.d.). Identify research data. Researchdata. https://www.unige.ch/researchdata/generate-collect/identifier-donnees-de-recherche +- Vrije Universiteit Amsterdam. *Research Data and Software Management (RDSM) Policy, version 3.0*. https://rdm.vu.nl/public/policies-regulations/RDSM-policy-VU-EN-v3.0.pdf +- Wilkinson, M. D., et al. (2016). *The FAIR Guiding Principles for scientific data management and stewardship*. Scientific Data, 3, 160018. https://doi.org/10.1038/sdata.2016.18 From 1c6431a755402a77aa87c30e1409679149a1e6a4 Mon Sep 17 00:00:00 2001 From: mtpeterson901 Date: Thu, 19 Mar 2026 11:23:53 +0100 Subject: [PATCH 17/20] Update research-data.qmd --- topics/research-data.qmd | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/topics/research-data.qmd b/topics/research-data.qmd index 7399d837b..c6c3fc766 100644 --- a/topics/research-data.qmd +++ b/topics/research-data.qmd @@ -5,7 +5,7 @@ categories: [Research Data] ## What is Research Data? -At VU Amsterdam, research data is defined as: +When we talk about Research Data at VU Amsterdam, it is defined as the following: > "Information that is collected, observed, generated, or reused for the purpose of underpinning academic research. Depending on the discipline, research data may consist of, for example, text, images, audio recordings, video, spreadsheets, databases, statistical data, geographic data, or sensor measurements." From 4576f07edef99496737a33b9e24d885335280f17 Mon Sep 17 00:00:00 2001 From: mtpeterson901 Date: Tue, 24 Mar 2026 16:20:23 +0100 Subject: [PATCH 18/20] Update research-data.qmd --- topics/research-data.qmd | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/topics/research-data.qmd b/topics/research-data.qmd index c6c3fc766..2980a1cec 100644 --- a/topics/research-data.qmd +++ b/topics/research-data.qmd @@ -73,7 +73,7 @@ Certain types of personal data are considered particularly sensitive and require Because research data can take so many forms and functions, it is not always straightforward to determine what qualifies as research data within a specific project. The pyramid below, developed by Andorfer (2015), provided by the University of Geneva, offers a useful conceptual framework for understanding how different types of research data function within the research process, particularly in the social sciences and humanities. -![Three-layer pyramid illustrating different types of research data, from primary sources at the base to publications at the top.](public/research-data-pyramid-eng_universitedegeneve.png) +![Three-layer pyramid illustrating different types of research data, from primary sources at the base to publications at the top.](public/research data pyramid-eng_universitedegeneve.png) --- From d8136faec978a982c916596665b897f29dc592d0 Mon Sep 17 00:00:00 2001 From: mtpeterson901 Date: Tue, 7 Apr 2026 13:56:48 +0200 Subject: [PATCH 19/20] Update research-data.qmd correct link to photo --- topics/research-data.qmd | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/topics/research-data.qmd b/topics/research-data.qmd index 2980a1cec..d2dcb1fcf 100644 --- a/topics/research-data.qmd +++ b/topics/research-data.qmd @@ -73,7 +73,7 @@ Certain types of personal data are considered particularly sensitive and require Because research data can take so many forms and functions, it is not always straightforward to determine what qualifies as research data within a specific project. The pyramid below, developed by Andorfer (2015), provided by the University of Geneva, offers a useful conceptual framework for understanding how different types of research data function within the research process, particularly in the social sciences and humanities. -![Three-layer pyramid illustrating different types of research data, from primary sources at the base to publications at the top.](public/research data pyramid-eng_universitedegeneve.png) +![Three-layer pyramid illustrating different types of research data, from primary sources at the base to publications at the top.](../public/research data pyramid-eng_universitedegeneve.png) --- From 08ae1bcb9a41d9323404a69d0a52ffc132a2fee9 Mon Sep 17 00:00:00 2001 From: mtpeterson901 Date: Tue, 7 Apr 2026 13:57:35 +0200 Subject: [PATCH 20/20] Update research-data.qmd