Interviews regarding data curation for qualitative data reuse and big social research

Version 1.0

Mannheimer, Sara. 2023. "Interviews regarding data curation for qualitative data reuse and big social research". Qualitative Data Repository. https://doi.org/10.5064/F6GWMU4O. QDR Main Collection. V1

Learn about Data Citation Standards.

Contact Owner

Make Data Count (MDC) Metrics

since 2019-10-01

7,637 Views

5,995 Downloads

0 Citations

Description	Project Overview Trends toward open science practices, along with advances in technology, have promoted increased data archiving in recent years, thus bringing new attention to the reuse of archived qualitative data. Qualitative data reuse can increase efficiency and reduce the burden on research subjects, since new studies can be conducted without collecting new data. Qualitative data reuse also supports larger-scale, longitudinal research by combining datasets to analyze more participants. At the same time, qualitative research data can increasingly be collected from online sources. Social scientists can access and analyze personal narratives and social interactions through social media such as blogs, vlogs, online forums, and posts and interactions from social networking sites like Facebook and Twitter. These big social data have been celebrated as an unprecedented source of data analytics, able to produce insights about human behavior on a massive scale. However, both types of research also present key epistemological, ethical, and legal issues. This study explores the issues of context, data quality and trustworthiness, data comparability, informed consent, privacy and confidentiality, and intellectual property and data ownership, with a focus on data curation strategies. The research suggests that connecting qualitative researchers, big social researchers, and curators can enhance responsible practices for qualitative data reuse and big social research. This study addressed the following research questions: RQ1: How is big social data curation similar to and different from qualitative data curation? RQ1a: How are epistemological, ethical, and legal issues different or similar for qualitative data reuse and big social research? RQ1b: How can data curation practices such as metadata and archiving support and resolve some of these epistemological and ethical issues? RQ2: What are the implications of these similarities and differences for big social data curation and qualitative data curation, and what can we learn from combining these two conversations? Data Description and Collection Overview The data in this study was collected using semi-structured interviews that centered around specific incidents of qualitative data archiving or reuse, big social research, or data curation. The participants for the interviews were therefore drawn from three categories: researchers who have used big social data, qualitative researchers who have published or reused qualitative data, and data curators who have worked with one or both types of data. Six key issues were identified in a literature review, and were then used to structure three interview guides for the semi-structured interviews. The six issues are context, data quality and trustworthiness, data comparability, informed consent, privacy and confidentiality, and intellectual property and data ownership. Participants were limited to those working in the United States. Ten participants from each of the three target populations—big social researchers, qualitative researchers who had published or reused data, and data curators were interviewed. The interviews were conducted between March 11 and October 6, 2021. When scheduling the interviews, participants received an email asking them to identify a critical incident prior to the interview. The “incident” in critical incident interviewing technique is a specific example that focuses a participant’s answers to the interview questions. The participants were asked their permission to have the interviews recorded, which was completed using the built-in recording technology of Zoom videoconferencing software. The author also took notes during the interviews. Otter.ai speech-to-text software was used to create initial transcriptions of the interview recordings. A hired undergraduate student hand-edited the transcripts for accuracy. The transcripts were manually de-identified. The author analyzed the interview transcripts using a qualitative content analysis approach. This involved using a combination of inductive and deductive coding approaches. After reviewing the research questions, the author used NVivo software to identify chunks of text in the interview transcripts that represented key themes of the research. Because the interviews were structured around each of the six key issues that had been identified in the literature review, the author deductively created a parent code for each of the six key issues. These parent codes were context, data quality and trustworthiness, data comparability, informed consent, privacy and confidentiality, and intellectual property and data ownership. The author then used inductive coding to create sub-codes beneath each of the parent codes for these key issues. Selection and Organization of Shared Data The data files consist of 28 of the interview transcripts themselves – transcripts from Big Science Researchers (BSR), Data Curators (DC), and Qualitative Researchers (QR) respectively. Two participants declined having their data shared. Additional data files include the redacted interview analysis, and participant summaries (Memos). The documentation files include the individual interview guides for each of the participant categories, two versions of the consent form, a codebook, the emails sent to the participants, a file of the interview dates and duration and the IRB protocol.
Subject	Computer and Information Science; Social Sciences
Keyword	data curation, qualitative research, social media, big data, interviews, critical incident technique, ethics, grounded theory
Related Publication	Mannheimer, S., (2022) "Data curation for qualitative data reuse and big social research: Connecting communities of practice", Humboldt University of Berlin. Doctoral dissertation. doi: doi.org/10.18452/25029
License/Data Use Agreement	Standard Access

Filter by

	1 to 10 of 42 Files	Download Request Access
	Mannheimer_BSR_Interview_Guide.pdf - 117.1 KB Published Apr 26, 2023 219 Downloads SHA512: 569968da6eed58ba79e8e0b4e5da176038460371ddadd35ab8e09571394d5289c1583ff99d757ec0acfd101d774f2f699129f566701d50afd036e9a2ceca8d4d Documentation	Preview "Mannheimer_BSR_Interview_Guide.pdf" Access File File Access Public Download Options Adobe PDF Download Metadata Data File Citation Download EndNote XML Download RIS Download BibTeX
	Mannheimer_Codebook.pdf - 209.4 KB Published Apr 26, 2023 231 Downloads SHA512: 7d6c5a590335de48e3641a18768815dc83c369334f735539ec007bfbb4cd1b172f8b5a48a33e9b61d29e4bfbbf107b501cdfa64d91062fba9611eae42dfe6543 Documentation	Preview "Mannheimer_Codebook.pdf" Access File File Access Public Download Options Adobe PDF Download Metadata Data File Citation Download EndNote XML Download RIS Download BibTeX
	Mannheimer_Consent_V1.pdf - 534.9 KB Published Apr 26, 2023 179 Downloads SHA512: 0f87f7c373678ca44eabe6b703cab5eed6552d8d6815957432038327bdcdb259fbcf09cc80086bc40b2d1801410ceb6592a1ced3f3d5327f565e0f13226bb90a Documentation	Preview "Mannheimer_Consent_V1.pdf" Access File File Access Public Download Options Adobe PDF Download Metadata Data File Citation Download EndNote XML Download RIS Download BibTeX
	Mannheimer_Consent_V2.pdf - 108.6 KB Published Apr 26, 2023 178 Downloads SHA512: c762b332e4b8cf6921765e538d86a83d192cc417831021a96f17e96119da49f770a955960f1334c7f7d25138e65469655159324db9ff6cee49049cc7f969c878 Documentation	Preview "Mannheimer_Consent_V2.pdf" Access File File Access Public Download Options Adobe PDF Download Metadata Data File Citation Download EndNote XML Download RIS Download BibTeX
	Mannheimer_DataNarrative.pdf - 145.9 KB Published Apr 26, 2023 212 Downloads SHA512: 2d83504b67745559ab9b1f04896575e41c13ac5da1af61b09083304216e113aae6adb5632d7d9b604758aea58985df3dd05b24f8499ebd6b67a274978cd46aa5 Documentation	Preview "Mannheimer_DataNarrative.pdf" Access File File Access Public Download Options Adobe PDF Download Metadata Data File Citation Download EndNote XML Download RIS Download BibTeX
	Mannheimer_DC_Interview_Guide.pdf - 143.4 KB Published Apr 26, 2023 200 Downloads SHA512: 1eb22615a532be08585d70d11912a7ca4bf0d9b50de6d574492ad4f05c2738ef013fa08bbb211f9bf8fc9a8b2074a318aebf70d779eb77eade80a74e53ee3c7f Documentation	Preview "Mannheimer_DC_Interview_Guide.pdf" Access File File Access Public Download Options Adobe PDF Download Metadata Data File Citation Download EndNote XML Download RIS Download BibTeX
	Mannheimer_Emails_to_Participants.pdf - 152.1 KB Published Apr 26, 2023 179 Downloads SHA512: 79d9eb5b39d19e16e48fee32080a91b9bfbe88b809a7b1e10efcb1e8d160f201392d10b107fcd288471ad0011872e5bb594cdf096c04ed146172c7e1174256b0 Documentation	Preview "Mannheimer_Emails_to_Participants.pdf" Access File File Access Public Download Options Adobe PDF Download Metadata Data File Citation Download EndNote XML Download RIS Download BibTeX
	Mannheimer_Interview_Dates_Lengths.pdf - 108.9 KB Published Apr 26, 2023 178 Downloads SHA512: da1554ac71da57b8ad7dfb1fa524d5352bbfedb0fa4fa1bcf0381b3d3927ce3151c78a9ad27bd4cb392f5b415e6516f9b4b6d14fb8fa76bd40f1176140900c7f Documentation	Preview "Mannheimer_Interview_Dates_Lengths.pdf" Access File File Access Public Download Options Adobe PDF Download Metadata Data File Citation Download EndNote XML Download RIS Download BibTeX
	Mannheimer_IRB-Exempt_Application.pdf - 184.9 KB Published Apr 26, 2023 179 Downloads SHA512: 69754fef4d0bb01fe160b87027ef9cde14a16facd38fd68df5191830c765746644e83fe09a51edd1c1f16b1df1e38e857bdc0117e1012424211e28a1105d97b2 Documentation	Preview "Mannheimer_IRB-Exempt_Application.pdf" Access File File Access Public Download Options Adobe PDF Download Metadata Data File Citation Download EndNote XML Download RIS Download BibTeX
	Mannheimer_QR_Interview_Guide.pdf - 118.0 KB Published Apr 26, 2023 201 Downloads SHA512: 67b1ebff610c44d11944ded9b2027300b12cf724dbbe2483a2c2eca7f4cbbecebcc14d96d96d71899848ca9f6de4c78b92e02f9463287c3a39351e424240e5a3 Documentation	Preview "Mannheimer_QR_Interview_Guide.pdf" Access File File Access Public Download Options Adobe PDF Download Metadata Data File Citation Download EndNote XML Download RIS Download BibTeX

Data Project Persistent Identifier	doi:10.5064/F6GWMU4O
Publication Date	2023-04-26
Title	Interviews regarding data curation for qualitative data reuse and big social research
Creator	Associate Professor and Data Librarian at Montana State University Bozeman: Bozeman, MT, U.S.0000-0002-1433-6782
Point of Contact	Use the Contact button at the top right to email this Data Project's contact. Mannheimer, Sara (Associate Professor and Data Librarian at Montana State University Bozeman: Bozeman, MT, U.S.)
Description	Project Overview Trends toward open science practices, along with advances in technology, have promoted increased data archiving in recent years, thus bringing new attention to the reuse of archived qualitative data. Qualitative data reuse can increase efficiency and reduce the burden on research subjects, since new studies can be conducted without collecting new data. Qualitative data reuse also supports larger-scale, longitudinal research by combining datasets to analyze more participants. At the same time, qualitative research data can increasingly be collected from online sources. Social scientists can access and analyze personal narratives and social interactions through social media such as blogs, vlogs, online forums, and posts and interactions from social networking sites like Facebook and Twitter. These big social data have been celebrated as an unprecedented source of data analytics, able to produce insights about human behavior on a massive scale. However, both types of research also present key epistemological, ethical, and legal issues. This study explores the issues of context, data quality and trustworthiness, data comparability, informed consent, privacy and confidentiality, and intellectual property and data ownership, with a focus on data curation strategies. The research suggests that connecting qualitative researchers, big social researchers, and curators can enhance responsible practices for qualitative data reuse and big social research. This study addressed the following research questions: RQ1: How is big social data curation similar to and different from qualitative data curation? RQ1a: How are epistemological, ethical, and legal issues different or similar for qualitative data reuse and big social research? RQ1b: How can data curation practices such as metadata and archiving support and resolve some of these epistemological and ethical issues? RQ2: What are the implications of these similarities and differences for big social data curation and qualitative data curation, and what can we learn from combining these two conversations? Data Description and Collection Overview The data in this study was collected using semi-structured interviews that centered around specific incidents of qualitative data archiving or reuse, big social research, or data curation. The participants for the interviews were therefore drawn from three categories: researchers who have used big social data, qualitative researchers who have published or reused qualitative data, and data curators who have worked with one or both types of data. Six key issues were identified in a literature review, and were then used to structure three interview guides for the semi-structured interviews. The six issues are context, data quality and trustworthiness, data comparability, informed consent, privacy and confidentiality, and intellectual property and data ownership. Participants were limited to those working in the United States. Ten participants from each of the three target populations—big social researchers, qualitative researchers who had published or reused data, and data curators were interviewed. The interviews were conducted between March 11 and October 6, 2021. When scheduling the interviews, participants received an email asking them to identify a critical incident prior to the interview. The “incident” in critical incident interviewing technique is a specific example that focuses a participant’s answers to the interview questions. The participants were asked their permission to have the interviews recorded, which was completed using the built-in recording technology of Zoom videoconferencing software. The author also took notes during the interviews. Otter.ai speech-to-text software was used to create initial transcriptions of the interview recordings. A hired undergraduate student hand-edited the transcripts for accuracy. The transcripts were manually de-identified. The author analyzed the interview transcripts using a qualitative content analysis approach. This involved using a combination of inductive and deductive coding approaches. After reviewing the research questions, the author used NVivo software to identify chunks of text in the interview transcripts that represented key themes of the research. Because the interviews were structured around each of the six key issues that had been identified in the literature review, the author deductively created a parent code for each of the six key issues. These parent codes were context, data quality and trustworthiness, data comparability, informed consent, privacy and confidentiality, and intellectual property and data ownership. The author then used inductive coding to create sub-codes beneath each of the parent codes for these key issues. Selection and Organization of Shared Data The data files consist of 28 of the interview transcripts themselves – transcripts from Big Science Researchers (BSR), Data Curators (DC), and Qualitative Researchers (QR) respectively. Two participants declined having their data shared. Additional data files include the redacted interview analysis, and participant summaries (Memos). The documentation files include the individual interview guides for each of the participant categories, two versions of the consent form, a codebook, the emails sent to the participants, a file of the interview dates and duration and the IRB protocol.
Subject	Computer and Information Science; Social Sciences
Keywords	data curation (LC Subject Headings (LCSH)) https://id.loc.gov/authorities/subjects/sh2015001855.html qualitative research (LC Subject Headings (LCSH)) https://id.loc.gov/authorities/subjects/sh99004969.html social media (ICPSR Subject Thesaurus) https://www.icpsr.umich.edu/web/ICPSR/thesaurus/10001/terms/27249 big data (LC Subject Headings (LCSH)) https://id.loc.gov/authorities/subjects/sh2012003227.html interviews (LC Subject Headings (LCSH)) https://id.loc.gov/authorities/subjects/sh85067568.html critical incident technique (LC Subject Headings (LCSH)) https://id.loc.gov/authorities/subjects/sh85034135.html ethics (ICPSR Subject Thesaurus) https://www.icpsr.umich.edu/web/ICPSR/thesaurus/10001/terms/25272 grounded theory (LC Subject Headings (LCSH)) https://id.loc.gov/authorities/subjects/sh90004682.html
Time Period	Start Date: 2019-03-01; End Date: 2023-06-01
Date of Data Collection	Start Date: 2021-03-11; End Date: 2021-10-06
Types of Data	interview transcripts; coded qualitative data
Geographic Coverage	United States
Related Publication	Mannheimer, S., (2022) "Data curation for qualitative data reuse and big social research: Connecting communities of practice", Humboldt University of Berlin. Doctoral dissertation. doi doi.org/10.18452/25029 https://edoc.hu-berlin.de/handle/18452/25999 Mannheimer, S., (2021) “Data Curation Implications of Qualitative Data Reuse and Big Social Research”, Journal of eScience Librarianship 10(4): 5. doi https://doi.org/10.7191/jeslib.2021.1218 https://publishing.escholarship.umassmed.edu/jeslib/article/id/467/ Mannheimer, S., (Forthcoming) "Scaling up: How data curation can support responsible qualitative data reuse and big social research", Springer Nature Synthesis Lectures. doi
Language	English
Contributor	Project Member: O'Brien, Emily
Software	NVivo, Version: 12
Distributor	Qualitative Data Repository (Syracuse University) (QDR) https://qdr.syr.edu
Distribution Date	2023-04-26
Depositor	Mannheimer, Sara
Deposit Date	2023-03-06

Data Project Terms

License/Data Use Agreement

Our Community Norms as well as good scientific practices expect that proper credit is given via citation. Please use the data citation shown on the data project page.

QDR Standard Access Conditions Standard Access

Restricted Files + Terms of Access

Restricted Files

There are 31 restricted files in this data project.

Terms of Access for Restricted Files

Documentation freely accessible under the Creative Commons Attribution-Share Alike 4.0 license.

Data files accessible without restrictions for all registered QDR users under our Standard Download Agreement.

Request Access

Users may request access to files.

Data Project Version	Summary	Version Note	Contributors	Published on
No records found.

Edit File

This file has already been deleted (or replaced) in the current version. It may not be edited.

Restrict Access

Restricting limits access to published files. People who want to use the restricted files can request access by default. If you disable request access, you must add information about access to the Terms of Access field.

Learn about restricting files and dataset access in the User Guide.

Request Access

Enable access request

You must enable request access or add terms of access to restrict file access.

Terms of Access for Restricted Files

Save Changes

Edit Embargo

The selected file or files have already been published. Contact an administrator to change the embargo date or reason of the file or files.

Edit Retention Period

The selected file or files have already been published. Contact an administrator to change the retention period date or reason of the file or files.

Delete Files

The file will be deleted after you click on the Delete button.

Files will not be removed from previously published versions of the data project.

Select File(s)

Please select one or more files.

Share Data Project

Share this data project on your favorite social media networks.

Continue

Data Project Citations

Citations for this data project are retrieved from Crossref via DataCite using Make Data Count standards. For more information about data project metrics, please refer to the User Guide.

Sorry, no citations were found.

Inaccessible Files Selected

The selected file(s) may not be downloaded because you have not been granted access or the file(s) have a retention period that has expired or the files can only be transferred via Globus.

You may request access to any restricted file(s) by clicking the Request Access button.

Ineligible Files Selected

The selected file(s) may not be transferred because you have not been granted access or the file(s) have a retention period that has expired or the files are not Globus accessible.

You may request access to any restricted file(s) by clicking the Request Access button.

Download Options

The files selected are too large to download as a ZIP.

You can select individual files that are below the 18.6 GB download limit from the files table, or use the Data Access API for programmatic access to the files.

Select File(s)

Please select a file or files to be downloaded.

Inaccessible Files Selected

The selected file(s) may not be downloaded because you have not been granted access or the file(s) have a retention period that has expired.

Click Continue to download the files you have access to download.

Ineligible Files Selected

Some file(s) cannot be transferred. (They are restricted, embargoed, with an expired retention period, or not Globus accessible.)

Click Continue to transfer the elligible files.

Delete Data Project

Are you sure you want to delete this data project and all of its files? You cannot undelete this data project.

Delete Draft Version

Are you sure you want to delete this draft version? Files will be reverted to the most recently published version. You cannot undelete this draft.

Unpublished Data Project Preview URL

You can create a Preview URL to copy and share with others who will not need a QDR account to review this unpublished data project. Once the data project is published or if the URL is disabled, the URL will no longer work and will point to a "Page not found" page. Only one Preview URL can be active for a single draft data project.

To cite this data in publications, use the data project's persistent ID instead of this URL. For more information about the Preview URL feature, please refer to the User Guide.

General Preview

Create a URL that others can use to review this draft data project version before it is published. They will be able to access all files in the data project and see all metadata, including metadata that may identify the data project's authors.

Anonymous Preview

Create a URL that others can use to access an anonymized view of this unpublished dataproject. Metadata that could identify the dataset author will not be displayed. (See Tool Tip for the list of withheld metadata fields.) Non-identifying metadata will be visible.

The data project's files are not changed and users of the Anonymous Preview URL will be able to access them. Users of the Anonymous Preview URL will not be able to see the name of the collection that this data project is in but will be able to see the data project is at QDR, which might make it easier to identify the data project authors.

To verify that all identifying information has been removed or anonymized, it is recommended that you logout and review the data project as as it would be seen by an Anonymous Preview URL user. See User Guide for more information.

You won't be able to create an Anonymous Preview URL once a version of this data project has been published.

Unpublished Data Project Preview URL

Are you sure you want to disable the Preview URL? If you have shared the Preview URL with others they will no longer be able to use it to access your unpublished data project.

Delete Files

The file(s) will be deleted after you click on the Delete button.

Files will not be removed from previously published versions of the data project.

Compute

This data project contains restricted files you may not compute on because you have not been granted access.

Deaccession Data Project

Are you sure you want to deaccession? This is permanent and the selected version(s) will no longer be viewable by the public.

Deaccession Data Project

Are you sure you want to deaccession this data project? This is permanent and it will no longer be viewable by the public.

Version Differences Details

Please select two versions to view the differences.

Version Differences Details

Version:
Last Updated:

Version Note:

Version:
Last Updated:

Version Note:

Select File(s)

Please select a file or files for access request.

Select File(s)

Embargoed files cannot be accessed. Please select an unembargoed file or files for your access request.

Edit Tags

Select existing file tags or create new tags to describe your files. Each file can have more than one tag.

Request Access

You need to Register or Log In to request access.

Data Project Terms

Please confirm and/or complete the information needed below in order to request access to files in this data project.

This data project is made available under the following terms. Please confirm and/or complete the information needed below in order to continue.

License/Data Use Agreement

Our Community Norms as well as good scientific practices expect that proper credit is given via citation. Please use the data citation shown on the data project page.

QDR Standard Access Conditions Standard Access

Preview Guestbook

Upon downloading files the guestbook asks for the following information.

Guestbook Name

Collected Data

Account Information

Package File Download

Use the Download URL in a Wget command or a download manager to download this package file. Download via web browser is not recommended. User Guide - Downloading a Dataverse Package via URL

Download URL

https://data.qdr.syr.edu/api/access/datafile/

Compute Batch

Clear Batch

Data Project	Data Project Persistent Identifier	Change Compute Batch

Compute Batch

Submit for Review

You will not be able to make changes to this data project while it is in review.

Publish Data Project

Are you sure you want to republish this data project?

Select if this is a minor or major version update.

Minor Release (1.1)

Major Release (2.0)

Version Note

Publish Data Project

This data project cannot be published until QDR Main Collection is published by its administrator.

Return to Author

Return this data project to contributor for modification. The reason for return entered below will be sent by email to the author.

Add/Edit a Version Note

Enter the reason this version was created. To learn more about Version Notes, visit the Version Notes section of the User Guide.

Version Note

Curation Status History

Status	Date	Assigner
No records found.

Styled Citation

Interviews regarding data curation for qualitative data reuse and big social research

Project Overview

Data Description and Collection Overview

Selection and Organization of Shared Data