A SUMMARY OF USING AND REUSING DATA Welcome to my blog, where we explore how data curation transforms raw information into high-quality, sustainable digital assets. I am Edith Hope Chavula (MLIS0225), and I invite you to dive into the vital world of using and reusing research data to drive continuous scientific value. Together, we will discover how recycling verified data bypasses high collection costs and accelerates global breakthroughs.

May 27, 2026

Asummary of using and reusing data

Today, data curators dive into the heart of modern research ecosystems to explore a phase where scientific investments truly yield continuous, long-term value, using and reusing data. Within the broader research data lifecycle, data curation serves as the bedrock that transforms raw information into high-quality, sustainable digital assets. However, the true magic happens during the ultimate stage of this cycle the reuse phase. By ensuring data remains accessible to designated users on a day-to-day basis whether through openly published information or secured via robust access controls we open the door to endless collaboration. As highlighted by Yoon (2017), this phase allows independent secondary researchers to discover, access, and actively apply existing datasets to address fresh research questions or validate previous scientific discoveries. By systematically recycling verified scientific information, contemporary research institutions can completely bypass the exceptionally high financial costs of fresh data collection while significantly accelerating the overall speed of global scientific breakthroughs.

To ensure that data is in an optimal state where it can be effectively used and reused, specific and rigorous curatorial actions must be performed before the information is permanently archived. According to Johnston et al. (2023), data must first be subjected to meticulous file standardisation, meaning that professional curators must convert restrictive proprietary formats into open, accessible file extensions like comma-separated values (CSV) and extensible markup language (EML) to guarantee long-term software independence. Furthermore, comprehensive documentation and explicit data provenance must be appended directly to the core dataset. As emphasized by Faniel et al. (2022), secondary consumers cannot successfully reuse data unless they thoroughly understand the original data collection methods, variable definitions, and processing history. Therefore, creating robust, machine-readable metadata schemas is a mandatory operational step that curators must complete to make the data fully discoverable and interpretable for future autonomous users.

In addition to documentation, resolving underlying data quality anomalies and legal restrictions is highly essential for data to be reused safely and ethically. According to Mannheimer (2021), data curators must systematically cleanse datasets by identifying missing values, resolving formatting contradictions, and eliminating structural errors that routinely prevent software interoperability across different computing environments. Simultaneously, strict ethical safeguards must be established to navigate complex privacy concerns. According to DuBois et al. (2021), data containing sensitive or human subject information must undergo strict administrative de-identification and masking procedures before public release. Curators must also apply clear Creative Commons or customized user licenses so that secondary researchers legally understand their rights regarding data modification and citation, thereby preventing downstream intellectual property conflicts. Below is the link for data using and reusing vedio.

https://www.youtube.com/live/vDAk5iB-0uI?si=iXi83hpzAa8b1FIK.

In conclusion, the "using and reusing data" phase serves as the definitive benchmark for a dataset's long-term utility, credibility, and overall scientific value. For data to be successfully used and reused, researchers and repository curators must work together to standardize file formats, compile exhaustive metadata, eliminate quality defects, and establish clear legal boundaries. Adhering to these strict curation protocols ensures that shared digital repositories remain reliable, actionable, and legally secure for decades. Ultimately, proper execution of this lifecycle phase transforms isolated data points into enduring public assets that minimize institutional duplication and drive global collaborative research forward.

References

DuBois, J. M., Strait, M., & Walsh, H. (2021). How data curation enables epistemically responsible reuse of qualitative research data. The Qualitative Report, 26(6), 1801-1815.

Faniel, I. M., Austin, M. C., & Yakel, E. (2022). How do properties of data, their curation, and their funding relate to data reuse? Journal of the Association for Information Science and Technology, 73(10), 1433-1446.

Johnston, L. R., Hudson-Vitale, C., Imker, H. J., Kozlowski, W., Olendorf, R., & Sprout, S. (2023). Understanding the value of curation: A survey of researcher satisfaction with academic generalist data repositories. PLOS ONE, 18(11), e0293527.

Mannheimer, S. (2021). Data curation strategies to support responsible big social research and big social data reuse. International Journal of Digital Curation, 16(1), 1-12.

Yoon, A. (2017). Social science research data curation: Issues of reuse. Library & Information Science Research, 39(1), 30-39.