To connect data to a publication, you must upload your dataset to a recognized repository, obtain a permanent Digital Object Identifier (DOI), and formally cite this DOI within your manuscript's data availability statement.
Linking your underlying research data to your academic paper is a fundamental practice in modern open science. It improves transparency, allows other researchers to replicate your findings, and is increasingly mandated by major academic publishers.
Here is the standard process for properly connecting your dataset to your research paper:
Step 1: Choose a Trusted Data Repository
Avoid hosting data on personal websites or cloud storage links (like Google Drive), as these frequently break over time. Instead, upload your files to a recognized data repository. General-purpose repositories like Zenodo, Figshare, or Dryad are excellent choices for most fields. Alternatively, you can use discipline-specific databases, such as GenBank for molecular biology or ICPSR for social sciences.
Step 2: Prepare and Document Your Dataset
Before uploading, ensure your data is clean, anonymized (especially if dealing with human subjects), and saved in open, non-proprietary formats. For example, saving tabular data as a CSV rather than a proprietary Excel file ensures long-term accessibility. You should also include a detailed README file and clear metadata that explains your variables, units of measurement, and how the files correspond to specific figures or tables in your manuscript.
Step 3: Generate a Data DOI
Once your dataset is uploaded and published within the repository, the platform will assign it a unique DOI. Unlike a standard web URL, a DOI is a persistent identifier that guarantees your supplementary materials will always remain discoverable, even if the repository changes its web architecture in the future.
Step 4: Write a Data Availability Statement
Most academic journals now require a Data Availability Statement (DAS), usually located near the end of your manuscript just before the acknowledgments. This short section tells readers exactly where and how they can access your data. A standard DAS should include the name of the repository, any specific access conditions, and the dataset's DOI link.
Step 5: Cite the Dataset in Your References
To create a formal, trackable link between your paper and your data, treat the dataset like any other academic source and include it in your bibliography. When compiling your reference list, WisPaper's TrueCite automatically finds and verifies citations, eliminating the stress of manually formatting dataset references and ensuring your APA or MLA formatting is flawlessly aligned with the repository's metadata. This ensures the connection is officially recognized by global citation indices.

