Research Data Resources

The following sites have come to our attention through various sources; their inclusion in this section does not indicate endorsement by Scientific Library staff.  These sites include online and print resources to guide researchers in managing research data through data management plans (DMPs) and data standards, locating research datasets for reuse, and educational resources to learn about the research data lifecycle, data management, data analysis, and data visualization.  A list of cancer-related data resources related to NCI research is also included.

 

Data Sharing Policies and Data Management Plans

Data Standards

Finding Datasets

              Data Repositories

              Data Publications

              Using PubMed

Data Learning Resources

              Data Lifecycle

              Data Management

              Data Analysis and Visualization

   General Data Education Resources

   Print and E-Books about Data Science

NCI Data Resources

              Data Ecosystem Overview and Resource List

              Repositories and Data Portals

 

Data Sharing Policies and Data Management Plans

Find resources to identify research funder requirements related to data sharing, and use checklists and templates to create data management plans (DMPs) to meet funder requirements.

 

Data Standards

Locate resources for standardization of data formatting and metadata related to datasets.

 

Finding Datasets

Find datasets for reference and reuse through data repositories, data publications, and using PubMed.

 

Data Repositories

  • DataMed – A portal being developed for the NIH BD2K Data Discovery Index (DDI) by the bioCADDIE project team to search across biomedical datasets and repositories.
  • Dryad - Dryad is a nonprofit repository for data underlying the international scientific and medical literature.
  • Figshare  – A cross-disciplinary repository where users and institutions can upload datasets, supported by the technology company Digital Science. 
  • Google Dataset Search – A tool for searching datasets across the web.
  • NIH Data Sharing Resources- This page links to open NIH-supported domain-specific repositories, other NIH-supported domain-specific resources, and generalist repositories.
  • Re3Data – The Registry of Research Data Repositories (Re3Data) is a portal created by the non-profit organization DataCite.  The portal covers data registries from across many academic disciplines, and users can search by keyword or browse repositories by subject.

 

Data Publications

  • Data Papers and Data Journals –  A brief guide on data publications from Oregon State University that gives background on the rise of data papers, defines data papers, explains peer review for data papers, and discusses how the data from data papers may be stored.
  • DataShare Wiki – Sources of dataset peer review - A list of peer-reviewed data journals compiled by the University of Edinburgh.
  • GigaScience  - An open access, open data, open peer-review journal from Oxford University Press focusing on “big data” research from the life and biomedical sciences.
  • Scientific Data - Scientific Data is a peer-reviewed, open-access journal from Springer Nature that publishes descriptions of scientifically valuable datasets and research that advances the sharing and reuse of scientific data.

 

Using PubMed

The new version of PubMed has a filter option on the results page (under "Article Attributes") that allows users to filter results to locate articles with “Associated Data.”  Open the full PubMed record for the result, and view the "Associated Data" section to find links to data resources. These data links may be to records in other National Library of Medicine databases (e.g., GenBank or ClinicalTrials.gov) or external data repositories (e.g., Figshare, Dryad).

 

Data Learning Resources

Find online and print resources related to key data science concepts, such as the data lifecycle, data management, data analysis, and data visualization.

 

Data Lifecycle

  • Data Life Cycle at DataONE – Data Observation Network for Earth (DataONE) offers a data lifecycle chart with links to best practices for each of the eight lifecycle stages (plan, collect, assure, describe, preserve, discover, integrate, and analyze).
  • Data Life Cycles – The Network of the National Library of Medicine (NNLM) offers a collection of links related to the research data lifecycle.

 

Data Management

 

Data Analysis and Visualization

  • Data Analysis and VisualisationThis guide from the University of Sydney offers guidance and tips for creating a visualization and a toolkit listing analysis and visualization tools and training resources.
  • Data Visualization – Use this guide from George Mason University to identify best practices for data visualizations, data visualization tools, and links to tips and tutorials related to data visualization.
  • Getting Started in Data AnalysisThis guide from Princeton University Library offers links to tutorials, videos, articles, and other resources to learn data analysis techniques in Excel and other common data analysis software.

 

General Data Education Resources

  • BD2K Guide to the Fundamentals of Data Science – A series of recorded webinars (2016-2018) on YouTube about data science concepts underlying modern biomedical research. These webinars are part of the Big Data to Knowledge (BD2K) initiative.
  • DataONE Data Management Skillbuilding Hub - Data Observation Network for Earth (DataONE) offers a series of open educational resources, including lessons, best practices, and videos, on data management, data sharing, data management planning, data entry and manipulation, data quality control and assurance, protecting your data, metadata, data citation, and more. 
  • NCI Data Science Learning Exchange Learning Resources for Beginners - Find a list of resources for learning scripting and programming languages, Git/GitHub, data visualization, study groups and special interest groups to join, general tutorials and overviews, and NIH Listservs to join related to data science. Resources are also available for Intermediate and Advanced learners.
  • NIH Data Science Training Resources - Links to data science training resources from NIAID, The National Center for Biotechnology Information (NCBI) at the National Library of Medicine, The National Institute of General Medical Sciences, and more.

  • NNLM RD3: Resources for Data-Driven Discovery – The Network of the National Library of Medicine provides a portal for learning about a broad range of data science topics, including resources and best practices for managing, storing, and sharing data.  Use the Data Thesaurus to look up common data science terms.

 

Print and E-Books about Data Science

 

NCI Data Resources

Find cancer-related data (genomics, imaging, proteomics, cancer occurrence statistics, etc.) collected and shared by the National Cancer Institute.  Also check the Sharing Tools and Resources (STAR) page for data sharing and analysis tools developed by the National Cancer Institute at Frederick (NCI-F) and Frederick National Laboratory (FNL) researchers.

 

Data Ecosystem Overview and Resource List

 

Repositories and Data Portals

 

 

 



Date Last Updated: 10/14/2020