The following sites have come to our attention through various sources; their inclusion in this section does not indicate endorsement by Scientific Library staff. These sites include online and print resources to guide researchers in managing research data through data management plans (DMPs) and data standards, locating research datasets for reuse, and educational resources to learn about the research data lifecycle, data management, data analysis, and data visualization. A list of cancer-related data resources related to NCI research is also included.
Data Sharing Policies and Data Management Plans
Data Learning Resources
Data Analysis and Visualization
General Data Education Resources
Print and E-Books about Data Science
NCI Data Resources
Data Ecosystem Overview and Resource List
Repositories and Data Portals
Find resources to identify research funder requirements related to data sharing, and use checklists and templates to create data management plans (DMPs) to meet funder requirements.
Locate resources for standardization of data formatting and metadata related to datasets.
Find datasets for reference and reuse through data repositories, data publications, and using PubMed.
The new version of PubMed has a filter option on the results page (under "Article Attributes") that allows users to filter results to locate articles with “Associated Data.” Open the full PubMed record for the result, and view the "Associated Data" section to find links to data resources. These data links may be to records in other National Library of Medicine databases (e.g., GenBank or ClinicalTrials.gov) or external data repositories (e.g., Figshare, Dryad).
Find online and print resources related to key data science concepts, such as the data lifecycle, data management, data analysis, and data visualization.
NIH Data Science Training Resources - Links to data science training resources from NIAID, The National Center for Biotechnology Information (NCBI) at the National Library of Medicine, The National Institute of General Medical Sciences, and more.
Find cancer-related data (genomics, imaging, proteomics, cancer occurrence statistics, etc.) collected and shared by the National Cancer Institute. Also check the Sharing Tools and Resources (STAR) page for data sharing and analysis tools developed by the National Cancer Institute at Frederick (NCI-F) and Frederick National Laboratory (FNL) researchers.
NCI Data Catalog - A listing of data collections produced by major NCI initiatives and other widely used data sets.
Cancer Research Data Commons (CRDC) - The CRDC is a virtual, expandable infrastructure that provides access to data from NCI programs such as The Cancer Genome Atlas (TCGA) and its pediatric counterpart, Therapeutically Applicable Research to Generate Effective Treatments (TARGET), and The Clinical Proteomics Tumor Analysis Consortium (CPTAC), through the Genomic Data Commons (GDC), NCI Cloud Resources, and the Proteomic Data Commons (PDC).