Nutrition Science Data and Biospecimen Resources Portal

The National Institutes of Health (NIH) Office of Nutrition Research is committed to advancing nutrition science research through promoting access to publicly available datasets, biospecimens, and data analysis tools and resources. In each of the topic areas, the list below provides information and links to resources that may be of interest to nutrition science researchers.

Common Data Elements and Measures

Data Analysis Tools and Resources

Repositories for Data and Biospecimens

Research Study Query and Reporting Tools


Common Data Elements and Measures:

National Collaborative on Childhood Obesity Research (NCCOR) Measures Registry  The Measures Registry is a searchable database of diet and physical activity measures relevant to childhood obesity research.  Its purpose is to standardize use of common measures and research methods across childhood obesity research at the individual, community, and population levels.  Measures are tools and methodologies used to assess individuals’ diet, physical activity, and the environments in which these behaviors occur.  Examples of measures include questionnaires, instruments, diaries, logs, electronic devices, direct observations of people or environments, protocols, GIS and analytic techniques.

NIH Common Data Elements Repository  The NIH Common Data Elements (CDE) Repository has been designed to provide access to structured human and machine-readable definitions of data elements that have been recommended or required by NIH Institutes and Centers and other organizations for use in research and for other purposes.

PhenX Toolkit  The PhenX Toolkit (consensus measures for Phenotypes and eXposures) provides recommended standard data collection protocols for conducting biomedical research. The protocols are selected by Working Groups of domain experts using a consensus process, which includes the scientific community. The Toolkit provides detailed protocols for collecting data and tools to help investigators incorporate these protocols into their studies. Using protocols from the PhenX Toolkit facilitates cross-study analysis, potentially increasing the scientific impact of individual studies.

Data Analysis Tools and Resources:

All of Us Researcher Workbench  The Researcher Workbench platform and its suite of custom tools are available to approved researchers. The Researcher Workbench provides access to Registered Tier data.  Its powerful tools support data analysis and collaboration. The workbench also provides integrated help and educational resources through the Workbench User Support Hub.

Behavioral Risk Factor Surveillance System (BRFSS)  The Behavioral Risk Factor Surveillance System (BRFSS) is the nation’s premier system of health-related telephone surveys that collect state data about U.S. residents regarding their health-related risk behaviors, chronic health conditions, and use of preventive services. Established in 1984 with 15 states, BRFSS now collects data in all 50 states as well as the District of Columbia and three U.S. territories. BRFSS completes more than 400,000 adult interviews each year, making it the largest continuously conducted health survey system in the world.

COMETS Analytics  COMETS Analytics is a cloud-based, self-serviced analytic platform developed for consortium-based metabolomics analyses that can be applied to large-scale studies, including those focusing on diet and nutrition. The software was specifically designed to simplify meta-analysis at the consortia level. Users first prepare a standardized data input, which includes clear definitions for covariables and models that can be applied to all cohorts. Next, the software checks the data and model integrity, including automated fixing of model issues (e.g., singularity), ensuring that the models are executable. Finally, COMETS Analytics runs the models and aggregates results in a standardized format. All analyses can be performed on a local server or through our secure web platform.  

The Common Fund Data Ecosystem Portal  The Common Fund (CF) Data Ecosystem (CFDE) is developing an online portal that will allow researchers to access and work across multiple Common Fund (CF) program data sets within a digital cloud environment. CF programs generate a wide range of diverse and valuable data sets designed to be used by the research community to accelerate discovery. The CFDE portal will create a shared resource that helps make CF data sets FAIR (Findable, Accessible, Interoperable, and Reusable) and enables researchers to ask scientific and clinical questions from a single access point.

The Database for Annotation, Visualization and Integrated Discovery (DAVID) The Database for Annotation, Visualization and Integrated Discovery provides a comprehensive set of functional annotation tools for investigators to understand the biological meaning behind large lists of genes. These tools are powered by the comprehensive DAVID Knowledgebase built upon the DAVID Gene concept which pulls together multiple sources of functional annotations.

Metabolomics Workbench  The Metabolomics Workbench serves as a national and international repository for metabolomics data and metadata and provides analysis tools and access to metabolite standards, protocols, tutorials, training, and more.

National Cancer Institute (NCI) Dietary Assessment Research Resources  The National Cancer Institute’s (NCI) dietary assessment research resources provides access to dietary collection resources, data collection tools, food composition databases, and diet analysis tools.

National Cancer Institute (NCI) Resources for Researchers  The National Cancer Institute’s (NCI) Resources for Researchers is a directory of NCI-supported tools and services for cancer researchers. Resources are developed and maintained by NCI scientists or were created with grant funding.  Most resources are free and available to anyone.

National Center for Biotechnology and Information (NCBI)  The National Center for Biotechnology Information (NCBI) advances science and health by providing access to biomedical and genomic information. The NCBI supports several popular data resources including, PubMed and PubMed Central, Bookshelf, BLAST, Nucleotide, Genome, SNP, Gene, Protein, and PubChem.

National Heart, Lung, and Blood Institute BioData Catalyst  NHLBI BioData Catalyst is a cloud-based platform providing tools, applications, and workflows in secure workspaces. By increasing access to NHLBI datasets and innovative data analysis capabilities, BioData Catalyst accelerates efficient biomedical research that drives discovery and scientific advancement, leading to novel diagnostic tools, therapeutics, and prevention strategies for heart, lung, blood, and sleep disorders. BioData Catalyst allows researchers to find, access, share, store, and compute on large scale datasets.

Repositories for Data and Biospecimens:

Database of Genotypes and Phenotypes (dbGaP)  The database of Genotypes and Phenotypes (dbGaP) was developed to archive and distribute the data and results from studies that have investigated the interaction of genotype and phenotype in humans.

Data Sharing for Demographic Research (DSDR)  The Data Sharing for Demographic Research (DSDR) is a data sharing project providing curation and archiving services for the demographic and population sciences community. It focuses on data collected through funding from the NICHD Population Dynamics Branch (PDB), but also provides these services for other data sets that fall within the scientific mission of PDB.

Dietary Supplement Ingredient Database (DSID)  The Dietary Supplement Ingredient Database (DSID) provides estimated levels of ingredients in dietary supplement products sold in the United States. These statistically predicted estimates may differ from labeled amounts and are based on chemical analysis of nationally representative products.  The DSID is intended primarily for research applications. These data are appropriate for use in population studies of nutrient intake rather than for assessing content of individual products.

Eunice Kennedy Shriver National Institute of Child Health and Human Development Data and Specimen Hub (DASH)  The NICHD DASH is a centralized resource that allows researchers to share and access de-identified data from studies funded by NICHD. DASH also serves as a portal for requesting biospecimens from selected studies.

National Cancer Institute (NCI) Cancer Epidemiology Descriptive Cohort Database (CEDCD)  The Cancer Epidemiology Descriptive Cohort Database (CEDCD) contains descriptive information about cohort studies that follow groups of persons over time for cancer incidence, mortality, and other health outcomes. The CEDCD is a searchable database that contains general study information (e.g., eligibility criteria and size), the type of data collected at baseline, cancer sites, number of participants diagnosed with cancer, and biospecimen information. All data included in this database are aggregated for each cohort; there are no individual level data. The goal of the CEDCD is to facilitate collaboration and highlight the opportunities for research within existing cohort studies.

National Collaborative on Childhood Obesity Research (NCCOR) Catalogue of Surveillance Systems  The Catalogue of Surveillance Systems provides one-stop access to over 100 publicly available datasets relevant to childhood obesity research. The datasets profiled in the Catalogue include information on obesity-related health behaviors, outcomes, and determinants as well as policies and environmental factors.

National Heart, Lung, and Blood Institute Biologic Specimen and Data Repository Information Coordinating Center (BioLINCC)  The mission of BioLINCC is to facilitate access and maximize the scientific value of the biorepository and data repositories, and to promote the availability and use of other NHLBI-funded population-based biospecimen and data resources.

National Health and Nutrition Examination Survey (NHANES)  The National Health and Nutrition Examination Survey (NHANES) is a program of studies designed to assess the health and nutritional status of adults and children in the United States. The survey is unique in that it combines interviews and physical examinations. NHANES is a major program of the National Center for Health Statistics (NCHS). NCHS is part of the Centers for Disease Control and Prevention (CDC) and has the responsibility for producing vital and health statistics for the Nation.

National Institute on Aging (NIA) AgingResearchBiobank  The National Institute on Aging (NIA) conducts and funds various longitudinal and clinical studies on aging that have generated a collection of biospecimens and related phenotypic and clinical data. In 2018, NIA established the AgingResearchBiobank to provide a state-of-the-art inventory system for the storage and distribution of these collections to the broader scientific community.

National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK) Central Repository  The NIDDK Central Repository stores biosamples, genetic and other data collected in designated NIDDK-funded clinical studies. The purpose of the NIDDK Central Repository is to expand the usefulness of these studies by allowing a wider research community to access data and materials beyond the end of the study.

NIH Office of Dietary Supplements (ODS) Dietary Supplement Label Database (DSLD)  The Dietary Supplement Label Database (DSLD) was developed by the NIH Office of Dietary Supplements (ODS) and catalogs all information printed on labels of dietary supplement products sold in the United States.  The database resulted from specific recommendations to NIH from Congress in 2004 that encouraged the ODS to develop, create, regularly update, maintain, and make available to government and research entities a database of all supplement labels sold in the United States.

United States Department of Agriculture FoodData Central  FoodData Central is an integrated data system that provides expanded nutrient profile data and links to related agricultural and experimental research.

Wide-ranging Data for Epidemiologic Research (WONDER)  The Center for Disease Control and Prevention (CDC) WONDER furthers CDC's mission of health promotion and disease prevention by speeding and simplifying access to public health information for state and local health departments, the Public Health Service, and the academic public health community. CDC WONDER is valuable in public health research, decision making, priority setting, program evaluation, and resource allocation.

Research Study Query and Reporting Tools:

National Center for Biotechnology Information PubMed  PubMed® comprises more than 33 million citations for biomedical literature from MEDLINE, life science journals, and online books. Citations may include links to full text content from PubMed Central and publisher web sites.

National Library of Medicine’s is a database of privately and publicly funded clinical studies conducted around the world. organizes information for each registered study as an integrated unit, displaying the study protocol information and, if available, the corresponding results information on the same page under different tabs.

NIH Research Portfolio Online Reporting Tools (RePORT)  The NIH Research Portfolio Online Reporting Tools (RePORT) website provides access to a variety of reporting tools, reports, data, and analyses of NIH research activities. One of the tools available on the RePORT site is the RePORT Expenditures and Results (RePORTER) module. RePORTER is an electronic tool that allows users to search a repository of NIH-funded research projects and access publications and patents resulting from NIH funding.

This page last reviewed on