CENTER FOR CLINICAL OBSERVATIONAL INVESTIGATIONS

The National Library of Medicine (NLM) provides information about clinical datasets. Inclusion on an NLM Web page does not imply endorsement of, or agreement with, the contents by NLM or the National Institutes of Health. Learn more: NLM Web Policies (nih.gov).

Datasets

Dataset List

NIH

logo for All of Us The All of Us Research Program is a large-scale United States based research program that began nationwide enrollment in May 2018 and intends to recruit more than one million participants. All of Us integrates electronic health records (EHR) data with survey questionnaires, genomic data and wearable device data to develop a diverse, information rich database that serves as a central point for many secondary research studies and reduce the need for developing individual single use study specific data collection protocols. The program includes two tiers of data access the Registered tier and the more restricted Controlled tier.

thumbnail graphic for AgingResearchBiobank AgingResearchBiobank is a United Sates based data and biospecimen repository sharing clinical and phenotypic data (images included) and their corresponding biospecimens of various longitudinal and clinical studies on aging. These studies have been funded by the National Institute on Aging (NIA) at the National Institutes of Health (NIH) and include both completed and ongoing studies.

thumbnail graphic for N3C The National Clinical Cohort Collaborative (N3C) COVID Enclave developed by National Center for Advancing Translational Sciences (NCATS) systematically and regularly collects data derived from the electronic health records of people who tested positive for COVID-19 or had related symptoms. In addition, data on non-COVID patients, controls, are collected for a case/control ratio of 1/2. The collected data are then harmonized into single OMOP Common Data Model format and made available for comparative studies. The data set includes such information as demographics, symptoms, lab test results, procedures, medications, medical conditions, physical measurements and more. These profiles used data from Release 184 and counts less than 20 were suppressed.

thumbnail graphic for SEER The Surveillance, Epidemiology, and End Results (SEER) Program provides information on cancer statistics to inform efforts to reduce the cancer burden among the U.S. population. Supported by the Surveillance Research Program (SRP) in NCI's Division of Cancer Control and Population Sciences (DCCPS), SEER offers access to Research Data through a downloadable application called SEER*Stat. The information provided here reflects the Research Data available in SEER*Stat that includes registries with data from multiple years ranging from 1975 to 2021.

non-NIH

thumbnail graphic for CMS VRDC The Centers for Medicare & Medicaid Services (CMS) Virtual Research Data Center (VRDC) collection contains populated claim forms and administrative meta data describing individual providers, facilities, patients, care plans and transactions known to CMS. The data is sourced from Medicare, Medicaid, Child Health Insurance Program (CHIP) and Social Security Disability Insurance (SSDI) encounters among others.

thumbnail graphic for CPRD AURUM Clinical Practice Research Datalink (CPRD) is a research service provided by the Medicines and Healthcare products Regulatory Agency with support from the National Institute for Health and Care Research (NIHR), as part of the United Kingdom (UK) Department of Health and Social Care, CPRD includes de-identified electronic health record patient-level data from a network of general practitioners from the UK.

thumbnail graphic for NAMCS The National Ambulatory Medical Care Survey (NAMCS) is an ongoing national survey of ambulatory medical services provided to patients, conducted by the CDC National Center for Health Statistics. The Health Center Component collects electronic health record (EHR) data from federally qualified health centers (FQHCs) and FQHC Look-Alikes (LALs) that serve areas and groups with limited access to medical care.

thumbnail graphic for NEMSIS The National Emergency Medical Services Information System (NEMSIS) is a US-based national EMS database recognized as the universal resource for patient-care information resulting from 911 emergency calls.

Specifically, NEMSIS provides the framework for collecting, storing, and sharing standardized EMS data from States nationwide. These EMS data have been used for epidemiology and research studies as well as by local, state, and national EMS stakeholders to more accurately assess EMS needs and performance, as well as support better strategic planning for the EMS systems. In addition, data from NEMSIS is also used to help benchmark performance, determine the effectiveness of clinical interventions, and facilitate cost-benefit analyses.

thumbnail graphic for PedsNET PEDSnet is a learning health system (LHS) network comprised of multiple children's hospitals with a goal to accelerate the study of child health. Each site contributes electronic health record (EHR) data and collaborates with the PEDSnet Coordinating Center to continually assess and strengthen data infrastructure to support patient-centered outcomes research, development of new methods using real-world data, pragmatic clinical trials, comparative effectiveness research, population surveillance, and health services research.

thumbnail graphic for UK Biobank The UK Biobank program is a large health and biomedical database that serves multiple retrospective, observational studies. UK Biobank includes data from over half a million participants between the ages of 40 and 69 from the United Kingdom. UK Biobank contains a combination of health, questionnaire and genetic data that is regularly updated and enriched with new data fields.

Last updated Dec 12, 2024