🚀 We're looking for ML Engineers and Medical Reviewers! Join the OpenPHR Mission →
Back to Marketplace
Dataset

UK Biobank

Genomics / EHR / Imaging General Restricted Access / Application Required De-identified
N/A GitHub Stars
N/A Open Issues
N/A Docker Support
N/A Last Updated

Technical Summary

The UK Biobank is a large-scale biomedical database and research resource, containing in-depth genetic and health information from half a million UK participants.

Key Capabilities

  • Unprecedented Scale and Depth: It provides whole-exome and whole-genome sequencing (WGS) data linked directly to detailed electronic health records (EHR), medical imaging (MRI, DEXA), and longitudinal lifestyle surveys for 500,000 individuals over more than a decade.
  • Multi-Modal AI Training: The combination of genetics, imaging, and clinical outcomes makes it the ultimate dataset for training multi-modal AI models capable of complex disease risk prediction.
  • Phenome-Wide Association Studies (PheWAS): Allows researchers to seamlessly query whether a specific genetic variant is correlated with literally thousands of different diseases or traits across a massive population.

Usage in Healthcare

UK Biobank is the cornerstone of modern precision medicine research. Pharmaceutical companies use it to validate new drug targets (by showing that individuals with natural mutations in a target gene have lower disease risk and no adverse side effects). AI researchers use it to build polygenic risk scores (PRS) that can predict an individual’s lifetime risk of developing diseases like breast cancer or heart disease based on their DNA.