Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
mkurman 's Collections
NeuroBLAST v3
Medical Pre-Training Datasets
Medical QA Datasets

Medical Pre-Training Datasets

updated Aug 23, 2025

A collection of medical datasets suitable for LLMs pretraining

Upvote
1

  • openmed-community/TheBlueScrubs-v1-fixed

    Viewer • Updated Aug 29, 2025 • 11.1M • 270 • 12

  • mkurman/hindawi-journals-2007-2023

    Viewer • Updated Jun 9, 2025 • 298k • 644 • 5

  • epfl-llm/guidelines

    Viewer • Updated Mar 7, 2024 • 38k • 1.33k • 141

  • ncbi/Open-Patients

    Viewer • Updated May 11, 2025 • 180k • 306 • 23

  • AGBonnet/augmented-clinical-notes

    Viewer • Updated Jan 24, 2024 • 30k • 824 • 59

  • harishnair04/mtsamples

    Viewer • Updated Nov 7, 2024 • 5k • 191 • 1

  • Tonic/Health-Bench-Eval-OSS-2025-07

    Viewer • Updated May 17, 2025 • 9.67k • 326 • 2

  • zeroshot/arxiv-biology

    Viewer • Updated Jan 5, 2023 • 1.28k • 206 • 14
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs