News Research Operational Efficiency Clinical Trials

Can AI-Extracted EHR Data Be Trusted? The VALID Framework Takes Aim at a Growing Problem

April 08, 2026 By Meg Barbor, MPH 5 min read
Share Share via Email Share on Facebook Share on LinkedIn Share on Twitter

5 Key Takeaways

  • 1

    The VALID framework was proposed to ensure the accuracy and reliability of LLM-/ML-extracted clinical data from electronic health records.

  • 2

    VALID employs a three-pronged approach, including variable-level performance metrics, verification checks, and replication analyses.

  • 3

    The framework assesses bias and variability in EHR systems to ensure equitable conclusions from real-world data.

  • 4

    Small errors in data extraction do not always affect clinical outcomes, as demonstrated by high agreement rates between LLM and human curation.

  • 5

    The VALID framework aims to standardize validation processes for AI-extracted data, emphasizing transparency and clinical relevance.

ASCO AI in Oncology is published by Conexiant under a license arrangement with the American Society of Clinical Oncology, Inc. (ASCO®). The ideas and opinions expressed in ASCO AI in Oncology do not necessarily reflect those of Conexiant or ASCO. For more information, see Policies.

KOL Commentary
Watch

Related Content