Clinical decision support based on Bevacizumab cancer trials and pushing the limitations of advanced LLMs
Description
An exhaustive study was needed to test the limits of leading Large Language Models (LLMs) using numerous real-world clinical trial outcomes. It was also necessary to provide extensive hallucination studies based on both extracting main points and providing novel AI clinical decision support. Here, 100 Bevacizumab cancer therapy articles representing over 900K words were summarized by the 3.5 Sonnet model into 49K words, which completed a detailed and complex problem of several cancer and study types to press the capabilities of the ChatGPT o1 reasoning model. Report summaries in general followed an effective format, with guardrails to de-identify patient information and numerical data sources attributions to ground the output to the input. The main takeaway was that both LLMs typically remained in-context with the input data when more structured prompts were used, while precise quotations and author name citations were less prominent. These errors were likely due to LLM pressure towards achieving coherence vs. exact recall based on manufacturer inference-time compute settings. Overall, ChatGPT o1 provided state-of-the-art evidence-based Bevacizumab insights regarding clinical efficacy across indications, dosing recommendations, combination therapies, and biomarker-driven selections.
Files
Clinical decision support based on Bevacizumab cancer trials and pushing the limitations of advanced LLMs.pdf
Files
(5.8 MB)
Name | Size | Download all |
---|---|---|
md5:d70b838a22cdc8627263ff01761942c7
|
2.1 MB | Preview Download |
md5:bd753885693b31f167a3c2634aba1abd
|
335.0 kB | Preview Download |
md5:3c88e84fa080b9315ea975659ad6576c
|
2.4 MB | Preview Download |
md5:d5d81ab6302032375d7effe712efb47d
|
165.9 kB | Preview Download |
md5:74cd95e42551754a50aee067d4adfcc6
|
811.2 kB | Preview Download |
Additional details
Dates
- Created
-
2025-01-27