Published May 20, 2026 | Version 1.5.0
Software Open

Langextract v1.5.0

Authors/Creators

  • 1. Google LLC

Description

LangExtract is a Python library that uses Large Language Models (LLMs) to extract structured information from unstructured text documents based on user-defined instructions. It processes materials such as clinical notes or reports, identifying and organizing key details while ensuring the extracted data corresponds to the source text through precise source grounding and visual highlighting.

Notes

This is the initial Zenodo deposit for LangExtract to establish the concept DOI.

Files

Files (276.9 kB)

Name Size Download all
md5:2cd33651445c3c5774383499132cfc19
142.1 kB Download
md5:032b96b68d15033ae60c5ea914eb89b0
134.8 kB Download

Additional details

Related works