Published September 24, 2024 | Version v1
Conference paper Open

Leveraging large language models for automated knowledge graphs generation in non-destructive testing

Description

This paper presents an innovative approach for the automatic generation of Knowledge Graphs (KGs) from heterogeneous scientific articles in the domain of Non-Destructive Testing (NDT) applied to building materials.


Our methodology leverages large language models (LLMs) to extract and semantically relate concepts from diverse sources. We developed material-specific agents for concrete, wood, steel, and bricks, each equipped with a curated glossary of terms to ensure domain accuracy. These agents process PDF documents, extracting relevant
information on deterioration mechanisms, physical changes, and applicable NDT methods. The extracted data is then normalized, validated, and structured into a Neo4j graph database, forming a comprehensive KG. Our results demonstrate the system’s ability to automatically discover and represent intricate relationships between materials, deterioration mechanisms, physical changes, and NDT techniques. The generated KG successfully captures complex interactions, such as the applicability of specific NDT methods to various materials under different deterioration conditions. This work not only highlights the potential of KGs in enhancing knowledge
discovery and representation in NDT research but also provides a scalable framework for extending this approach to other scientific domains.

Files

Leveraging large language models .....pdf

Files (672.3 kB)

Name Size Download all
md5:d1789d2f8f0febc3f7362fe1e51dd8f5
672.3 kB Preview Download

Additional details

Funding

European Commission
Reincarnate - Reincarnation of construction products and materials by slowing down and extending cycles 101056773

Dates

Accepted
2024-09-24