Leveraging large language models for automated knowledge graphs generation in non-destructive testing
Description
This paper presents an innovative approach for the automatic generation of Knowledge Graphs (KGs) from heterogeneous scientific articles in the domain of Non-Destructive Testing (NDT) applied to building materials.
Our methodology leverages large language models (LLMs) to extract and semantically relate concepts from diverse sources. We developed material-specific agents for concrete, wood, steel, and bricks, each equipped with a curated glossary of terms to ensure domain accuracy. These agents process PDF documents, extracting relevant
information on deterioration mechanisms, physical changes, and applicable NDT methods. The extracted data is then normalized, validated, and structured into a Neo4j graph database, forming a comprehensive KG. Our results demonstrate the system’s ability to automatically discover and represent intricate relationships between materials, deterioration mechanisms, physical changes, and NDT techniques. The generated KG successfully captures complex interactions, such as the applicability of specific NDT methods to various materials under different deterioration conditions. This work not only highlights the potential of KGs in enhancing knowledge
discovery and representation in NDT research but also provides a scalable framework for extending this approach to other scientific domains.
Files
Leveraging large language models .....pdf
Files
(672.3 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:d1789d2f8f0febc3f7362fe1e51dd8f5
|
672.3 kB | Preview Download |
Additional details
Funding
Dates
- Accepted
-
2024-09-24