US NTSB Aviation Accident and Incident Final Reports Dataset (2016–2023)
Authors/Creators
Description
This dataset consolidates information from final reports on aviation accidents and incidents occurring between 2016 and 2023, as provided by the U.S. National Transportation Safety Board (NTSB) and available through the NTSB CAROL platform as of December 24, 2024. It covers 7,462 individual occurrences and integrates both structured and unstructured data extracted from the official reports.
The dataset includes detailed data for each occurrence, such as event identification (e.g., NtsbNo, EventID, ReportNo), occurrence details (EventDate, City, State, Country, Latitude, Longitude), aircraft information (Make, Model, AirCraftCategory, NumberOfEngines, EngineType), flight and operator data (Operator, PurposeOfFlight, Scheduled, FAR), injury and damage statistics (FatalInjuryCount, SeriousInjuryCount, MinorInjuryCount, OnGroundInjuryCount, AirCraftDamage), environmental conditions (WeatherCondition), and investigation-related information (ProbableCause, Findings, BroadPhaseofFlight, ReportStatus, ReportUrl, DocketUrl). The dataset also include full text content of the associated reports, extracted from the original PDF files (rep_text).
This dataset is intended to support research on aviation safety, risk analysis, and natural language processing (NLP) applications.
Files
final_reports_2016-23_cons_2024-12-24.csv
Files
(103.7 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:c0c39953a9a4cbb0438803db3b7d716c
|
86.0 MB | Preview Download |
|
md5:22116221c2fd0415b575ae98fa17e744
|
17.7 MB | Download |