Published April 26, 2023 | Version v1
Conference paper Open

Towards Intelligent Log Parsing: Developing a Taxonomy for Advanced Parsing Techniques

Creators

Description

This replication package has been prepared to support further investigation and validation of our study on log parsing errors. It contains detailed information about log parsing results and the analysis of log characteristics for each log parser evaluated in our research. The package is organized into separate zipped folders. Inside the parsing results folder, the ground truth for each dataset is also provided for reference.

The package is organized as follows:

  1. Parsing results: There are separate folders for each log parsing tool, such as ULP, Drain, etc. Inside each folder, you will find individual CSV files containing the parsing results for each dataset analyzed in our study. These files are named as "Android_2k.log_structured.csv", "Apache_2k.log_structured.csv", and so on.

  2. Analysis of log characteristics: In addition to the log parsing results, the package includes an analysis of log characteristics for each log parser. There are separate folders for the charactertistics analysis for each log parsing tool,  Inside each folder (for a specific log parser) , you will find individual CSV files containing the result of the analysis. In these files, a '1' is placed in front of the characteristic if it is found in the corresponding log event.This analysis provides a comprehensive examination of the factors affecting the parsing process, which can be valuable for researchers and practitioners interested in improving log parsing techniques.

Please note that the analysis results have been double-checked by three researchers to ensure accuracy and reliability of the findings.

To use this replication package, follow these steps:

  1. Download the package from Zenodo and extract it to a desired location on your local machine.

  2. Navigate to the folder corresponding to the log parsing tool of interest.

  3. Open the CSV files containing the parsing results for individual datasets and examine the data according to your research objectives.

  4. Review the analysis of log characteristics provided for each log parser to gain insights into the factors affecting the parsing process.

By providing this replication package, we aim to promote transparency, reproducibility, and further advancements in log parsing research. We encourage researchers and practitioners to utilize this package to validate our findings, build upon our work, and contribute to the development of more efficient and accurate log parsing techniques.

Files

Characteristics_analysis.zip

Files (9.0 MB)

Name Size Download all
md5:f0e8abb0045a1adff6324a57cdbcbac0
2.7 MB Preview Download
md5:6b489ae0ed151e706ae09fb76ae5287e
6.3 MB Preview Download