Published December 9, 2025 | Version 2.0.0
Dataset Open

AI-augmented Cybersecurity Requirements Generation using LLMs | Reproducible Research Package

  • 1. ROR icon Universidad Politécnica de Madrid

Description

AI-augmented Cybersecurity Requirements Generation using LLMs | Reproducible Research Package

This repository accompanies the paper “Experimental Evaluation of AI-Augmented Cybersecurity Requirements Generation Leveraging LLMs’ Capabilities” (10.1109/ACCESS.2026.3658339). It contains every script, dataset, prompt template and result needed to fully reproduce our empirical study.

Research Description

This project investigates the practical use of state‑of‑the‑art Large Language Models (LLMs) to transform high‑level, standard‑driven cyber‑security controls into concrete, system‑specific requirements. Using a synthetic yet industrially plausible case study—AI4I4, an IoT‑enabled automotive logistics platform—we benchmark thirteen frontier models (GPT‑4, LLaMa 3, Mistral, QWen, etc.), representing tge state of the art as of September 2024, across four prompting pipelines and three temperature regimes.

Key contributions include:

1. Annotated benchmark of 54 ISO‑27002 clauses with placeholder semantics suitable for automatic instantiation.
2. LangChain pipelines that decompose the task into applicability filtering, domain‑element search, requirement generation, and JSON formatting.
3. Comprehensive evaluation of accuracy (precision, recall, F2), creativity (F2‑synthetic), and consistency (Jaccard overlap across runs).
4. Prompt library enumerating >180 templates, showing how subtle changes in instruction design affect hallucination rate and coverage.

The artefacts and scripts below allow full replication—from raw prompts to final figures—on any infrastructure with access to the referenced models.
 
For more information on the repository structure, reproducibility, licensing, and contact details, please refer to the README.

Files

ai_requirements_generation_rr.zip

Files (1.2 MB)

Name Size Download all
md5:cf26b1677a1e71564bc2a334f3747bbc
1.2 MB Preview Download

Additional details

Related works

Is supplement to
Journal article: 10.1109/ACCESS.2026.3658339 (DOI)

Funding

Ministerio de Ciencia, Innovación y Universidades
PRESECREL Modelos y plataformas para sistema informáticos industriales predecibles, seguros y confiables PID2021-124502OB-C43

Dates

Available
2025-08-10
Initial public release

Software

Repository URL
https://github.com/STRAST-UPM/ai_requirements_generation_rr
Programming language
Python
Development Status
Active