Published August 30, 2023 | Version 0.0.1
Dataset Open

Brazilian Secondary School Exam (ENEM) Questions Dataset

  • 1. Federal University of Campina Grande

Description

Dataset description

Dataset extracted from the tests available on the INEP website.

- https://www.gov.br/inep/pt-br/areas-de-atuacao/avaliacao-e-exames-educacionais/enem/provas-e-gabaritos

The available data were obtained from tests in PDF and scripts were used to read and process the files to extract the texts related to the questions and alternatives.

The extraction was made for enem tests from the years 2010 to 2022

 

Description of the dataframe.

Columns:

- description (str): containing the text relative to the question

- alternatives (list[str]): list of string containing the alternatives to the question

- year (int): year of the application of the question

- subject (str): the area of subject [Linguagens, códigos e suas tecnologias; Ciências humanas e suas tecnologias; Ciências da natureza e suas tecnologias; Matemática e suas tecnologias]

- ground_truth (str): correct alternative of the question

 

Code used

All the code used to process the data can be found at

- https://github.com/wineone/tcc-matheus-lisboa

Files

enem_questions.zip

Files (412.0 kB)

Name Size Download all
md5:0389950866be9f8466e2016dcad311be
412.0 kB Preview Download