Published March 28, 2024 | Version v1.0
Dataset Open

iRead4Skills - Basic Lexicons per Complexity Level

Description

The iRead4Skills Basic lexicons per Complexity Level consists of three basic lexicons per complexity level for French, Spanish, and Portuguese, provided in .xlsx format. These lexicons were compiled under the scope of the project iReadSkills – Intelligent Reading Improvement System for Fundamental and Transversal Skills Development, funded by the European Commission (grant number: 1010094837). The project aims to enhance reading skills within the adult population by creating an intelligent system that assesses text complexity and recommends suitable reading materials to adults with low literacy skills, contributing to reducing skills gaps and facilitating access to information and culture (https://iread4skills.com/).

Each lexicon covers the complexity levels deemed relevant for the project - Very Easy (approximately A1), Easy (approximately A2), and Plain (approximately  B1) -, and will contribute to the complexity analysis systems for the three languages of the project: French, Portuguese, and Spanish. The data files are accompanied by a description of the data. The baselines for each lexicon definition can be consulted here: iRead4Skills - Baselines for complexity lexicons definition (https://doi.org/10.5281/zenodo.10069793)

 

French lexicon: 10103 entries 

Portuguese lexicon:  2 729 entries

Spanish lexicon: 3 033 entries 

Files

iRead4Skills_D3.6 Basic lexicons per complexity levels.zip

Files (544.3 kB)

Additional details

Related works

Is supplement to
Project deliverable: 10.5281/zenodo.10069793 (DOI)

Funding

HORIZON-CL2-2022-TRANSFORMATIONS-01-07 – Conditions for the successful development of skills matched to needs 1010094837
European Commission