Dialogue and Argumentation for Cultural Literacy Learning in Schools: Multilingual Data Corpus
Creators
- Rapanta, Chrysi1
- Cascalheira, Dilar1
- Gil, Beatriz1
- Gonçalves, Cláudia1
- Garcia, D'Jamila1
- Morais, Rita1
- Pereira, João Rui1
- Čermáková, Anna2
- Maine, Fiona2
- Peck, Julia2
- Brummernhenrich, Benjamin3
- Jucks, Regina3
- Petronytė, Miglė4
- Valančienė, Daina4
- Juskiene, Vaiva4
- Badaukienė, Ramunė4
- Eigminienė, Dainora4
- Stonkuviene, Irena4
- Zaleskienė, Irena4
- Garcia-Mila, Mercè5
- Remesal, Ana5
- Castells, Nuria5
- Gilabert, Sandra5
- Miralda-Banda, Andrea5
- Luna, Jose5
- Vrikki, Maria6
- Evagorou, Maria6
- Chatzianastasi, Maria6
- Karousiou, Christiana6
- Papanastasiou, Elena6
- Stylianou-Georgiou, Agni6
- Rodosthenous, Marina6
- Talli, Cedar7
- Cohen, Irit7
- Shalom Greenberg, Chaim7
- Bar, Noa7
- Sarfati, Neta7
- Schwarz, Baruch7
- 1. Universidade Nova de Lisboa, Portugal
- 2. University of Cambridge, UK
- 3. Westfälische Wilhelms-Universität Münster, Germany
- 4. Vilniaus Universitetas, Lithuania
- 5. Universitat de Barcelona, Spain
- 6. University of Nicosia, Cyprus
- 7. The Hebrew University of Jerusalem, Israel
Description
This dataset is the Multilingual Corpus of the DIALLS (DIalogue and Argumentation for Literacy Learning in Schools) project (dialls2020.eu/) and consists of a set of transcripts of classroom interactions of students from ages 5 to 15 years old. These classroom interactions took place in seven DIALLS participant countries (UK, Portugal, Germany, Lithuania, Spain, Cyprus, and Israel).
The corpus is a set of 202 transcripts in the participant countries’ native language (English, Portuguese, German, Lithuanian, Catalan, Cypriot Greek, and Hebrew). The transcripts in each native language range from a maximum of 35 for Hebrew (more than 17% of the overall corpus) to a minimum of 19 transcripts for Cypriot Greek (10% of the corpus). More than 50% of the transcripts in a language different from English (90 transcripts) have associated their English translation. The topic of the project is cultural literacy through dialogue and argumentation in school children.
The Multilingual Corpus is relevant to the following areas of research: Educational dialogue, Citizenship education, Argumentation and learning, Multimodal literacy, Dialogic teaching, Dialogue/discourse analysis, Arts-based education, Cultural studies, Teacher professional development and communities.
The dataset is organised in two main sections: an Excel file, and a zip folder with .csv files matching the excel file. A description of the corpus and further information on the dataset can be accessed in a .pdf file.
Files
1.DIALLS_Multilingual_Corpus_description.V2.pdf
Additional details
Related works
- Cites
- Project deliverable: https://dialls2020.eu/wp-content/uploads/2019/06/D3.1-final.pdf (URL)
- Project deliverable: https://dialls2020.eu/wp-content/uploads/2019/01/D2.3_final-submitted.pdf (URL)