{ "access": { "embargo": { "active": false, "reason": null }, "files": "public", "record": "public", "status": "open" }, "created": "2021-06-02T18:52:07.117228+00:00", "custom_fields": {}, "deletion_status": { "is_deleted": false, "status": "P" }, "files": { "count": 4, "enabled": true, "entries": { "Corpus_Ladder_31052021.xml": { "checksum": "md5:25ce59a27da0c516a39415ce94276b86", "ext": "xml", "id": "7a8c3e02-e55b-4c61-9f66-edcff7244d22", "key": "Corpus_Ladder_31052021.xml", "metadata": null, "mimetype": "text/xml", "size": 1224525 }, "Corpus_Ladder_31052021_useLb.xml": { "checksum": "md5:e0aa8b11dfd7b18995751c756600d0ed", "ext": "xml", "id": "459d38fd-1f2a-461f-bac8-246f0e4f159a", "key": "Corpus_Ladder_31052021_useLb.xml", "metadata": null, "mimetype": "text/xml", "size": 1230035 }, "Instant Messaging Corpus.csv": { "checksum": "md5:53deb81b49d4f2cf20a35bcb991df86c", "ext": "csv", "id": "df11079a-ef0b-4f8e-ade6-d640936bc64e", "key": "Instant Messaging Corpus.csv", "metadata": null, "mimetype": "text/csv", "size": 378063 }, "Mail Corpus.csv": { "checksum": "md5:f773cbe8df56f0b67c32bdb9a6ffbcc6", "ext": "csv", "id": "9f7358f6-c361-47c3-a9b0-616238ef17e4", "key": "Mail Corpus.csv", "metadata": null, "mimetype": "text/csv", "size": 145418 } }, "order": [], "total_bytes": 2978041 }, "id": "4895219", "is_draft": false, "is_published": true, "links": { "access": "https://zenodo.org/api/records/4895219/access", "access_links": "https://zenodo.org/api/records/4895219/access/links", "access_request": "https://zenodo.org/api/records/4895219/access/request", "access_users": "https://zenodo.org/api/records/4895219/access/users", "archive": "https://zenodo.org/api/records/4895219/files-archive", "archive_media": "https://zenodo.org/api/records/4895219/media-files-archive", "communities": "https://zenodo.org/api/records/4895219/communities", "communities-suggestions": "https://zenodo.org/api/records/4895219/communities-suggestions", "doi": "https://doi.org/10.5281/zenodo.4895219", "draft": "https://zenodo.org/api/records/4895219/draft", "files": "https://zenodo.org/api/records/4895219/files", "latest": "https://zenodo.org/api/records/4895219/versions/latest", "latest_html": "https://zenodo.org/records/4895219/latest", "media_files": "https://zenodo.org/api/records/4895219/media-files", "parent": "https://zenodo.org/api/records/4895218", "parent_doi": "https://zenodo.org/doi/10.5281/zenodo.4895218", "parent_html": "https://zenodo.org/records/4895218", "requests": "https://zenodo.org/api/records/4895219/requests", "reserve_doi": "https://zenodo.org/api/records/4895219/draft/pids/doi", "self": "https://zenodo.org/api/records/4895219", "self_doi": "https://zenodo.org/doi/10.5281/zenodo.4895219", "self_html": "https://zenodo.org/records/4895219", "self_iiif_manifest": "https://zenodo.org/api/iiif/record:4895219/manifest", "self_iiif_sequence": "https://zenodo.org/api/iiif/record:4895219/sequence/default", "versions": "https://zenodo.org/api/records/4895219/versions" }, "media_files": { "count": 0, "enabled": false, "entries": {}, "order": [], "total_bytes": 0 }, "metadata": { "additional_descriptions": [ { "description": "Note: Many thanks go to Joseph Wang-Kathrein, Research Institute Brenner-Archives, University of Innsbruck, for the consulting and technical help about the long time archiving. Thanks to the Digital Science Center Innsbruck for the financial support.", "type": { "id": "notes", "title": { "de": "Anmerkungen", "en": "Notes" } } } ], "creators": [ { "affiliations": [ { "name": "University of Innsbruck" } ], "person_or_org": { "family_name": "Brocca", "given_name": "Nicola", "identifiers": [ { "identifier": "0000-0001-5318-0162", "scheme": "orcid" } ], "name": "Brocca, Nicola", "type": "personal" } } ], "description": "
\n\n
Ladder. A Corpus of Computer-Mediated Communication for the Analysis of the Acquisition of Pragmalinguistic Competences by German-Speaking Learners of Italian.
\n\n\n\n
\n\n
\n\n
Project description:
\n\nMany recent research projects (Artoni, Benigni, & Nuzzo, 2020; Cortés Velásquez & Nuzzo, 2017; Nuzzo & Cortés Velásquez, 2020) have underlined the usefulness of creating and analyzing corpora for teaching pragmatics, which, unlike other linguistic levels such as syntax, cannot be explained by rules but only by reference to tendential values or more or less appropriate choices in a given context. This is even more true for interactions via digital media, such as email and instant-messaging services, which have little place in manuals or L2 courses and for which learners have few reference models (Brocca, 2021; Trubnikova & Garofolin, 2020).
\n\nData collection:
\n\nData were collected from April 2020 to April 2021 with the help of a discourse completion task (DCT). The data consists of emails and instant messages. The informants are (i) German learners of Italian between A2-C1 level according to the CEFR and most of them are students living in Tyrol (Austria) and (ii) native speakers of Italian most of whom are students from Rome (Italy). The data of the learners were collected by students of the undergraduate seminar “Insegnare la pragmatica” which is part of the compulsory module 2b for student teachers at the Institute of Didactics of the University of Innsbruck. The data of the native speakers were collected in large part from students in foreign languages at the University RomaTre thanks to the collaboration with Prof. Elena Nuzzo.
\n\nThe DCTs have been conducted with online questionnaires. Along with the texts, metadata were also registered with the help of an online questionnaire giving sociolinguistic information about the informant (age, self-assessed language level, place of residence, native language, etc.). The DCTs aim to elicit linguistic acts of request and refusal in increasing levels of social distance and different media (Taguchi & Roever, 2017, pp. 85, 231; Hinger et al. 2018: 148). The DCTs elicit different speech acts (requests and refusals) with different degrees of formality (study/work or free time), directed at different people (lecturer, friend, boss) and in different media (mail or instant messaging). The scenarios represent authentic circumstances for the students. The following table shows the situations that were studied:
\n\n\n\n
high level of social distance between sender and recipient
\n\nScenario 1: Sender is asking for something that he/she is not entitled to
\n\nScenario 2: Sender is asking for something that he/she is entitled to
\n\nWhatsApp messages
\n\na) low level of social distance between sender and recipient
\n\nScenario 1: Request
\n\nScenario 2: Rejecting a request
\n\nScenario 3: Short-notice cancellation of an invitation
\n\nb) medium level of social distance between sender and recipient
\n\nScenario 4: Request
\n\nScenario 5: Rejecting a request
\n\nScenario 6: Short-term rejection of an invitation
\n\n\n\n
\n\n
The WhatsApp messages, which are exemplary of the text type instant messaging, were produced directly with the cell phone. The metadata were subsequently associated with the respective messages in an Excel spreadsheet. All personal data were anonymized.
\n\nThe prompts were presented in Italian, as follows:
\n\nMail a) Immagina di star facendo un corso con il Dr. Nicola Brocca. Domani devi fare una presentazione in classe. Non hai avuto tempo per studiare perché dovevi prepararti a un esame di inglese e ti accorgi che il materiale da presentare è più di quello che avevi previsto. Scrivi una mail al professore: la tua speranza è spostare la presentazione.
\n\nEngl: Imagine you are taking a course with Dr. Nicola Brocca. Tomorrow you have to give a presentation in class. You had no time to study because you had to prepare for an English exam, and you realize that there is more material to present than you had imagined. You write an email to the professor: your hope is to reschedule the presentation.
\n\nMail b) Hai fatto un corso con il Dr. Brocca. Hai consegnato il tuo portfolio il 01.02.2020 adesso è il 01.03.2020 e non hai ancora ricevuto il voto. Ti serve il voto per registrarti per una borsa di studio. Manda una mail al prof.: il tuo obiettivo è ricevere il voto al più presto
\n\nEngl: You have taken a course with Dr. Brocca. You turned in your portfolio on 02/01/2020, it is now 03/01/2020 and you have not received the grade yet. You need the grade to register for a scholarship. Send an email to the professor: your goal is to receive the grade as soon as possible.
\n\n\n\n
WhatsApp messages
\n\n1. Sei in Erasmus in Italia. Avete creato una chat con 10 compagni di corso. Hai perso la tua tessera della biblioteca a vuoi chiedere se qualcuno ti può aiutare perché ti serve un libro entro domani...per esempio prestandoti la sua. Cosa scrivi?
\n\nEngl: You are taking part in the Erasmus program in Italy. You have created a chat with 10 classmates. You lost your library card and want to ask if someone can help you because you need a book by tomorrow.... E.g. by lending you their card. What do you write?
\n\n2. Ricevi questo messaggio da un amico/a che fa un seminario con te: "Ciao, sono a corto di tempo. Ho visto che hai preso 30 all'esame. Potresti darmi una mano e restare con me in biblioteca oggi?" Non vuoi aiutare il tuo amico. Come reagisci?
\n\nEngl: You receive this message from a friend who is attending a seminar with you: "Hello, I'm running out of time. I saw that you got a 30 on the exam. Could you help me and stay with me in the library today?" You don't want to help the friend. How do you respond?
\n\n3. Cinque giorni fa hai promesso ad un/a amico/a che questa sera sareste andati al cinema assieme. Però hai cambiato idea. Cosa fai? Cosa scrivi?
\n\nEngl: Five days ago, you promised a friend that tonight you would go to the movies together. But you changed your mind. What would you do? What do you write?
\n\n\n\n
4. Sei al lavoro e hai smarrito il documento elettronico per entrare nel parcheggio. Sei nuovo in questo gruppo di lavoro e hai solo il numero del tuo diretto superiore. Gli mandi un messaggio per chiedergli se ti può aiutare.
\n\nEngl: You are at work and have lost your electronic badge to enter the parking lot. You are new to this work group and only have the number of your direct supervisor. You send him/her a message and ask if he/she can help you.
\n\n\n\n
5. Ricevi questo messaggio dal/la tuo/a superiore. "Gentile collega, domani c'è una scadenza importante. Per caso sarebbe in grado di restare oggi in ufficio oltre l'orario?" Non vuoi restare in ufficio oltre il normale. Come reagisci?
\n\nEngl: You receive this message from your supervisor. "Dear colleague, tomorrow is an important appointment. Would you be able to stay in the office after hours today?" You don't want to stay in the office beyond normal working hours. How do you respond?
\n\n\n\n
6. Cinque giorni fa hai promesso al/la tuo/a superiore che oggi saresti andato a una cena di lavoro. Però devi disdire. Cosa fai?
\n\nEngl: Five days ago, you promised your superior that you would go to a business dinner today. However, you have to cancel. What do you do?
\n\n\n\n
The corpus, which was first collected in .xlsx format, was exported to XML format and CSV format in cooperation with Joseph Wang-Kathrein (Brenner Archive Research Center). It was ensured that the emoticons and special characters were also transferred unchanged in the conversion process. These formats allow long-term archiving and significantly facilitate data exchange.
\n\nThe size of the corpus (as of May 2021, version Ladder 1.0):
\n\nThe LADDER corpus includes emails and instant-messaging messages amounting to 18,935 tokens and 33,966 tokens respectively. The corpus of WhatsApp messages consists of a total of 1,204 messages from 80 native speakers and 114 learners. The corpus of emails consists of a total of 235 emails from 78 native-speaker informants and 38 learners. The amount of data allows a qualitatively relevant comparison in sub-corpora e.g. language levels.
\n\nThe size of the corpus is necessarily limited quantitatively, as data collection must be done manually through individual DCT management and metadata checking. The major bottleneck is currently the annotation of socio-pragmatic aspects, a process that is difficult to automate and that needs to be conducted through cross-annotation by multiple annotators.
\n\nSome students' works on the corpus have been collected and are accessible via the following link: https://ladder.hypotheses.org/
\n\n\n\n
Bibliography:
\n\nArtoni, D., Benigni, V., & Nuzzo, E. (2020), "Pragmatic instruction in L2-Russian: a study on requests and advice" in Instructed Second Language Acquisition, 4(1), 62-95. doi:10.1558/isla.39864
\n\nBrocca, N. (2021), "LADDER: La costruzione e analisi di un corpus di scritture digitali per l’insegnamento della pragmatica in L2" in Italiano Lingua Due, 13(1 (2021)).
\n\nCortés Velásquez, D., & Nuzzo, E. (2017), "Disdire un appuntamento: spunti per la didattica dell'italiano L2 a partire da un corpus di parlanti nativi" in Italiano Lingua Due, 1, 17-36.
\n\nHinger, B., Stadler, W., Schmiderer, K., Bauer, M., (Hrg.) (2018). Testen und Bewerten fremdsprachlicher Kompetenzen. Tübingen: Narr Francke Attempto Verlag.
\n\nNuzzo, E., & Cortés Velásquez, D. (2020), "Canceling Last Minute in Italian and Colombian Spanish: A Cross-Cultural Account of Pragmalinguistic Strategies" in Corpus Pragmatics, 4, 1-26. doi:10.1007/s41701-020-00084-y
\n\nTaguchi, N., & Roever, C. (2017), Second language pragmatics: Oxford: Oxford University Press.
\n\nTrubnikova, V., & Garofolin, B. (2020), Lingua e interazione. Insegnare la pragmatica a scuola. Pisa: ETS.
\n\n", "publication_date": "2021-06-02", "publisher": "Zenodo", "resource_type": { "id": "dataset", "title": { "de": "Datensatz", "en": "Dataset" } }, "rights": [ { "description": { "en": "The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited." }, "icon": "cc-by-icon", "id": "cc-by-4.0", "props": { "scheme": "spdx", "url": "https://creativecommons.org/licenses/by/4.0/legalcode" }, "title": { "en": "Creative Commons Attribution 4.0 International" } } ], "subjects": [ { "subject": "Italian as L2" }, { "subject": "Computer Mediated Communication" }, { "subject": "Sociopragmatics" }, { "subject": "Speech Acts" }, { "subject": "Learning Analytics" }, { "subject": "Theacher Education" }, { "subject": "Pragmatics Teaching" }, { "subject": "Italian pragmatics" }, { "subject": "Intercultural pragmatics" }, { "subject": "corpuspragmatics" }, { "subject": "Italian spoken by Germanophones" }, { "subject": "corpus" }, { "subject": "DDL" }, { "subject": "Data Driven Learning" } ], "title": "LADDER. Learners' digital communication: a corpus for pragmatic competences in Italian L1/L2", "version": "2021.06.02" }, "parent": { "access": { "owned_by": { "user": 222147 } }, "communities": { "default": "f759b883-29f6-4fd2-8c95-24e9b19173a1", "entries": [ { "access": { "member_policy": "open", "members_visibility": "public", "record_policy": "open", "review_policy": "open", "visibility": "public" }, "children": { "allow": false }, "created": "2021-09-21T11:26:42.797822+00:00", "custom_fields": {}, "deletion_status": { "is_deleted": false, "status": "P" }, "id": "f759b883-29f6-4fd2-8c95-24e9b19173a1", "links": {}, "metadata": { "curation_policy": "
Note that we only accept contributions in which also the accompanying data and code are made publicly available. If a paper, for example, is provided as open access, uploaded to Zenodo, and then requested for inclusion in our community, but uses data and code which are not shared, we won't include it in this collection. In case of having been rejected, feel free to contact calc@digling.org to discuss the reasons for rejection.
\r\n", "page": "Collection of papers, presentations, datasets, and source code for digital applications in historical linguistics. Supersedes the DIGHL community, which has been spammed by too many unrelated contributions in the past and won't be maintained any longer in the future.
", "title": "Digital Historical Linguistics" }, "revision_id": 0, "slug": "digling", "updated": "2021-09-21T11:26:43.015441+00:00" } ], "ids": [ "f759b883-29f6-4fd2-8c95-24e9b19173a1" ] }, "id": "4895218", "pids": { "doi": { "client": "datacite", "identifier": "10.5281/zenodo.4895218", "provider": "datacite" } } }, "pids": { "doi": { "client": "datacite", "identifier": "10.5281/zenodo.4895219", "provider": "datacite" }, "oai": { "identifier": "oai:zenodo.org:4895219", "provider": "oai" } }, "revision_id": 14, "stats": { "all_versions": { "data_volume": 106274140.0, "downloads": 148, "unique_downloads": 94, "unique_views": 769, "views": 821 }, "this_version": { "data_volume": 64628761.0, "downloads": 89, "unique_downloads": 51, "unique_views": 628, "views": 665 } }, "status": "published", "updated": "2022-03-28T12:45:24.549282+00:00", "versions": { "index": 1, "is_latest": false } }