Study on the Domain Adaption of Korean Speech Act using Daily Conversation Dataset and Petition Corpus

Song, Youngsook; Cho, Won Ik

doi:10.5281/zenodo.10722020

There is a newer version of the record available.

Published February 28, 2024 | Version v1

Preprint Open

Study on the Domain Adaption of Korean Speech Act using Daily Conversation Dataset and Petition Corpus

In Korean, quantitative speech act studies have usually been conducted on single utterances with unspecified sources. In this study, we annotate sentences from the National Institute of Korean Language's Messenger Corpus and the National Petition Corpus, as well as example sentences from an academic paper on contemporary Korean vlogging, and check the discrepancy between human annotation and model prediction. In particular, for sentences with differences in locutionary and illocutionary forces, we analyze the causes of errors to see if stylistic features used in a particular domain affect the correct inference of speech act. Through this, we see the necessity to build and analyze a balanced corpus in various text domains, taking into account cases with different usage roles, e.g., messenger conversations belonging to private conversations and petition corpus/vlogging script that have an unspecified audience.

Files

NLP4DH_2023CopyCRCopy____Journal_ver_ 0228v1.pdf

Files (1.9 MB)

Name	Size	Download all
NLP4DH_2023__Copy____CR__Copy____Journal_ver_ 0228v1.pdf md5:75614e504959f856585c9f5e5c785d0c	1.9 MB	Preview Download

197

Views

Downloads

Show more details

	All versions	This version
Views	197	48
Downloads	1,385	48
Data volume	3.2 GB	117.0 MB

More info on how stats are collected....

DOI

Resource type

Preprint

Publisher

Zenodo

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: February 28, 2024
Modified: July 7, 2024

Study on the Domain Adaption of Korean Speech Act using Daily Conversation Dataset and Petition Corpus

Authors/Creators

Description

Files

NLP4DH_2023__Copy____CR__Copy____Journal_ver_ 0228v1.pdf

Files (1.9 MB)

NLP4DH_2023CopyCRCopy____Journal_ver_ 0228v1.pdf