Published October 1, 2015
| Version 2015-10
Dataset
Open
CLiPS Stylometry Investigation (CSI) Corpus
Contributors
- 1. University of Antwerp
Description
The CSI corpus is a yearly expanded corpus of student texts in two genres: essays and reviews. The purpose of this corpus lies primarily in stylometric research, but other applications are possible. There is a vast amount of meta-data available, both on the author (gender, age, sexual orientation, region of origin, personality profile) and on the document (timestamp, genre, veracity, sentiment, grade). The current version of the corpus was assembled in February 2016. Previous versions of the corpus are available from the authors via e-mail request.
Files
csicorpus.zip
Files
(3.4 MB)
Name | Size | Download all |
---|---|---|
md5:4389d6ff8ecf4f2fdf436ac7aee2e366
|
3.4 MB | Preview Download |