Published May 23, 2025 | Version v2

Bilkent Turkish Writings Dataset

  • 1. ROR icon Imperial College London

Description

A comprehensive compilation of Turkish creative writings from Bilkent University's Turkish 101 and Turkish 102 courses (2014-2025). This dataset contains 9,119 student writings originally created by Bilkent University students and instructors, focusing on creativity, content, composition, grammar, spelling, and punctuation development. The writings were originally published publicly by Bilkent University and have been systematically collected, processed, and structured into a research dataset. Version 2.0 includes significant expansion with 33% more content compared to the initial release, making it one of the largest publicly available Turkish creative writing corpora for academic research. Note: This is a compilation of existing publicly available content - the original creative works were authored by Bilkent University students and instructors.

Files

bilkent-turkish-writings-dataset-master.zip

Files (109.0 MB)

Name Size Download all
md5:1ad4ef4fcd2ef65d84d5951270df3a39
29.8 MB Preview Download
md5:bab65cdeac28139820bc5d66afa01467
34.8 MB Preview Download
md5:b4e5beb9a7f6151467b8565cf1fe5cb5
44.5 MB Preview Download

Additional details

Software

Repository URL
https://github.com/selimfirat/bilkent-turkish-writings-dataset/
Programming language
Python , CSV
Development Status
Active