UPDATE: Zenodo migration postponed to Oct 13 from 06:00-08:00 UTC. Read the announcement.
There is a newer version of this record available.

Report Open Access

Keyboard Layout Analysis: Creating the Corpus, Bigram Chains, and Shakespeare's Monkeys

Douglas, Ian

 

The process to create a corpus suitable for evaluating computer keyboard layouts optimised for typing English and computer program code. After sourcing, sampling and cleaning suitable texts, the texts are processed to extract bigrams, which are then used to create sample input texts of a desired length. These texts have a character distribution, and letter sequence, closely matching either English or computer programs, even though they look random. The resulting texts are excellent for evaluating keyboard layouts. Corpus analysis is included.

p { margin-bottom: 0.25cm; line-height: 115%; orphans: 0; widows: 0; background: transparent; page-break-before: auto }p.western { font-family: "Libertinus Math"; font-size: 12pt; font-weight: normal }a:visited { color: #800000; so-language: zxx; text-decoration: underline }a:link { color: #000080; so-language: zxx; text-decoration: underline }

Includes related data files, but not the actual corpora, due to avoiding any copyright issues. 28 March 2021 1.0.0 Initial version. 29 March 2021 1.0.1 Added 4,5,6,7,8,9-grams, made tables more compact. Added Appendix A.
Files (18.3 MB)
Name Size
Keyboard-Layout-Analysis-101.zip
md5:e1d0d87c1f554830b1b22fa61f0cb2f9
8.6 MB Download
Keyboard-Layout-Analysis-Createng-the-Corpus-Bigram-Chains-and-Shakespeares-Monkeys-1.0.0.pdf
md5:88336dbb04093ddf1ddbd9463c3db81c
1.8 MB Download
Keyboard-Layout-Analysis-Createng-the-Corpus-Bigram-Chains-and-Shakespeares-Monkeys-1.0.1.pdf
md5:348d630f431f36695f32647b72197e52
3.6 MB Download
Keyboard-Layout-Analysis.zip
md5:69b35b4cbbdd1c6f275bf362c3779464
4.2 MB Download
2,102
949
views
downloads
All versions This version
Views 2,1021,382
Downloads 949492
Data volume 5.5 GB3.3 GB
Unique views 1,8341,274
Unique downloads 784414

Share

Cite as