Published August 5, 2022 | Version v1
Journal article Open

PROTOTYPING NLP NON-WORD DETECTION SYSTEM FOR DINKA USING DICTIONARY LOOKUP APPROACH

  • 1. University of Juba
  • 2. Computing at Kampala International University

Description

Spellcheckers are computer software used for non-word or real word error detection. The Dinka text
editors have been developed, however, no one has developed their spellcheckers. The research entitled
Prototyping NLP Non-Word Detection System for Dinka Using Dictionary Lookup Approach was a
solution to Dinka spellchecking. The study objectives were: requirements gathering and analysis. The
computer keyboard was customized to accept the Dinka characters. Dinka lexicon was created with 6,976
words. The prototype was implemented using java programming language and dictionary lookup approach
was used for non-word detection. The accuracy of detection (detecting real words and non-words) gave
98.10%, and the accuracy of non-word detection (detection of non-words only) was 91.36%. The True
Positive Rate (TPR) was 99.10% and the True Negative Rate (TNR) was 91.36 %. The speed of non-word
detection which was found as1, 044 Hz was slow.

Files

11322ijnlc04.pdf

Files (739.9 kB)

Name Size Download all
md5:23be6e2225bae62cb75e4c4e0139bde4
739.9 kB Preview Download