
Document Type Master's Dissertation Author Liang, Hsuan Lorraine liangh@unisa.ac.za URN etd-06252009-163007 Document Title Spell checkers and correctors : a unified treatment Degree MSc Department Computer Science Supervisor
Advisor Name Title Prof D G Kourie Committee Co-Chair Prof B W Watson Supervisor Keywords
- performance
- n-gram
- FSA
- formal concept analysis
- edit distance
- dictionary lookup
- classification
- spell checking
- spell correcting
Date 2009-04-15 Availability unrestricted Abstract The aim of this dissertation is to provide a unified treatment of various spell checkers and correctors. Firstly, the spell checking and correcting problems are formally described in mathematics in order to provide a better understanding of these tasks. An approach that is similar to the way in which denotational semantics used to describe programming languages is adopted. Secondly, the various attributes of existing spell checking and correcting techniques are discussed. Extensive studies on selected spell checking/correcting algorithms and packages are then performed. Lastly, an empirical investigation of various spell checking/correcting packages is presented. It provides a comparison and suggests a classification of these packages in terms of their functionalities, implementation strategies, and performance. The investigation was conducted on packages for spell checking and correcting in English as well as in Northern Sotho and Chinese. The classification provides a unified presentation of the strengths and weaknesses of the techniques studied in the research. The findings provide a better understanding of these techniques in order to assist in improving some existing spell checking/correcting applications and future spell checking/correcting package designs and implementations.
ŠUniversity of Pretoria 2008
Please cite as follows
Liang, HL 2008, Spell checkers and correctors : a unified treatment, MSc dissertation, University of Pretoria, Pretoria, viewed yymmdd < http://upetd.up.ac.za/thesis/available/etd-06252009-163007/ >
E1305/gmFiles
Filename Size Approximate Download Time (Hours:Minutes:Seconds)
28.8 Modem 56K Modem ISDN (64 Kb) ISDN (128 Kb) Higher-speed Access dissertation.pdf 744.44 Kb 00:03:26 00:01:46 00:01:33 00:00:46 00:00:03