Abstract:
This paper proposes novel approaches for checking and correcting spelling errors in Sinhala essays written by candidates of grade five scholarship examination. They don't have a proper mechanism to identify their spelling mistakes in essays by themselves. Spelling errors by such students may occur due to the violation of spelling rules, missing or adding of letters, missing modifiers, inaccurate spelling in a similar structure, and similar sound letters]. To mitigate such challenges, the Sinhala corpus file has been developed to identify the accurate and inaccurate spellings of the written words. The role of this application is to identify the correct and incorrect words which are entered by the user and generate the most correct words as suggestions for the incorrect words. This paper introduces three new novel approaches to detect the correctly spelled words in Sinhala essays namely object word checker method, suffixes checker method and similar word checker method. With addition to that this paper discusses three approaches to generate accurate suggestions including one novel approach. When evaluating the accuracy of the spelling error detection and correction module the overall results for precision, recall, and the f -measure were recorded as 83.05%, 85.57%, and 86.62% respectively.
Citation:
P. M. Samarasinghe, W. B. I. Sewwandi, L. Ranathunga and W. A. S. N. Wijetunge, "A Data Driven Approach for Detection and Correction of Spelling Errors in Sinhala Essays," 2021 6th International Conference on Information Technology Research (ICITR), 2021, pp. 1-5, doi: 10.1109/ICITR54349.2021.9657458.