LIÊN KẾT WEBSITE
Using large N-gram for vietnamese spell checking
Advances in Intelligent Systems and Computing Số , năm 2015 (Tập 326, trang 617-627)
ISSN: 21945357
ISSN: 21945357
DOI: 10.1007/978-3-319-11680-8_49
Tài liệu thuộc danh mục: Scopus
Conference Paper
English
Từ khóa: Intelligent systems; F-score; Large corpora; Large N; N-gram modeling; N-grams; Spell-checking; System use; Vietnamese; Systems engineering
Tóm tắt tiếng anh
Spell checking is a process including detecting, correcting or providing spelling suggestions for misspelled words. In this paper, we present our spell checking system relied on the context and our experimental results when doing for Vietnamese. This system uses N-gram model with large corpus. N-grams is compressed to save the memory. Furthermore, we take the contexts in both sides of syllables to improve the system’s performance. Our system got high accuracy approximate 94% F-score on the Vietnamese text. © Springer International Publishing Switzerland 2015.