This paper presents a simple but effective method to improve the quality of WordNet synsets and extract glosses for synsets. We translate the Princeton WordNet and other intermediate WordNets to a target language using a machine translator, then the correct candidates are selected by applying different ranking methods: occurrence count, cosine similarity between words, cosine similarity between word embeddings and cosine similarity between Doc2Vec of sentences. Our approaches may be applicable to build WordNets in any language which has some bilingual dictionaries and at least a monolingual corpus in the target language.
Tạp chí: International Exchange and Planning Workshop on PPB and Farmers's Seed System for Sustainable Development in China and Soueast Asia, Nanning - Guangxi, China 15-18 October 2019
Tạp chí: International conference on International Learning Instruction and Teacher Education, Ha Noi National University of Education, 14-15th December, 2019
Tạp chí: 21st PATTAYA International Conference on Agricultural, Environmental and Biological Sciences (PAEBS-19) - Enviromental Sciences and Engineering, Thailan, 9-10/12/2019
Tạp chí khoa học Trường Đại học Cần Thơ
Lầu 4, Nhà Điều Hành, Khu II, đường 3/2, P. Xuân Khánh, Q. Ninh Kiều, TP. Cần Thơ
Điện thoại: (0292) 3 872 157; Email: tapchidhct@ctu.edu.vn
Chương trình chạy tốt nhất trên trình duyệt IE 9+ & FF 16+, độ phân giải màn hình 1024x768 trở lên