This paper proposes a new method to estimate impression of short sentences considering adjectives. In the proposed system; first; an input sentence is analyzed and preprocessed to obtain keywords. Next; adjectives are taken out from the data which is queried from Google N-gram corpus using keywords-based templates. The semantic similarity scores between the keywords and adjectives are then computed by combining several computational measurements such as Jaccard coefficient; Dice coefficient; Overlap coefficient; and Pointwise mutual information. In the next step; the library sentiment of patterns.en - natural language processing toolkit is utilized to check the sentiment polarity (positive or negative) of adjectives and sentences. Finally; adjectives are ranked and top na adjectives (in this paper na is 5) are chosen according to the estimated values. We carried out subjective experiments and obtained fairly good results. For example; when the input sentence is “It is snowy”; selected adjectives and their scores are: white (0.70); light (0.49); cold (0.43); solid (0.38) and scenic (0.37).
Tạp chí khoa học Trường Đại học Cần Thơ
Lầu 4, Nhà Điều Hành, Khu II, đường 3/2, P. Xuân Khánh, Q. Ninh Kiều, TP. Cần Thơ
Điện thoại: (0292) 3 872 157; Email: tapchidhct@ctu.edu.vn
Chương trình chạy tốt nhất trên trình duyệt IE 9+ & FF 16+, độ phân giải màn hình 1024x768 trở lên