Vietnamese Text Classification with TextRank and Jaccard Similarity Coefficient

Hướng dẫn

Tìm kiếm nâng cao

Tựa bài viết

Tìm

Tác giả

Năm xuất bản

Tóm tắt

Lĩnh vực

Phân loại

Số tạp chí

Bản tin định kỳ

Báo cáo thường niên

Tạp chí khoa học ĐHCT

Tạp chí tiếng anh ĐHCT

Tạp chí trong nước

Tạp chí quốc tế

Kỷ yếu HN trong nước

Kỷ yếu HN quốc tế

Book chapter

Vietnamese Text Classification with TextRank and Jaccard Similarity Coefficient

Số tạp chí 5(2020) Trang: 363-369

Tác giả: Huỳnh Tuấn Hảo, Trương Quốc Định, Huỳnh Xuân Hiệp, Duong Trung Nghia

Tạp chí: Advances in Science, Technology and Engineering Systems Journal

Liên kết: https://astesj.com/v05/i06/p44/

Tóm tắt

Text classification is considered one of the most fundamental and essential problems that deal with automatically classifying textual resources into pre-defined categories. Numerous algorithms, datasets, and evaluation measurements have been proposed to address the task. Within the era of information redundancy, it is challenging and time-consuming to engineering a sizable amount of data in multi-languages manually. However, it is time-consuming to consider all words in a text, but rather several key tokens. In this work, the authors proposed an effective method to classify Vietnamese texts leveraging the TextRank algorithm and Jaccard similarity coefficient. TextRank ranks words and sentences according to their contribution value and extracts the most representative keywords. First, we collected textual sources from a wide range of Vietnamese news websites. We then applied data preprocessing, extracted keywords by TextRank algorithm, measured similarity score by Jaccard distance and predicted categories. The authors have conducted numerous experiments, and the proposed method has achieved an accuracy of 90.07% on real-world datasets. We have proved that it is entirely applicable in practice.

Các bài báo khác

Factors Affecting Job Searching Ability of Fresh University Graduates in the Mekong Delta, Vietnam

Số tạp chí 7(2020) Trang: 548-552

Tác giả: Nguyễn Quốc Nghi, Lê Thị Diệu Hiền

Tạp chí: International Journal of Research and Review

Tóm tắt

Plant Identification Using New Architecture Convolutional Neural Networks Combine with Replacing the Red of Color Channel Image by Vein Morphology Leaf

Số tạp chí 7(2020) Trang: 197–208

Tác giả: Huỳnh Xuân Hiệp, Trương Quốc Bảo, Trương Quốc Định, Nguyen Thanh Tan Kiet

Tạp chí: Vietnam Journal of Computer Science

Tóm tắt

Analyze Production Efficiency and Scale Efficiency of Rice Farming Households in Hau Giang Province

Số tạp chí 4(2020) Trang: 525-527

Tác giả: Nguyễn Quốc Nghi, La Nguyễn Thùy Dung

Tạp chí: International Journal of Trend in Scientific Research and Development

Tóm tắt

Enhancing Pedagogical Profession and Personal Improvement for Vietnamese Student Teachers through Reality-experienced Internship Program in Thailand

Số tạp chí 8(2020) Trang: 112-118

Tác giả: Huỳnh Thị Thúy Diễm, Nguyễn Kỳ Tuấn Sơn, Chaninan Pruekpramool, Kamonwan Kanyaprasith, Nason Phonphok

Tạp chí: Universal Journal of Educational Research

Tóm tắt

Synthesis of TiO2/Cellulose Nanocomposites and its Application for Degradation of Methylene Blue

Số tạp chí 7(2020) Trang: 30-35

Tác giả: Trần Thị Bích Quyên, Đoàn Văn Hồng Thiện, Ngo Nguyen Tra My, Phan Van Hoang Khang

Tạp chí: International Journal of Trend in Research and Development

Tóm tắt

A Pilot Study on Language Learning Strategies Used by Low and High Proficiency University Students at a Vietnamese University

Số tạp chí 8(2020) Trang: 250-256

Tác giả: Lý Hồng Thái

Tạp chí: The International Journal of Humanities & Social Studies

Tóm tắt

Factors Influencing the Entrepreneurial Behaviour

Số tạp chí 5(2020) Trang: 393-397

Tác giả: Lê Nguyễn Đoan Khôi

Tạp chí: International Journal of Trend in Scientific Research and Development

Tóm tắt

Understanding the 4.0 Industrial Revolution Impacts on How Students Aware of Opportunities and Challenges

Số tạp chí 7(2020) Trang: 263-267

Tác giả: Nguyễn Quốc Nghi, Lê Thị Diệu Hiền

Tạp chí: International Journal of Research and Scientific Innovation

Tóm tắt

Production Efficiency Of Hoaloc-Mango Gardeners In The Southern Vietnam

Số tạp chí 7(2020) Trang: 1465-1473

Tác giả: Trương Hồng Võ Tuấn Kiệt, Nguyễn Thị Kim Thoa, Phạm Thị Nguyên

Tạp chí: European Journal of Molecular and Clinical Medicine

Tóm tắt

Recommender Systems Using Collaborative Tagging

Số tạp chí 16(2020) Trang: 10(18)

Tác giả: Latha Banda, Huỳnh Xuân Hiệp, Mohamed Abdel-Basset, Lê Hoàng Sơn, David Taniar, Phạm Huy Thông, Karan Singh

Tạp chí: International Journal of Data Warehousing and Mining

Tóm tắt

Recommender Systems Based on Resonance Relationship of Criteria With Choquet Operation

Số tạp chí 16(2020) Trang: 44-62

Tác giả: Huỳnh Xuân Hiệp, Luong Hoang Huong, Cù Nguyên Giáp, Lê Hoàng Son, Huỳnh Minh Trí

Tạp chí: International Journal of Data Warehousing and Mining

Tóm tắt

The S-Shaped Relationship Between Internationalization and Performance: Empirical Evidence from Laos

Số tạp chí 7(2020) Trang: 357-366

Tác giả: Phan Anh Tú, Nguyen Thi Kim Thuy, Phan Minh Triet

Tạp chí: Journal of Asian Finance, Economics and Business

Tóm tắt

Anticancer and Antioxidant of Chloroform Extracts from Medical Plants in the Mekong Delta, Vietnam

Số tạp chí 19(2020) Trang: 398-405

Tác giả: Đỗ Tấn Khang, Nguyễn Trọng Tuân, Trần Thanh Mến, Nguyễn Văn Ây, Nguyen Khanh Dung, Tu Le Ngoc Thao, Bi Truong Giang, Huynh Diet Dieu, Huỳnh Văn Lợi, Lê Thị Thủy Tiên, Nguyễn Phương Thúy

Tạp chí: Asian Journal of Plant Sciences

Tóm tắt

Bank Credit, Trade Credit and Growth of Listed Agricultural Firms in Vietnam

Số tạp chí 7(2020) Trang: 303-315

Tác giả: Lê Khương Ninh, Phan Anh Tú, Bùi Tuấn Anh

Tạp chí: Journal of Asian Finance, Economics and Business

Tóm tắt

An integrated approach of ISM and fuzzy TOPSIS for supplier selection

Số tạp chí 13(2020) Trang: 701-735

Tác giả: Trần Thị Thắm, Trần Thị Mỹ Dung, Nguyễn Hồng Phúc, Nguyễn Trọng Trí Đức

Tạp chí: International Journal of Procurement Management

Tóm tắt

On the cosmic-ray energy scale of the LOFAR radio telescope

Số tạp chí 2020(2020) Trang:

Tác giả: K. Mulrey, Trịnh Thị Ngọc Gia, G.K. Krampah, H. Pandya, B.M. Hare, T. Winchen, P. Mitra, J.R. Hörandel, T. Huege, S. Thoudam, S. ter Veen, A. Nelles, O. Scholten, J.P. Rachen, H. Falcke, A. Corstanje, S. Buitink

Tạp chí: Journal of Cosmology and Astroparticle Physics

Tóm tắt

The Impact of Occupational Stress on Job Satisfaction and Job Performance of Banking Credit Officers

Số tạp chí 10(2020) Trang: 3891-3898

Tác giả: Nguyễn Quốc Nghi, Hoàng Thị Hồng Lộc, Nguyễn Du Hạ Long

Tạp chí: Management Science Letters

Tóm tắt

Helicteres binhthuanensis V.S.Dang (Malvaceae, Helicteroideae), a new species from southern Vietnam

Số tạp chí 2020(2020) Trang: 87-95

Tác giả: Đặng Văn Sơn, Đặng Minh Quân, Hoàng Nghĩa Sơn

Tạp chí: PhytoKeys

Tóm tắt

Biocomposite scaffold preparation from hydroxyapatite extracted from waste bovine bone

Số tạp chí 9(2020) Trang: 37-47

Tác giả: Hồ Quốc Phong, Huỳnh Liên Hương, Tao The Duong, Meng-Jiy Wang

Tạp chí: Green Processing and Synthesis

Tóm tắt

Mechano-chemical stability and water effect on gas selectivity in mixed-metal zeolitic imidazolate frameworks: a systematic investigation from van der Waals corrected density functional theory

Số tạp chí 22(2020) Trang:

Tác giả: Diem Thi-Xuan Dang, Nguyễn Thị Tuyết Nhung, Duc Nguyen-Manh, Jer-Lai Kuo, Nam Thoai, Huong Thi-Diem Nguyen

Tạp chí: Physical Chemistry Chemical Physics

Tóm tắt

BPH Sensor Network Optimization Based on Cellular Automata and Honeycomb Structure

Số tạp chí 25(2020) Trang: 1140–1150

Tác giả: Huỳnh Xuân Hiệp, Ông Thị Mỹ Linh, Huỳnh Phụng Toàn, Đặng Quang Huy, Luong Hoang Huong, Pham Van Huy, Duong Trung Nghia, Bernard Pottier

Tạp chí: Mobile Networks and Applications

Tóm tắt

A Novel Single Valued Neutrosophic Hesitant Fuzzy Time Series Model: Applications in Indonesian and Argentinian Stock Index Forecasting

Số tạp chí 8(2020) Trang: 60126 - 60141

Tác giả: BILLY TANUWIJAYA, Huỳnh Xuân Hiệp, Ganeshsree Selvachandran, MAHMOUD ISMAIL, Phạm Văn Huy, Lê Hoàng Sơn, MOHAMED ABDEL-BASSET

Tạp chí: IEEE Access

Tóm tắt

Context-Similarity Collaborative Filtering Recommendation

Số tạp chí 8(2020) Trang: 33342 - 33351

Tác giả: Huỳnh Xuân Hiệp, MAHMOUD ISMAIL, Phạm Văn Huy, Mohamed Abdel-Basset, Lê Hoàng Sơn, Phạm Mộng Nghi, Phan Quốc Nghĩa

Tạp chí: IEEE Access

Tóm tắt

A simple and efficient transfection protocol for Cryptosporidium parvum using Polyethylenimine (PEI) and Octaarginine

Số tạp chí 147(2020) Trang: 1065 - 1070

Tác giả: Nguyễn Hồ Bảo Trân, Dieter Seebach, Wanpeng Zheng, Maxi Berberich, Faustin Kamena, Arwid Daugschies

Tạp chí: Parasitology

Tóm tắt

Functional characterization of myeloid differentiation factor 88 in Nile tilapia (Oreochromis niloticus)

Số tạp chí 250(2020) Trang:

Tác giả: Nguyễn Bảo Trung, Po-Tsang Lee

Tạp chí: Comparative Biochemistry and Physiology Part B: Biochemistry and Molecular Biology

Tóm tắt

Developmental toxicity of Clerodendrum cyrtophyllum turcz ethanol extract in zebrafish embryo

Số tạp chí 1(2020) Trang:

Tác giả: Thu Hang Nguyen, Nguyễn Phúc Đảm, Patrick Kestemont, Hai The Pham, Duong Thi Ly Huong, Marc Muller, Joëlle Quetin-Leclercq

Tạp chí: Journal of Ethnopharmacology

Tóm tắt

Karush-Kuhn-Tucker optimality conditions and duality for semi-infinite programming problems with vanishing constraints

Số tạp chí 4(2020) Trang: 319336

Tác giả: Lê Thanh Tùng

Tạp chí: Journal of Nonlinear and Variational Analysis

Tóm tắt

Expression profile, subcellular localization and signaling pathway analysis of fish-specific TLR25 in Nile tilapia (Oreochromis niloticus)

Số tạp chí 104(2020) Trang: 141-154

Tác giả: Po-Tsang Lee, Nguyễn Bảo Trung, Po-Yu Chiu, Yu-Lin Lin, Hồ Thị Hằng

Tạp chí: Fish & Shellfish Immunology

Tóm tắt

Growing Self-Organizing Maps for Metagenomic Visualizations Supporting Disease Classification

Số tạp chí Tran Khanh DangJosef KüngMakoto TakizawaTai M. Chung(2020) Trang: 151-166

Tác giả: Nguyễn Thanh Hải, Trương Quốc Định, Nguyễn Ngọc Mỹ, Phung Duong Linh, Banh Ngoc Thuy Thao, Nguyen Anh Bang, Nguyễn Chí Linh

Tạp chí: Lecture Notes in Computer Science

Tóm tắt

Đầu tiên Trước 5 6 7 8 9 10 11 12 13 14 Tiếp Cuối

Vietnamese | English

Tạp chí khoa học Trường Đại học Cần Thơ
Khu II, Đại học Cần Thơ, Đường 3/2, Phường Ninh Kiều, Thành phố Cần Thơ, Việt Nam
Điện thoại: (0292) 3 872 157; Email: tapchidhct@ctu.edu.vn

Chương trình chạy tốt nhất trên trình duyệt IE 9+ & FF 16+, độ phân giải màn hình 1024x768 trở lên

Vui lòng chờ...