An Approach for Web Content Classification with FastText

Hướng dẫn

Tìm kiếm nâng cao

Tựa bài viết

Tìm

Tác giả

Năm xuất bản

Tóm tắt

Lĩnh vực

Phân loại

Số tạp chí

Bản tin định kỳ

Báo cáo thường niên

Tạp chí khoa học ĐHCT

Tạp chí tiếng anh ĐHCT

Tạp chí trong nước

Tạp chí quốc tế

Kỷ yếu HN trong nước

Kỷ yếu HN quốc tế

Book chapter

An Approach for Web Content Classification with FastText

Số tạp chí 5(2024) Trang: 765

Tác giả: Lương Hoàng Hướng, Lan Thu Thi Le, Nguyễn Thanh Hải

Tạp chí: Lecture Notes in Computer Science

Liên kết: https://doi.org/10.1007/s42979-024-03155-y

Tóm tắt

Nowadays, with the Internet infrastructure and nearly global access, the amount and diversity of data are increasing rapidly. Many tasks require information retrieval and data collection for machine learn- ing, research, and survey reports in various fields such as meteorology, science, geography, literature, and more. However, manual data collection and classification can be time-consuming and prone to errors. Addition- ally, AI assistants used for drafting or writing can sometimes be corrected regarding writing style and inappropriate language for the given con- text. Faced with these needs, In this article, Vietnamese documents are classified using the TF-IDF method, TF-IDF combined with SVD, and FastText at three levels: word level, n-gram level, and character level. For this approach, 15 categories were gathered from various online news sources. The dataset was preprocessed and trained using machine learn- ing models such as SVM, Naive Bayes, Neural Network, and Random Forest to find the most effective method. The Random Forest combined with the FastText method was highly evaluated, achieving a success rate of 82% when measured against essential evaluation criteria of accuracy, precision, and F1 score.

Các bài báo khác

Climate risks and resilience in urbanizing areas of the Vietnamese Mekong Delta: future action-orientated research needs

Số tạp chí Edward Park(2024) Trang: 131-156

Tác giả: Nigel Keith Downes, Nguyễn Đình Giang Nam, Văn Phạm Đăng Trí, Huỳnh Văn Đà, Nguyễn Ánh Minh, Vo Dao Chi, Le Thanh Sang, Phạm Thanh Vũ, Bao Thanh

Tạp chí: The Mekong Delta Environmental Research Guidebook

Tóm tắt

Feminist Approaches to Situated Knowledge Production: Urban Flood Management in Can Tho City, Vietnam

Số tạp chí in Eward Park, Ho Huu Loc, & Dung Duc Tran(2024) Trang: 231 - 259

Tác giả: Lý Quốc Đẳng, Nozomi Kawarazuka

Tạp chí: The Mekong Delta Environmental Research Guidebook

Tóm tắt

An Approach for Object Recognition in Videos for Vocabulary Extraction

Số tạp chí Ozgur AkanPaolo BellavistaJiannong CaoGeoffrey CoulsonFalko DresslerDomenico FerrariMario GerlaHisashi KobayashiSergio PalazzoSartaj SahniXuemin ShenMircea StanXiaohua JiaAlbert Y. Zomaya(2024) Trang: 36–51

Tác giả: Nguyễn Thanh Hải, Anh Bao Nguyen Le, Chi Bao Nguyen, Quoc Cuong Dang, Be Hai Danh, Huynh Nhu Le, Lương Hoàng Hướng

Tạp chí: Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering

Tóm tắt

Quantification of ecosystem benifits of community plantation and ots impact on well-bening

Số tạp chí Shamik Chakraborty, Amit Chatterjee, Pankaj Kumar(2024) Trang:

Tác giả: Pankai Kumar, Huỳnh Vương Thu Minh, Tomoki Yagasaki, Gowhar Meraj, Shamik Chakraborty, Rajarshi Dasgupta, Amit Chatterjee, Binaya Kumar Mishra, Ram Avtar, Osamu Saito, Kazuhiko Takeuchi

Tạp chí: Urban Water Ecosystems in Africa and Asia

Tóm tắt

Building resilience to climate change through water retention solutions in Ca Mau City, Vietnam

Số tạp chí Shamik Chakraborty, Amit Chatterjee, Pankaj Kumar(2024) Trang:

Tác giả: Huỳnh Vương Thu Minh, Lê Anh Tuấn, Nguyễn Đình Giang Nam, Trần Văn Tỷ, Kim Lavane, Pankai Kumar, Nigel Keith Downes

Tạp chí: Urban Water Ecosystems in Africa and Asia

Tóm tắt

Rising Waters , Stagnant Paths: Gendered Experiences of Flooding and Restricted Mobility in Can Tho City, Vietnam

Số tạp chí 28 August 2024(2024) Trang: 69-88

Tác giả: Danang Nizar, Lý Quốc Đẳng

Tạp chí: Climate-Related Human Mobility in Asia and the Pacific

Tóm tắt

Deep Learning for Fashion Consulting

Số tạp chí In: Thai-Nghe, N., Do, TN., Penferhat, S (eds),. Intelligent Systems and Data Science. ISDS 2024(2024) Trang: 59-66

Tác giả: Trương Quốc Định, Nguyễn Bá Duy, Dinh Thanh Nhan, Phạm Thị Diễm

Tạp chí: Communications in Computer and Information Science

Tóm tắt

An Approach to Instrumental Song Classification Utilizing Spectrogram and Convolutional Neural Networks

Số tạp chí 543(2024) Trang: 221–233

Tác giả: Anh Tuan Le, Nguyễn Thanh Hải, Nguyễn Thị Thanh Hiền, Nguyễn Hữu Hòa

Tạp chí: Studies in Systems, Decision and Control

Tóm tắt

Promoting STEM-Integrated Learning Through Engineering Design: High School Students’ Automatic Hand Washers

Số tạp chí 543(2024) Trang: 93–110

Tác giả: Pham Thi Thuy Linh, Nguyễn Thanh Hải, Nguyễn Hữu Hòa, Long Diep, N. C. Thi Tran

Tạp chí: Studies in Systems, Decision and Control

Tóm tắt

Feature Selection Based on Ranking Metagenomic Relative Abundance for Inflammatory Bowel Disease Prediction

Số tạp chí 87(2024) Trang: 94–105

Tác giả: Nguyễn Thị Thanh Hiền, Lê Nguyễn Hạt, Nguyễn Thanh Hải

Tạp chí: Lecture Notes on Data Engineering and Communications Technologies

Tóm tắt

Fake Face Recognition on Images Generated by Various Deepfakes Tools

Số tạp chí 14479(2024) Trang: pp 51- 62

Tác giả: Nguyễn Lê Bảo Anh, Nguyễn Thị Thanh Hiền, Sử Kim Anh, Nguyễn Thanh Hải

Tạp chí: Computational Data and Social Networks

Tóm tắt

Comparative Analysis of Fine-Tuned MobileNet Versions on Fish Disease Detection

Số tạp chí 214(2024) Trang: 201–212

Tác giả: Nguyễn Văn Hiền, Huỳnh Quốc Thịnh, Nguyễn Minh Nhật, Sử Kim Anh, Nguyễn Thanh Hải

Tạp chí: Lecture Notes on Data Engineering and Communications Technologies

Tóm tắt

Deep Residual Networks for Pigmented Skin Lesions Diagnosis

Số tạp chí 14748(2024) Trang: 323–334

Tác giả: Nguyễn Thanh Hải, Pham Thi Thuy Linh, Phạm Thị Ngọc Diễm, Trần Thanh Điền, Chau Ngoc Ha

Tạp chí: Lecture Notes in Computer Science

Tóm tắt

Interpreting Results of VGG-16 for COVID-19 Diagnosis on CT Images

Số tạp chí Ngoc Thanh Nguyen · Bogdan Franczyk · André Ludwig · Manuel Núñez · Jan Treur · Gottfried Vossen · Adrianna Kozierkiewicz(2024) Trang: 157-169

Tác giả: Nguyễn Thanh Hải, Huỳnh Ngọc Tuyết, Phan Tấn Tài, Hoang Thanh Huynh, Kha Van Nguyen, Phạm Huỳnh Ngọc

Tạp chí: Lecture Notes in Computer Science

Tóm tắt

XÂY DỰNG HÀNH CHÍNH ĐIỆN TỬ VÀ NHỮNG TÁC ĐỘNG TỚI SỰ PHÁT TRIỂN BỀN VỮNG ĐỒNG BẰNG SÔNG CỬU LONG

Số tạp chí PGS. TS Nguyễn Chí Ngôn(2024) Trang: 346-369

Tác giả: Lê Hoàng Thảo, Dương Nguyễn Phú Cường, Phạm Đăng Khôi

Tạp chí: Công nghệ kỹ thuật và công nghệ thông tin trong tiến trình công nghiệp hóa - hiện đại hóa Đồng bằng Sông Cửu Long

Tóm tắt

Việc làm, thu nhập và chất lượng cuộc sống

Số tạp chí Trong Đặng Kiều Nhân và Nguyễn Ánh Minh(2024) Trang: 225-244

Tác giả: Nguyễn Thanh Bình, Lê Kim Ngân, Nguyễn Ánh Minh, Đặng Kiều Nhân

Tạp chí: Đặc trưng và đổi mới kinh tế - xã hội - văn hóa của đồng bằng sông Cửu Long trong bối cảnh mới

Tóm tắt

Child Abuse Behaviors Identification from Surveillance Videos

Số tạp chí In: Leonard Barolli(2024) Trang: 106-118

Tác giả: Phạm Thị Ngọc Diễm, Phan Bá Đại Phúc, Trần Thanh Điền

Tạp chí: Lecture Notes on Data Engineering and Communications Technologies

Tóm tắt

Hyperparameter Tuning on Classical Machine Learning Models in Orthopedic Disease Prediction on Biomechanical Features

Số tạp chí Leonard Barolli(2024) Trang: 48-59

Tác giả: Nguyễn Thanh Hải, Phan Tấn Tài, Nguyen Minh Hong, Pham Thi Bich Nhu, Pham Thi Thuy Linh

Tạp chí: Lecture Notes on Data Engineering and Communications Technologies

Tóm tắt

Experimental Study on Spectrometric Features of Mud Crabs for Automatic Internal Quality Grading

Số tạp chí 2191(2024) Trang: 3-14

Tác giả: Võ Hải Đăng, Trần Nhựt Thanh, Masayuki Fukuzawa

Tạp chí: Communications in Computer and Information Science

Tóm tắt

Machine Learning-Based Acoustic System for Maturity Classification of Durian Fruit Before Harvesting

Số tạp chí Thai-Nghe, N., Do, TN., Benferhat, S.(2024) Trang: 33-46

Tác giả: Huu-Phuoc Nguyen, Viet-Lam Huynh, Thanh-Phong Duong, Nguyễn Chánh Nghiệm, Trần Nhựt Thanh

Tạp chí: Communications in Computer and Information Science

Tóm tắt

An Embedded System for Eggs Freshness Detection

Số tạp chí Thai-Nghe, N., Do, TN., Benferhat, S(2024) Trang: 47-58

Tác giả: Quoc-Hung Pham, Thanh-Nhan Nguyen, Huy-Hoang Vo, Duy-Khanh Nguyen, Nhat-Tan Pham, Trần Nhựt Thanh

Tạp chí: Communications in Computer and Information Science

Tóm tắt

An IoT-Based Indoor Fire Early Warning System Using Far Infrared Thermal Sensors

Số tạp chí In: Hamdan, A.(2024) Trang: 295-306

Tác giả: Lương Vinh Quốc Danh, Lê Hoàng Thảo, Trần Hữu Danh, Cù Vĩnh Lộc, Trương Xuân Việt, Trần Nhựt Khải Hoàn, Lê Thành Phiêu, Nguyễn Chí Ngôn

Tạp chí: Studies in Big Data

Tóm tắt

Fake Face Detection with Separable Convolutions

Số tạp chí Nguyen Hoang Phuong Nguyen Thi Huyen Chau Vladik Kreinovich(2024) Trang: 135-147

Tác giả: Nguyễn Thanh Hải, Nguyen Tien Dat, Trần Thanh Thiên, Nguyễn Hữu Hòa, Nguyễn Thái Nghe

Tạp chí: Studies in Systems, Decision and Control

Tóm tắt

The Genetic Algorithm and its Application in Calculating the Kinetic Parameters of the Thermoluminescence Curve

Số tạp chí Yann-Henri Chemin(2024) Trang: 73-85

Tác giả: Nguyễn Duy Sang

Tạp chí: Genetic Algorithms Theory, Design and Programming

Tóm tắt

The Soft Power Impact of China in Strategic Competition With the United States in Vietnam

Số tạp chí Mohamad Zreik(2024) Trang: 314-331

Tác giả: Lê Hoàng Kiệt, Nguyễn Ánh Minh, Trần Xuân Hiệp, Nguyễn Hữu Phúc

Tạp chí: Soft Power and Diplomatic Strategies in Asia and the Middle East

Tóm tắt

Vietnamese University Lecturers’ Experiences and Perspectives on Student Assessment within the Outcome-Based Education Framework

Số tạp chí Hoang-Yen Phuong and Thanh-Thao Le(2024) Trang: 1-20

Tác giả: Phương Hoàng Yến, Nguyễn Anh Thi, Nguyễn Hương Trà, Huỳnh Thị Anh Thư, Phạm Trút Thùy, Lê Thanh Thảo

Tạp chí: Exploring Teacher Beliefs: INSIGHTS AND PERSPECTIVES

Tóm tắt

Summative or Formative Assessment? Diversity in EFL Learners’ Perspectives in Teachers’ Assessment Practices

Số tạp chí Thao Quoc Tran and Tham My Duong(2024) Trang: 1-20

Tác giả: Phương Hoàng Yến, Nguyễn Anh Thi, Lê Thanh Thảo, Nguyễn Hương Trà, Huỳnh Thị Anh Thư, Phạm Trút Thùy

Tạp chí: Addressing Issues of Learner Diversity in English Language Education

Tóm tắt

Biodiversity in the Mekong River Basin: Dynamics of planktonic communities and fishes in the Vietnamese Mekong Delta

Số tạp chí Eric Wolanski(2024) Trang: 355-392

Tác giả: Trần Đắc Định, Vũ Ngọc Út, Trần Xuân Lợi, Đinh Minh Quang, Dương Văn Ni

Tạp chí: The Mekong River basin

Tóm tắt

FIRMS, CONTEXT, AND BRIBERY IN A TRANSITION ECONOMY

Số tạp chí Cheng-Few Lee and Min-Teh Yu(2024) Trang: 287-312

Tác giả: Phan Anh Tú

Tạp chí: Advances in Pacific Basin Business, Economics and Finance

Tóm tắt

1 2 Tiếp Cuối

Vietnamese | English

Tạp chí khoa học Trường Đại học Cần Thơ
Khu II, Đại học Cần Thơ, Đường 3/2, Phường Ninh Kiều, Thành phố Cần Thơ, Việt Nam
Điện thoại: (0292) 3 872 157; Email: tapchidhct@ctu.edu.vn

Chương trình chạy tốt nhất trên trình duyệt IE 9+ & FF 16+, độ phân giải màn hình 1024x768 trở lên

Vui lòng chờ...