An Approach for Web Content Classification with FastText

Hướng dẫn

Tìm kiếm nâng cao

Tựa bài viết

Tìm

Tác giả

Năm xuất bản

Tóm tắt

Lĩnh vực

Phân loại

Số tạp chí

Bản tin định kỳ

Báo cáo thường niên

Tạp chí khoa học ĐHCT

Tạp chí tiếng anh ĐHCT

Tạp chí trong nước

Tạp chí quốc tế

Kỷ yếu HN trong nước

Kỷ yếu HN quốc tế

Book chapter

An Approach for Web Content Classification with FastText

5 (2024) Trang: 765

Tác giả: Lương Hoàng Hướng, Lan Thu Thi Le, Nguyễn Thanh Hải

Tạp chí: Lecture Notes in Computer Science

Liên kết: https://doi.org/10.1007/s42979-024-03155-y

Tóm tắt

Nowadays, with the Internet infrastructure and nearly global access, the amount and diversity of data are increasing rapidly. Many tasks require information retrieval and data collection for machine learn- ing, research, and survey reports in various fields such as meteorology, science, geography, literature, and more. However, manual data collection and classification can be time-consuming and prone to errors. Addition- ally, AI assistants used for drafting or writing can sometimes be corrected regarding writing style and inappropriate language for the given con- text. Faced with these needs, In this article, Vietnamese documents are classified using the TF-IDF method, TF-IDF combined with SVD, and FastText at three levels: word level, n-gram level, and character level. For this approach, 15 categories were gathered from various online news sources. The dataset was preprocessed and trained using machine learn- ing models such as SVM, Naive Bayes, Neural Network, and Random Forest to find the most effective method. The Random Forest combined with the FastText method was highly evaluated, achieving a success rate of 82% when measured against essential evaluation criteria of accuracy, precision, and F1 score.

Các bài báo khác

ACTIVE LEARNING ON FINE-TUNED RESNET101 FOR BREAST CANCER PREDICTION IMPROVEMENT

(2024) Trang: 36-43

Tác giả: Lương Hoàng Hướng, Nguyễn Thanh Hải, Nguyễn Thái Nghe

Tạp chí: Hội nghị khoa học quốc gia lần thứ XVII về Nghiên cứu cơ bản và ứng dụng Công nghệ thông tin

Tóm tắt

BCCNetAttention: Enhancing Breast Cancer Report Through Image Captioning with Convolution Neural Network and Transformers Architecture

15417 (2025) Trang: 188–199

Tác giả: Lương Hoàng Hướng, Nguyễn Thái Nghe, Nguyễn Thanh Hải

Tạp chí: Lecture Notes in Computer Science

Tóm tắt

EfficientPhoCaption: An Enhanced PhoBERT-Based Image Captioning Framework for Breast Cancer Diagnosis

6 (2025) Trang: 1-14

Tác giả: Lương Hoàng Hướng, Nguyễn Thanh Hải, Nguyễn Thái Nghe

Tạp chí: SN Computer Science

Tóm tắt

Utilising Unet3+ for Tooth Segmentation on X-Ray Image

1863 (2023) Trang: 181–192

Tác giả: Lương Hoàng Hướng, Duy Khanh Nguyen, Hao Van Tran, Phuc Tan Huynh, Bang Do Huu Dang, Dat Tuan Ly, Nguyễn Thanh Hải

Tạp chí: Communications in Computer and Information Science

Tóm tắt

Toward Supporting Breast Cancer Diagnosis Based on Captioning Mammogram and Ultrasound Images

(2024) Trang: 71-85

Tác giả: Lương Hoàng Hướng, Nguyễn Thanh Hải, Nguyễn Thái Nghe

Tạp chí: Communications in Computer and Information Science

Tóm tắt

Detection and classification of breast cancer in mammographic images with fine-tuned convolutional neural networks

8 (2024) Trang: 1-28

Tác giả: Lương Hoàng Hướng, Nguyễn Thanh Hải, Nguyễn Thái Nghe

Tạp chí: Journal of Information and Telecommunication (JIT)

Tóm tắt

A Combination of Active Learning and Deep Learning for Improving Breast Cancer Prediction

In: Nghia, P.T., Thai, V.D., Thuy, N.T., Son, L.H., Huynh, VN. (eds) (2023) Trang: 3-10

Tác giả: Lương Hoàng Hướng, Nguyễn Thanh Hải, Nguyễn Thái Nghe

Tạp chí: Lecture Notes in Networks and Systems

Tóm tắt

Fine-Tuning MobileNet for Breast Cancer Diagnosis

563 (2023) Trang: 841–856

Tác giả: Lương Hoàng Hướng, Nghia Trong, Toai Dinh, Thuan Dang, Tong Nguyen, Nguyễn Thanh Hải, Tin Duong

Tạp chí: Lecture Notes in Networks and Systems book series

Tóm tắt

Fine-Tuning VGG16 for Alzheimer’s Disease Diagnosis

176 (2023) Trang: 68–79

Tác giả: Lương Hoàng Hướng, Phong vo, Hau Phan, Nam Tran, Hung Le, Nguyễn Thanh Hải

Tạp chí: Lecture Notes on Data Engineering and Communications Technologies book series

Tóm tắt

Vietnamese | English

Tạp chí khoa học Trường Đại học Cần Thơ
Khu II, Đại học Cần Thơ, Đường 3/2, Phường Ninh Kiều, Thành phố Cần Thơ, Việt Nam
Điện thoại: (0292) 3 872 157; Email: tapchidhct@ctu.edu.vn

Chương trình chạy tốt nhất trên trình duyệt IE 9+ & FF 16+, độ phân giải màn hình 1024x768 trở lên

Vui lòng chờ...