Image Caption Generator with a Combination Between Convolutional Neural Network and Long Short-Term Memory

Hướng dẫn

Tìm kiếm nâng cao

Tựa bài viết

Tìm

Tác giả

Năm xuất bản

Tóm tắt

Lĩnh vực

Phân loại

Số tạp chí

Bản tin định kỳ

Báo cáo thường niên

Tạp chí khoa học ĐHCT

Tạp chí tiếng anh ĐHCT

Tạp chí trong nước

Tạp chí quốc tế

Kỷ yếu HN trong nước

Kỷ yếu HN quốc tế

Book chapter

Image Caption Generator with a Combination Between Convolutional Neural Network and Long Short-Term Memory

Số tạp chí Nguyen Hoang Phuong, Vladik Kreinovich(2022) Trang: 225–238

Tác giả: Duy Thuy Thi Nguyen, Nguyễn Thanh Hải

Tạp chí: Studies in Computational Intelligence

Liên kết: https://doi.org/10.1007/978-3-031-08580-2_21

Tóm tắt

Automatically describing the content of an image holds an essential role in numerous applications. Some practical applications include providing more accurate and detailed images or videos in scenarios like information search by image or video surveillance systems. The technique generates picture captions that are generally semantically informative and grammatically accurate by acquiring image and caption pairings. Humans use natural languages to describe scenes because they are short and compact. On the other hand, machine vision systems characterize the scene by capturing an image that is a two-dimensional array. The concept is to combine the image and captions into one area and then map from the image to the sentences. This study proposes a merge model to combine the image vector and the partial caption. It can be implemented through three steps: processing the sequence from the text, extracting the feature vector from the image, and decoding the output by concatenating the above two layers. Besides that, to evaluate model performance, we generate multiple sentences using Beam Search and BLEU. The experiments show that the method can generate captions with relatively accurate content and less training memory. We use the Flickr8k dataset consisting of 8000 images paired with five different captions, which provide precise descriptions of the salient entities and events. For training, we use 6000 images, 1000 for test and 1000 for development, while Flickr8k text includes text files describing train set, test set. The best results were obtained when testing the model with BLEU-1, Greedy, and Beam Search with k $=$ 5 or k $=$ 7 all over 60 in BLEU scores.

Các bài báo khác

Layer-Wise Optimization of Contextual Neural Networks with Dynamic Field of Aggregation

Số tạp chí Ngoc Thanh Nguyen, Tien Khoa Tran, Ualsher Tukayev, Tzung-Pei Hong, Bogdan Trawi´nski, Edward Szczerbicki(2022) Trang: 302-312

Tác giả: Marcin Jodłowiec, Adriana Albu, Krzysztof Wołk, Nguyễn Thái Nghe, Adrian Karasi´ nski

Tạp chí: Intelligent Information and Database Systems

Tóm tắt

Vietnamese Text Summarization Based on Neural Network Models

Số tạp chí 124(2022) Trang: 85-96

Tác giả: Lâm Nhựt Khang, Tuong Thanh Do, Nguyet-Hue Thi Pham, Jugal Kalita

Tạp chí: Lecture Notes on Data Engineering and Communications Technologies

Tóm tắt

Social Distancing Violation Detection in Video Using ChessBoard and Bird’s-eye Perspective

Số tạp chí 1688(2022) Trang: 462-476

Tác giả: Trần Công Án, Ngô Hữu Trọng, Nguyễn Thanh Hải

Tạp chí: Communications in Computer and Information Science

Tóm tắt

Combine classification algorithm and Centernet model to predict trafic density

Số tạp chí In Tran Khanh Dang · Josef Küng · Tai M. Chung (Eds.)(2022) Trang: 588-600

Tác giả: Vũ Lê Quỳnh Phương, Trần Nguyễn Minh Thư, Phạm Nguyên Khang, Nguyễn Việt Đông

Tạp chí: Communications in Computer and Information Science

Tóm tắt

Improving Data Analytic Performance in Health Information System with Big Data Technology

Số tạp chí Health Information Science 11th International Conference, HIS 2022 Virtual Event, October 28–30, 2022 Proceedings(2022) Trang: 157–164

Tác giả: Ngô Bá Hùng, Trần Chí Nguyện

Tạp chí: Lecture Notes in Computer Science

Tóm tắt

A Drowsiness Detection System Based on Eye Landmarks Using IoT

Số tạp chí Tran Khanh Dang, Josef Küng, Tai M. Chung(2022) Trang: 714-722

Tác giả: Lâm Nhựt Khang, Vinh Phuoc Mai, Gia-Binh Quach Dang, Quoc-Bao Hong Ngo, Nhat-Hao Quan Huynh, Mai Phuc Lieu, Jugal Kalita

Tạp chí: Communications in Computer and Information Science

Tóm tắt

Text Classification Models and Topic Models: An Overall Picture and a Case Study in Vietnamese

Số tạp chí Tran Khanh Dang, Josef Küng, Tai M. Chung(2022) Trang: 377-392

Tác giả: Lâm Nhựt Khang, Vu-Luan Le Tran, Jugal Kalita

Tạp chí: Communications in Computer and Information Science

Tóm tắt

A Vietnamese Festival Preservation Application

Số tạp chí In Abrar Ullah · Sajid Anwar · Álvaro Rocha · Steve Gill(2022) Trang: 449–460

Tác giả: Ngan-Khanh Chau, Zied Bouraoui, Đỗ Thanh Nghị, Mã Trường Thành

Tạp chí: Lecture Notes in Networks and Systems

Tóm tắt

Extractive Text Summarization on Large-scale Dataset Using K-Means Clustering

Số tạp chí In Hamido Fujita · Philippe Fournier-Viger · Moonis Ali · Yinglin Wang(2022) Trang: 737-746

Tác giả: Nguyễn Tí Hon, Đỗ Thanh Nghị

Tạp chí: Lecture Notes in Computer Science

Tóm tắt

ImageNet Challenging Classification with the Raspberry Pis: A Federated Learning Algorithm of Local Stochastic Gradient Descent Models

Số tạp chí Tran Khanh Dang·Josef Küng·Tai M. Chung(2022) Trang: 131-144

Tác giả: Đỗ Thanh Nghị, Trần Nguyễn Minh Thư

Tạp chí: Communications in Computer and Information Science

Tóm tắt

Remote Medical Assistance Vehicle in Covid-19 Quarantine Areas: A Case Study in Vietnam

Số tạp chí Leonard Barolli(2022) Trang: 59–70

Tác giả: Linh Thuy Thi Pham, Nguyễn Thanh Hải, Tan Phuc Nhan Bui, Ngoc Cam Thi Tran, Khoi Tuan Huynh Nguyen, Huong Hoang Luong

Tạp chí: Lecture Notes in Networks and Systems

Tóm tắt

Transfer Learning with Fine-Tuning on MobileNet and GRAD-CAM for Bones Abnormalities Diagnosis

Số tạp chí Leonard Barolli(2022) Trang: 171–179

Tác giả: Huong Hoang Luong, Nguyễn Thanh Hải, Lan Thu Thi Le, Vinh Quoc Hua, Khang Vu Nguyen, Thinh Nguyen Phuc Bach, Tu Ngoc Anh Nguyen, Hien Tran Quang Nguyen

Tạp chí: Lecture Notes in Networks and Systems book series

Tóm tắt

Intelligent Helmet Supporting Visually Impaired People Using Obstacle Detection and Communication Techniques

Số tạp chí Leonard Barolli(2022) Trang: 120–131

Tác giả: Linh Thuy Thi Pham, Khoa Thanh Nguyen, Duyen Thuy Dao, Nguyễn Thanh Hải, Huong Hoang Luong, Nhan Trong Pham Van

Tạp chí: Lecture Notes in Networks and Systems

Tóm tắt

Enhancing Inflammatory Bowel Disease Diagnosis Performance Using Chi-Squared Algorithm on Metagenomic Data

Số tạp chí Ngoc Le Anh, Seok-Joo Koh, Thi Dieu Linh Nguyen, Jaime Lloret, Thanh Tung Nguyen(2022) Trang: 669–678

Tác giả: Nguyễn Thanh Hải, Lương Hoàng Hướng, Trong Thanh Tran, Ngoc Van Nguyen, Khoi Dinh Nguyen

Tạp chí: Lecture Notes in Networks and Systems

Tóm tắt

Mobility Prediction on a Location-Based Social Network Using K Latest Movements of Friends

Số tạp chí Ngoc Le Anh, Seok-Joo Koh, Thi Dieu Linh Nguyen, Jaime Lloret, Thanh Tung Nguyen(2022) Trang: 279–286

Tác giả: Nguyễn Thanh Hải, Chi Le Hoang Tran, Huong Hoang Luong

Tạp chí: Lecture Notes in Networks and Systems

Tóm tắt

Binning on Metagenomic Data for Disease Prediction Using Linear Discriminant Analysis and K-Means

Số tạp chí Ngoc Le Anh, Seok-Joo Koh, Thi Dieu Linh Nguyen, Jaime Lloret, Thanh Tung Nguyen(2022) Trang: 402–409

Tác giả: Nhi Yen K. Phan, Nguyễn Thanh Hải

Tạp chí: Lecture Notes in Networks and Systems

Tóm tắt

Deep Learning Architectures Extended from Transfer Learning for Classification of Rice Leaf Diseases

Số tạp chí Hamido Fujita, Philippe Fournier-Viger, Moonis Ali, Yinglin Wang(2022) Trang: 785–796

Tác giả: Nguyễn Thanh Hải, Quyen Thuc Quach, Chi Le Hoang Tran, Huong Hoang Luong

Tạp chí: Lecture Notes in Computer Science

Tóm tắt

Brain Tumors Detection on MRI Images with K-means Clustering and Residual Networks

Số tạp chí Costin Bădică, Jan Treur, Djamal Benslimane, Bogumiła Hnatkowska, Marek Krótkiewicz(2022) Trang: 317–329

Tác giả: Nguyễn Thanh Hải, Lương Hoàng Hướng, Tan Ha Ngoc Kien, Nghia Trong Le Phan, Thuan Minh Dang, Tin Tri Duong, Tong Duc Nguyen, Toai Cong Dinh

Tạp chí: Communications in Computer and Information Science

Tóm tắt

Poses Classification in a Taekwondo Lesson Using Skeleton Data Extracted from Videos with Shallow and Deep Learning Architectures

Số tạp chí Tran Khanh Dang, Josef Küng, Tai M. Chung(2022) Trang: 447–461

Tác giả: Ha Thanh Thi Hoang, Chau Ngoc Ha, Dat Tien Nguyen, Truong Nhat Nguyen, Huỳnh Ngọc Tuyết, Phan Tấn Tài, Nguyễn Thanh Hải

Tạp chí: Communications in Computer and Information Science

Tóm tắt

Disease Diagnosis Based on Symptoms Description

Số tạp chí Nguyen Hoang Phuong, Vladik Kreinovich(2022) Trang: 179–190

Tác giả: Huong Hoang Luong, Phong Cao Nguyen, Nguyễn Thanh Hải

Tạp chí: Studies in Computational Intelligence

Tóm tắt

Effects Evaluation of Data Augmentation Techniques on Common Seafood Types Classification Tasks

Số tạp chí Nguyen Hoang Phuong, Vladik Kreinovich(2022) Trang: 213–223

Tác giả: Nguyễn Thanh Hải, Ngan Kim Thi Nguyen, Chi Le Hoang Tran, Lương Hoàng Hướng

Tạp chí: Studies in Computational Intelligence

Tóm tắt

Clothing Classification Using Shallow Convolutional Neural Networks

Số tạp chí Nguyen Hoang Phuong, Vladik Kreinovich(2022) Trang: 239–250

Tác giả: Mai Truc Lam Nguyen, Nguyễn Thanh Hải

Tạp chí: Studies in Computational Intelligence

Tóm tắt

Feature Selection Using Correlation Matrix on Metagenomic Data with Pearson Enhancing Inflammatory Bowel Disease Prediction

Số tạp chí Rosdiazli Ibrahim, K. Porkumaran, Ramani Kannan, Nursyarizal Mohd Nor, S. Prabakar(2022) Trang: 1073–1084

Tác giả: Huong Hoang Luong, Trong Thanh Tran, Ngoc Van Nguyen, An Duc Le, Huyen Thi Thanh Nguyen, Khoi Dinh Nguyen, Nghi Cong Tran, Nguyễn Thanh Hải

Tạp chí: Lecture Notes in Electrical Engineering

Tóm tắt

Deep Learning for Rice Leaf Disease Detection in Smart Agriculture

Số tạp chí Ngoc Hoang Thanh Dang, Yu-Dong Zhang, João Manuel R. S. Tavares, Bo-Hao Chen(2022) Trang: 659-670

Tác giả: Nguyễn Thái Nghe, Ngô Thanh Trí, Nguyễn Hữu Hòa

Tạp chí: Artificial Intelligence in Data and Big Data Processing

Tóm tắt

Ứng dụng công nghệ trong nông nghiệp

Số tạp chí Nguyễn Thanh Phương(2022) Trang: 321-368

Tác giả: Nguyễn Chí Ngôn, Trương Minh Thái, Lương Vinh Quốc Danh, Nguyễn Thái Nghe, Nguyễn Hồng Phúc, Nguyễn Thắng Lợi, Huỳnh Xuân Hiệp, Nguyễn Đỗ Quỳnh, Nguyễn Hữu Hòa, Trương Chí Thành, Nguyễn Văn Khải, Takeo Matsubara, Hiroaki Muraoka, Kunio Doi, Kazunori Sawamoto

Tạp chí: Nông Nghiệp Đồng bằng sông Cửu Long Hiện trạng và Định hướng phát triển

Tóm tắt

Cardiovascular Disease Detection on X-Ray Images with Transfer Learning

Số tạp chí Hamido Fujita Philippe Fournier-Viger Moonis Ali Yinglin Wang (Eds.)(2022) Trang: 173-183

Tác giả: Nguyễn Văn Bình, Nguyễn Thái Nghe

Tạp chí: Lecture Notes in Computer Science

Tóm tắt

An Attendance Checking System on Mobile Devices Using Transfer Learning

Số tạp chí Hamido Fujita, Yutaka Watanobe, Takuya Azumi(2022) Trang: 499-506

Tác giả: Huỳnh Thanh Dư, Maciej Huk, Nguyễn Hùng Dũng, Nguyễn Thái Nghe

Tạp chí: Frontiers in Artificial Intelligence and Applications

Tóm tắt

A Session-Based Recommender System for Learning Resources

Số tạp chí Tran Khanh Dang, Josef Küng, and Tai M. Chung(2022) Trang: 706-713

Tác giả: Phạm Hồng Sang, Nguyễn Thái Nghe

Tạp chí: Future Data and Security Engineering Big Data, Security and Privacy, Smart City and Industry 4.0 Applications

Tóm tắt

Recommendations in E-Commerce Systems Based on Deep Matrix Factorization

Số tạp chí Tran Khanh Dang, Josef Küng, and Tai M. Chung(2022) Trang: 419-431

Tác giả: Nguyễn Thái Nghe, Nguyễn Thanh Hải, Trần Thanh Điện

Tạp chí: Future Data and Security Engineering Big Data, Security and Privacy, Smart City and Industry 4.0 Applications

Tóm tắt

Đầu tiên Trước 1 2 3 4 Tiếp Cuối

Vietnamese | English

Tạp chí khoa học Trường Đại học Cần Thơ
Khu II, Đại học Cần Thơ, Đường 3/2, Phường Ninh Kiều, Thành phố Cần Thơ, Việt Nam
Điện thoại: (0292) 3 872 157; Email: tapchidhct@ctu.edu.vn

Chương trình chạy tốt nhất trên trình duyệt IE 9+ & FF 16+, độ phân giải màn hình 1024x768 trở lên

Vui lòng chờ...