Đăng nhập
 
Tìm kiếm nâng cao
 
Tên bài báo
Tác giả
Năm xuất bản
Tóm tắt
Lĩnh vực
Phân loại
Số tạp chí
 

Bản tin định kỳ
Báo cáo thường niên
Tạp chí khoa học ĐHCT
Tạp chí tiếng anh ĐHCT
Tạp chí trong nước
Tạp chí quốc tế
Kỷ yếu HN trong nước
Kỷ yếu HN quốc tế
Book chapter
Book chapter 2022
Số tạp chí Nguyen Hoang Phuong, Vladik Kreinovich(2022) Trang: 225–238
Tạp chí: Studies in Computational Intelligence

Automatically describing the content of an image holds an essential role in numerous applications. Some practical applications include providing more accurate and detailed images or videos in scenarios like information search by image or video surveillance systems. The technique generates picture captions that are generally semantically informative and grammatically accurate by acquiring image and caption pairings. Humans use natural languages to describe scenes because they are short and compact. On the other hand, machine vision systems characterize the scene by capturing an image that is a two-dimensional array. The concept is to combine the image and captions into one area and then map from the image to the sentences. This study proposes a merge model to combine the image vector and the partial caption. It can be implemented through three steps: processing the sequence from the text, extracting the feature vector from the image, and decoding the output by concatenating the above two layers. Besides that, to evaluate model performance, we generate multiple sentences using Beam Search and BLEU. The experiments show that the method can generate captions with relatively accurate content and less training memory. We use the Flickr8k dataset consisting of 8000 images paired with five different captions, which provide precise descriptions of the salient entities and events. For training, we use 6000 images, 1000 for test and 1000 for development, while Flickr8k text includes text files describing train set, test set. The best results were obtained when testing the model with BLEU-1, Greedy, and Beam Search with k == 5 or k == 7 all over 60 in BLEU scores.

Các bài báo khác
Số tạp chí Ngoc Thanh Nguyen, Tien Khoa Tran, Ualsher Tukayev, Tzung-Pei Hong, Bogdan Trawi´nski, Edward Szczerbicki(2022) Trang: 302-312
Tạp chí: Intelligent Information and Database Systems
Số tạp chí 124(2022) Trang: 85-96
Tạp chí: Lecture Notes on Data Engineering and Communications Technologies
Số tạp chí 1688(2022) Trang: 462-476
Tạp chí: Communications in Computer and Information Science
Số tạp chí In Tran Khanh Dang · Josef Küng · Tai M. Chung (Eds.)(2022) Trang: 588-600
Tạp chí: Communications in Computer and Information Science
Số tạp chí Health Information Science 11th International Conference, HIS 2022 Virtual Event, October 28–30, 2022 Proceedings(2022) Trang: 157–164
Tạp chí: Lecture Notes in Computer Science
Số tạp chí Tran Khanh Dang, Josef Küng, Tai M. Chung(2022) Trang: 714-722
Tạp chí: Communications in Computer and Information Science
Số tạp chí Tran Khanh Dang, Josef Küng, Tai M. Chung(2022) Trang: 377-392
Tạp chí: Communications in Computer and Information Science
Số tạp chí In Abrar Ullah · Sajid Anwar · Álvaro Rocha · Steve Gill(2022) Trang: 449–460
Tạp chí: Lecture Notes in Networks and Systems
Số tạp chí In Hamido Fujita · Philippe Fournier-Viger · Moonis Ali · Yinglin Wang(2022) Trang: 737-746
Tạp chí: Lecture Notes in Computer Science
Số tạp chí Tran Khanh Dang·Josef Küng·Tai M. Chung(2022) Trang: 131-144
Tạp chí: Communications in Computer and Information Science
Số tạp chí Ngoc Le Anh, Seok-Joo Koh, Thi Dieu Linh Nguyen, Jaime Lloret, Thanh Tung Nguyen(2022) Trang: 669–678
Tạp chí: Lecture Notes in Networks and Systems
Số tạp chí Ngoc Le Anh, Seok-Joo Koh, Thi Dieu Linh Nguyen, Jaime Lloret, Thanh Tung Nguyen(2022) Trang: 279–286
Tạp chí: Lecture Notes in Networks and Systems
Số tạp chí Ngoc Le Anh, Seok-Joo Koh, Thi Dieu Linh Nguyen, Jaime Lloret, Thanh Tung Nguyen(2022) Trang: 402–409
Tạp chí: Lecture Notes in Networks and Systems
Số tạp chí Hamido Fujita, Philippe Fournier-Viger, Moonis Ali, Yinglin Wang(2022) Trang: 785–796
Tạp chí: Lecture Notes in Computer Science
Số tạp chí Costin Bădică, Jan Treur, Djamal Benslimane, Bogumiła Hnatkowska, Marek Krótkiewicz(2022) Trang: 317–329
Tạp chí: Communications in Computer and Information Science
Số tạp chí Nguyen Hoang Phuong, Vladik Kreinovich(2022) Trang: 179–190
Tạp chí: Studies in Computational Intelligence
Số tạp chí Nguyen Hoang Phuong, Vladik Kreinovich(2022) Trang: 213–223
Tạp chí: Studies in Computational Intelligence
Số tạp chí Nguyen Hoang Phuong, Vladik Kreinovich(2022) Trang: 239–250
Tạp chí: Studies in Computational Intelligence
Số tạp chí Rosdiazli Ibrahim, K. Porkumaran, Ramani Kannan, Nursyarizal Mohd Nor, S. Prabakar(2022) Trang: 1073–1084
Tạp chí: Lecture Notes in Electrical Engineering
Số tạp chí Ngoc Hoang Thanh Dang, Yu-Dong Zhang, João Manuel R. S. Tavares, Bo-Hao Chen(2022) Trang: 659-670
Tạp chí: Artificial Intelligence in Data and Big Data Processing
Số tạp chí Hamido Fujita Philippe Fournier-Viger Moonis Ali Yinglin Wang (Eds.)(2022) Trang: 173-183
Tạp chí: Lecture Notes in Computer Science
Số tạp chí Hamido Fujita, Yutaka Watanobe, Takuya Azumi(2022) Trang: 499-506
Tạp chí: Frontiers in Artificial Intelligence and Applications
Số tạp chí Tran Khanh Dang, Josef Küng, and Tai M. Chung(2022) Trang: 706-713
Tạp chí: Future Data and Security Engineering Big Data, Security and Privacy, Smart City and Industry 4.0 Applications
Số tạp chí Tran Khanh Dang, Josef Küng, and Tai M. Chung(2022) Trang: 419-431
Tạp chí: Future Data and Security Engineering Big Data, Security and Privacy, Smart City and Industry 4.0 Applications


Vietnamese | English






 
 
Vui lòng chờ...