English is the most common language globally, and it is increasingly important. English has been compiled in most online docu- ments, information, and contents. However, with a considerable vocabu- lary, learning English is difficult for many people to remember. Therefore, many modern technologies have been proposed to support English learn- ing, such as English learning technology through word-matching games to help children become excited and easily approach English from an early age. In addition, translation tools can help users look up vocabularies, antonyms, synonyms, and examples. This study presents a method to support learning English via object detection in videos, images, or even live-stream videos in real-time using deep learning architectures such as You Look Only Once (YOLO) - one of the finest families of object detec- tion models with state-of-the-art performances. The method to obtain an mAP is 55.6 with 17GFlops. The results are vocabulary, meaning, and making sentences with that. Our method has good accuracy in data of 2786 images belonging to 59 classes.
Số tạp chí Ngoc Thanh Nguyen · Bogdan Franczyk · André Ludwig · Manuel Núñez · Jan Treur · Gottfried Vossen · Adrianna Kozierkiewicz(2024) Trang: 157-169
Tạp chí khoa học Trường Đại học Cần Thơ
Lầu 4, Nhà Điều Hành, Khu II, đường 3/2, P. Xuân Khánh, Q. Ninh Kiều, TP. Cần Thơ
Điện thoại: (0292) 3 872 157; Email: tapchidhct@ctu.edu.vn
Chương trình chạy tốt nhất trên trình duyệt IE 9+ & FF 16+, độ phân giải màn hình 1024x768 trở lên