Bui, Anh Dai, Quoc Trung Nguyen, Thanh Nha Tran, and Viet Hung Nguyen. “AN IMPROVED MULTI-VISION CONTEXTUAL ATTENTION MODEL FOR VIETNAMESE VISUAL-BASED QUESTION ANSWERING”. Ho Chi Minh City University of Education Journal of Science 22, no. 2 (February 28, 2025): 247–259. Accessed April 22, 2026. https://journal.hcmue.edu.vn/index.php/hcmuejos/article/view/4328.