Publications

I am an early-stage researcher working at the intersection of theory and practice in modern NLP systems. My work is driven by the goal of building research artifacts that are scientifically rigorous, empirically sound, and practically grounded, with careful evaluation across diverse real-world scenarios. I actively work toward producing high-quality research contributions and contributing to the NLP community through publications at leading international venues.

2026
  1. Chu Minh Tam, Pham Le Ngoc Ngan, Hua Tue Minh, Nguyen Xuan An, Nguyen Phuc Thinh, Le Ngoc Hung Dung, Nguyen Song Thien Long, Bui Cong Tuan, and Quan Thanh Tho. BKAlign: A Context-Enriched Semantic Ranking Framework for Vietnamese Entity-Relation Alignment with Knowledge Graph Constraints. FISU Joint Conference on Artificial Intelligence (FJCAI), 2026. (National Conference)
  2. Long S. T. Nguyen*, Quan M. Bui*, Tin T. Ngo, Quynh T. N. Vo, Dung N. H. Le, and Tho T. Quan. ViHERMES: A Graph-Grounded Multihop Question Answering Benchmark and System for Vietnamese Healthcare Regulations. Asian Conference on Intelligent Information and Database Systems (ACIIDS), 2026. (B-ranked Conference) [preprint]
  3. Long S. T. Nguyen and Tho T. Quan. Which Works Best for Vietnamese? A Practical Study of Information Retrieval Methods across Domains. Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2026. (A-ranked Conference)
  4. Hung Luu, Long S. T. Nguyen, Trung Pham, Hieu Pham, and Tho Quan. HiGraAgent: Dual-Agent Adaptive Reasoning over Hierarchical Knowledge Graph for Open-Domain Multi-hop Question Answering. Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2026. (A-ranked Conference)
  5. Long Nguyen, Duc Nguyen, Quan Bui, Dung Phan, Dung Le, Khanh Nguyen, Quynh Vo, Khang Vo, Nam Duong, Anh Dinh, Tri Trinh, Chi Phan, An Nguyen, Thai Nguyen, Dang Le, Vinh Dang, and Tho Quan. Catching the First Light of Tomorrow: A Hackathon-Based Framework for Introducing High School Students to AI Agents. Symposium on Educational Advances in Artificial Intelligence (EAAI), co-located with AAAI, 2026. (Symposium at A*-ranked Conference)
  6. Dung T. Phan*, Chi N. L. Phan*, Long S. T. Nguyen*, Phuc T. Dao, Quan M. Bui, Tin T. Ngo, Thi T. Nguyen, and Tho T. Quan. MAFIA-NeT: Multi-Agent Framework for Interactive Agricultural Negotiation and Trading Systems. International Conference on Agents and Artificial Intelligence (ICAART), 2026. (B-ranked Conference)
2025
  1. Hua Phuoc Truong, Ha Xuan Son, Le Ngoc Hung Dung, Vu Huy Hoang, Nguyen Song Thien Long, and Quan Thanh Tho. Preventing Large-Scale Internet Broadcast Signal Piracy Using Digital Watermarking. National Conference on Electronics, Communications and Information Technology (REV-ECIT), 2025. (National Conference) [pdf]
  2. Long S. T. Nguyen, Quynh T. N. Vo, Thi T. Nguyen, and Tho T. Quan. URAG 2.0: An Agentic Dual Retrieval Framework for Enhanced Reasoning in RAG-based QA Systems. Symposium on Information and Communication Technology (SoICT), 2025.
  3. Chien Vu Manh, Bao Anh Tran, Viet Phuong Ngo, Luan Le Chi, Anh Quang Nguyen, Long S. T. Nguyen, and Anh Nguyen-Duc. An Empirical Study of Multi-Agent RAG for Real-World University Admissions Counseling. Symposium on Information and Communication Technology (SoICT), 2025.
  4. Hieu M. Pham, Trung M. Pham, Vi K. Nguyen, Long S. T. Nguyen, Truong D. Tran, Duc Q. Nguyen, Tuong H. Nguyen, Long H. K. Nguyen, Huong T. T. Ha, and Tho T. Quan. GAFB-MKL: Adaptive Filter Banks via Genetic Algorithm and Sparse Multiple Kernel Learning for EEG-based Motor Imagery Classification. Symposium on Information and Communication Technology (SoICT), 2025.
  5. Tai Q. To*, Long S. T. Nguyen*, Hung C. Nguyen, Nguyen B. Le, Tam M. Nguyen, Tung T. Nguyen, Chattrakul Sombattheera, and Tho T. Quan. AI Conferences Made Easy by AI - A Case Study at MIWAI 2025. IEEE-RIVF International Conference on Computing and Communication Technologies (RIVF), 2025. [pdf] [bib] [website]
  6. Long S. T. Nguyen, Quynh T. N. Vo, Hung C. Luu, and Tho T. Quan. When in Doubt, Ask First: A Unified Retrieval Agent-Based System for Ambiguous and Unanswerable Question Answering. International Joint Conference on Natural Language Processing & Asia-Pacific Chapter of the Association for Computational Linguistics (IJCNLP-AACL), 2025. (B-ranked Conference) [pdf] [bib] [website]
  7. Long S. T. Nguyen, Hung C. Luu, Quynh T. N. Vo, Hy N. G. La, Hoai M. Tran, Anh T. D. Dinh, Tuan H. Nguyen, Tri N. Ho, and Tho T. Quan. Can Small Language Models Handle Vietnamese Legal Reasoning? Insights from Multi-Task Evaluation. Workshop on Vietnamese Language and Speech Processing (VLSP), at INLG, 2025. [pdf] [bib] [website]
  8. Vinh Q. Vo, Bao G. Quach, Quyen T. Bui, Khai Q. Truong, Long S. T. Nguyen, Fabien Baldacci, and Tho T. Quan. Towards Cost-Effective Voice Cloning System for Vietnamese TTS: A Case Study at HCMUT. Conference on Multi-disciplinary Trends in Artificial Intelligence (MIWAI), 2025. [pdf] [bib] [website]
  9. Trung M. Pham, Hieu M. Pham, Vi K. Nguyen, Truong D. Tran, Long S. T. Nguyen, Duc Q. Nguyen, Huong T. T. Ha, and Tho T. Quan. FA-GPNet: When Gaussian Process Meets Auto-Encoder and FBCSP - A Hybrid Model for Motor Imagery Classification. Conference on Multi-disciplinary Trends in Artificial Intelligence (MIWAI), 2025. [pdf] [bib] [website]
  10. Long S. T. Nguyen*, Khang H. N. Vo*, Thu H. A. Nguyen*, Tuan C. Bui, Duc Q. Nguyen, Thanh-Tung Tran, Anh D. Nguyen, Minh L. Nguyen, Fabien Baldacci, Thang H. Bui, Emanuel Di Nardo, Angelo Ciaramella, Son H. Le, Ihsan Ullah, Lorenzo Di Rocco, and Tho T. Quan. Bridging LLMs and Symbolic Reasoning in Educational QA Systems: Insights from the XAI Challenge at IJCNN 2025. Italian Conference on Big Data and Data Science (ITADATA), 2025. [pdf] [bib] [website]
  11. Tuan Bui, An Nguyen, Phat Thai, Minh Hua, Ngan L. N. Pham, Ngan T. B. Pham, Dung Le, Long Nguyen, Thanh-Tung Tran, Thang Bui, and Tho Quan. Formal Reasoning for Intelligent QA Systems: A Case Study in the Educational Domain. ACM Workshop on AI-powered Question & Answering Systems (AIQAM), at ACMMM, 2025. ️(Workshop at A*-ranked Conference) [pdf] [bib] [website]
  12. Long S. T. Nguyen, Truong P. Hua, Thanh M. Nguyen, Toan Q. Pham, Nam K. Ngo, An X. Nguyen, Nghi D. M. Pham, Nghia H. Nguyen, and Tho T. Quan. A Benchmark Dataset and Evaluation Framework for Vietnamese Large Language Models in Customer Support. International Conference on Computational Collective Intelligence (ICCCI), 2025. (B-ranked Conference) [pdf] [bib] [website]
  13. Duc Nguyen, Dong Le, Long Nguyen, Quyen Vu, Tran Le, Dung Nguyen, Nga Huynh, Huong Nguyen, Phat Tran, Dang Le, Sang Truong, Sanmi Koyejo, Cuong Le, and Tho Quan. Riding on The Back of A Whale: A Hackathon Framework for Introducing High School Students to Large Language Models. Conference on Artificial Intelligence in Education (AIED), 2025. (A-ranked Conference) [pdf] [bib] [website]
  14. Hung D. Bui, Phuc P. H. Nguyen, Vinh Q. Dang, Nga Huynh, Hung D. Vo, Long S. T. Nguyen, and Tho T. Quan. Auto-ARM: An Autonomous Adaptive Mask Refinement Mechanism for Enhancing Naturalness in Virtual Try-On Models. Conference on Information Technology and its Applications (CITA), 2025. [pdf] [bib] [website]
  15. Hung D. Vo, Long S. T. Nguyen, Tri H. Trinh, Khang H. N. Vo, and Tho T. Quan. VIEACT-TTS-VC: A Vietnamese End-to-End Text-to-Speech and Voice Conversion Framework Using Low-Resource Adaptable Style Transfer. Intelligent Systems Conference (IntelliSys), 2025. [pdf] [bib] [website]
  16. Long S. T. Nguyen, Tran T. B. Le, Huong P. N. Nguyen, Quynh T. N. Vo, Phong H. N. Nguyen, and Tho T. Quan. Serving the Underserved: Leveraging BARTBahnar Language Model for Bahnaric-Vietnamese Translation. Workshop on Language Models for Underserved Communities (LM4UC), at NAACL, 2025. (Workshop at A-ranked Conference) [pdf] [bib] [website]
2024
  1. Long S. T. Nguyen and Tho T. Quan. URAG: Implementing a Unified Hybrid RAG for Precise Answers in University Admission Chatbots – A Case Study at HCMUT. Symposium on Information and Communication Technology (SoICT), 2024. [pdf] [bib] [website]
  2. Long S. T. Nguyen*, Huy G. Nguyen*, Bao G. Khuu, Huy A. T. Luu, Huy Q. Le, Tuan T. Nguyen, and Tho T. Quan. RAPID: Retrieval-Augmented Parallel Inference Drafting for Text-Based Video Event Retrieval. Symposium on Information and Communication Technology (SoICT), 2024. [pdf] [bib] [website]
  3. Tran Ngoc Oanh, Bui Cong Tuan, Nguyen Viet Phuong, Ho Nguyen Ngoc Bao, Nguyen Song Thien Long, Bui Hoai Thang, and Quan Thanh Tho. Development of Intelligent Virtual Assistants Using Large Language Models to Support Academic Activities. Hong Bang International University Journal of Science, Special Issue on National Conference of 05/2024, 2024. (National Conference) [pdf] [bib][website]
  4. Tuan Bui, Oanh Tran, Phuong Nguyen, Bao Ho, Long Nguyen, Thang Bui, and Tho Quan. Cross-Data Knowledge Graph Construction for LLM-enabled Educational Question-Answering System: A Case Study at HCMUT. ACM Workshop on AI-Powered Q&A Systems for Multimedia (AIQAM), at ICMR, 2024. [pdf] [bib] [website]
non-archival
  1. Long S. T. Nguyen, Dat T. Truong, Nhan D. Tran, Quynh T. N. Vo, Quy T. Nguyen, and Tho T. Quan. Not All Data Augmentation Works: A Typology-Aware Study for Low-Resource Neural Machine Translation in Vietnamese Ethnic Minority Languages Workshop on Language Models for Underserved Communities (LM4UC), at AAAI, 2026. (Workshop at A*-ranked Conference) [pdf] [bib] [website]
  2. Thi Ty Nguyen, Phat T. Tran-Truong, Long S. T. Nguyen, Tan Sang Nguyen, and Tho T. Quan. Sentence-Aware Bahnaric-Vietnamese Lexical Mapping with Contrastive Contextual Representations. Workshop on Language Models for Underserved Communities (LM4UC), at AAAI, 2026. (Workshop at A*-ranked Conference) [pdf] [bib] [website]