About me

I am currently a research associate at Forward Data Lab at UIUC, supervised by Professor Kevin Chen-chuan Chang. I am also fortunate to be supervised by Hy Truong Son, Anh Totti Nguyen and Thiago Serra. My research focuses on applications of NLP, particularly LLMs, in the domains of Healthcare and Information Retrieval. I am also interested in quantifying and understanding the limitations of Vision-Language Models.

  1. LLMs in Information Retrieval: Currently, I am developing methods of leveraging LLMs to bridge the semantic gap between complex queries and products.
  2. LLMs in Healthcare: I worked on improving speech summarization with LLM synthetic data (link) and leveraging LLM reasoning to enhance the interpretability and performance of medical sentiment analysis (link).
  3. Vision-Language Models: Some tasks are simple for humans yet hard for VLMs. My current project aims to quantify, understand and close this performance gap.

In the summer of 2024, I was an intern at the Machine Learning Research team at CodaMetrix where I worked on improving the performance of automated ICD-10 extreme multilabeled classification systems.

Publications

♠ denotes equal contribution

Real-time Speech Summarization for Medical Conversations
Khai Le-Duc, Khai-Nguyen Nguyen, Long Vo-Dang, Truong-Son Hy
Interspeech 2024

Sentiment Reasoning for Healthcare
Khai-Nguyen Nguyen, Khai Le-Duc, Bach Phan Tat, Duy Le, Long Vo-Dang, Truong-Son Hy
Workshop on Advancements In Medical Foundation Models, NeurIPS 2024

Getting away with more network pruning: From sparsity to geometry and linear regions
Jeffrey Cai, Khai-Nguyen Nguyen, Nishant Shrestha, Aidan Good, Ruisen Tu, Xin Yu, Shandian Zhe, Thiago Serra
Workshop on Sparsity in Neural Networks, ICLR 2023

Medical Spoken Named Entity Recognition
Khai Le-Duc, David Thulke, Hung-Phong Tran, Long Vo-Dang, Khai-Nguyen Nguyen, Truong-Son Hy, Ralf Schluter
Submitted to COLING 2024

Like a bilingual baby: The advantage of visually grounding a bilingual language model
Khai-Nguyen Nguyen, Zixin Tang, Ankur Mali, M Alex Kelly
arXiv, 2022

Important and Difficult Topics in CS2: An Expert Consensus via Delphi Study
Lea Wittie, Anastasia Kurdia, Meriel Huggard, Khai-Nguyen Nguyen
ASEE Annual Conference and Exposition 2023