AI & Fundamentals
Exploring Trustworthy Foundation Models: Benchmarking, Finetuning and Reasoning - Bo Han, Associate Professor, Hong Kong Baptist University
DATE: Tue, July 15, 2025 - 11:30 am
LOCATION: UBC Vancouver Campus, Fried Kaiser (KAIS) building, Room 2020/2030, 2332 Main Mall
DETAILS
Abstract:
In the current landscape of machine learning, where foundation models must navigate imperfect real-world conditions such as noisy data and unexpected inputs, ensuring their trustworthiness through rigorousbenchmarking, safety-focused finetuning, and robust reasoning is more critical than ever. In this talk, I will focus on three recent research advancements that collectively advance these dimensions, offering a comprehensive approach to building trustworthy foundation models. For benchmarking, I will introduce CounterAnimal, a dataset designed to systematically evaluate CLIP’s vulnerability to realistic spurious correlations, revealing that scaling models or data quality can mitigate these biases, yet scaling data alone does not effectively address them. Transitioning to finetuning, we delve deep into the process of unlearning undesirable model behaviors. We propose a general framework to examine and understand the limitations of current unlearning methods and suggest enhanced revisions for more effective unlearning. Furthermore, addressing reasoning, we investigate the reasoning robustness under noisy rationales by constructing the NoRa dataset and propose contrastive denoising with noisy chain-of-thought, a method that markedly improves denoising-reasoning capabilities by contrasting noisy inputs with minimal clean supervision.
This talk is a part of a full day event. Please see the event page for the full schedule.