BEGIN:VCALENDAR VERSION:2.0 PRODID:-//https://caida.ubc.ca//NONSGML iCalcreator 2.41.92// CALSCALE:GREGORIAN METHOD:PUBLISH UID:37353166-3766-4432-b433-363134323336 X-WR-RELCALID:efc09d74-9c93-479e-a94f-485231ddccde X-WR-TIMEZONE:America/Vancouver X-WR-CALNAME:Challenges and Opportunities with Multimodal LLMs - Vicente Or dóñez-Román\, Associate Professor\, Rice University BEGIN:VTIMEZONE TZID:America/Vancouver TZUNTIL:20251102T090000Z BEGIN:STANDARD TZNAME:PST DTSTART:20231105T020000 TZOFFSETFROM:-0700 TZOFFSETTO:-0800 RDATE:20241103T020000 END:STANDARD BEGIN:DAYLIGHT TZNAME:PDT DTSTART:20240310T020000 TZOFFSETFROM:-0800 TZOFFSETTO:-0700 RDATE:20250309T020000 END:DAYLIGHT END:VTIMEZONE BEGIN:VEVENT UID:0156d004-77bf-4c64-969f-3423da77543b DTSTAMP:20260228T023909Z CLASS:PUBLIC CREATED:20240409T195954Z DESCRIPTION:Abstract: In this talk I will provide an overview of how the fi eld of computer vision has been impacted by the recent success of Multimod al LLMs\, and what are some of the challenges and opportunities associated with these models. I will present some of our recent work leveraging Mult imodal LLMs\, including our SCoRD model that turns a multimodal LLM into a subject-conditional visual relation prediction and grounding model throug h enhanced text supervision. SCoRD takes as input an image and a subject a nd predicts an exhaustive list of all the objects interacting with the sub ject along with the… DTSTART;TZID=America/Vancouver:20240415T100000 DTEND;TZID=America/Vancouver:20240415T110000 LAST-MODIFIED:20240409T200725Z LOCATION:UBC Vancouver Campus\, ICCS X836 SUMMARY:Challenges and Opportunities with Multimodal LLMs - Vicente Ordóñez -Román\, Associate Professor\, Rice University TRANSP:OPAQUE URL:https://caida.ubc.ca/event/challenges-and-opportunities-multimodal-llms -vicente-ordonez-roman-associate-professor-rice END:VEVENT END:VCALENDAR