ViperGPT: Visual Inference via Python Execution for Reasoning

About this Session

Answering queries about visual inputs is a complex task that requires both visual processing and reasoning. In this talk, Sachit will demonstrate how large language models can be instrumental in reasoning within such settings, which extend beyond traditional language tasks. ViperGPT utilizes a provided API to access computer vision modules and composes them by generating Python code that is subsequently executed. This simple approach requires no additional training and achieves state-of-the-art results across various complex visual tasks. Sachit will also discuss how ViperGPT inspired the development of code-based agents and share insights on the future potential of such agents.

About the Speaker

Sachit Menon is a PhD student in Computer Science at Columbia University advised by Professor Carl Vondrick. His research centres around models trained at scale and ways to use them for novel tasks, such as using large language models to perform visual reasoning.

About Tech Talks

A regular series by Soroco, Tech Talks are expert-led technical sessions that deep dive into a specific area of technology and provide engineers valuable insights and tools. It also examines fascinating research, use cases and facilitates larger conversations around cutting-edge tech.

Registration is now closed for this Tech Talk

Watch the Talk

See Scout in action.
Schedule your demo now!

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.