Skip to main navigation Skip to search Skip to main content

PropTest: Automatic Property Testing for Improved Visual Programming

  • Rice University
  • Columbia University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

Visual Programming has recently emerged as an alternative to end-to-end black-box visual reasoning models.This type of method leverages Large Language Models (LLMs) to generate the source code for an executable computer program that solves a given problem.This strategy has the advantage of offering an interpretable reasoning path and does not require finetuning a model with task-specific data.We propose PropTest, a general strategy that improves visual programming by further using an LLM to generate code that tests for visual properties in an initial round of proposed solutions.Our method generates tests for data-type consistency, output syntax, and semantic properties.PropTest achieves comparable results to state-of-the-art methods while using publicly available LLMs.This is demonstrated across different benchmarks on visual question answering and referring expression comprehension.Particularly, PropTest improves ViperGPT by obtaining 46.1% accuracy (+6.0%) on GQA using Llama3-8B and 59.5% (+8.1%) on RefCOCO+ using CodeLlama-34B.

Original languageEnglish
Title of host publicationEMNLP 2024 - 2024 Conference on Empirical Methods in Natural Language Processing, Findings of EMNLP 2024
EditorsYaser Al-Onaizan, Mohit Bansal, Yun-Nung Chen
PublisherAssociation for Computational Linguistics (ACL)
Pages8241-8256
Number of pages16
ISBN (Electronic)9798891761681
DOIs
StatePublished - 2024
Event2024 Findings of the Association for Computational Linguistics, EMNLP 2024 - Hybrid, Miami, United States
Duration: Nov 12 2024Nov 16 2024

Publication series

NameEMNLP 2024 - 2024 Conference on Empirical Methods in Natural Language Processing, Findings of EMNLP 2024

Conference

Conference2024 Findings of the Association for Computational Linguistics, EMNLP 2024
Country/TerritoryUnited States
CityHybrid, Miami
Period11/12/2411/16/24

Fingerprint

Dive into the research topics of 'PropTest: Automatic Property Testing for Improved Visual Programming'. Together they form a unique fingerprint.

Cite this