Skip to main navigation Skip to search Skip to main content

A Landmark-Aware Visual Navigation Dataset for Map Representation Learning

  • Faith Johnson
  • , Kristin Dana
  • , Bryan Bo Cao
  • , Shubham Jain
  • , Ashwin Ashok
  • Rutgers - The State University of New Jersey, New Brunswick
  • Stony Brook University
  • Georgia State University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Map representations learned by expert demonstrations have shown promising research value. However, the field of visual navigation still faces challenges due to the lack of real-world human-navigation datasets that can support efficient, supervised, representation learning of environments. We present a Landmark-Aware Visual Navigation (LAVN) dataset to allow for supervised learning of human-centric exploration policies and map building. We collect RGBD observation and human point-click pairs as a human annotator explores virtual and real-world environments with the goal of full coverage exploration of the space. The human annotators also provide distinct landmark examples along each trajectory, which we intuit will simplify the task of map or graph building and localization. These human point-clicks serve as direct supervision for waypoint prediction when learning to explore in environments. Our dataset covers a wide spectrum of scenes, including rooms in indoor environments, as well as walkways outdoors. We releaseour dataset with detailed documentation at https://huggingface.co/datasets/visnavdataset/lavn (DOI: l0.57967/hf/2386) and a plan for long-term preservation.

Original languageEnglish
Title of host publicationHRI 2025 - Proceedings of the 2025 ACM/IEEE International Conference on Human-Robot Interaction
PublisherIEEE Computer Society
Pages1026-1031
Number of pages6
ISBN (Electronic)9798350378931
DOIs
StatePublished - 2025
Event20th Annual ACM/IEEE International Conference on Human-Robot Interaction, HRI 2025 - Melbourne, Australia
Duration: Mar 4 2025Mar 6 2025

Publication series

NameACM/IEEE International Conference on Human-Robot Interaction
ISSN (Electronic)2167-2148

Conference

Conference20th Annual ACM/IEEE International Conference on Human-Robot Interaction, HRI 2025
Country/TerritoryAustralia
CityMelbourne
Period03/4/2503/6/25

Keywords

  • Dataset
  • Gaze Behavior Generation
  • Graph Representation
  • Human-in-the-Loop
  • Implicit Behavior Cloning
  • Landmark
  • Map Representation
  • Visual Navigation

Fingerprint

Dive into the research topics of 'A Landmark-Aware Visual Navigation Dataset for Map Representation Learning'. Together they form a unique fingerprint.

Cite this