Skip to main navigation Skip to search Skip to main content

ViFiT: Reconstructing Vision Trajectories from IMU and Wi-Fi Fine Time Measurements

  • Bryan Bo Cao
  • , Abrar Alali
  • , Hansi Liu
  • , Nicholas Meegan
  • , Marco Gruteser
  • , Kristin Dana
  • , Ashwin Ashok
  • , Shubham Jain
  • Stony Brook University
  • Old Dominion University
  • Saudi Electronic University
  • Rutgers University
  • Georgia State University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

Tracking subjects in videos is one of the most widely used functions in camera-based IoT applications such as security surveillance, smart city traffic safety enhancement, vehicle to pedestrian communication and so on. In computer vision domain, tracking is usually achieved by first detecting subjects, then associating detected bounding boxes across video frames. Typically, frames are transmitted to a remote site for processing, incurring high latency and network costs. To address this, we propose ViFiT, a transformer-based model that reconstructs vision bounding box trajectories from phone data (IMU and Fine Time Measurements). It leverages a transformer's ability of better modeling long-term time series data. ViFiT is evaluated on Vi-Fi Dataset, a large-scale multimodal dataset in 5 diverse real world scenes, including indoor and outdoor environments. Results demonstrate that ViFiT outperforms the state-of-the-art approach for cross-modal reconstruction in LSTM Encoder-Decoder architecture X-Translator and achieves a high frame reduction rate as 97.76% with IMU and Wi-Fi data.

Original languageEnglish
Title of host publicationISACom 2023 - Proceedings of the 2023 3rd ACM MobiCom Workshop on Integrated Sensing and Communication Systems
PublisherAssociation for Computing Machinery, Inc
Pages13-18
Number of pages6
ISBN (Electronic)9798400703645
DOIs
StatePublished - Oct 2 2023
Event3rd ACM MobiCom Workshop on Integrated Sensing and Communication Systems, ISACom 2023 - Madrid, Spain
Duration: Oct 6 2023Oct 6 2023

Publication series

NameISACom 2023 - Proceedings of the 2023 3rd ACM MobiCom Workshop on Integrated Sensing and Communication Systems

Conference

Conference3rd ACM MobiCom Workshop on Integrated Sensing and Communication Systems, ISACom 2023
Country/TerritorySpain
CityMadrid
Period10/6/2310/6/23

Keywords

  • Efficient Video System
  • IMU
  • Multimodal Learning
  • Multimodal Reconstruction
  • Object Detection
  • Tracking
  • Transformer

Fingerprint

Dive into the research topics of 'ViFiT: Reconstructing Vision Trajectories from IMU and Wi-Fi Fine Time Measurements'. Together they form a unique fingerprint.

Cite this