Skip to main navigation Skip to search Skip to main content

First-Person Activity Recognition: Feature, Temporal Structure, and Prediction

  • Jet Propulsion Laboratory, California Institute of Technology

Research output: Contribution to journalArticlepeer-review

34 Scopus citations

Abstract

This paper discusses the problem of recognizing interaction-level human activities from a first-person viewpoint. The goal is to enable an observer (e.g., a robot or a wearable camera) to understand ‘what activity others are performing to it’ from continuous video inputs. These include friendly interactions such as ‘a person hugging the observer’ as well as hostile interactions like ‘punching the observer’ or ‘throwing objects at the observer’, whose videos involve a large amount of camera ego-motion caused by physical interactions. The paper investigates multi-channel kernels to integrate global and local motion information, and presents a new activity learning/recognition methodology that explicitly considers temporal structures displayed in first-person activity videos. Furthermore, we present a novel algorithm for early recognition (i.e., prediction) of activities from first-person videos, which allows us to infer ongoing activities at their early stage. In our experiments, we not only show classification results with segmented videos, but also confirm that our new approach is able to detect activities from continuous videos and perform early recognition reliably.

Original languageEnglish
Pages (from-to)307-328
Number of pages22
JournalInternational Journal of Computer Vision
Volume119
Issue number3
DOIs
StatePublished - Sep 1 2016

Fingerprint

Dive into the research topics of 'First-Person Activity Recognition: Feature, Temporal Structure, and Prediction'. Together they form a unique fingerprint.

Cite this