Skip to main navigation Skip to search Skip to main content

HYBRID RANDOM FEATURES

  • Krzysztof Choromanski
  • , Haoxian Chen
  • , Han Lin
  • , Yuanzhe Ma
  • , Arijit Sehanobish
  • , Deepali Jain
  • , Michael S. Ryoo
  • , Jake Varley
  • , Andy Zeng
  • , Valerii Likhosherstov
  • , Dmitry Kalashnikov
  • , Vikas Sindhwani
  • , Adrian Weller
  • Google Brain Robotics
  • Columbia University
  • Yale University
  • University of Cambridge
  • Alan Turing Institute

Research output: Contribution to conferencePaperpeer-review

5 Scopus citations

Abstract

We propose a new class of random feature methods for linearizing softmax and Gaussian kernels called hybrid random features (HRFs) that automatically adapt the quality of kernel estimation to provide most accurate approximation in the defined regions of interest. Special instantiations of HRFs lead to well-known methods such as trigonometric (Rahimi & Recht, 2007) or (recently introduced in the context of linear-attention Transformers) positive random features (Choromanski et al., 2021b). By generalizing Bochner's Theorem for softmax/Gaussian kernels and leveraging random features for compositional kernels, the HRF-mechanism provides strong theoretical guarantees - unbiased approximation and strictly smaller worst-case relative errors than its counterparts. We conduct exhaustive empirical evaluation of HRF ranging from pointwise kernel estimation experiments, through tests on data admitting clustering structure to benchmarking implicit-attention Transformers (also for downstream Robotics applications), demonstrating its quality in a wide spectrum of machine learning problems.

Original languageEnglish
StatePublished - 2022
Event10th International Conference on Learning Representations, ICLR 2022 - Virtual, Online
Duration: Apr 25 2022Apr 29 2022

Conference

Conference10th International Conference on Learning Representations, ICLR 2022
CityVirtual, Online
Period04/25/2204/29/22

Fingerprint

Dive into the research topics of 'HYBRID RANDOM FEATURES'. Together they form a unique fingerprint.

Cite this