Natural Language Processing & Speech

PUBLICATIONS

PLUE: Language Understanding Evaluation Benchmark for Privacy Policies in English

To address these problems and encourage re- search to develop NLU technologies in the privacy policy domain, we introduce the Privacy Policy Language Understanding Evaluation (PLUE) benchmark, to evaluate the privacy policy language understanding across six tasks, including text classification, question answering, semantic parsing, and named-entity recognition.

PUBLICATIONS

GCT: Gated Contextual Transformer For Sequential Audio Tagging

We propose a new neural network architecture for the task of sequential audio tagging. "Sequential audio tagging" means we want to know what types of acoustic events (e.g. dog bark, car engine) occur in an audio recording, and in what order they occur.

PUBLICATIONS

SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision

Visual speech recognition (VSR), also known as lip reading, is the task of recognizing speech content based on visual lip movements. VSR has a wide range of applications in real-world scenarios such as helping the hearing- impaired perceive human speech and improving automatic speech recognition (ASR) in noisy environments.

PUBLICATIONS

Scaling Speech Technology to 1,000+ Languages

we build a new dataset comprising a moderate amount of labeled data for 1,107 languages and another dataset of unlabeled speech in 3,809 languages (§3). We leverage ....