[Paper] Google’s 1.3 MiB On-Device Model brings high-performance Disfluency Detection down to size

Googles 1.3 MiB On-Device Model Brings High-Performance Disfluency Detection Down to Size

AI Google Research proposes small, fast, on-device disfluency detection models based on the BERT architecture. Smallest model size is only 1.3 MiB, representing a size reduction of two orders of magnitude and an inference latency reduction of a factor of eight compared to state-of-the-art BERT-based models.

Read More