On April 6, Google launched the experimental voice input app Google AI Edge Eloquent on iOS, highlighting offline operation and intelligent text refinement. The app features Google’s proprietary Gemma4 series ASR models (E2B/E4B variants), supporting local speech-to-text conversion, automatic filtering of fillers and repetitions, and four text style options: key points, formal, concise, and full. Users can optionally enable the cloud-based Gemini model for advanced cleanup and import Gmail contacts and terminology to build personalized vocabularies. Currently free with no subscription requirements, it stands in stark contrast to its competitor SuperWhisper, which charges $85 annually. An Android version is planned, with future support for system keyboard integration and floating operations. This move marks Google’s tangible advancement in edge AI voice processing and underscores the Gemma model’s capability for lightweight deployment on mobile devices.Author and source: AIBase
On Monday, April 6, local time, Google quietly launched an experimental voice input app called "Google AI Edge Eloquent" on the iOS platform. The app emphasizes "offline-first" and "intelligent refinement," aiming to use edge-side AI technology to convert natural spoken language into professional, polished text in real time. This move marks Google’s official entry into the premium AI speech-to-text market, led by Wispr Flow and SuperWhisper.
Core Technologies and Key Features:
Eloquent is powered by Google's newly released Gemma4 series (E2B/E4B specifications) automatic speech recognition (ASR) model. The model supports fully offline operation, enabling local transcription immediately after downloading the model package, effectively safeguarding privacy and reducing latency. The application features a powerful "Smart Cleanup" function that automatically identifies and filters out filler words such as "um" and "uh," as well as repetitions and corrections, delivering logically coherent text output.
Product Deep Integration and Interaction:
- Multimodal style conversion: Offers four text processing modes—“Key Points,” “Formal,” “Brief,” and “Complete.”
- Cloud Collaboration (Optional): After enabling cloud mode, the application will utilize the cloud-based Gemini model for advanced text cleaning.
- Personalized context: Support importing user-specific keywords, names, and terms from Gmail, and allow creation of custom glossaries.
- Productivity Statistics: Real-time display of dictation word count, words per minute (WPM), and historical session records.
Market Strategy and Future Planning:
The app is currently available for free on the iOS App Store, with no subscription fees or usage restrictions, offering strong competition to rivals like SuperWhisper, which charges $85 annually. Although initially launched on iOS, the official description confirms plans for an Android version and previews support for system-wide keyboard integration and floating button features similar to Wispr Flow. As a key member of Google’s AI Edge brand, Eloquent’s release is not merely an attempt at a utility app but also a flagship example demonstrating Google’s capability to deploy the Gemma model on mobile devices.
