Built a WhatsApp Audio Summarizer using Groq & Whisper-v3. The speed is insane

www.reddit.com

Built a WhatsApp Audio Summarizer using Groq & Whisper-v3. The speed is insane

www.reddit.com

eifachposteMB to AI (Reddit RSS)English · 14 hours ago

Original Reddit post

Just wanted to share a project I finished. I was frustrated with the latency of most transcription apps, so I implemented Whisper-large-v3 via Groq API for an Android app called CakeNotes AI. The goal was to make “sharing to summarize” feel as fast as a native feature. Current Stack:•Kotlin / Compose•Room (for local history)•Groq API (Inference is almost sub-second for 1min audios)If you are into LLM implementations for daily utility, give it a spin. I’m especially curious about how it handles different accents! Play Store: https://play.google.com/store/apps/details?id=com.cakenotes.ai Tx submitted by /u/Cultural_Swimming_94

Originally posted by u/Cultural_Swimming_94 on r/ArtificialInteligence

You must log in or # to comment.

Chat