Just wanted to share a project I finished. I was frustrated with the latency of most transcription apps, so I implemented Whisper-large-v3 via Groq API for an Android app called CakeNotes AI. The goal was to make “sharing to summarize” feel as fast as a native feature. Current Stack:•Kotlin / Compose•Room (for local history)•Groq API (Inference is almost sub-second for 1min audios)If you are into LLM implementations for daily utility, give it a spin. I’m especially curious about how it handles different accents! Play Store: https://play.google.com/store/apps/details?id=com.cakenotes.ai Tx submitted by /u/Cultural_Swimming_94
Originally posted by u/Cultural_Swimming_94 on r/ArtificialInteligence
You must log in or # to comment.
