Regularly I try to use the live voice modes on different services like ChatGPT, Perplexity and Grok, but the experience is always so bad. Why don’t they use the models they use when doing stuff in text? It’s probably because of trying to maintain a low latency during the chat, but why not say “Let me research that for you…” or have 2 agents running and 1 reporting back during the other agent thinking. The live models are so lazy and thus unusable in 90% of the cases for me. What do you think? submitted by /u/HoarderOfBytes
Originally posted by u/HoarderOfBytes on r/ArtificialInteligence
You must log in or # to comment.
