Original Reddit post

so basically …‘Every word, punctuation mark, and part of a word is counted as tokens when AI thinks.’ but with current AI’s the workflow is so that they give the answer immediately, but what if I don’t want the answer instantly… but slowly… say after 10-15 mins or even an hour… shouldn’t it mean that the AI servers gets more time to compute and can allocate resources in such a way that it uses tokens more efficiently… I mean kind of like how good, fast and cheap triangle concept works… There are option for fast and good models, and there are options for cheap and fast models… but i dont think I have seen a good and cheap model which is - although slow but gives good results… I feel like we are being pushed to a AI use where its always fast… nobody has the time to wait anymore… I hope I was able to get across the point I was trying to make. Basically… Its OK if my assignment is done in 20 mins or if my code is written in >2-3 hours if that means it will be cheaper… https://preview.redd.it/0j9a2iw9m68h1.png?width=1200&format=png&auto=webp&s=be1f8471ae2de931a4a54e55de2c7bf72275dc9d submitted by /u/mynamehere_99

Originally posted by u/mynamehere_99 on r/ArtificialInteligence