Just a short ask… in general if an app includes some level of AI integration and typically either chargers for “tokens” or API use… or BYO_API_TOKEN to use AI… it seems most apps charge for AI use. I am fine tuning an AI for a small specialized model (internal to my app). I am curios if I should maybe limit how many calls can be made even though it runs locally (ideally on 4GB to 8GB GPU VRAM)… should I have a “free tier” that is like 2 prompts an hour… and then a subscription plan like $10 a month for 20 requests, $20 for unlimited? I mean to be fair, I bought a DGX for $4200 + paid $2K+ working through multiple teachers/distillation and fine tuning the LLM. It offers MUCH faster (and for me… no cost) responses on decent (8GB VRAM) hardware… but given not only how much I spent already + time, but future (never ending???) continued updated fine tuning/distillation/etc… if the model returns useful time saving responses that enhance my apps overall workflow would it be insane to ask for a little compensation with a small monthly subscription fee? Trying to understand what seems to be the future integration of AI into apps and how best to go about this. I am one guy… out of a job for a bit and need some income… eating through my savings to build this, I was hoping the idea of asking for a few bucks a month per user was not like “What an asshole… how dare he charge us for this time saving feature he spent his savings on”. submitted by /u/Tiny-Sink-9290
Originally posted by u/Tiny-Sink-9290 on r/ArtificialInteligence
