Original Reddit post

I was able to fine-tune a BERT model with more than 400M parameters on 500,000 sentences in under 30 mins and for less than $2.00 of compute. It’s a crypto sentiment model with 3 classes negative, neutral and positive. Right now I’m at 89% across my test dataset with my best-of-5 seed. I built an API in for it in less than a day burning through tokens in Cursor, this obviously cost a lot more but relatively speaking, trying to build this out even 2 years ago would have been weeks of work. This time next year I can’t even begin to imagine what we’ll be able to do. submitted by /u/spicyboi97

Originally posted by u/spicyboi97 on r/ArtificialInteligence