Original Reddit post

Hey guys, for the last week I have been building an AI short form content generator. It uses Claude Sonnet to generate Remotion components to achieve this motion graphics style. It has a two step process, first it generates a Video Plan with Voice over lines and a visual idea for each scene. Then it hands it over scene by scene and generates a voice over with eleven labs and then the remotion code with sonnet. Currently the cost is still pretty high, around 50 cents per video because we have a lot of input tokens for context and instructions and each scene has around 3-5000 output tokens. What do you think of it? Do you have any Ideas on how I could maybe optimize it or bring down the cost? submitted by /u/GustavooIV

Originally posted by u/GustavooIV on r/ArtificialInteligence