Original Reddit post

I’m looking to fine tune a video generation model that takes in video as input and produces video as output, ideally also being able to include some text to describe the change If this doesn’t exist one that is video in, video out that can be fine tuned and a separate one that is video and text in, and video out would be great. submitted by /u/mczarnek

Originally posted by u/mczarnek on r/ArtificialInteligence