We all know that AI models are trained using Copyrighted data. Even in the cases where it wasn’t explicitly copyrighted, if it was private data which was appropriated illegally by a third party and hosted publicly, which was then scraped by AI companies, it is then illegal in itself even without getting into any sort of fair use arguments regarding training. And regarding that said fair use argument about training, it is absolutely insane that anyone is taking that seriously. If it was you or I who pirated the entire internet we would be locked up faster than Americans or Israelis’ bombing children even before we could build a product let alone commercialize it. The only reason it is being allowed to continue is because these companies have very expensive lawyers and can put off the legal pressure for longer than it would take for them to literally change the law so that it “becomes” not theft. Am I missing something here? submitted by /u/AryanEmbered
Originally posted by u/AryanEmbered on r/ArtificialInteligence
