Original Reddit post

Building a book to speech pipeline where users upload books, a parser extracts texts/tables/diagrams and converts into markdown, then passes to the TTS model. Currently running this as a stateless flow like upload book -> parse -> TTS output but wondering if adding a database here makes sense. Would parsed markdown, processing status, maybe cache TTS outputs for repeated responses?? Like would it be an overkill for a simple tool or does it become necessary once you are handling multiple uploads, retry logic and partial processing states? submitted by /u/lucasbennett_1

Originally posted by u/lucasbennett_1 on r/ArtificialInteligence