Original Reddit post

Building something that talks to an LLM and I keep going back and forth. Half the time I just send messages and hope the right stuff is in the window. The other half I want to see what’s in there - system prompt, history, RAG chunks - before the next call, so I can debug and tune. Do you bother exposing the full context (or a snapshot) in your apps? Or do you treat it as an implementation detail and only care about inputs/outputs? What’s been your experience when things go wrong (wrong answer, weird drift) - did having visibility help? submitted by /u/hack_the_developer

Originally posted by u/hack_the_developer on r/ArtificialInteligence