My team is currently looking to integrate large language models into our customer support workflow, but we are hitting a wall. Every week there is a new framework or a better performing open-source model, and we cannot decide between fine-tuning something like Llama 3 or just sticking with expensive API calls. We need a system that handles retrieval augmented generation without hallucinating internal data, but our internal devs are already stretched thin. Has anyone navigated this successfully without wasting months on R&D? submitted by /u/Sirwanga
Originally posted by u/Sirwanga on r/ArtificialInteligence
You must log in or # to comment.
