Epstein Files x GraphRAG - what would your architecture/workflow be like?

www.reddit.com

Epstein Files x GraphRAG - what would your architecture/workflow be like?

www.reddit.com

eifachposteMB to AI (Reddit RSS)English · 10 days ago

Original Reddit post

If you were to implement GraphRAG for Epstein Files, what would your technical workflow be like? Given the files are mostly PDFs, the extraction workflow is the one that would take considerable thought/time. Although there are datasets on HF of the OCR data, but that’s only ~20k records Next considerable design decision would go into how to set up the graph from extracted data. Using LLMs would be expensive and inaccurate. Setting up vector DB would be the easiest of all I believe. I think this might be a good project to showcase graphRAG on large unstructured data. Hmu if want to work on this together! submitted by /u/adityashukla8

Originally posted by u/adityashukla8 on r/ArtificialInteligence

You must log in or # to comment.

Chat