Original Reddit post

I’ve been experimenting with a 3D visualization of LLM inference where nodes represent components like attention layers, FFN, KV cache, etc. As tokens are generated, activation paths animate across a network (kind of like lightning chains), and node intensity reflects activity. The goal is to make the inference process feel more intuitive, but I’m not sure how accurate/useful this abstraction is. submitted by /u/ABHISHEK7846

Originally posted by u/ABHISHEK7846 on r/ArtificialInteligence