When Karpathy’s LLM-wiki gist blew up in April, I’d been quietly running a version of it: an automated compiler that turns my Claude Code session transcripts into a tagged, cross-linked Obsidian wiki. After 30 days it sits at 553 articles (509 concepts, 43 connection articles, 1 long-form Q&A), all written by the compiler, none by me. The architecture matches what’s now becoming standard: raw/ , immutable source zone. Session transcripts, project docs, plans get rsync-mirrored in. Never edited after ingest, so the compiled layer can always be rebuilt. knowledge/ , compiler-owned. Concept articles, connection articles, a master index. I never write here. One schema file (AGENTS.md) that defines article structure, a 6-namespace tag taxonomy (project/source/type/topic/status/stack), and compilation rules. The whole lifecycle is 4 slash commands: kb_sync , mirror sources + ingest + compile kb_health , lint: broken wikilinks, orphans, stale articles, near-duplicates kb_qa , query: the LLM reads the index (571 lines) and picks articles to answer from kb_commit , stage + auto-message + push Things I learned that the tutorials skip: The schema file does all the work. The folders are trivial. What makes 553 articles coherent is the schema: what counts as a concept, how tags namespace, when an article gets split. Skip this and you get a pile of summaries, not a wiki. Index-guided retrieval beats vector search at this scale. No embeddings anywhere. The model reads the index and picks 3-5 articles. At 50-500 articles this is more accurate than RAG and much simpler to debug. Maintenance is the real cost. One new source updates 10-15 related pages, which also means orphan articles and near-duplicates accumulate. My health pass caught duplicate pairs and 100+ broken wikilinks after the first heavy week. Without lint, the wiki rots by week four. Session transcripts are the highest-value input. Decisions, failure modes, “we tried X and it broke because Y” , all of it normally evaporates at session end. Compiled, it compounds: the wiki now answers questions my past self already solved. Happy to share the schema structure or the compile prompts if anyone wants to build their own. What are you all doing with your session history? submitted by /u/Accomplished_Snow_78
Originally posted by u/Accomplished_Snow_78 on r/ClaudeCode
