I’m trying to calibrate whether what I was attempting to build is actually achievable with current AI tooling, or whether I was effectively trying to build something meaningfully beyond the current state of the art. Context: I’m a PM in a very complex, ambiguous environment with a huge amount of information flowing at me constantly. I have a strong strategic/systems background but not a deep technical one. I spent a significant amount of time building a personal “operating system” using Claude Code plus various memory/retrieval approaches. The goal was NOT just “chat with my notes.” I was trying to build something that could:
- ingest recordings/transcripts from all my meetings
- store and retrieve long-term context reliably
- understand my goals and projects
- dynamically maintain workback plans and dependencies
- connect information across meetings/docs/messages
- surface relevant context when I’m making decisions or writing docs
- function as a trustworthy external memory/project management layer I iterated on this repeatedly:
- different retrieval approaches
- different memory structures
- project/task extraction
- multiple refactors based on discovered failure modes
- architecture changes suggested by Claude itself And I kept hitting the same problem: the system sounded coherent, but in practice it would miss critical context, fail to retrieve important things, consistently miss second order effects, lose nuance, fail to connect dots I expected it to connect, or confidently produce flawed conclusions. The more I audited it, the less I trusted it. At this point I’m honestly considering abandoning most of it and going back to much more manual systems because the overhead of supervising the AI became so high. My question for people who are deeper in this space: Was I trying to build something genuinely beyond what current LLM systems can reliably support today? Or is this actually achievable right now and I just architected it badly / used the wrong tools / approached it incorrectly? I’m especially interested in answers from people who have attempted something similar in high-context operational environments, not just lightweight “second brain” setups. submitted by /u/Reasonable-Two-141
Originally posted by u/Reasonable-Two-141 on r/ClaudeCode
You must log in or # to comment.
