Skip to Content

The Zero-Cost RAG: Why My Obsidian Vault is the Ultimate AI Context Layer

How I built a zero-rent, file-system-based Retrieval-Augmented Generation (RAG) system using Obsidian and Git—and why vector databases are completely over-engineered for personal AI context.

The Zero-Cost RAG: Why My Obsidian Vault is the Ultimate AI Context Layer

If you ask a modern AI engineer how to give an agent context, they will immediately tell you to set up a vector database (like Pinecone, Chroma, or pgvector), chunk your text, run it through an embeddings model, and write a semantic search query. This is classic tech over-engineering. For a personal knowledge base, it's completely unnecessary, expensive, and slow.

Instead, I built a Zero-Cost, File-System-Based RAG using nothing but Obsidian and Git. It costs me absolutely nothing, has zero latency, and provides perfect deterministic control over what my agents can see and do.

The 3 Layers of File-System RAG

My RAG architecture consists of three clean, straightforward layers: 

  1. The Retrieval Layer (Obsidian Knowledge Base): All my documentation—from server infrastructure maps and security rules to my neurological operating manual—is written in clean Markdown files inside my Obsidian vault. This is the master database.
  2. The Ingestion Layer (Git-Backed Grounding): When an AI agent boots up in my workspace, the system configuration forces the agent to read my active context files directly from the local file system. There are no API keys or databases in the middle; the agent reads the Markdown directly.
  3. The Generation Layer (Bounded Execution): The agent generates code or drafts text that is strictly bounded by the boundaries defined in the retrieved files. If a rule says "Never publish directly to production," the agent is physically blocked from generating a script that does so.

💡 Why Git-Backed Markdown is Superior

- No Semantic Noise: Traditional RAG chops text into tiny chunks, often losing context. By reading whole markdown files, agents parse the complete, continuous logic flow.
- Version-Controlled Brain: Since the vault is backed by Git, my AI's context is version-controlled. If I change a server port, I commit the change, and the AI's "brain" is instantly upgraded.
- Zero Latency & Cost: The local OS file system serves as the database. It is secure, local, and completely offline.


Operating the Machine

This RAG setup is the "cognitive exoskeleton" that lets me direct a digital workforce. Because my seizures wiped my syntax memory, I don't spend my cognitive energy trying to remember how to format an Odoo API call or write a bash script. The agents retrieve the syntax rules from my RAG, and I act as the Solutions Architect, auditing their work and designing the boundaries. It is a perfect division of labor.

🔗 Related Resources

  • Internal Link: To see the master branch of my career codebase, read my post on Version-Controlled Career.
  • External Link: Learn more about standard RAG architectures and when to use vector versus structured retrieval in the Jina AI RAG Guide.
The Zero-Cost RAG: Why My Obsidian Vault is the Ultimate AI Context Layer
Ramon Rios Jr. June 14, 2026
Share this post
Archive
Sign in to leave a comment
Building a Brain: How I Try to Give AI Context
Why my early experiments with AI agents failed due to a lack of context, and how I started building a dedicated repository to help them understand my servers.