Change an auth helper signature without breaking callers, tests, or prior project decisions.
Run the proof before your coding agent edits.
Same task, same coding agent, better repo evidence. Sverklo gives Claude Code, Cursor, Windsurf, and Codex-style agents your repo's files, symbols, callers, tests, dependencies, diffs, and git-pinned decisions before they edit.
Inspect the public 180-task benchmark, then run a receipt on a real repo. MIT, no API keys, no code upload; first run downloads a local ONNX model. Caveat: grep still wins exact strings.
Run the copied command from a real repo, follow the proof guide, then paste the markdown receipt into the proof thread. No project files or MCP config are written; first run may cache the local ONNX model/index under ~/.sverklo.
What changes when the agent gets repo evidence first?
The comparison is intentionally plain: ask the same coding agent to change an auth helper. Baseline tools make it search and guess. Sverklo hands it the repo relationships before the edit.
The agent greps likely names, reads a few files, and may miss indirect callers or stale decisions after context compaction.
impact, refs, test_map, and recall show real callers, covered tests, dependencies, and git-pinned decisions.
Files, symbols, callers, tests, dependencies, diffs, and decisions appear before the model writes code.
Fewer blind edits on relationship-heavy tasks. If you already know the exact string or the repo is tiny, grep is still the sharper first move.
Proof source: public 180-task retrieval benchmark plus maintainer-seeded receipts. External same-task receipts are still the top proof gap.
/* 90-second walkthrough — terminal then live Claude Code MCP integration */
Don't trust the claim. Run the proof.
sverklo prove --markdown prints a shareable receipt from the current repo: central files, one real symbol, real callers, and the exact prompt to paste into your coding agent.
These examples are maintainer-seeded so you can inspect the shape before installing. They are not third-party validation; external receipts and corrections are the current ask.
TypeScript MCP server, CLI, benchmark harness, telemetry worker, and site worktree in one receipt.
Middleware-heavy JavaScript repo: real entry points, core symbols, and caller relationships.
Large Node framework with plugin and route structure that rewards graph-aware retrieval.
TypeScript validation library where symbol precision matters more than keyword matching.
Compact state library showing whether the proof remains useful when the repo is not huge.
/* what happened when we published the bench */
A public benchmark, two competitors, fixes shipped on both sides in 36 hours.
- Apr 28 Published 180-task bench across 6 codebases with two competing local-first MCP code-intel servers (jcodemunch-mcp, GitNexus) and the slices where sverklo lost — including FastAPI P5 dead-code (0.00 vs grep's 1.00).
- May 2 → 3 jcodemunch maintainer @jgravelle shipped v1.80.7 / 1.80.8 / 1.80.9 against specific bench findings. P5 recall 0.00 → 1.00; lodash P1 0/10 → 9/10.
- May 4 Adding lodash to the bench exposed the symmetric blind spot in sverklo's parser. sverklo v0.20.2 ships the fix. Sverklo P1 0.30 → 0.73; overall F1 0.45 → 0.56 on that run.
"the bigger opportunity here is the potential genesis of an 'MCP Server Arena' on par with what the leading AI/LLM/Chatbot arenas provide…"
— Jake Gravelle, jcodemunch-mcp maintainer, on r/mcp
Sverklo doesn't replace grep. It complements it.
- ✓ Exploratory questions ("how does the auth flow work?")
- ✓ Refactor blast radius (
impact) - ✓ Large interconnected codebases
- ✓ Memory across sessions, tied to git SHAs
- ✓ Project audits — god nodes, dead code
- — Focused diff review
- — Exact string matching
- — Reading file contents
- — Build & test verification
- — Anything where you already know the symbol
Sverklo is the right tool when you don't know exactly what to search for. When you do know, grep is fine.
your agent
reviews like
a senior dev.
src/auth/session.ts
78
src/api/billing.ts
52
src/lib/jwt.ts
34
src/utils/clock.ts
12
v0.17 → the GitHub Action posts inline review comments on every PR alongside the sticky summary, anchored to the heuristic-flagged lines.
risk score
Every changed file gets a 0–100 score combining untested status, security-sensitive paths, importer fan-in, caller count, dangling references, and churn. Reasons are explicit — no black box.
missing-test detection
test_map walks the diff, the import graph, and filename conventions to flag changed code without matching tests — ranked by risk.
dangling-ref check
Every removed symbol is checked against the symbol-reference graph. If a caller still exists, the merge is blocked with file:line evidence.
mcp prompts
Five workflows: /sverklo:review-changes, /sverklo:pre-merge, /sverklo:onboard, /sverklo:architecture-map, /sverklo:debug-issue.
benchmark/ — reproducible.
impact render on React's most-called symbol. Sub-millisecond on every repo we tested — it's an indexed SQL join, not a 200-grep-match scan.
indexed.
ranked.
remembered.
tests · deps · decisions
channel weights, path × 1.5
not a blind context dump
hybrid search
BM25 for precision, semantic embeddings for recall, PageRank for structural importance — fused via RRF. Faster and more accurate than grep.
local embeddings
all-MiniLM-L6-v2 via ONNX runtime. 384-dimensional vectors generated on your machine. No API calls, no data leaves your laptop.
pagerank ranking
Files that are imported by many others rank higher. Your agent finds the actually-important code first, not just keyword matches.
persistent memory
Save decisions, patterns, and preferences with git-state linking. Stale memories flagged automatically when referenced files change.
multi-repo + cross impact
One MCP server serves all your repos via a global registry. Trace how a change ripples across dependent repos. No per-project config needed — sverklo init in any repo, done.
configurable pagerank
Drop a .sverklo.yaml in your project root to tune PageRank weights, boost or penalize paths, and customize ranking to match your codebase's shape.
incremental
File watcher updates the index on every save. Dependency graph and PageRank recompute in real time. Always fresh.
eleven languages
TypeScript, JavaScript, Python, Go, Rust, Java, C, C++, Ruby, PHP, Vue.
your agent
finally
remembers.
remember scope:"workspace" writes once, surfaces across every repo in the workspace.
sverklo init imports your existing project knowledge:
CLAUDE.md(12).cursorrules(3)docs/adr/001-prisma.mddocs/adr/002-auth.mdCONTRIBUTING.md(1)
v0.17 →
sverklo memory export pushes them to markdown / Notion / JSON.
bring your own docs
Auto-imports memories from CLAUDE.md, .cursorrules, AGENTS.md, CONTRIBUTING.md, and ADRs on init. Your existing project knowledge becomes semantically searchable instantly.
bi-temporal git memory
Every memory has valid_from_sha and valid_until_sha. Memories are never deleted, only superseded — so you can query "what we believed at commit X" and see when code drift made that advice stale.
staleness detection
If a memory references a file that no longer exists, it's flagged as stale. No more advice based on deleted code.
semantic recall
Memories are embedded and searched the same way as code. Ask "what did we decide about auth?" and get the relevant memory.
auto-inject on session start
An MCP resource surfaces top memories to Claude before you type anything. Your decisions travel across sessions automatically.
what Sverklo bundles
in one local MCP server.
| Tool | Code-native | Local-first | MCP drop-in | Git-aware memory | Symbol graph | Bi-temporal | Cross-repo |
|---|---|---|---|---|---|---|---|
| ▸sverklo | ● | ● | ● | ● | ● | ● | ● |
| mempalace | — | ● | ● | — | — | ● | — |
| claude-mem | — | ● | CC only | — | — | — | — |
| Mem0 | — | crippled | SDK | — | — | — | — |
| Zep / Graphiti | — | Neo4j | ● | — | — | ● | — |
| Augment Code | ● | cloud | ● | — | ● | — | — |
| Greptile | ● | cloud | 3rd party | — | ● | — | — |
| Aider repo-map | ● | ● | — | — | — | — | — |
| Zilliz claude-context | ● | Milvus | ● | — | — | — | — |
| XRAY MCP | ● | ● | ● | — | partial | — | — |
The wedge: Sverklo ties code evidence and memory to git state, then warns when the code has drifted since a decision. If you want conversation memory, use mempalace. If you want local repo evidence before agent edits, run a Sverklo proof receipt.
Want a deeper dive? See side-by-side comparisons → against Serena, GitNexus, codebase-memory-mcp, Cursor @codebase, and Sourcegraph Cody.
install.
prove.
code.
questions,
honestly
answered.
How do I stop Claude Code from hallucinating function names that don't exist in my codebase?
Claude Code hallucinates function names because it generates from training-data patterns rather than your actual symbol graph. It will write getUserByEmail() when your code uses findByEmail(), invent imports for packages you don't depend on, and forget yesterday's design decision because context was compacted. Sverklo solves this with a 37-tool MCP retrieval layer the agent calls before writing code: lookup resolves a name to its definition with file:line, refs proves whether a symbol exists with caller context, verify lets the agent re-check that a quoted span is still present at the cited git SHA. Run npm exec --yes --package=sverklo@latest -- sverklo prove --no-write --guided --markdown to see central files and a real caller graph before writing MCP config. Local-first, MIT-licensed, no API keys.
Why would I install another MCP server?
Most MCP servers are single-purpose wrappers around an API. Sverklo is local-first code intelligence: hybrid search (BM25 + ONNX embeddings + PageRank), symbol-level impact analysis, diff-aware PR review, and bi-temporal memory — all running on your laptop with no API keys. It works with Claude Code, Cursor, Windsurf, Zed, VS Code, JetBrains, and Google Antigravity, so if you already have an MCP client, sverklo just appears as 37 new tools alongside whatever else you run.
How is Sverklo different from Cursor's @codebase or Claude Context?
Cursor's @codebase indexing is cloud-based and tied to the Cursor editor. Claude Context (Zilliz) requires a Milvus database. Sverklo runs entirely on your laptop with SQLite and ONNX embeddings, works across every major AI coding agent via the MCP protocol, and adds symbol-level impact analysis and bi-temporal memory that neither offers.
Does Sverklo work offline?
Yes after the first-run model download. Sverklo downloads the ONNX embedding model once (~90MB, cached locally). After that you can disconnect from the internet and indexing, search, embeddings, memory, and dashboard continue to work locally. Telemetry is opt-in and off by default. Your code never leaves your laptop.
Which AI coding agents does Sverklo support?
Sverklo works with any AI coding agent that speaks the Model Context Protocol (MCP): Claude Code, Cursor, Windsurf, Zed, VS Code, JetBrains, and Google Antigravity. The sverklo init command auto-detects which clients you have installed and writes the right config files.
Is Sverklo free and open source?
Yes. The current server is MIT licensed. All 37 tools in the current OSS server have no usage limits and opt-in telemetry is off by default (see /security). A future Sverklo Pro tier may add smart auto-capture and larger embedding models, and Sverklo Team may add shared team memory; current OSS functionality should not be gated by those tiers.
How do I install Sverklo in Claude Code?
Run npm exec --yes --package=sverklo@latest -- sverklo prove --no-write --guided --markdown from your project first. It prints a real repo-memory receipt without writing project files or MCP config. If the proof looks useful, run npm install -g sverklo, sverklo init --dry-run, then sverklo init. Restart Claude Code and the sverklo tools appear in the /mcp list.
What programming languages does Sverklo support?
24 languages total. 10 first-class with structural parsing: TypeScript/TSX, JavaScript, Python, Go, Rust, C# (tree-sitter), plus Vue (SFC), Markdown, and Jupyter notebooks (custom parsers). 14 more via regex fallback: Java, C, C++, Ruby, PHP, Kotlin, Scala, Swift, Dart, Elixir, Lua, Zig, Haskell, Clojure, OCaml. Hybrid search works across all 24; symbol-level impact analysis is sharpest on the first-class 10.
When is Sverklo the wrong tool?
Sverklo is the right tool when you don't know exactly what string to search for. When you do know the literal string, plain grep is faster and more reliable. On a 30-file repo, you don't need Sverklo. It earns its place on large interconnected codebases where AI coding agents waste thousands of tokens reading the wrong files.
prove it on your repo.
Run the receipt, inspect the callers/tests/decisions it finds, and share the part that is wrong or useful. Local-first, MIT, no API keys, no code upload.