How do I stop Cursor from hallucinating my codebase?

Same root cause as Claude Code: Cursor generates from training data when it doesn't have authoritative retrieval. Sverklo runs alongside Cursor as an MCP server (Cursor 0.42+ supports MCP) and adds the symbol graph, blast radius, and diff-aware risk-scored review that Cursor's @codebase indexing doesn't expose. sverklo init auto-detects Cursor and writes the right config files.

How does Sverklo handle MCP code search?

Sverklo is a local-first MCP server that combines hybrid code search (BM25 + ONNX embeddings + PageRank), symbol-level impact analysis, diff-aware PR review, and bi-temporal memory in one tool. It ships 37 MCP tools total — defaulting to a compact 'core' profile that covers most code-intel sessions — and runs entirely on your laptop with no API keys. Works with Claude Code, Cursor, Windsurf, Zed, VS Code, JetBrains, and Google Antigravity.

How does Sverklo compare with Sourcegraph Cody?

Sverklo is the open-source local-first alternative to Sourcegraph Cody for codebase question-answering. Cody is source-available with an enterprise deployment model and per-developer pricing; Sverklo is MIT-licensed, single-machine, and free. Same retrieval surface (hybrid BM25 + vector + PageRank), different deployment model. Sverklo doesn't try to be a hosted IDE plugin — it's a 37-tool MCP retrieval layer that any AI coding agent can call.

How do I give an AI coding agent persistent memory across sessions?

Sverklo's remember and recall ship a bi-temporal memory layer pinned to git SHAs. Every memory carries valid_from_sha and valid_until_sha; updating a memory inserts a new row and links the old one via superseded_by, so the timeline view can answer 'what did this team believe about the auth flow at commit abc123?' Recall is hybrid (FTS5 + cosine over an ONNX embedding) and runs entirely in embedded SQLite — no Zilliz, Milvus, or managed Postgres+pgvector required.

What local-first code-intelligence evidence does Sverklo publish?

Sverklo is MIT-licensed and ships hybrid retrieval (BM25 + ONNX embeddings + PageRank), symbol-graph impact analysis, diff-aware risk-scored review, and bi-temporal memory pinned to git SHAs in one zero-config install. It runs on a laptop with embedded SQLite and a local ONNX model. No cloud, no API keys, no signup. Numbers are reproducible on the published 180-task bench at sverklo.com/bench.

/* proof receipt ◦ public benchmark ◦ visible losses ◦ local-first MIT */

Run the proof before your coding agent edits.

Name: Sverklo
Author: Sverklo

Same task, same coding agent, better repo evidence. Sverklo gives Claude Code, Cursor, Windsurf, and Codex-style agents your repo's files, symbols, callers, tests, dependencies, diffs, and git-pinned decisions before they edit.

Inspect the public 180-task benchmark, then run a receipt on a real repo. MIT, no API keys, no code upload; first run downloads a local ONNX model. Caveat: grep still wins exact strings.

compare same task → proof guide → repo memory → read benchmark →

Run the copied command from a real repo, follow the proof guide, then paste the markdown receipt into the proof thread. No project files or MCP config are written; first run may cache the local ONNX model/index under ~/.sverklo.

180hand-verified retrieval tasks
6OSS codebases in the bench
5baselines, including grep
0code upload
MITlicensed

~/your-project claude code

# prove first; no project files or MCP config written $ npm exec --yes --package=sverklo@latest -- sverklo prove --no-write --guided --markdown ✓ trial mode ← no project files or MCP config written ✓ guided selection ← why this symbol was picked ✓ repo memory proof ← central files + real caller graph ✓ markdown receipt ← paste into GitHub, Discord, Reddit # paste the proof prompt into Claude/Cursor/Codex > Use sverklo impact on validateToken and tell me what would break if I changed its signature. › impact symbol="validateToken" ─ src/routes/auth.ts:18 requireAuth() direct caller ─ src/middleware/auth.ts:42 authMiddleware() direct caller ─ src/tests/auth.test.ts:31 login flow test coverage ✓ caller graph in 23ms — signature change is not local

# install sverklo $ npm install -g sverklo # add to .cursor/mcp.json (or Cursor Settings → MCP) $ cat .cursor/mcp.json { "mcpServers": { "sverklo": { "command": "sverklo", "args": ["mcp", "."] } } } # Cursor agent picks up sverklo on next session > where do we handle stripe webhooks? › context task="stripe webhook handling" ─ src/api/webhooks/stripe.ts:14 handleStripeEvent() score: 0.92 ─ src/billing/events.ts:31 processCharge() score: 0.78 ✓ bundle: 4 files, 2 memories — 1,840 tokens

# install sverklo and run init — Antigravity is auto-detected $ npm install -g sverklo $ cd your-project && sverklo init ✓ ~/.gemini/antigravity/mcp_config.json ← global MCP config ✓ added sverklo (project: ~/your-project) Restart Antigravity to pick up the new MCP server. # Gemini in Antigravity uses sverklo via MCP > review my branch before I merge to main › review_diff ref="main..HEAD" ─ ◆ risk 52 src/api/billing.ts 2 removed symbols still referenced; 8 importers › test_map ref="main..HEAD" ─ 5 covered · 1 uncovered (risk 78) verdict: BLOCKED — fix dangling refs in 3 files

# one server, all your repos — no per-project config needed $ npm install -g sverklo $ cd ~/api && sverklo init # auto-registers $ cd ~/frontend && sverklo init # auto-registers $ sverklo setup --global # one-time global MCP config # now every AI editor has access to all your repos $ sverklo list api ~/api 37 tools fresh frontend ~/frontend 37 tools fresh

same task, same agent

What changes when the agent gets repo evidence first?

The comparison is intentionally plain: ask the same coding agent to change an auth helper. Baseline tools make it search and guess. Sverklo hands it the repo relationships before the edit.

Task

Change an auth helper signature without breaking callers, tests, or prior project decisions.

Baseline path

The agent greps likely names, reads a few files, and may miss indirect callers or stale decisions after context compaction.

Sverklo path

impact, refs, test_map, and recall show real callers, covered tests, dependencies, and git-pinned decisions.

Evidence changed

Files, symbols, callers, tests, dependencies, diffs, and decisions appear before the model writes code.

Outcome + caveat

Fewer blind edits on relationship-heavy tasks. If you already know the exact string or the repo is tiny, grep is still the sharper first move.

Proof source: public 180-task retrieval benchmark plus maintainer-seeded receipts. External same-task receipts are still the top proof gap.

/* 90-second walkthrough — terminal then live Claude Code MCP integration */

seeded receipts from real repos

Don't trust the claim. Run the proof.

sverklo prove --markdown prints a shareable receipt from the current repo: central files, one real symbol, real callers, and the exact prompt to paste into your coding agent.

These examples are maintainer-seeded so you can inspect the shape before installing. They are not third-party validation; external receipts and corrections are the current ask.

Sverklo

maintainer-seeded example

TypeScript MCP server, CLI, benchmark harness, telemetry worker, and site worktree in one receipt.

#79

Express

maintainer-seeded example

Middleware-heavy JavaScript repo: real entry points, core symbols, and caller relationships.

#79

Fastify

maintainer-seeded example

Large Node framework with plugin and route structure that rewards graph-aware retrieval.

comment

Zod

maintainer-seeded example

TypeScript validation library where symbol precision matters more than keyword matching.

comment

Zustand

maintainer-seeded example

Compact state library showing whether the proof remains useful when the repo is not huge.

comment

paste your receipt → see benchmark →

/* what happened when we published the bench */

A public benchmark, two competitors, fixes shipped on both sides in 36 hours.

Apr 28 Published 180-task bench across 6 codebases with two competing local-first MCP code-intel servers (jcodemunch-mcp, GitNexus) and the slices where sverklo lost — including FastAPI P5 dead-code (0.00 vs grep's 1.00).
May 2 → 3 jcodemunch maintainer @jgravelle shipped v1.80.7 / 1.80.8 / 1.80.9 against specific bench findings. P5 recall 0.00 → 1.00; lodash P1 0/10 → 9/10.
May 4 Adding lodash to the bench exposed the symmetric blind spot in sverklo's parser. sverklo v0.20.2 ships the fix. Sverklo P1 0.30 → 0.73; overall F1 0.45 → 0.56 on that run.

"the bigger opportunity here is the potential genesis of an 'MCP Server Arena' on par with what the leading AI/LLM/Chatbot arenas provide…"
— Jake Gravelle, jcodemunch-mcp maintainer, on r/mcp

180

hand-verified tasks

6 OSS codebases · 5 baselines

0.58

overall F1

canonical May 13 bench run

35×

fewer input tokens

vs naive grep on that bench

losses

published too

grep and baselines still win slices

methodology + losses → · sverklo-benchground truth + reproducer · preprint · Zenodo · CC BY 4.0 →

/* honest about when this helps */

Sverklo doesn't replace grep. It complements it.

SVERKLO WINS

✓ Exploratory questions ("how does the auth flow work?")
✓ Refactor blast radius (impact)
✓ Large interconnected codebases
✓ Memory across sessions, tied to git SHAs
✓ Project audits — god nodes, dead code

GREP WINS

— Focused diff review
— Exact string matching
— Reading file contents
— Build & test verification
— Anything where you already know the symbol

Sverklo is the right tool when you don't know exactly what to search for. When you do know, grep is fine.

/* diff-aware mr review */

your agent
reviews like
a senior dev.

~/your-project review main..HEAD

> /sverklo:pre-merge › review_diff ref="main..HEAD" ─ 7 files changed (12 added, 3 removed, 4 modified) ## ⚠️ Highest-risk files ─ ◆ risk 78 (critical) src/auth/session.ts no matching tests; security-sensitive; 14 importers ─ ◆ risk 52 (high) src/api/billing.ts 2 removed symbols still referenced; 8 importers › test_map ref="main..HEAD" ─ 5 covered · 2 uncovered ─ ◆ risk 78 src/auth/session.ts ← block merge › impact symbol="oldValidateToken" ─ 4 dangling references in 3 files — must fix verdict: BLOCKED — 1 critical, 4 dangling refs

per-file risk · main..HEAD low med high

src/auth/session.ts 78

no tests · 14 importers · sec-sensitive

src/api/billing.ts 52

2 dangling refs · 8 importers

src/lib/jwt.ts 34

covered · 6 importers

src/utils/clock.ts 12

covered · 1 importer

score = importance × untested-ness × churn × dangling-refs

v0.17 → the GitHub Action posts inline review comments on every PR alongside the sticky summary, anchored to the heuristic-flagged lines.

risk score

Every changed file gets a 0–100 score combining untested status, security-sensitive paths, importer fan-in, caller count, dangling references, and churn. Reasons are explicit — no black box.

missing-test detection

test_map walks the diff, the import graph, and filename conventions to flag changed code without matching tests — ranked by risk.

dangling-ref check

Every removed symbol is checked against the symbol-reference graph. If a caller still exists, the merge is blocked with file:line evidence.

mcp prompts

Five workflows: /sverklo:review-changes, /sverklo:pre-merge, /sverklo:onboard, /sverklo:architecture-map, /sverklo:debug-issue.

/* review-quality benchmark — published, not claimed */

METHOD

Real merge requests across 5 OSS repos (Express, FastAPI, etc). Same Claude model, same prompt, with and without sverklo's diff-aware tools. F1 vs human-written review. Harness in benchmark/ — reproducible.

RESULTS

+29% F1 after parser improvements on P1/P2 categories. 2–4× fewer tokens per review on most repos. Mixed results on small focused diffs (built-in tools win there). Honest receipts beat marketing math.

WHY WE PUBLISH MIXED

Competitors claim "5–10× fewer tokens" with no methodology. We'd rather be the tool you trust than the one you screenshot. The benchmark harness is in the repo — clone it, run it on your codebase, decide for yourself.

/* performance benchmark — three real codebases, one laptop */

SEARCH p95

26 ms

On facebook/react (4,368 files, 20k chunks). On smaller repos (gin, nestjs) it drops to 11–14 ms. Sverklo lives in the agent's hot loop — the latency budget allows it.

IMPACT ANALYSIS

<1.2 ms

impact render on React's most-called symbol. Sub-millisecond on every repo we tested — it's an indexed SQL join, not a 200-grep-match scan.

COLD INDEX (REACT)

152 s

2.5 minutes for a 4k-file monorepo, ~7 ms/chunk. Pay it once per project; incremental refresh after that only re-processes changed files. The honest trade-off — listed first because we think you'd rather know.

      Run on Apple Silicon. Steady-state RAM ~200 MB; peak during indexing 400–700 MB while the embedder batches. Full methodology, all metrics, and the reproducer harness in BENCHMARKS.md. See the MCP benchmark comparison.
    

/* repo evidence graph */

indexed.
ranked.
remembered.

01 · task

"change this auth helper safely"

02 · evidence

files · symbols · callers
tests · deps · decisions

03 · rank

PageRank + RRF fusion
channel weights, path × 1.5

04 · agent prompt

edit plan with caveats
not a blind context dump

hybrid search

BM25 for precision, semantic embeddings for recall, PageRank for structural importance — fused via RRF. Faster and more accurate than grep.

local embeddings

all-MiniLM-L6-v2 via ONNX runtime. 384-dimensional vectors generated on your machine. No API calls, no data leaves your laptop.

pagerank ranking

Files that are imported by many others rank higher. Your agent finds the actually-important code first, not just keyword matches.

persistent memory

Save decisions, patterns, and preferences with git-state linking. Stale memories flagged automatically when referenced files change.

multi-repo + cross impact

One MCP server serves all your repos via a global registry. Trace how a change ripples across dependent repos. No per-project config needed — sverklo init in any repo, done.

configurable pagerank

Drop a .sverklo.yaml in your project root to tune PageRank weights, boost or penalize paths, and customize ranking to match your codebase's shape.

incremental

File watcher updates the index on every save. Dependency graph and PageRank recompute in real time. Always fresh.

eleven languages

TypeScript, JavaScript, Python, Go, Rust, Java, C, C++, Ruby, PHP, Vue.

/* persistent memory */

your agent
finally
remembers.

Every AI coding session starts from zero. Decisions vanish after context compaction. Sverklo fixes this with persistent memory linked to git state.

On sverklo init, your existing CLAUDE.md, .cursorrules, and ADRs get parsed into searchable memories — categorized, tagged, and embedded automatically.

Tell your agent to remember new things as they come up. Ask later. It's already there.

tuesday → thursday · two conversations, one memory remember + recall

# tuesday — agent saves the decision > we picked Prisma for the ORM because of better TypeScript types › remember category="decision" ✓ memory #42 saved · main@a3f29e1 # thursday — new conversation, fresh context window ───────────────────────────────────────────────────── Loaded 3 memories: ORM choice, auth pattern, API conventions > add a new API endpoint for user profiles # Claude already knows: Prisma + JWT (from memories #42, #43) › writes endpoint with Prisma + JWT — no rediscovery

#42 decision

"We chose Prisma over Drizzle for better TypeScript types."

kind: semantic
scope: workspace [ws]
tags: database · orm
git: main@a3f29e1
valid_from: a3f29e1
confidence: 1.0

v0.17 → remember scope:"workspace" writes once, surfaces across every repo in the workspace.

init on import

sverklo init imports your existing project knowledge:

CLAUDE.md (12)
.cursorrules (3)
docs/adr/001-prisma.md
docs/adr/002-auth.md
CONTRIBUTING.md (1)

18 memories · categorized, tagged, embedded

v0.17 → sverklo memory export pushes them to markdown / Notion / JSON.

bring your own docs

Auto-imports memories from CLAUDE.md, .cursorrules, AGENTS.md, CONTRIBUTING.md, and ADRs on init. Your existing project knowledge becomes semantically searchable instantly.

bi-temporal git memory

Every memory has valid_from_sha and valid_until_sha. Memories are never deleted, only superseded — so you can query "what we believed at commit X" and see when code drift made that advice stale.

staleness detection

If a memory references a file that no longer exists, it's flagged as stale. No more advice based on deleted code.

semantic recall

Memories are embedded and searched the same way as code. Ask "what did we decide about auth?" and get the relevant memory.

auto-inject on session start

An MCP resource surfaces top memories to Claude before you type anything. Your decisions travel across sessions automatically.

      37 MCP tools for search, impact analysis, review, and memory — defaults to a compact core profile for common code-intelligence sessions. Full reference on GitHub.
    

/* comparison matrix */

what Sverklo bundles
in one local MCP server.

Use this as a starting matrix, not a winner-takes-all claim. Some tools optimize for hosted review, some for editor-native search, and some for memory only. Sverklo's lane is local repo evidence for agents before they edit.

Tool	Code-native	Local-first	MCP drop-in	Git-aware memory	Symbol graph	Bi-temporal	Cross-repo
▸sverklo	●	●	●	●	●	●	●
mempalace	—	●	●	—	—	●	—
claude-mem	—	●	CC only	—	—	—	—
Mem0	—	crippled	SDK	—	—	—	—
Zep / Graphiti	—	Neo4j	●	—	—	●	—
Augment Code	●	cloud	●	—	●	—	—
Greptile	●	cloud	3rd party	—	●	—	—
Aider repo-map	●	●	—	—	—	—	—
Zilliz claude-context	●	Milvus	●	—	—	—	—
XRAY MCP	●	●	●	—	partial	—	—

Different lanes: Most "AI memory" tools (mempalace, claude-mem, Mem0, Zep) are built to remember conversations. Sverklo is built to understand code. We index 24 languages (10 first-class with structural parsing, 14 via regex fallback), build a symbol-level dependency graph, and run hybrid BM25 + vector + PageRank search — then bolt a bi-temporal memory layer on top that's tied to git commits, not wall-clock time.

The wedge: Sverklo ties code evidence and memory to git state, then warns when the code has drifted since a decision. If you want conversation memory, use mempalace. If you want local repo evidence before agent edits, run a Sverklo proof receipt.

Want a deeper dive? See side-by-side comparisons → against Serena, GitNexus, codebase-memory-mcp, Cursor @codebase, and Sourcegraph Cody.

/* install */

install.
prove.
code.

      WORKS WITH
      CClaude Code
      Cursor
      WWindsurf
      ZZed
      VVS Code
      JJetBrains
      AAntigravity
      & any MCP client
    

01 — prove

Run a no-write proof receipt

Requires Node 24+. Does not write project files, MCP config, or agent instructions. Read the proof guide.

02 — preview

Inspect setup writes first

Shows the exact project and home-directory config files before changing anything.

03 — wire

Initialize only after proof looks useful

Writes MCP config, agent instructions, and setup checks. Use a different agent name for Cursor, Codex, Windsurf, VS Code, Zed, or Antigravity.

/* questions we get asked */

questions,
honestly
answered.

How do I stop Claude Code from hallucinating function names that don't exist in my codebase?

Claude Code hallucinates function names because it generates from training-data patterns rather than your actual symbol graph. It will write getUserByEmail() when your code uses findByEmail(), invent imports for packages you don't depend on, and forget yesterday's design decision because context was compacted. Sverklo solves this with a 37-tool MCP retrieval layer the agent calls before writing code: lookup resolves a name to its definition with file:line, refs proves whether a symbol exists with caller context, verify lets the agent re-check that a quoted span is still present at the cited git SHA. Run npm exec --yes --package=sverklo@latest -- sverklo prove --no-write --guided --markdown to see central files and a real caller graph before writing MCP config. Local-first, MIT-licensed, no API keys.

Why would I install another MCP server?

Most MCP servers are single-purpose wrappers around an API. Sverklo is local-first code intelligence: hybrid search (BM25 + ONNX embeddings + PageRank), symbol-level impact analysis, diff-aware PR review, and bi-temporal memory — all running on your laptop with no API keys. It works with Claude Code, Cursor, Windsurf, Zed, VS Code, JetBrains, and Google Antigravity, so if you already have an MCP client, sverklo just appears as 37 new tools alongside whatever else you run.

How is Sverklo different from Cursor's @codebase or Claude Context?

Cursor's @codebase indexing is cloud-based and tied to the Cursor editor. Claude Context (Zilliz) requires a Milvus database. Sverklo runs entirely on your laptop with SQLite and ONNX embeddings, works across every major AI coding agent via the MCP protocol, and adds symbol-level impact analysis and bi-temporal memory that neither offers.

Does Sverklo work offline?

Yes after the first-run model download. Sverklo downloads the ONNX embedding model once (~90MB, cached locally). After that you can disconnect from the internet and indexing, search, embeddings, memory, and dashboard continue to work locally. Telemetry is opt-in and off by default. Your code never leaves your laptop.

Which AI coding agents does Sverklo support?

Sverklo works with any AI coding agent that speaks the Model Context Protocol (MCP): Claude Code, Cursor, Windsurf, Zed, VS Code, JetBrains, and Google Antigravity. The sverklo init command auto-detects which clients you have installed and writes the right config files.

Is Sverklo free and open source?

Yes. The current server is MIT licensed. All 37 tools in the current OSS server have no usage limits and opt-in telemetry is off by default (see /security). A future Sverklo Pro tier may add smart auto-capture and larger embedding models, and Sverklo Team may add shared team memory; current OSS functionality should not be gated by those tiers.

How do I install Sverklo in Claude Code?

Run npm exec --yes --package=sverklo@latest -- sverklo prove --no-write --guided --markdown from your project first. It prints a real repo-memory receipt without writing project files or MCP config. If the proof looks useful, run npm install -g sverklo, sverklo init --dry-run, then sverklo init. Restart Claude Code and the sverklo tools appear in the /mcp list.

What programming languages does Sverklo support?

24 languages total. 10 first-class with structural parsing: TypeScript/TSX, JavaScript, Python, Go, Rust, C# (tree-sitter), plus Vue (SFC), Markdown, and Jupyter notebooks (custom parsers). 14 more via regex fallback: Java, C, C++, Ruby, PHP, Kotlin, Scala, Swift, Dart, Elixir, Lua, Zig, Haskell, Clojure, OCaml. Hybrid search works across all 24; symbol-level impact analysis is sharpest on the first-class 10.

When is Sverklo the wrong tool?

Sverklo is the right tool when you don't know exactly what string to search for. When you do know the literal string, plain grep is faster and more reliable. On a 30-file repo, you don't need Sverklo. It earns its place on large interconnected codebases where AI coding agents waste thousands of tokens reading the wrong files.

/* one proof receipt, MIT-licensed, stays on your laptop */

prove it on your repo.

Run the receipt, inspect the callers/tests/decisions it finds, and share the part that is wrong or useful. Local-first, MIT, no API keys, no code upload.

inspect receipts →

Run the proof before your coding agent edits.

What changes when the agent gets repo evidence first?

Don't trust the claim. Run the proof.

A public benchmark, two competitors, fixes shipped on both sides in 36 hours.

Sverklo doesn't replace grep. It complements it.

your agentreviews likea senior dev.

risk score

missing-test detection

dangling-ref check

mcp prompts

indexed.ranked.remembered.

hybrid search

local embeddings

pagerank ranking

persistent memory

multi-repo + cross impact

configurable pagerank

incremental

eleven languages

your agentfinallyremembers.

bring your own docs

bi-temporal git memory

staleness detection

semantic recall

auto-inject on session start

what Sverklo bundlesin one local MCP server.

install.prove.code.

questions,honestlyanswered.

prove it on your repo.

your agent
reviews like
a senior dev.

indexed.
ranked.
remembered.

your agent
finally
remembers.

what Sverklo bundles
in one local MCP server.

install.
prove.
code.

questions,
honestly
answered.