AxonAtlas logoAxonAtlas
PlatformPricingDocumentation
Login
Get Started

Contents

Platform Documentation

AxonAtlas Architecture & Intelligence

A comprehensive technical guide to the AxonAtlas Code Intelligence Platform — how it works, what it builds, and why it represents a fundamental shift in how engineering teams understand their software systems.

Code Intelligence PlatformSoftware Knowledge GraphMCP Native14-Phase PipelineMulti-Repository

What is AxonAtlas

AxonAtlas is a semantic code intelligence engine that transforms raw source code repositories into a queryable, semantically-rich knowledge graph. It goes far beyond surface-level linting or traditional static analysis — AxonAtlas constructs a deep structural and semantic model of an entire codebase, resolving function calls, propagating types, detecting architectural communities, mapping execution flows, and identifying dead code — all within a unified graph that AI agents and developers can query in real time.

Think of AxonAtlas as a Software Knowledge Engine: an always-on intelligence substrate that understands not just what code exists, but how it connects, why it changes together, and where risk concentrates. Traditional tools show you a list of warnings. AxonAtlas shows you the living architecture of your system.

What AxonAtlas Does

  • →Parses entire codebases using Tree-sitter grammars, extracting functions, classes, methods, interfaces, types, enums, and variables
  • →Resolves inter-symbol dependencies using multi-tier reasoning — local scope, import resolution, receiver-method de-aliasing, and global fuzzy matching
  • →Enriches the graph with worklist-based type propagation, Leiden community detection, execution process flow tracing, and temporal coupling from git history
  • →Persists the knowledge graph into KuzuDB — a high-performance embedded graph database — supporting full-text search, vector similarity, and raw Cypher queries
  • →Exposes intelligence through the Model Context Protocol (MCP), enabling AI agents, IDEs, and CI systems to query code structure in real time

Who AxonAtlas Serves

AI Coding Agents

Structured code context for accurate modifications — callers, callees, types, communities, blast radius. Agents make fewer mistakes when they can see the full picture.

Engineering Leaders

Architectural visibility — community maps, coupling hotspots, dead code quantification. Make data-driven decisions about technical debt and system health.

Developer Experience Teams

Onboarding acceleration — narrative symbol explanations, process flow visualization. New engineers understand the system in hours, not months.

Platform & DevOps

PR risk scoring, test impact analysis, incremental CI intelligence. Know exactly which tests to run and which reviews need extra attention.

Supported Languages & Frameworks

AxonAtlas provides first-class support for Python and TypeScript/JavaScript, with framework-aware entry point detection that understands the conventions of modern web and CLI frameworks:

EcosystemFrameworksEntry Point Detection
PythonFlask, FastAPI, Django, Click, Celery, APSchedulerDecorators, main(), __main__.py, argparse
TypeScriptExpress, Next.js, Node.jsRoute handlers, pages, layouts, event listeners

Why Code Intelligence Matters

The software industry is undergoing a structural transformation. AI coding agents can now generate thousands of lines of code per hour — but this unprecedented velocity has created a new category of engineering risk: architectural drift at machine speed.

Traditional static analysis tools were designed for a world where humans wrote code slowly and reviewed it carefully. They answer narrow questions: “Is this variable unused?” “Does this type match?” But they cannot answer the questions that matter most in an agent-assisted world: “If I change this function, what breaks downstream?” “Which files always change together but have no import relationship?” “Is this code actually reachable from any entry point?”

Code Intelligence is the answer to this gap. It goes beyond syntax-level checks to build a living model of your system's architecture — one that understands relationships, dependencies, coupling patterns, and execution flows. It is the difference between having a spell-checker and having an editor who understands your entire narrative.

Static Analysis vs. Code Intelligence

The distinction between traditional static analysis and a code intelligence platform is fundamental. Static analysis operates on individual files or narrow scopes. Code intelligence operates on the entire system as a connected graph.

CapabilityTraditional Static AnalysisAxonAtlas Code Intelligence
RepresentationFlat warning lists, AST dumpsUnified knowledge graph (11 node types, 17 edge types)
Semantic DepthSingle-pass name matchingMulti-tier call resolution with confidence scoring
Type IntelligenceNo type propagationCompiler-grade engines (Pyright/TSC) + heuristic fallback
Architectural InsightNoneLeiden community detection, process flow tracing
Change IntelligenceNoneTemporal coupling (git history) + structural coupling
Incremental UpdatesFull re-scan or dirty file onlyDependency-aware cascade — transitive importers re-analyzed
Agent IntegrationCLI-only or JSON outputNative MCP protocol with 17 tools + 3 resources
Multi-RepositorySingle-repo scopeCross-repo symbol resolution, workspace-level intelligence

The Developer Productivity Crisis

Studies consistently show that developers spend more time understanding existing code than writing new code. In large organizations, this ratio can reach 70:30 — seventy percent of engineering time spent reading, navigating, and comprehending code that already exists. This is the invisible tax on every feature, every refactor, every bug fix.

AxonAtlas directly addresses this productivity crisis by providing a continuously updated, queryable modelof the entire codebase. Instead of manually tracing call chains through dozens of files, an engineer (or an AI agent) can ask: “Show me every caller of this function, what community it belongs to, and what processes flow through it” — and receive an answer in milliseconds.

This is not incremental improvement. This is a category-level shift in how engineering teams interact with their own systems.

System Architecture Overview

AxonAtlas is organized into four clearly separated layers, each with distinct responsibilities. This layered architecture ensures that the intelligence pipeline, the storage engine, and the protocol interface can evolve independently without breaking contracts between them.

At the highest level, the MCP Protocol Layer handles all external communication — exposing 17 tools and 3 resources over both stdio and Streamable HTTP transports. Below it, the Intelligence Services layer provides the reasoning engine: query planning, impact analysis, narrative explanations, and risk scoring. The Semantic Core manages symbol indexes, type intelligence, and workspace-level coordination. And at the foundation, the Ingestion Pipeline and Storage Layer convert raw source code into a persistent, queryable knowledge graph.

System Architecture
┌──────────────────────────────────────────────────────────────┐
│                     MCP Protocol Layer                        │
│  ┌──────────┐  ┌────────────┐  ┌────────┐  ┌──────────────┐ │
│  │ 17 Tools │  │ 3 Resources│  │Security│  │ HTTP + stdio │ │
│  └────┬─────┘  └─────┬──────┘  └───┬────┘  └──────┬───────┘ │
└───────┼──────────────┼─────────────┼───────────────┼─────────┘
        │              │             │               │
┌───────┼──────────────┼─────────────┼───────────────┼─────────┐
│       ▼              ▼             ▼               ▼         │
│                Intelligence Services                         │
│  ┌───────────┐  ┌────────┐  ┌─────────┐  ┌─────────────┐    │
│  │ Query     │  │ Impact │  │ Explain │  │ Review Risk │    │
│  │ Planning  │  │Analysis│  │Narrative│  │  Scoring    │    │
│  └─────┬─────┘  └───┬────┘  └────┬────┘  └──────┬──────┘    │
└────────┼────────────┼────────────┼───────────────┼───────────┘
         │            │            │               │
┌────────┼────────────┼────────────┼───────────────┼───────────┐
│        ▼            ▼            ▼               ▼           │
│                  Semantic Core                               │
│  ┌────────────┐  ┌──────────┐  ┌────────────┐               │
│  │ Symbol     │  │ Type     │  │ Workspace  │               │
│  │ Index      │  │ Intel    │  │ Manager    │               │
│  └─────┬──────┘  └────┬─────┘  └─────┬──────┘               │
└────────┼──────────────┼──────────────┼───────────────────────┘
         │              │              │
┌────────┼──────────────┼──────────────┼───────────────────────┐
│        ▼              ▼              ▼                       │
│             Ingestion Pipeline (14 Phases)                   │
│  ┌────────┐ ┌──────┐ ┌───────┐ ┌────────┐ ┌──────────────┐  │
│  │Structure│→│Parse │→│Import │→│TypeFlow│→│  Resolution  │  │
│  └────────┘ └──────┘ └───────┘ └────────┘ └──────┬───────┘  │
│  ┌──────────┐ ┌──────────┐ ┌─────────┐ ┌────────┴──┐       │
│  │Community │ │ Coupling │ │Process  │ │ Dead Code │       │
│  └──────────┘ └──────────┘ └─────────┘ └───────────┘       │
└──────────────────────────────────────────────────────────────┘
         │
┌────────┼─────────────────────────────────────────────────────┐
│        ▼          Storage Layer                              │
│  ┌────────────────┐  ┌─────────────┐  ┌──────────────────┐  │
│  │  KuzuDB Graph  │  │ Connection  │  │  Repo Router     │  │
│  │  Cypher Engine  │  │ Pool + R/W  │  │  (LRU Pool)     │  │
│  └────────────────┘  └─────────────┘  └──────────────────┘  │
└──────────────────────────────────────────────────────────────┘

Core Module Map

The system comprises 16 core modules, each with a single, well-defined responsibility. This modularity ensures that individual components can be tested, optimized, and replaced without cascading changes across the system.

LayerResponsibility
Data Model11 NodeLabels, 17 RelTypes, deterministic moniker IDs
In-Memory GraphThread-safe KnowledgeGraph with 6 secondary indexes
Persistent StorageKuzuDB integration, FTS, vector search, batch operations
Multi-Repo RouterLRU connection pool, registry, path normalization
Ingestion Pipeline14-phase DAG orchestrator with topological validation
Phase ModelDeclarative dependency DAG with Kahn's cycle detection
Call Resolution4-tier hierarchical resolver with confidence scoring
Type PropagationWorklist-based Type Flow Graph
Community DetectionAdaptive Leiden algorithm (full/chunked/sampled)
Dead Code AnalysisReachability analysis with false-positive reduction
Process FlowsEntry point detection + BFS flow tracing
Temporal CouplingGit co-change + structural coupling hybrid
Incremental EngineDependency-aware cascade reparse
Intelligence ServiceQuery planning, impact analysis, probabilistic ranking
MCP Serverstdio + HTTP transport, 17 tools, 3 resources
SecuritySupabase-backed secret validation, repo-level ACL

The Intelligence Pipeline

At the heart of AxonAtlas is a deterministic, multi-phase directed acyclic graph (DAG)with 14 declared phases. This is not a simple sequential script — it is a formally validated pipeline where each phase declares its dependencies, and the execution order is verified at startup using Kahn's algorithm for cycle detection.

This design ensures that phase ordering bugs are caught before any analysis runs. Phases within the same parallel group are verified to have no transitive dependencies among themselves, enabling safe concurrent execution. Optional phases — such as coupling analysis, duplicate detection, and embedding generation — can be skipped without breaking downstream phases, providing graceful degradation when certain data sources (like git history) are unavailable.

Pipeline DAG
                    structure
                       │
                    parsing
                       │
                    imports
                       │
                   type_flow
                   ╱   │   ╲
              calls heritage types    ← parallel_group: "resolution"
                   ╲   │   ╱
                   contracts
                 ╱  │  │  ╲  ╲
        communities │  │  coupling  duplicates  ← parallel_group: "semantic"
                processes │
                   dead_code
                        │
                    embeddings

Phase-by-Phase Breakdown

Each phase builds on the outputs of its predecessors. The early phases establish the structural foundation — file nodes, parsed symbols, import relationships. The middle phases add semantic depth — type propagation, call resolution, class hierarchies. The later phases perform higher-order analysis — community clustering, process flow detection, dead code identification. Finally, the embedding phase generates vector representations for semantic search.

#PhaseDescriptionKey Output
1StructureWalk filesystem, create FILE and FOLDER nodes with CONTAINS edgesRepository tree
2ParsingTree-sitter parsing: extract FUNCTION, CLASS, METHOD, INTERFACE, ENUM, VARIABLE, TYPE_ALIAS nodesSymbol table
3ImportsResolve IMPORTS (file→file) and IMPORTS_SYMBOL (file→symbol) edges across modulesImport graph
4Type FlowWorklist-based type propagation across assignments, parameters, return typesType annotations
5CallsMulti-tier call resolution (local → import → receiver → global fuzzy)Call graph
6HeritageResolve EXTENDS and IMPLEMENTS edges for class hierarchiesInheritance tree
7TypesResolve USES_TYPE edges from symbols to referenced typesType dependency graph
8ContractsStitch CONSUMES_API edges from combined import + call evidenceAPI surface map
9CommunitiesLeiden clustering with adaptive strategy (full/chunked/sampled)Community nodes
10ProcessesEntry point detection + BFS flow tracingProcess flow nodes
11Dead CodeReachability analysis from entry points with false-positive reductionis_dead flags
12CouplingGit co-change matrix + structural coupling hybridCOUPLED_WITH edges
13DuplicatesSemantic clone detection via structural similaritySEMANTIC_CLONE edges
14EmbeddingsVector embedding generation for symbol nodesEmbedding table

Pipeline Guarantees

Determinism

Given identical input, the pipeline produces bit-identical graph output. Moniker IDs are content-addressed, not order-dependent. This makes it possible to verify index correctness by comparing fingerprints across machines.

Topological Validation

Execution order is validated against the declared DAG at startup. If a developer adds a new phase with an incorrect dependency, the system catches it immediately — before any analysis runs.

Parallel Safety

Phases within the same parallel_group are verified to have no transitive dependencies among themselves. The resolution group (calls, heritage, types) runs concurrently because each operates on independent edge types.

Graceful Degradation

Optional phases can be skipped without breaking the pipeline. If git history is unavailable, coupling analysis is skipped. If no embedding model is configured, the embedding phase is bypassed.

The Software Knowledge Graph

The AxonAtlas knowledge graph is a labeled property graph with a precisely defined schema of 11 node types and 17 relationship types. Every entity in the codebase — from a single variable to an entire architectural community — is represented as a typed node with properties. Every connection between entities — a function call, an import, a type reference, a temporal coupling — is represented as a typed, directed edge.

This graph is not an afterthought bolted onto existing analysis. It is the analysis. Every intelligence capability AxonAtlas provides — from impact analysis to community detection to dead code identification — is implemented as a graph traversal or graph algorithm over this unified model.

Node Types

The graph schema distinguishes between structural nodes (representing physical entities in the codebase) and synthetic nodes (representing discovered patterns or computed groupings). Structural nodes are created during the parsing phase. Synthetic nodes emerge from later analysis phases.

LabelCategorySemanticsExample
FILEStructuralSource file on disksrc/auth/service.py
FOLDERStructuralDirectory containersrc/auth/
FUNCTIONStructuralStandalone functionvalidate_token()
CLASSStructuralClass definitionAuthService
METHODStructuralClass-bound methodAuthService.verify()
INTERFACEStructuralTypeScript interface / Python ProtocolIAuthProvider
TYPE_ALIASStructuralType alias or typedefUserId = string
ENUMStructuralEnumeration typeUserRole
VARIABLEStructuralModule-level variable or constantMAX_RETRIES
COMMUNITYSyntheticDetected architectural clusterauth-core
PROCESSSyntheticDiscovered execution flowlogin → verify → issue_token

Relationship Types

Relationships carry the semantic intelligence. Each edge type models a specific kind of connection with distinct semantics. Some relationships are directional (a function calls another function); others are symmetric (two files are coupledin git history). The diversity of edge types is what makes AxonAtlas queries so expressive — you can ask “who calls this function?” as easily as “which files are temporally coupled with this one?”

RelationshipDirectionSemantics
CONTAINSFolder → FileStructural containment
DEFINESFile → SymbolSymbol provenance — which file owns this symbol
CALLSSymbol → SymbolFunction or method invocation
IMPORTSFile → FileModule-level import dependency
IMPORTS_SYMBOLFile → SymbolNamed import binding (from X import Y)
EXTENDSClass → ClassClass inheritance
IMPLEMENTSClass → InterfaceInterface conformance or protocol implementation
USES_TYPESymbol → TypeType annotation reference
EXPORTSFile → SymbolPublic API surface exposure
MEMBER_OFSymbol → CommunityCluster membership assignment
STEP_IN_PROCESSSymbol → ProcessParticipation in an execution flow
COUPLED_WITHFile ↔ FileTemporal and structural co-change coupling
CONSUMES_APISymbol → SymbolCross-boundary API contract (e.g., frontend → backend)
SEMANTIC_CLONESymbol ↔ SymbolNear-duplicate code detection
CONVERGENT_FLOWProcess → ProcessOverlapping execution paths (>50% step overlap)
DYNAMICALLY_REFERENCESSymbol → SymbolRuntime-resolved reference (decorators, __getattr__)
CROSS_REPO_REFERENCESSymbol → SymbolCross-repository dependency across workspace

Deterministic Identity — The Moniker System

Every node in the AxonAtlas graph receives a deterministic, content-addressable ID called a moniker. Monikers are stable across re-indexes, portable across machines, and globally unique within a workspace. This is the foundational invariant that makes incremental indexing, cross-repository resolution, and graph fingerprinting possible.

moniker format
# Symbol moniker — encodes label, language, repo, module, and name
function:python:local:src/auth/service:validate_token
class:typescript:local:src/components/App:Dashboard

# Structural moniker — stable file identity
axon://file/src/auth/service.py

KuzuDB — The Storage Engine

The knowledge graph is persisted to KuzuDB, a high-performance embedded columnar graph database optimized for analytical queries. AxonAtlas wraps KuzuDB with a sophisticated concurrency layer: a read-many / write-exclusive lock enables concurrent read queries while serializing mutations, and a bounded connection pool prevents resource exhaustion under high query load.

The storage backend supports four distinct search modalities, each optimized for a different access pattern:

Search TypeAlgorithmUse Case
Full-Text SearchKuzuDB FTS extension with BM25 scoringSymbol name search, content search, documentation queries
Fuzzy SearchLevenshtein + Jaro-Winkler similarityTypo-tolerant symbol resolution for agent queries
Vector Searcharray_cosine_similarity() native in CypherSemantic code search via embeddings — find similar functions
Cypher QueriesRaw Cypher passthroughCustom agent queries, complex graph traversals, ad-hoc analysis

Semantic Intelligence Engine

Semantic intelligence is the capability that separates AxonAtlas from every other code analysis tool on the market. Where traditional tools stop at syntactic parsing — extracting function names and detecting import statements — AxonAtlas reconstructs the actual runtime behavior of the code through multi-tier reasoning about calls, types, and dependencies.

This is the layer that transforms a raw syntax tree into a semantic graph — where every function knows its callers and callees, every variable knows its type, and every class knows its place in the inheritance hierarchy. Without semantic intelligence, you would have a directory listing. With it, you have a navigable map of the entire system.

Multi-Tier Call Resolution

Call resolution is the most critical semantic operation in AxonAtlas. When the parser encounters a call expression like service.validate_token(), the resolver must determine which function is actually being invoked. Unlike a compiler, AxonAtlas operates without a symbol table — it must reconstruct call targets from incomplete information, context clues, and heuristic reasoning.

The resolver uses a hierarchical cascade with four tiers, each progressively broader in scope but lower in confidence. This design ensures that the highest-confidence resolution is always preferred, while still providing useful connections even when exact resolution is impossible.

Call Resolution Cascade
  ┌─────────────────────────────────────────────────────┐
  │  Tier 1: Local Scope Match      (confidence: 1.0)  │
  │  ─── Look for the symbol in the same file           │
  │  ─── Direct name match in local definitions         │
  └──────────────────────┬──────────────────────────────┘
            miss ↓
  ┌──────────────────────┴──────────────────────────────┐
  │  Tier 2: Import Resolution       (confidence: 0.9)  │
  │  ─── Follow import chain to target module           │
  │  ─── Resolve aliased imports (from X import Y as Z) │
  └──────────────────────┬──────────────────────────────┘
            miss ↓
  ┌──────────────────────┴──────────────────────────────┐
  │  Tier 3: Receiver-Method Binding (confidence: 0.7)  │
  │  ─── Trace variable type → class → method lookup    │
  │  ─── Uses type flow results for receiver typing     │
  └──────────────────────┬──────────────────────────────┘
            miss ↓
  ┌──────────────────────┴──────────────────────────────┐
  │  Tier 4: Global Fuzzy Match      (confidence: 0.5)  │
  │  ─── Search all known symbols for best-match name   │
  │  ─── Weighted by scope proximity and name distance  │
  └─────────────────────────────────────────────────────┘

Each resolved call edge carries its confidence score as a property. This is critical for downstream consumers — an AI agent querying callers can distinguish between high-confidence direct calls and lower-confidence fuzzy matches, adjusting its behavior accordingly. Impact analysis uses these scores to weight blast-radius calculations, ensuring that uncertain connections do not produce false alarms.

Worklist-Based Type Propagation

Many symbols in dynamic languages like Python lack explicit type annotations. AxonAtlas addresses this through a worklist-based type propagation algorithm modeled after compiler dataflow analysis. The algorithm constructs a Type Flow Graph (TFG) — a directed graph where nodes represent type-bearing positions (variables, parameters, return values) and edges represent type-flow relationships (assignments, function returns, parameter passing).

The worklist iterates until a fixed point is reached — when no more types can be propagated without violating the monotonicity constraint. This convergence guarantee means the algorithm always terminates and produces the maximally precise result given the available information. The propagated types are then used as input to the call resolution engine, significantly improving receiver-method binding accuracy.

Compiler-Grade Type Intelligence

For codebases that use TypeScript or typed Python, AxonAtlas can optionally invoke external type-checking engines — Pyright for Python and the TypeScript Compiler (tsc) for JavaScript/TypeScript. These engines provide ground-truth type information that supplements the heuristic type flow system.

EngineLanguageData ProvidedIntegration Method
PyrightPython 3.xResolved types, overloads, generics, Protocol conformanceJSON-RPC output parsing
tscTypeScript / JavaScriptFull type inference, module resolution, declaration mergingCLI output parsing
Heuristic TFGAll supportedAssignment-based propagation, return type inferenceInternal worklist algorithm

Advanced Analysis Capabilities

Beyond core call resolution and type inference, AxonAtlas provides four categories of higher-order analysis that transform the knowledge graph from a dependency map into a strategic intelligence layer for engineering decision-making.

Architectural Community Detection

Most codebases have an implicit architecture that diverges significantly from their folder structure. Functions in separate directories may be tightly coupled, while adjacent files may have no interaction at all. AxonAtlas uses the Leiden algorithm — the state-of-the-art in community detection — to discover the actual functional clusters in your code.

The implementation employs an adaptive strategy selector that automatically chooses the optimal algorithm configuration based on graph size. For small codebases (under 500 symbols), it runs the full Leiden algorithm. For medium graphs (500–5,000 symbols), it uses a chunked approach that processes subgraphs independently and merges results. For large graphs (5,000+ symbols), it uses a sampled approach that identifies communities from representative subsets and propagates membership to neighbors.

Graph SizeStrategyResolutionApproach
< 500 nodesFullAdaptive (0.5–2.0)Complete Leiden on full graph
500–5,000 nodesChunkedAdaptive per-chunkPartition → Leiden → Merge overlaps
5,000+ nodesSampledFixed 1.0Random sample → Leiden → Propagate to neighbors

Detected communities are materialized as COMMUNITY nodes in the graph, with MEMBER_OF edges connecting each symbol to its cluster. Community labels are automatically generated from the most prominent symbols within each group. This enables queries like: “Show me all symbols in the auth-core community” or “Which community does validate_token belong to?”

Dead Code Analysis

AxonAtlas identifies dead code through reachability analysis from detected entry points. The system traces all possible execution paths from known entry points (main functions, route handlers, CLI commands, test functions) through the call graph. Any symbol that is not reachable from at least one entry point is flagged as potentially dead.

Crucially, the analysis applies extensive false-positive reduction heuristics to avoid flagging symbols that are genuinely used but difficult to trace statically. Symbols referenced through decorators, dynamic dispatch, serialization frameworks, or reflection patterns are automatically excluded. Plugin registration patterns, event handler conventions, and protocol conformance are all recognized and respected.

Dead Code Detection Flow
  Entry Points (detected)    Known Alive
  ──────────────────────    ────────────
  ├─ main()                 ├─ Decorated functions
  ├─ @app.route(...)        ├─ __init__ methods
  ├─ @click.command(...)    ├─ Protocol implementations
  ├─ test_*()               ├─ __all__ exports
  ├─ layouts / pages        └─ Event handlers
  │
  └─→  BFS Reachability through CALLS edges
       │
       ├─ Reachable symbols → marked alive
       └─ Unreachable symbols → flagged is_dead=true
          │
          └─→ False-positive filter applied
              │
              └─→ Final dead code set returned

Execution Process Flow Detection

AxonAtlas automatically discovers end-to-end execution flows by tracing call chains from detected entry points. Each discovered flow is materialized as a PROCESS node in the graph, with STEP_IN_PROCESS edges connecting the symbols that participate in that flow.

This capability is particularly valuable for understanding the critical paths through a system. Rather than manually tracing from a route handler through middleware, service layers, data access, and back — the entire chain is pre-computed and queryable. When two processes share more than 50% of their steps, AxonAtlas automatically creates CONVERGENT_FLOW edges, highlighting architectural hotspots where multiple critical paths converge on shared infrastructure.

Temporal Coupling Analysis

Temporal coupling reveals hidden dependencies that are invisible in the import graph. Two files may have no syntactic relationship — no imports, no shared types — yet they always change together in git history. This pattern is a powerful signal: it typically indicates shared business logic, implicit contracts, or coordinated data structures.

AxonAtlas mines the git commit history to build a co-change matrix, computing the Jaccard similarity between file change sets. Files that appear together in commits above a configurable threshold (default: 30%) receive COUPLED_WITH edges. This is hybridized with structural coupling — shared call targets and type references — to produce a composite coupling score that captures both behavioral and structural dependency.

The practical impact is significant. When an engineer modifies a file, AxonAtlas can immediately surface: “Based on the last 500 commits, this file has a 78% coupling with validators.py— you may want to review it as well.” This kind of insight is impossible from code analysis alone; it requires historical behavioral data.

Multi-Repository Intelligence

Real-world software systems are rarely contained within a single repository. A typical engineering organization operates with a frontend application, a backend API, shared libraries, infrastructure-as-code, and potentially dozens of microservices — each in its own repository with its own dependency graph and release cycle.

AxonAtlas is architected from the ground up for multi-repository, workspace-level intelligence. The WorkspaceManager coordinates multiple repository indexes, the RepositoryRouter manages per-repo database connections, and the GlobalSymbolIndex provides a unified view across all indexed repositories — enabling queries that span repository boundaries.

Multi-Repository Architecture
  ┌──────────────────────────────────────────────────────────┐
  │                    Workspace Manager                      │
  │  ┌────────────┐  ┌────────────┐  ┌────────────────────┐  │
  │  │ Repo        │  │ Topological│  │ Global Symbol      │  │
  │  │ Registry    │  │ Sort Order │  │ Index (cross-repo) │  │
  │  └──────┬─────┘  └──────┬─────┘  └─────────┬──────────┘  │
  └─────────┼───────────────┼───────────────────┼─────────────┘
            │               │                   │
  ┌─────────┼───────────────┼───────────────────┼─────────────┐
  │         ▼               ▼                   ▼             │
  │                 Repository Router                         │
  │  ┌─────────────────────────────────────────────────────┐  │
  │  │  LRU Connection Pool (bounded, per-repo backends)   │  │
  │  └──────┬────────────┬────────────┬────────────────────┘  │
  │         │            │            │                       │
  │    ┌────▼────┐  ┌────▼────┐  ┌────▼────┐                  │
  │    │ Repo A  │  │ Repo B  │  │ Repo C  │  ← KuzuDB each  │
  │    │(backend)│  │(backend)│  │(backend)│                  │
  │    └─────────┘  └─────────┘  └─────────┘                  │
  └───────────────────────────────────────────────────────────┘

The Global Symbol Index

The GlobalSymbolIndex aggregates per-repository symbol indexes into a unified lookup layer. When a query references a symbol by name, the index searches across all registered repositories and returns candidates ranked by relevance. This enables cross-repository operations that would be impossible with single-repo tools.

Consider a common scenario: your frontend application imports types from a shared library, and your backend API consumes the same library. With AxonAtlas, you can ask: “Who consumes the User type across all repositories?” — and the GlobalSymbolIndex will return callers from both the frontend and backend, even though they live in separate repos with separate databases.

Repository Router & Connection Pooling

Each repository gets its own isolated KuzuDB database file, ensuring clean separation of data. The RepositoryRouter manages an LRU-bounded connection pool that keeps frequently accessed repositories warm while automatically evicting stale connections. This design scales efficiently from a single-repo developer setup to a workspace with dozens of indexed repositories.

The router also handles path normalization — resolving symlinks, canonicalizing case-insensitive paths on Windows, and mapping relative paths to absolute repository roots. This seemingly mundane responsibility is critical for ensuring that the same repository is never indexed twice under different path representations.

Cross-Repository Impact Analysis

When the WorkspaceManager detects CROSS_REPO_REFERENCES edges, impact analysis automatically spans repository boundaries. Modifying a utility function in a shared library triggers blast-radius calculation that includes consumers in every repository that depends on it. The topological sort order ensures that upstream repositories are always indexed before their downstream dependents, maintaining consistency across the entire workspace.

MCP Protocol & Agent Integration

AxonAtlas is a native Model Context Protocol (MCP) server — the emerging open standard for connecting AI agents to external tools and data sources. This means any MCP-compatible AI system — Claude, GPT, Cursor, Windsurf, or custom agents — can query the AxonAtlas knowledge graph in real time, receiving structured, typed responses that are immediately actionable.

The MCP integration is not a bolt-on API wrapper. The server is built directly into the AxonAtlas runtime, sharing the same graph instances, symbol indexes, and intelligence services as the core pipeline. This eliminates serialization overhead and ensures that agent queries operate on the most current graph state. The server supports both stdio transport (for local IDE integrations) and Streamable HTTP (for remote and team-level deployments)(Soon).

Tool Catalog — 17 Intelligence Tools

Every tool is designed with agent ergonomics in mind. Inputs accept natural identifiers (symbol names, file paths) rather than internal IDs. Outputs are structured with consistent schemas that agents can parse without ambiguity. Error responses include remediation guidance, helping agents self-correct.

ToolCategoryDescription
querySearchHybrid keyword + vector semantic search across the knowledge graph
contextInspection360-degree view of a symbol: callers, callees, types, community membership
explainNarrativeAI-friendly narrative explanation of a symbol's role and relationships
impactAnalysisBlast radius discovery: all affected symbols from a change, with depth control
call_pathAnalysisShortest path between two symbols via BFS over CALLS edges
communitiesArchitectureList or drill into detected communities with member symbols
couplingAnalysisShow temporally and structurally coupled files for a given file
dead_codeQualityList all unreachable symbols flagged as dead code
cyclesQualityDetect circular dependencies using strongly connected components
file_contextInspectionComprehensive file summary: symbols, imports, coupling, community
detect_changesCIMap git diff hunks to affected symbols in the graph
review_riskCIPR risk scoring: blast radius, missing co-changes, boundary crossings
test_impactCIFind tests affected by code changes — trace callers to test files
cypherAdvancedRaw Cypher query passthrough for custom graph exploration
graph_integrityOperationsStructural health check: invariant verification, trust metrics
helpMetaContextual operational guidance based on task type
list_reposMetaList indexed repositories with stats and graph reachability

Resource Endpoints

In addition to tools, AxonAtlas exposes three MCP resource endpoints that provide contextual data to agents during conversation initialization:

ResourceURI PatternData Provided
Repository Summaryaxon://repo/{name}/summaryNode counts, edge counts, graph metrics, indexing status
Ingestion Statusaxon://repo/{name}/ingestionPipeline state, phase completion, last run timestamp
Operational Manualaxon://help/playbookAgent-facing operational guidance and workflow recommendations

Security Architecture

The MCP server implements a multi-layer security architecture designed for team deployments where repository access must be controlled per-user and per-agent. Authentication is backed by a validated secret, providing JWT-based secret validation with per-repository access control lists (ACLs).

Secret Validation

Every MCP connection is authenticated via a validated secret. Unauthenticated connections receive a structured error with remediation instructions — not a silent failure.

Repository ACLs

Each authenticated user has a defined set of accessible repositories. Queries against unauthorized repositories return clear permission errors, preventing data leakage.

Structured Errors

All security failures produce typed, parseable error responses that agents can handle programmatically. No stack traces, no internal paths — clean error contracts.

Scalability & Performance Architecture

AxonAtlas is designed to scale from a single developer working on a personal project to an engineering organization with dozens of repositories containing millions of lines of code. Every layer of the system is engineered for predictable performance at scale.

The scalability architecture addresses three distinct dimensions: storage scalability (how the graph grows), query scalability (how fast intelligence is served), and operational scalability (how the system recovers and self-heals).

Concurrency & Thread Safety

The storage layer implements a read-many / write-exclusive lock (ReadWriteLock) that allows unlimited concurrent read queries while serializing write operations. This is the optimal concurrency model for a read-heavy workload — which is exactly the usage pattern for a code intelligence system, where graph mutations happen during indexing (infrequent) and queries happen continuously (frequent).

Connection Pooling

Each KuzuDB backend maintains a bounded connection pool. When all connections are in use, new queries queue rather than fail. Pool sizes are configurable per deployment — from 4 connections for a laptop to 32 for a team server.

LRU Repository Eviction

The RepositoryRouter uses an LRU cache to manage per-repo backends. When the pool reaches capacity, the least-recently-used repository's backend is gracefully closed and its resources released. Hot repositories stay warm automatically.

Batch Operations

Graph mutations (node creation, edge insertion) are batched into transactions. Instead of issuing thousands of individual Cypher statements, the backend constructs parameterized batch queries that insert hundreds of nodes in a single round-trip.

Secondary Indexes

The in-memory KnowledgeGraph maintains 6 secondary indexes: by-file, by-label, by-name, by-community, by-caller, and by-callee. These indexes provide O(1) access for the most common query patterns, eliminating the need for full graph scans.

Incremental Intelligence

Full re-indexing is expensive. For a large repository with thousands of files, a complete pipeline run can take minutes. AxonAtlas solves this with a dependency-aware incremental engine that re-analyzes only the files affected by a change — and their transitive importers.

When a file changes, the incremental engine identifies which symbols are impacted, traces the IMPORTS graph to find all transitive importers, schedules them for re-parsing, and runs only the affected pipeline phases. This cascade strategy ensures that downstream consumers of a modified symbol always receive updated information, while files unrelated to the change are left untouched.

Graph Integrity & Self-Healing

The axon_graph_integrity tool provides a comprehensive structural health check that verifies graph invariants, ingestion consistency, and trust metrics. This is the equivalent of a database CHECKDB — it ensures that the knowledge graph is internally consistent and trustworthy.

CheckWhat It VerifiesAction on Failure
Orphan nodesEvery non-FILE node has at least one incoming edgeFlag for re-indexing
Dangling edgesEvery edge connects two existing nodesRemove stale edges
Schema conformanceAll nodes have required properties (label, moniker, file_path)Report violations
Community coverageEvery non-structural node belongs to at least one communityRe-run community detection
Index consistencySecondary indexes match the canonical node setRebuild indexes

Real-World Use Cases

AxonAtlas is designed to serve multiple personas across an engineering organization. Each use case leverages the same underlying knowledge graph — the intelligence is universal, but the value is specific to each role.

AI Agent Infrastructure

AI Agent Developers & Platform Teams

AI coding agents make better decisions when they can see the full picture. By integrating AxonAtlas as an MCP server, agents gain real-time access to call graphs, community boundaries, blast radius data, and dead code maps — enabling them to propose changes that respect the existing architecture rather than blindly modifying files.

→Agents understand which functions are critical hotspots before modifying them
→PR descriptions include automatic impact analysis and risk scoring
→Architecture drift detection prevents agents from creating duplicate abstractions
→Test impact analysis tells agents exactly which tests need to run

Pull Request Risk Assessment

Engineering Leads & Code Reviewers

Every pull request is automatically assessed for architectural risk. AxonAtlas analyzes the diff, maps changed lines to affected symbols, calculates the downstream blast radius, identifies missing co-change files (files historically coupled but not included in the PR), and flags community boundary crossings that may indicate cross-team coordination needs.

→Risk scores prioritize which PRs need senior reviewer attention
→Missing coupled files are surfaced before they cause production incidents
→Community boundary crossings trigger appropriate team notifications
→Review time decreases because reviewers see the structural impact upfront

Developer Onboarding

New Team Members & DevEx Teams

Understanding a large codebase is the primary bottleneck for new engineers. AxonAtlas provides narrative symbol explanations, community maps, and process flow visualizations that transform the onboarding experience. Instead of reading through thousands of files, a new engineer can query: 'Explain the authentication flow' and receive a structured response tracing every function in the chain.

→Onboarding time reduced from weeks to days for complex systems
→Community maps reveal the actual architecture — not just the folder structure
→Process flow visualization shows critical paths end-to-end
→Narrative explanations provide context that code comments cannot

Technical Debt Quantification

Engineering Managers & CTOs

AxonAtlas transforms vague 'technical debt' discussions into data-driven conversations. Dead code analysis quantifies exactly how much unreachable code exists. Coupling analysis reveals which files are implicitly bound together. Community detection shows where the actual architecture diverges from the intended module boundaries.

→Dead code quantified in lines and symbols — prioritize cleanup by blast radius
→Coupling hotspots identified with historical confidence scores
→Architecture-vs-structure divergence visualized through community maps
→Refactoring ROI estimated by measuring blast radius reduction potential

CI/CD Intelligence

Platform Engineers & DevOps

Integrate AxonAtlas into your CI pipeline to add intelligence to every build. Test impact analysis identifies exactly which test suites need to run based on the changed symbols — eliminating unnecessary test execution. Change detection maps diff hunks to graph symbols, providing precise change metadata for downstream tooling.

→CI time reduced by running only affected test suites
→Change metadata enriches deployment dashboards and release notes
→Risk scoring gates prevent high-risk changes from auto-deploying
→Graph integrity checks run as post-index verification in CI

Security & Compliance Auditing

Security Engineers & Compliance Teams

The knowledge graph enables precise security auditing at the architectural level. Trace data flows from user input to database queries. Identify all callers of sensitive functions (authentication, authorization, encryption). Map which communities handle PII data and ensure they follow compliance patterns.

→Data flow tracing from input to storage through the call graph
→Sensitive function audit: who calls crypto, auth, and database APIs
→Community-level compliance: ensure PII-handling clusters follow patterns
→Change monitoring: alert when sensitive communities are modified

Strategic Positioning

AxonAtlas occupies a unique position in the developer tooling ecosystem. It is not a linter. It is not a code search engine. It is not a project management tool. AxonAtlas is a Code Intelligence Platform — a category that is emerging as the critical infrastructure layer for AI-assisted software development.

The platforms that will define the next decade of engineering productivity are those that serve as the intelligence backbone for both human engineers and AI agents. AxonAtlas is built to be that backbone — a continuously updated, semantically rich model of your entire software system that every tool, every agent, and every engineer can query.

Competitive Differentiation

The following comparison maps AxonAtlas against adjacent tool categories. AxonAtlas is not a replacement for any single tool — it is the intelligence layer that makes all other tools more effective.

CapabilityLinters (ESLint, Pylint)Code SearchAI AssistantsAxonAtlas
RepresentationAST-level warningsText index + regexPrompt context window11-type knowledge graph
Semantic DepthSingle-file rulesCross-file text searchNone (stateless)Multi-tier resolution + type flow
Architectural ViewNoneRepository navigationNoneLeiden communities + process flows
Change IntelligenceNoneBlame + historyNoneTemporal coupling + blast radius
Agent IntegrationCLI output parsingAPI accessBuilt-in (limited scope)Native MCP with 17 tools
Multi-RepositoryConfig per-repoCross-repo searchSingle-file contextWorkspace-level graph
Incremental UpdatesFile-level re-lintIndex rebuildN/ADependency-aware cascade

The Intelligence Stack

AxonAtlas fits into the modern developer toolchain as the intelligence layer between source code and the tools that operate on it. Consider the following layered view:

L5
AI Agents & IDEsConsumers of intelligence — Claude, Cursor, custom agents
L4
AxonAtlas IntelligenceKnowledge graph, semantic resolution, impact analysis, risk scoring
L3
Version ControlGit, GitHub, GitLab — history, branching, PRs
L2
Build & CICompilers, bundlers, test runners, deployment pipelines
L1
Source CodeThe raw files, modules, and packages that constitute the system

“AxonAtlas represents a new category of developer infrastructure — not a tool that replaces existing workflows, but an intelligence layer that makes every existing tool smarter. When your CI system knows the blast radius, when your agent knows the community structure, when your reviewer knows the coupling score — that is the value of a Code Intelligence Platform.”

— AxonAtlas Architecture Team