benchmarks

The benchmarks/ directory contains the performance evaluation harness that measures jcodemunch's token efficiency against real-world repositories.

Purpose

This module quantifies the core value proposition of jcodemunch: how many fewer tokens does an AI agent consume when using jcodemunch's structured index versus reading raw source files? The harness automates this comparison across multiple repositories and queries.

Key Components

harness/run_benchmark.py

The main benchmark script. It:

tasks.json

A JSON corpus defining which repositories to benchmark and what queries to run against them. Default targets include popular frameworks like Express, FastAPI, and Gin.

Constants

Constant | Value | Description