Total Disclosure Benchmark Registry.
Complete benchmark suite. All results honest — including where Purgr loses.
100% needle found across 7 scales from 10K to 1M tokens. Lifecycle cadence: compress() called every 10 messages (PRODUCTION_BATCH_SIZE=10).
| Scale | TRR | Final Call Latency | Needle |
|---|---|---|---|
| 10k | 44.3% | 5.8ms | ✓ survived |
| 25k | 57.8% | 24.5ms | ✓ survived |
| 50k | 62.6% | 27.3ms | ✓ survived |
| 100k | 69.5% | 50.8ms | ✓ survived |
| 200k | 70.2% | 105.5ms | ✓ survived |
| 500k | 71.4% | 318.4ms | ✓ survived |
| 1M | 75.0% | 714.9ms | ✓ survived |
96% vs 75% NIAH. Zero dependencies vs transformer model required.
| Fixture | Purgr | LLMLingua-2 |
|---|---|---|
| Dollar Amount | 100% | 100% |
| Deadline | 60% | 60% |
| Person/Role | 100% | 50% |
| Version String | 100% | 0% |
| Negation | 100% | 100% |
| Compound | 100% | 100% |
| Coding Early | 100% | — |
| Document Middle | 100% | — |
| Planning Early | 100% | — |
| Planning Late | 100% | — |
| Mean | 96% | 75% |
| Tool | Mean NIAH | Avg TRR | Latency (10k tokens) | Dependencies |
|---|---|---|---|---|
| Purgr | 96% | 44.3% | 5.8ms | Zero |
| LLMLingua-2 | 75% | ~50% | 14,224ms | Transformer model |
Linear scaling confirmed 10k to 1M tokens. 714.9ms at 1M on local CPU, no GPU.
| Scale | TRR | Final Call Latency | Needle |
|---|---|---|---|
| 10k | 44.3% | 5.8ms | ✓ survived |
| 100k | 69.5% | 50.8ms | ✓ survived |
| 1M | 75.0% | 714.9ms | ✓ survived |
10 runs on identical input produce byte-identical compressed output and identical Merkle roots.
| Property | Result | Notes |
|---|---|---|
| Tokens | Identical | 1,206 tokens all 10 runs |
| Content | Byte-identical | Zero content variance |
| Merkle Root | Identical | Receipt chain is reproducible |
| Signatures | Unique | Cryptographically correct |
Post-compression deterministic verification. Every receipt shows how many critical facts survived.
| Session | Tokens In | TRR | Facts | Preserved | Fidelity |
|---|---|---|---|---|---|
| Developer session (bigcon.txt) | 242,304 | 53% | 143 | 143 | 100% |
| Elon Musk interview | 25,749 | 67% | 6 | 6 | 100% |
| Yann LeCun interview | 34,591 | 92% | 1 | 1 | 100% |
| Grok technical session | 44,214 | 17% | 16 | 16 | 100% |
NCD/LZ hash containment identifies messages Koopman would incorrectly compress. Tested on 242k-token developer session (bigcon.txt).
| Configuration | TRR | Rescued | Why |
|---|---|---|---|
| DMD only | 53% | 0 | Baseline |
| DMD + Liveness | 51% | 4 messages | NCD/LZ identifies load-bearing messages |
| DMD + Liveness + Co-occurrence | 49% | 10 messages | Semantic signal rescues 6 additional messages |
enableLiveness: true.
Standalone stateless function. Proves which specific numbers, dates, and identifiers in an LLM response exist in the source document.
| Category | Description |
|---|---|
| Grounded | Fact present verbatim in document |
| Derived | Fact mathematically computable from adjacent document facts |
| Ungrounded | ⚠ Not traceable to document — verify manually |
Purgr compresses any content type — not just developer conversations.
| Content Type | Format | Messages | Tokens | TRR | Facts |
|---|---|---|---|---|---|
| Developer architecture session | claude-web-export | 617 | 242,304 | 53% | 143/143 |
| Lex Fridman / Elon Musk interview | prose-transcript | 477 | 25,749 | 67% | 6/6 |
| Lex Fridman / Yann LeCun interview | prose-transcript | 202 | 34,591 | 92% | 1/1 |
| Grok technical conversation | prose-transcript | 560 | 44,214 | 17% | 16/16 |
Testing Integrity.
All benchmarks are evaluated against verbatim production builds with no benchmarking cheats.
Verbatim Builds.
Tests evaluated running raw output from Purgr against unmodified dependencies.
Known Limitations.
Pure synthetic vocabulary-drift NIAH scores 0% — no token overlap means no lexical signal. Co-occurrence matrix improves semantic survival on real multi-session content. Semantic paraphrase detection without embeddings remains an open research problem.
Source Included.
Fixture synthesis and NIAH run scripts are included identically in the deployed SDK source.