Evidence store
Kura
Stores what counts as real. SIRE-tagged evidence corpus with deterministic identity, retrieval boundaries, and provenance for every chunk.
Kura is the Prepare API — the write side of the evidence boundary that turns authoritative source material into governed retrieval context before the model ever sees it. Every request enters through the Prompt Sanitizer, which authenticates the caller and binds identity via OpenFGA/ReBAC before anything else fires. Kura ingests sources through a two-stage pipeline — GPU-accelerated parsing (Docling with DocLayNet layout analysis, TableFormer for tables, EasyOCR for images) followed by CPU-side enrichment that attaches SIRE identity (Subject, Included, Relevant, Excluded), clause IDs, normative markers, and provenance. Every chunk carries a SHA-256 source hash and HMAC-SHA-256 watermark for tamper-evident verification without database access. Retrieval is hybrid (pgvector + tsvector ranking) and enforces the SIRE exclusion gate before any chunk reaches the model. Tenant-scoped row-level security is live today; caller-specific OpenFGA/ReBAC retrieval enforcement is the next boundary. Corpus Explorer is the inspection surface where users verify the underlying text behind any cited clause, querying the same governed Elasticsearch alias the runtime retrieval layer uses. Without Kura, every downstream decision is an assertion without evidence: the Compiler cannot scope what the model sees, the Ledger has nothing to verify against, the Gate has no basis.
Without Kura, every downstream decision is an assertion without evidence. The Compiler cannot scope what the model sees. The Ledger has nothing to check claims against. The Gate has no basis. No evidence, no grounded answer.
How Kura becomes governed model context
Read this left to right. Source material enters the evidence boundary, becomes policy-bearing chunks, and only then becomes bounded retrieval context. Kura stops at that handoff. It does not generate the final answer or verify claims; it makes governed evidence and identity available for downstream orchestration.
Who this is for
Corpus engineers
data stewards who curate, version, and maintain authoritative source collections inside the evidence boundary. Responsible for ingestion, SIRE tagging, and evidence quality.
Every downstream system
Compiler draws evidence for zone mapping. Ledger checks claims against it. Gate relies on it for emission policy. Kadai returns answers bounded by what Kura contains.
Kura — the governed evidence store — is the evidence boundary. SIRE (Subject, Included, Relevant, Excluded) identity tags scope what each source covers and what it must never answer. Every chunk carries provenance chains, SHA-256 hashes, and HMAC-SHA-256 watermarks. Compiler — the prompt-assembly engine —, Ledger — the integrity-protected inference audit trail —, and Gate — the emission policy boundary — all depend on Kura as the source of governed evidence.
Go deeper
API Developer Guide
Full API reference — POST /v2/documents for ingestion, retrieval endpoints, attestation, and error handling.
Prompt Sanitizer
The entry point that establishes caller identity before Kura scopes evidence.
Compiler
The next stage — uses Kura's metadata to map evidence into CFPO zones.
Ledger
Checks claims against the evidence Kura provides.
Platform Architecture
See where Kura sits as the upstream foundation of the pipeline.
Pricing
Kura usage-based pricing — ingestion, storage, and retrieval.