Kenshiki Labs

AI Security

What is governed retrieval security?

Governed retrieval security is the practice of enforcing access control, evidence integrity, and policy compliance at the moment an AI system retrieves information — not after generation, not at output filtering, but at the retrieval boundary itself. It prevents unauthorized data access (the model never sees evidence the principal isn't authorized to retrieve), guarantees evidence integrity (the chunks reaching the model are byte-identical to authoritative source versions), and enforces policy compliance per query (each retrieval scope is computed deterministically from the SIRE-tagged corpus and the principal's authorization).

Why this matters

Traditional RAG systems treat retrieval as invisible. The model asks for information, the system returns it, and nobody audits who accessed what. In regulated industries, that’s a data-governance violation.

Governed retrieval means:

  • Every retrieval request is authenticated and authorized (REBAC).
  • Only documents matching the requester’s clearance are returned.
  • The retrieval is logged with cryptographic proof.
  • An auditor can later verify what was accessed and by whom.

How it works

When a model requests information from the knowledge base:

  1. Authentication: Verify the caller’s identity.
  2. Authorization: Check their REBAC profile against the document’s access tags.
  3. Retrieval: Return only documents they’re authorized to access.
  4. Integrity: Sign the retrieval response with a tamper-evident hash.
  5. Logging: Record the retrieval in the Claim Ledger — integrity-protected audit trail for every AI inference with the caller’s identity and clearance.

If a caller lacks authorization, they get nothing — not a redacted version, not an inference, nothing. The boundary is enforced before data leaves the database.

How Kenshiki Labs, the runtime AI governance control plane implements this

Kenshiki Labs’ Kura retrieval engine provides:

  1. REBAC integration: Caller identity and clearances are checked against evidence access tags.
  2. Evidence scoping: The retrieval pool is filtered based on authorization.
  3. Tamper-evident responses: Retrieval results are cryptographically signed.
  4. Claim Ledger integration: Every retrieval is logged with caller identity, authorization decision, and data returned.

This means cardholder data in your knowledge base stays protected even when the model has access to the knowledge base. The boundary is enforced at the retrieval layer, not the model layer.

Related concepts