Kenshiki

Pricing

Govern every AI response.

Usage-based pricing. No seat tax. No opaque packaging. Pay for what you govern. The contract is the same everywhere — what changes is where it runs and how strong the proof becomes. We show the model, the hardware, and the infrastructure cost without turning the page into an internal margin worksheet.

Pick your environment

Same API contract everywhere. Workshop is usage-only, Refinery adds private-deployment fees, and Clean Room moves to software licensing with unlimited in-boundary usage.

Start here

Workshop

Usage only

No platform fee. Start under $100/month.

Models: Kadai 16B, 42B, 14B + BYOK

Full governance pipeline on shared infrastructure. Bring your own model key or use Kadai.

  • Full bounded-synthesis pipeline
  • Kadai models on shared L4 GPUs
  • BYOK option (GPT, Claude, OpenRouter)
  • L1–L3 claim evaluation
  • Same API contract as Refinery and Clean Room

Private deployment

Refinery

From $3,500/mo

Platform fee + standard usage rates. Annual commitment.

Models: Kadai 70B Instruct, 235B

Same governance pipeline, private infrastructure. Managed by Kenshiki on your VPC or ours.

  • Everything in Workshop, plus:
  • Private inference — no public model API
  • L4 evaluation (hidden-state probes)
  • Chain of custody
  • Managed deployment and monitoring
  • 99.9% SLA (Kenshiki-managed)

Air-gapped

Clean Room

From $50K/year

Software license + hardware BOM published. Contract.

Models: Kadai 400B, 670B

Disconnected operation on customer premises. Hardware root of trust. Signed attestation on every output.

  • Everything in Refinery, plus:
  • Fully disconnected — no external dependency
  • Hardware root of trust
  • Signed attestation chain
  • Independently verifiable by third parties
  • Unlimited usage within deployment

Compare the proof boundary

The easiest way to buy Kenshiki is to choose the environment first. The API contract stays the same. What changes is where inference runs, where the evidence boundary lives, and how much you can prove about what happened inside the runtime.

Dimension Workshop Refinery Clean Room
Where model runs Shared Kadai or your existing public-model providerPrivate inference in your VPC or Kenshiki-managed private deploymentCustomer premises, disconnected if required
Where data lives Kenshiki-managed cloud boundary with governed retrieval and exportabilityPrivate deployment boundary with governed data sources under local controlCustomer premises only
Proof boundary Full pipeline audit with L1-L3 evaluationPrivate inference, local telemetry, L4 evaluation, and chain of custodySigned attestation chain anchored to hardware trust
Best for Pilots, teams evaluating governed synthesis, and BYOK onrampsEnterprise production workloads that need private deployment and managed operationsHigh-assurance or disconnected programs that cannot depend on external connectivity
Starting price Usage only, under $100/month to startFrom $3,500/month plus usageFrom $50K/year software license

Two products. One contract.

Use them together or independently. Workshop and Refinery meter Kura and Kadai directly. Clean Room keeps the same contract but moves to software licensing rather than API metering.

Reasoning API

Kadai

Governed inference. Question in, bounded answer out with claim-level evaluation.

$0.02 governance fee per request

Includes up to 8K governed input tokens after compilation. Additional governed context: $0.01 per extra 8K. Plus token pricing per Kadai model. BYOK: $0.015/request.

Every governed request includes

  • Prompt Compiler — CFPO rewrite, 5 passes
  • Claim Ledger — L1–L3 evaluation (L1–L4 in Refinery/Clean Room)
  • Boundary Gate — classification and output state assignment
  • Per-claim audit trail
  • Telemetry headers

Governed request boundary

  • One end-to-end Kadai call through Compiler, retrieval, Ledger evaluation, and Boundary Gate
  • Base governance fee includes up to 8K governed input tokens after compilation
  • Governed input means the compiled prompt plus admissible inline and retrieved evidence
  • Model output tokens are billed under model inference, not governance

BYOK support boundary

  • Kenshiki guarantees the governance pipeline around BYOK requests: compilation, retrieval scoping, Ledger evaluation, Gate decisions, and audit records
  • Kenshiki does not guarantee the uptime, latency, output stability, or semantic behavior of the upstream provider
  • Hosted Kadai is the primary path for stronger service commitments and more stable evaluation behavior

Evidence Store

Kura

Governed RAG. Ingest sources, preserve provenance, retrieve with access control.

Component Unit Rate
Ingestion per billable page $0.05
Storage per GB/mo $0.50
Retrieval per query $0.005
Embedding refresh per billable page $0.05

Includes

  • SIRE tagging and Crosswalk mapping
  • OpenFGA/ReBAC per-caller evidence scoping
  • SHA-256 hashing, HMAC-SHA-256 watermarks
  • REST API access
  • Data export guarantee

Working billing definitions

Billable page
One normalized page-equivalent processed by the ingestion pipeline. PDFs bill by rendered page, slide decks by slide, and structurally complex files normalize into page-equivalents before billing.
Billing preview
The dashboard should show the billable page-equivalent count before ingestion is finalized, so scans and oversized tables do not surprise the buyer.
Retrieval query
One logical governed retrieval call with bounded fan-out. Internal retries, reranking, and Ledger-side checks are not billed as separate queries.

What a real month looks like

Illustrative examples — your costs depend on corpus size, request volume, and model choice.

Pilot

Workshop

$99/mo

Mid-size team evaluating governed synthesis. 500 source pages ingested, 5K retrieval queries, 2K governed requests using Kadai 14B Instruct, average 2K input / 500 output tokens per request.

Component Cost
Kura ingestion $25
Kura storage $1
Kura retrieval $25
Governance fee $40
Kadai 14B tokens (in) $4
Kadai 14B tokens (out) $4
Total $99/mo

Illustrative Workshop workload that stays inside the included 8K governed-context boundary.

Enterprise

Refinery

$6,150/mo

Enterprise compliance team on private infrastructure. 5,000 source pages ingested, 50K retrieval queries, 20K governed requests using Kadai 70B Instruct.

Component Cost
Platform fee (annual) $5,000
Kura ingestion $250
Kura storage $10
Kura retrieval $250
Governance fee $400
Kadai 70B tokens (in) $120
Kadai 70B tokens (out) $120
Total $6,150/mo

Private deployment with platform fee plus standard Kura and Kadai usage. Clean Room shifts to software licensing and unlimited in-boundary usage instead of metered API billing.

Under the hood

Every Kadai model is fine-tuned by Kenshiki for bounded synthesis. We show the parameters, the hardware, and the infrastructure cost. Base architectures are disclosed in technical documentation.

Model Params Active Arch Hardware Infra/mo Input/1K Output/1K
Kadai 16B Instruct 16B 2.4B MoE 1×L4 ~$584 $0.0005 $0.002
Kadai 42B 42B 6.6B MoE 1×L4 ~$584 $0.0008 $0.003
Kadai 14B Instruct 14B 14B Dense 1×L4 ~$584 $0.001 $0.004
Kadai 70B Instruct 70B 70B Dense 4×L4 ~$3,358 $0.003 $0.012
Kadai 235B 235B 21B MoE 1×H100 ~$8,972 $0.004 $0.016
Kadai 400B 400B 17B MoE 8×H100 ~$40,150 $0.008 $0.032
Kadai 670B 670B 37B MoE 8×H100 ~$40,150 $0.010 $0.040

Why show infrastructure cost? Active parameters — not total parameters — drive inference compute. A 235B MoE model activates 21B parameters per pass and runs on a single H100. A dense 70B model fires all 70B and needs four L4s. The 235B is not the cheaper absolute line item here, but it shows why headline parameter count alone is a bad proxy for cost or runtime shape. We show runtime physics, not internal margin worksheets.

Questions

Can I use Kura without Kadai?
Yes. Kura is a standalone governed evidence store with its own API. Use it for ingestion, retrieval, and access-controlled search without calling Kadai.
Can I use my own model?
Yes — on Workshop. Bring your own GPT, Claude, or OpenRouter key. The governance pipeline still runs (Compiler, Ledger, Gate), but inference uses your provider. BYOK governance fee is $0.015/request and BYOK is governed best-effort rather than the stronger service path for hosted Kadai.
What base models are behind Kadai?
Every Kadai model is fine-tuned from an open-weight base architecture for bounded synthesis. Base models are disclosed in technical documentation and security reviews.
What counts as a page, query, and governed request?
A billable page is a normalized page-equivalent processed by ingestion. PDFs bill by rendered page, slide decks by slide, and complex files normalize into page-equivalents before billing. A retrieval query is one logical governed retrieval call with bounded fan-out. A governed request is one end-to-end Kadai call through Compiler, retrieval, Ledger, and Boundary Gate.
What happens on long-context governed requests?
The $0.02 governance fee includes up to 8K governed input tokens after compilation. Above that, additional governed context is billed at $0.01 per extra 8K. Governed input means the compiled prompt plus admissible inline and retrieved evidence. Output tokens are still billed under model inference.
Why compare a 235B MoE to a 70B dense model?
Active parameters explain why a 235B MoE belongs in the lineup even though the H100-backed deployment costs more in absolute dollars here. Kadai 235B activates 21B parameters per pass, while Kadai 70B Instruct is dense and fires all 70B. The point is efficiency per unit of capability, not that the 235B is always the cheaper line item.
What SLA applies to BYOK?
Kenshiki guarantees the governance pipeline around BYOK requests — compilation, retrieval scoping, Ledger evaluation, Gate decisions, and audit records. Kenshiki does not guarantee the uptime, latency, or semantic stability of the upstream provider. Material provider changes may require prompt or evaluation re-baselining.
Is there a free tier?
No. Workshop starts under $100/month for a real workload. If you can't justify that, you're not ready for governed inference.
What happens if I exceed my spending cap?
Soft gate with notification at 80% and 100% of your cap. No hard stop — governed inference should never silently break. You set the monthly spending limit in your dashboard.
Can I export my data?
Yes, always. Your evidence is exportable at any time.
What's the commitment?
Workshop: none. Refinery: annual (monthly available at 25% premium). Clean Room: contract.

Next Step

Pick the proof boundary that matches the consequence.

Start in Workshop if you want the fastest path to a governed response. Move to Refinery or Clean Room when the environment itself becomes part of what you have to prove.