Inferagate

One control plane for AI runtime, governance, and spend.

Product capabilities

Open console
Guardrails

Policy sets from categories, subcategories, and objects.

Build policy from PII, PHI, secrets, prompt injection, harmful content, advice denial, illegal content, insults, violence, tool misuse, and data egress controls.

Open guardrails
FinOps

Budget controls close to the traffic lane.

Set spend caps by tenant, project, environment, route, model, virtual key, or developer workflow, then use logs and route simulation to understand cost and latency tradeoffs.

Open budgets
Operator playbooks

Do the important work from guided flows, not tribal knowledge.

Open flow builder
01

Connect a provider

Add OpenAI, Anthropic, Bedrock, OpenRouter, Hugging Face, vLLM, or Ollama with the right default URL, auth type, and capability profile.

  • Pick provider from quick connect
  • Save credentials securely
  • Run readiness and model scan
02

Publish model access

Register approved models, classify them by use case, then expose only the right targets to each project or environment.

  • Choose discovered or manual model ids
  • Tag chat, embedding, OCR, vision, code, or video
  • Assign primary, fallback, or restricted role
03

Route safely

Create a route draft, simulate provider selection, compare candidate order, then promote gradually when the lane is healthy.

  • Set default and fallback models
  • Preview latency, cost, and health
  • Approve before live promotion
04

Apply guardrails

Build policy sets from granular categories like secrets, PII, PHI, jailbreaks, harmful content, legal/medical/financial advice, and data egress.

  • Select categories and subcategories
  • Choose block, redact, warn, or monitor
  • Review request/response diffs in logs
05

Control spend

Place budgets close to traffic: by tenant, project, environment, route, virtual key, model, or developer workflow.

  • Define caps and owner labels
  • Track cost, latency, and fallback usage
  • Use logs as the source of truth
06

Investigate incidents

Use logs to see runtime events, guardrail decisions, blocked payloads, redactions, audit changes, and full conversation timelines.

  • Filter by event, source, status, route, and time
  • Inspect sanitized request/response payloads
  • Export evidence for review
07

Add retrieval context

Upload approved documents, preview chunks, run retrieval checks, and keep citations visible before injecting RAG into live traffic.

  • Import PDF, MD, TXT, or DOCX sources
  • Preview chunks and metadata
  • Validate retrieval quality before enabling routes
08

Prepare go-live

Use readiness, diagnostics, service health, and smoke tests to confirm the deployment is ready before teams switch traffic.

  • Check gateway, database, Redis, and guardrails
  • Run probes against selected models
  • Capture go-live evidence from logs
Get started

Want help mapping your first AI gateway rollout?

Tell us what you are connecting and we will send the right setup path: open-source quickstart, enterprise rollout, provider migration, guardrails, FinOps, or observability.

Multi-tenant by design Open-core friendly Provider-neutral

We will only use this to follow up about Inferagate.