AI LLM Gateway -- Included in Every Tier

One Endpoint for Every LLM.
Cache Costs 50-70%.

Stop managing multiple SDKs. Route OpenAI, Anthropic Claude, and AWS Bedrock through a single internal endpoint. Built-in response caching cuts your API bill in half. Prompt audit logs for compliance.

3
LLM Providers
50-70%
Cache Hit Rate
$0
Extra Cost
0ms
Cached Response
One API for Every LLM Provider

Use the OpenAI SDK. Just change the model name. Zero code changes beyond base_url.

# One SDK for everything -- just change the model name from openai import OpenAI client = OpenAI(base_url="http://ai.gw.internal:8080/v1") # OpenAI client.chat.completions.create(model="gpt-4o", ...) # Anthropic Claude (auto-translated) client.chat.completions.create(model="claude-sonnet-4", ...) # AWS Bedrock (SigV4 auto-signed) client.chat.completions.create(model="bedrock/anthropic.claude-3-haiku", ...)
OpenAI
gpt-4o, gpt-4-turbo, gpt-3.5-turbo -- all GPT models pass through natively
Anthropic
claude-sonnet-4, claude-opus-4, claude-haiku -- auto-translated from OpenAI format
AWS Bedrock
bedrock/... prefix -- SigV4 signed automatically from instance role. No keys in code.
Built for AI-First Teams

Everything you need to operate LLM infrastructure at scale -- without adding operational complexity.

Response Caching

Identical prompts return cached responses at 0ms latency with zero API cost. Configurable TTL per model. Enterprise tier adds semantic caching with embedding similarity for near-duplicate prompts.

Gateway+
📈

Cost Attribution

Every request tagged by source IP, team, or service name. CloudWatch dashboard shows per-team spend broken down by model and provider. Export to JSON for finance and chargeback workflows.

Gateway+
📋

Prompt Audit Logging

Every API call logged with model, token count, estimated cost, and source IP. Prompt content is NEVER logged -- only metadata. Compliance-ready audit trail stored in your own CloudWatch log group.

Gateway+

Bedrock Support

AWS Bedrock models accessed with automatic SigV4 signing -- no credential management in application code. The gateway uses the instance IAM role. Just use the bedrock/... model name prefix.

Gateway+
🏛

Model Routing

Route different models to different providers by name. Define fallback rules and cost caps per route. All configured via the dashboard or Terraform -- no application changes required.

Gateway+
🔗

No Vendor Lock-in

Switch from OpenAI to Claude to Bedrock by changing one model string in your configuration. The gateway handles all protocol translation transparently. Your application code stays identical.

Gateway+
Your Prompts Never Leave Your VPC

The gateway is an EC2 appliance running in your own AWS account. Camphor has zero visibility into your prompt content -- by design.

  • The gateway runs as an EC2 instance inside your VPC, in your AWS account. No shared infrastructure.
  • Prompts flow directly from your application to the gateway, then from the gateway to the LLM provider -- entirely within your network boundary.
  • Camphor Networks never sees prompt content, response content, or API keys. We have no data plane access to your VPC.
  • Audit logs -- model name, token count, latency, source IP -- are stored in your own CloudWatch log group. Only you have access.
  • The control plane (Camphor dashboard) only reads configuration metadata: model routes, cache TTLs, cost caps. No prompt data, ever.
Request Flow
Your App
--->
Camphor Gateway
in YOUR VPC
--->
OpenAI / Claude
/ Bedrock
Gateway only sees:
- model name (e.g. gpt-4o)
- token count (from provider response)
- request latency
- source IP within your VPC
✓ No prompt content ever stored Audit logs contain only metadata -- never the text of your prompts or model responses. Compliant with SOC 2, HIPAA logging requirements. ✗ Camphor control plane has zero data plane access

Included in Every Tier

The Camphor AI LLM proxy is not an add-on. Basic caching ships in the Gateway tier at no extra charge. Upgrade to Enterprise for semantic caching powered by embedding similarity.

Gateway

LLM Proxy + Basic Cache

OpenAI / Claude / Bedrock routing Exact-match response caching Configurable TTL per model Prompt audit log (metadata only) Cost attribution by source IP
Security

LLM Proxy + IDS/IPS

All Gateway features Network IDS/IPS (ET Open ruleset) Egress firewall for LLM domains Prompt traffic isolation Compliance-ready logging
Enterprise

Semantic Cache + Advanced

All Security features Semantic caching (embedding similarity) Per-team spend dashboards Model-level cost caps + alerts Terraform provider support
Open Pricing Calculator

Cut your LLM API bill in half.
Deploy in 10 minutes.

One CloudFormation stack. No agents. No sidecars. Works with the OpenAI SDK you already use.