ChatInfer

Documentation

Build with the ChatInfer API

Explore the developer concepts, API patterns, and integration flow for building AI chat applications with reliable inference.

Quick start

Quick start

A simple four-step flow for integrating ChatInfer into your application.

Integration flow

1

Join early access

Request access and tell us about your AI application or inference workflow.

2

Create an API key

Generate a project API key once your account is enabled.

3

Send a chat completion request

Use a familiar chat completions interface to send messages through ChatInfer.

4

Monitor and iterate

Track latency, cost, errors, usage, and user conversations as your application scales.

Example

Chat completions example

ChatInfer is designed around a familiar request pattern for developer-friendly integration.

Request

POST /v1/chat/completions
{
  "model": "chatinfer-auto",
  "messages": [
    {
      "role": "user",
      "content": "Explain vector search in simple terms."
    }
  ]
}

Response

200 OK
{
  "id": "chatcmpl_demo",
  "object": "chat.completion",
  "model": "chatinfer-auto",
  "choices": [
    {
      "message": {
        "role": "assistant",
        "content": "Vector search finds information by comparing meaning, not just keywords."
      }
    }
  ]
}

Documentation

Explore the docs

The full documentation set is being prepared for early access users.

Getting Started

Preview

Learn the basics of integrating ChatInfer into your application.

Authentication

Preview

How to authenticate your API requests securely.

Chat Completions

Preview

Send and receive chat completion requests.

Knowledge Base

Coming soon

Connect your knowledge sources for grounded AI answers.

Model Routing

Coming soon

Route requests across models based on cost and latency.

Webhooks

Coming soon

Receive real-time events from the ChatInfer platform.

SDKs

Coming soon

Official client libraries for popular programming languages.

Rate Limits

Coming soon

Understanding API rate limits and best practices.

Documentation is in early access

The ChatInfer API is currently available to selected early access users. Early access users will receive integration guidance, API examples, and onboarding support as features become available.

Need access to the API?

Join the waitlist and tell us what you're building. We'll prioritize early users based on use case, scale, and integration needs.