Skip to content

Chat Completions

POST /v1/chat/completions —— OpenAI Chat Completions 兼容接口,支持同步与流式。

请求字段

  • model(必填):模型标识,即 model_code
  • messages(必填,非空):消息数组。
  • 其余字段(temperaturetop_pmax_tokensstoptools 等)原样透传到上游。

非流式

bash
API_BASE=https://xingsuan.cloud
curl "$API_BASE/v1/chat/completions" \
  -H "Authorization: Bearer $XSK_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-4o-mini",
    "messages": [{"role": "user", "content": "用一句话介绍你自己"}]
  }'
python
from openai import OpenAI

client = OpenAI(base_url="https://xingsuan.cloud/v1", api_key="xsk_...")

resp = client.chat.completions.create(
    model="gpt-4o-mini",
    messages=[{"role": "user", "content": "用一句话介绍你自己"}],
)
print(resp.choices[0].message.content)
js
import OpenAI from "openai"

const client = new OpenAI({ baseURL: "https://xingsuan.cloud/v1", apiKey: process.env.XSK_KEY })

const resp = await client.chat.completions.create({
  model: "gpt-4o-mini",
  messages: [{ role: "user", content: "用一句话介绍你自己" }],
})
console.log(resp.choices[0].message.content)

流式

设置 stream: true,响应为 SSE 流,逐 token 推送:

python
stream = client.chat.completions.create(
    model="gpt-4o-mini",
    messages=[{"role": "user", "content": "讲一个 30 字以内的小故事"}],
    stream=True,
)
for chunk in stream:
    delta = chunk.choices[0].delta.content or ""
    print(delta, end="", flush=True)
js
const stream = await client.chat.completions.create({
  model: "gpt-4o-mini",
  messages: [{ role: "user", content: "讲一个 30 字以内的小故事" }],
  stream: true,
})
for await (const chunk of stream) {
  process.stdout.write(chunk.choices[0]?.delta?.content ?? "")
}

响应

非流式响应与 OpenAI 官方格式一致(idobjectchoicesusage 等字段),由网关原样透传。具体字段含义请参考 OpenAI 官方文档。

错误

常见错误:模型不存在 / 禁用 / 不支持该端点 → 404;额度超限 → 429;余额不足 → 402。错误信封为 OpenAI 形状,详见 错误码