主题
Chat Completions
POST /v1/chat/completions —— OpenAI Chat Completions 兼容接口,支持同步与流式。
请求字段
model(必填):模型标识,即 model_code。messages(必填,非空):消息数组。- 其余字段(
temperature、top_p、max_tokens、stop、tools等)原样透传到上游。
非流式
bash
API_BASE=https://xingsuan.cloud
curl "$API_BASE/v1/chat/completions" \
-H "Authorization: Bearer $XSK_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "gpt-4o-mini",
"messages": [{"role": "user", "content": "用一句话介绍你自己"}]
}'python
from openai import OpenAI
client = OpenAI(base_url="https://xingsuan.cloud/v1", api_key="xsk_...")
resp = client.chat.completions.create(
model="gpt-4o-mini",
messages=[{"role": "user", "content": "用一句话介绍你自己"}],
)
print(resp.choices[0].message.content)js
import OpenAI from "openai"
const client = new OpenAI({ baseURL: "https://xingsuan.cloud/v1", apiKey: process.env.XSK_KEY })
const resp = await client.chat.completions.create({
model: "gpt-4o-mini",
messages: [{ role: "user", content: "用一句话介绍你自己" }],
})
console.log(resp.choices[0].message.content)流式
设置 stream: true,响应为 SSE 流,逐 token 推送:
python
stream = client.chat.completions.create(
model="gpt-4o-mini",
messages=[{"role": "user", "content": "讲一个 30 字以内的小故事"}],
stream=True,
)
for chunk in stream:
delta = chunk.choices[0].delta.content or ""
print(delta, end="", flush=True)js
const stream = await client.chat.completions.create({
model: "gpt-4o-mini",
messages: [{ role: "user", content: "讲一个 30 字以内的小故事" }],
stream: true,
})
for await (const chunk of stream) {
process.stdout.write(chunk.choices[0]?.delta?.content ?? "")
}响应
非流式响应与 OpenAI 官方格式一致(id、object、choices、usage 等字段),由网关原样透传。具体字段含义请参考 OpenAI 官方文档。
错误
常见错误:模型不存在 / 禁用 / 不支持该端点 → 404;额度超限 → 429;余额不足 → 402。错误信封为 OpenAI 形状,详见 错误码。