推理强度

推理强度参数用于控制模型在响应时投入多少推理 token，方便用户自行权衡输出质量和输出效率。该参数目前仅适用于 Ring-2.6-1T。

等级

Ring-2.6-1T 支持两种推理强度的等级：

级别	描述	典型使用场景
`high`	默认，面向 Agent 工作流，具备更低的 token 开销与更快的多步执行能力。	多轮交互、工具协作、任务拆解
`xhigh`	面向高难任务，为复杂推理提供更充分的思考空间，token 消耗更高。	数学竞赛、科研分析、复杂逻辑推理、多路径探索

选择指南：

默认（不设置参数）等同于 high，适合大多数场景。
当任务对推理深度要求极高，且对成本和延迟不敏感时，选择 xhigh。
xhigh 模式下 token 消耗更高，建议先用 high 评估，仅在效果不满足时升级。

基本用法

Ring-2.6-1T 支持两种接口风格，包括 Anthropic 兼容接口 和 OpenAI 兼容接口，参数字段略有差异。

Anthropic 兼容接口

使用 /anthropic/v1/messages 端点，通过 output_config.effort 传入推理强度：

cURL


curl https://api.ant-ling.com/anthropic/v1/messages \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -d '{
    "model": "Ring-2.6-1T",
    "output_config": {"effort": "xhigh"},
    "messages": [
      {"role": "user", "content": "分析微服务架构与单体架构的权衡"}
    ]
  }'

Node.js


import Anthropic from '@anthropic-ai/sdk';
 
const client = new Anthropic({
  baseURL: 'https://api.ant-ling.com',
  apiKey: 'YOUR_API_KEY',
});
 
const response = await client.messages.create({
  model: 'Ring-2.6-1T',
  messages: [
    { role: 'user', content: '分析微服务架构与单体架构的权衡' }
  ],
  // @ts-ignore
  output_config: { effort: 'xhigh' },
});
 
console.log(response.content[0].text);

Python


import anthropic
 
client = anthropic.Anthropic(
    base_url="https://api.ant-ling.com",
    api_key="YOUR_API_KEY"
)
 
response = client.messages.create(
    model="Ring-2.6-1T",
    messages=[
        {"role": "user", "content": "分析微服务架构与单体架构的权衡"}
    ],
    extra_body={
        "output_config": {"effort": "xhigh"}
    }
)
 
print(response.content[0].text)

OpenAI 兼容接口

使用 /v1/chat/completions 端点，通过 reasoning.effort 传入推理强度：

Python


from openai import OpenAI
 
client = OpenAI(
    base_url="https://api.ant-ling.com/v1/",
    api_key="YOUR_API_KEY"
)
 
response = client.chat.completions.create(
    model="Ring-2.6-1T",
    messages=[
        {"role": "user", "content": "分析微服务架构与单体架构的权衡"}
    ],
    extra_body={
        "reasoning": {"effort": "xhigh"}
    }
)
 
print(response.choices[0].message.content)

Node.js


import OpenAI from 'openai';
 
const client = new OpenAI({
  baseURL: 'https://api.ant-ling.com/v1/',
  apiKey: 'YOUR_API_KEY',
});
 
const response = await client.chat.completions.create({
  model: 'Ring-2.6-1T',
  messages: [
    { role: 'user', content: '分析微服务架构与单体架构的权衡' }
  ],
  // @ts-ignore
  reasoning: { effort: 'xhigh' },
});
 
console.log(response.choices[0].message.content);

cURL


curl https://api.ant-ling.com/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -d '{
    "model": "Ring-2.6-1T",
    "reasoning": {"effort": "xhigh"},
    "messages": [
      {"role": "user", "content": "分析微服务架构与单体架构的权衡"}
    ]
  }'

何时调整推理强度

使用 high（默认）：高频 Agent 工作流、多轮工具协作、任务拆解与生产级部署。速度与成本优先，适合大多数场景。
使用 xhigh：数学竞赛解题、科研论文分析、复杂逻辑推理、多路径探索等对推理深度要求最高的任务。

推理强度与工具调用

推理强度会影响工具调用的行为。在 xhigh 模式下，Ring-2.6-1T 倾向于：

更仔细地规划工具调用路径
在多步任务中进行更充分的中间推理
对工具返回结果进行更深入的分析

在 high 模式下，模型会以更高效的方式完成工具协作，减少不必要的推理开销，适合高频自动化流水线。

最佳实践

从默认值开始：大多数场景下 high（默认）已能提供出色效果，先评估默认行为再考虑升级。
按任务类型选择：代码生成、工具调用等 Agent 任务首选 high；数学推理、科研分析等高难任务选 xhigh。
监控 token 消耗：xhigh 模式 token 消耗显著更高，建议在生产环境中持续监控用量。
动态调整：在复杂流水线中，可根据子任务复杂度动态切换，简单子任务用 high、关键推理步骤用 xhigh。

推理强度

等级

基本用法

Anthropic 兼容接口

cURL

Node.js

Python

OpenAI 兼容接口

Python

Node.js

cURL

何时调整推理强度

推理强度与工具调用

最佳实践

相关资源