Reasoning Effort
The reasoning effort parameter lets you control how much reasoning depth Ring-2.6-1T applies when responding, balancing response quality against token efficiency. This parameter is currently only available for Ring-2.6-1T.
Effort Levels
Ring-2.6-1T supports two reasoning effort levels:
| Level | Description | Typical Use Cases |
|---|---|---|
high | Default. Designed for high-frequency Agent workflows with lower token cost and faster multi-step execution. | Multi-turn interaction, tool collaboration, task decomposition, production defaults |
xhigh | Designed for high-difficulty tasks. Provides more thorough reasoning space at higher token cost. | Math competitions, research analysis, complex logical reasoning, multi-path exploration |
Choosing a level:
- Omitting the parameter is equivalent to
high— suitable for most Agent and production scenarios. - Use
xhighwhen the task demands maximum reasoning depth and you’re less sensitive to cost or latency. xhighconsumes significantly more tokens. Start withhighand upgrade only when results don’t meet your quality bar.
Basic Usage
Ant Ling supports both the Anthropic API and the OpenAI-compatible API. The effort parameter field differs between the two styles.
Anthropic API
Use the /anthropic/v1/messages endpoint with output_config.effort:
Python
import anthropic
client = anthropic.Anthropic(
base_url="https://api.ant-ling.com",
api_key="YOUR_API_KEY"
)
response = client.messages.create(
model="Ring-2.6-1T",
messages=[
{"role": "user", "content": "Analyze the trade-offs between microservices and monolithic architectures"}
],
extra_body={
"output_config": {"effort": "xhigh"}
}
)
print(response.content[0].text)OpenAI-compatible API
Use the /v1/chat/completions endpoint with reasoning.effort:
Python
from openai import OpenAI
client = OpenAI(
base_url="https://api.ant-ling.com/v1/",
api_key="YOUR_API_KEY"
)
response = client.chat.completions.create(
model="Ring-2.6-1T",
messages=[
{"role": "user", "content": "Analyze the trade-offs between microservices and monolithic architectures"}
],
extra_body={
"reasoning": {"effort": "xhigh"}
}
)
print(response.choices[0].message.content)When to Adjust Reasoning Effort
- Use
high(default): High-frequency Agent workflows, multi-turn tool collaboration, task decomposition, and production deployments. Prioritizes speed and cost for the majority of use cases. - Use
xhigh: Math competition problems, research paper analysis, complex logical reasoning, and multi-path exploration — tasks that demand the deepest possible reasoning.
Reasoning Effort and Tool Use
The effort level affects tool-calling behavior. At xhigh, Ring-2.6-1T tends to:
- Plan tool call sequences more carefully
- Reason more thoroughly between steps in multi-tool tasks
- Perform deeper analysis on tool responses
At high, the model completes tool collaboration more efficiently with less intermediate reasoning overhead — well suited for high-frequency automated pipelines.
Best Practices
- Start with the default:
highdelivers excellent results for most scenarios. Evaluate the default before upgrading. - Match effort to task type: Use
highfor coding agents, tool-use workflows, and task decomposition; usexhighfor math reasoning, research analysis, and complex decision-making. - Monitor token usage:
xhighconsumes significantly more tokens. Track usage in production to control costs. - Adjust dynamically: In complex pipelines, switch effort per subtask —
highfor routine steps,xhighfor critical reasoning steps.
Related Resources
- Ring Model Overview - Learn about Ring-2.6-1T’s capabilities and use cases
- Streaming - Use streaming responses to reduce perceived latency in reasoning tasks
- Quick Start - Make your first API call in 5 minutes