Xfyun: Xfyun Realtime ASR LLM
科大讯飞实时语音转写大模型,来自 new_rta 服务,用于 FinClaw 语音输入
Overview for Xfyun Realtime ASR LLM
A compact summary of this model's best-fit workloads, routing status and production integration shape.
Model design signal
Recent public usage volume
Composite coding and reasoning signal
Routing snapshot
Gateway reports this model ready for traffic
Context and limits
Pricing snapshot
Integration checklist
Capability profile
Xfyun policies apply for retention, training and regional handling.
Parameter support
max_tokenstemperaturetop_pProviders for Xfyun Realtime ASR LLM
EtherGate routes requests to configured providers that can handle your prompt size and parameters, with fallbacks to maximize uptime. uptime.
Performance for Xfyun Realtime ASR LLM
Compare different providers across EtherGate
Throughput
30dLatency
30dE2E Latency
30dTool Call Error Rate
30dStructured Output Error Rate
30dEffective Pricing for Xfyun Realtime ASR LLM
Actual cost per million tokens across providers over the past hour
Weighted Average
| Provider | Input Price | Output Price | Cache Hit Rate |
|---|---|---|---|
| Xfyun | Provider priced | Provider priced | 42% |
Input Price / 1M tokens
7dOutput Price / 1M tokens
7dBenchmarks for Xfyun Realtime ASR LLM
Performance metrics and benchmarks
Ranking Distribution
Category Performance
Composite coding and reasoning signal
Models Arena
3D
53.0% Win · 170.0s Avg
ASCII
50.0% Win · 152.0s Avg
Code Categories
53.4% Win · 254.7s Avg
Data Visualization
53.3% Win · 236.9s Avg
Game Development
54.8% Win · 270.4s Avg
Image to HTML
46.7% Win · 116.4s Avg
SVG
55.2% Win · 133.4s Avg
UI Component
54.5% Win · 252.7s Avg
Website
51.8% Win · 306.2s Avg
Apps using Xfyun Realtime ASR LLM
Top public apps this month
Available: Gateway reports this model ready for traffic
Open-source agent workspace for model routing
AI coding agent connected to EtherGate
Applied research lab for frontier models
Developer tool using OpenAI-compatible APIs
VS Code extension for model-assisted coding
Monthly token usage
Recent activity on Xfyun Realtime ASR LLM
Total usage per day on EtherGate
Uptime stats for Xfyun Realtime ASR LLM
Uptime stats for Xfyun Realtime ASR LLM across configured providers
Sample code and API for Xfyun Realtime ASR LLM
EtherGate normalizes requests and responses across providers for you.
EtherGate supports reasoning-enabled models that can return step-by-step reasoning metadata. Use the reasoning parameter when a provider supports it, and preserve reasoning_details when continuing a conversation. Learn more about reasoning tokens.
Request
Use the OpenAI-compatible Chat Completions endpoint. Keep your SDK, change the base URL, API key and model ID.
import OpenAI from "openai";
const client = new OpenAI({
apiKey: process.env.ETHERGATE_API_KEY,
baseURL: "https://ethergate.cn/v1",
});
const completion = await client.chat.completions.create({
model: "xfyun/rtasr-llm",
messages: [{ role: "user", content: "Explain your best use case." }],
});Tool configuration examples
Use this model ID directly in agent tools that accept OpenAI-compatible endpoints.
curl -fsSL 'https://ethergate.cn/console/skills/ethergate-provider/install.sh' | bash -s -- --target qclaw --api-key '<your-ethergate-key>' --locale 'en'Required headers
Bearer $ETHERGATE_API_KEYContent-Type: application/jsonhttps://ethergate.cn/v1Optional headers
Optional attribution headers help you identify traffic by app or website when debugging usage.