Leading open-source models
wylon Token Factory serves the leading open-source model families — MiniMax, Kimi, GLM, Qwen, DeepSeek — all running on wylon's GPU super-node architecture behind a single unified API.
Frontiers
MiniMax
Long-context, productivity-grade workloads — document processing, summarization, multi-turn business conversations.
Kimi
Solid on multimodal, ultra-long context, and code — a popular foundation for agents and developer tooling.
GLM
Strong on Chinese corpora — covers general chat, tool use, and agentic workflows.
Qwen
Full lineup from on-device tiny models to flagship MoEs, balancing performance and cost.
DeepSeek
Reputation for reasoning, code, and math — its MoE line offers excellent price-performance.
And more
See the full list, context lengths, and pricing in the model matrix below and on the pricing page.
View pricing →Other
Refer to the live service for the authoritative model list. See the pricing page or your console for current pricing.
Single wylon API
All models share the same endpoint — just switch the model field to compare families.
FAQ
Do I need to change my code when switching models?
Just swap the model field — the request shape stays the same. If you rely on tool calls or structured output, we recommend running a regression pass after switching.
Will I be notified when new models go live?
Yes. wylon posts release announcements in the dashboard, and major changes are sent ahead of time.
Can enterprises request private access?
Please contact us — our solutions team can help scope a plan with you.