Pricing

Two ways to pay, your choice

wylon Token Factory will offer both subscription plans and usage-based pricing.

Subscription plans

Pick the plan that fits your scale

Pro

For prototyping and personal projects.

??
  • Basic shared inference quota.
  • Standard rate limits.
  • Try out the full range of models.
  • Priority technical support.
Subscribe
Most popular

Max

For application builders and serious developers.

??
  • Everything in Pro.
  • 6× the 5-hour usage of Pro.
  • Higher rate limits.
  • Early access to new models and features.
Subscribe

Team

For teams and tailored deployments.

Custom
  • Shared token quota across the team.
  • Higher rate limits.
  • Priority onboarding and flexible scaling.
  • Commercial support and SLA guarantees.
Contact sales
Usage-based pricing

Per-million-token pricing

Family Model Tier Cached input (per 1M) Input (per 1M) Output (per 1M)
Kimi Kimi-K2.5 General ¥0.69 ¥3.99 ¥20.99
Kimi-K2.6 General ¥1.09 ¥6.49 ¥26.99
MiniMax MiniMax-M2.5 General ¥0.20 ¥2.09 ¥8.39
MiniMax-M2.7 General ¥0.41 ¥2.05 ¥8.22
GLM GLM-5.1 Input [0, 32k] ¥1.29 ¥5.99 ¥23.99
GLM-5.1 Input [32k, +∞) ¥1.99 ¥7.99 ¥27.99
Qwen Qwen3.6-35B-A3B Input [0, 128k] ¥0.08 ¥0.39 ¥3.19
Qwen3.6-35B-A3B Input [128k, 256k] ¥0.32 ¥1.59 ¥12.79
Qwen3.6-27B Input [0, 128k] ¥0.12 ¥0.59 ¥4.79
Qwen3.6-27B Input [128k, 256k] ¥0.36 ¥1.79 ¥14.39
DeepSeek DeepSeek-V4-Flash General ¥0.20 ¥0.99 ¥1.99

The table above is illustrative; the published rate card in your dashboard or sales quote is authoritative.

FAQ

When does my quota reset?

Rolling-window mechanism:

  • 5-hour window: starts at your first request and resets every 5 hours.
  • Weekly limit: starts at your first request and resets every 7 days.

You can see live remaining quota and reset times in the dashboard.

How is token usage billed?

Chat completions are billed based on the input and output token counts reported in the response's usage field. Input tokens served from the system-level cache are itemized separately; the published discount on the pricing page is authoritative.

Subscription or usage-based — which should I pick?

Subscriptions are a good fit for prototyping, fixed monthly budgets, and small teams. Usage-based pricing suits workloads with volatile traffic, per-project billing, or fine-grained cost attribution.

How do I upgrade my subscription?

In your dashboard, go to Benefits → Subscription, pick the plan you want, and complete checkout. Upgrades are prorated — the unused portion of your current plan is refunded, and the new plan's benefits take effect immediately.

沪ICP备2026010432号-1 沪公网安备31010402336632号