MiniMax launches all-modality Token Plan days after M2.7 release

Following last week’s M2.7 release, MiniMax is now introducing a flat-rate subscription that covers text, speech, music, video, and image generation under one bill.

Starting at $10 a month.

minimax featured image

Hot on the heels of last week’s M2.7 launch, MiniMax is now introducing the Token Plan — a flat-rate API subscription covering all of its models under a single key and a single bill. Text, speech, music, video, and image generation, previously billed separately, start at $10 per month.

M2.7 is a text-only model. The multimodal side of the Token Plan comes from four separate models bundled alongside it — Image 01, Speech 2.8, Music 2.5, and Hailuo 2.3 for video. Each has its own daily quota depending on the tier.

  • Starter — $10/month: 1,500 M2.7 requests per 5 hours. Text only.
  • Plus — $20/month: 4,500 M2.7 requests. Adds Image 01 (50 images/day) and Speech 2.8 (4,000 chars/day).
  • Plus Highspeed — $40/month: Same as Plus with dedicated highspeed throughput and double the image quota.
  • Max — $50/month: 15,000 M2.7 requests. Adds Music 2.5 (4 songs/day) and Hailuo 2.3 video (2 videos/day).
  • Max Highspeed — $80/month: Same as Max with higher quotas across all models.
  • Ultra Highspeed — $150/month: 30,000 M2.7 requests. Image 01 at 800/day, Speech 2.8 at 50,000 chars/day, Music 2.5 at 15 songs/day, Hailuo 2.3 at 5 videos/day.
  • The appeal: One key and one bill across five model types instead of managing separate accounts and unpredictable usage costs per modality.
  • The catch: Music and video are locked to Max tier and above. The $10 Starter gets M2.7 requests only.
MiniMax Token Plan tiers

The Bottom Line: MiniMax is not the only one going down this road. Stepfun launched a flat-rate coding plan for $6.99 yesterday. At this pace, predictable subscription pricing may simply become the standard expectation for AI API access — and pay-as-you-go the exception.

Check out MiniMax’s coding plans.

RunPod
RunPod

If you need on-demand GPUs for training, fine-tuning, inference, or running open-source models, give RunPod a try.

  • Available hardware: H100, H200, A100, L40S, RTX 4090, RTX 5090, and 30+ more
  • Cost: significantly cheaper than AWS or GCP, billed per second, no contracts
  • Setup: spins up in under a minute, 30+ regions worldwide
Try RunPod →
Affiliate disclosure: We may earn a commission if you sign up via our link, at no extra cost to you.
Efficienist Newsletter

Get the core business tech news delivered straight to your inbox. We track AI, automation, SaaS, and cybersecurity so you don't have to.

Just read what you want, and be done with it.

Read Next