Skip to content

chore(pricing): Update fireworks-ai pricing#696

Closed
siddharthsambharia-portkey wants to merge 2 commits intomainfrom
pricing-update/fireworks-ai-24373741724
Closed

chore(pricing): Update fireworks-ai pricing#696
siddharthsambharia-portkey wants to merge 2 commits intomainfrom
pricing-update/fireworks-ai-24373741724

Conversation

@siddharthsambharia-portkey
Copy link
Copy Markdown
Collaborator

🔄 Pricing Update: fireworks-ai

📊 Summary (complete_diff mode)

Change Type Count
➕ Models added 16
🔄 Models updated (merged) 8

➕ New Models

  • deepseek-v3p1
  • deepseek-v3p2
  • glm-4p7
  • glm-5p1
  • gpt-oss-120b
  • gpt-oss-20b
  • qwen3-vl-30b-a3b-instruct
  • qwen3-vl-30b-a3b-thinking
  • llama-v3p3-70b-instruct
  • qwen3-8b
  • qwen3p6-plus
  • flux-1-dev-fp8
  • flux-1-schnell-fp8
  • flux-kontext-pro
  • flux-kontext-max
  • qwen3-embedding-8b

🔄 Updated Models

  • glm-5
  • kimi-k2-instruct-0905
  • kimi-k2-thinking
  • kimi-k2p5
  • minimax-m2p1
  • minimax-m2p5
  • minimax-m2p7
  • mixtral-8x22b-instruct

Model → Pricing Category Mapping

Named Families (exact page values)

Model ID Input Cache Read Output Notes
deepseek-v3p1, deepseek-v3p2 $0.56 $0.28 (50%) $1.68 DeepSeek V3 family
glm-4p7 $0.60 $0.30 (50%) $2.20 GLM-4.7
glm-5 $1.00 $0.20 (explicit) $3.20 GLM-5
glm-5p1 $1.40 $0.26 (explicit) $4.40 GLM-5.1
gpt-oss-120b $0.15 $0.075 (50%) $0.60 OpenAI GPT-OSS-120B
gpt-oss-20b $0.07 $0.035 (50%) $0.30 OpenAI GPT-OSS-20B
kimi-k2-instruct-0905, kimi-k2-thinking $0.60 $0.30 (50%) $2.50 Kimi K2
kimi-k2p5 $0.60 $0.10 (explicit) $3.00 Kimi K2.5
qwen3-vl-30b-a3b-instruct, qwen3-vl-30b-a3b-thinking $0.15 $0.075 (50%) $0.60 Qwen3 VL 30B A3B
minimax-m2p1 $0.30 $0.15 (50%) $1.20 MiniMax M2.1 (tier-based)
minimax-m2p5 $0.30 $0.03 (explicit) $1.20 MiniMax 2.5
minimax-m2p7 $0.30 $0.06 (explicit) $1.20 MiniMax 2.7

Tier-Based Text/Vision

Model ID Tier Input/Output
llama-v3p3-70b-instruct >16B $0.90
mixtral-8x22b-instruct MoE 56.1–176B $1.20
qwen3-8b 4B–16B $0.20
qwen3p6-plus MoE 0–56B (est.) $0.50

Image Models

Model ID Pricing
flux-1-dev-fp8 $0.0005/step
flux-1-schnell-fp8 $0.00035/step
flux-kontext-pro $0.04/image
flux-kontext-max $0.08/image

Embedding

Model ID Input
qwen3-embedding-8b $0.10/1M tokens

Skipped

  • qwen3-reranker-8b (reranker, excluded per skill rules)

Cache rule: 50% of input price (read only, no write charge), unless named-family row specifies explicitly.
Batch rule: 50% of serverless input AND output.


Generated by Pricing Agent on 2026-04-14

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant