chore(pricing): Update fireworks-ai pricing by siddharthsambharia-portkey · Pull Request #696 · Portkey-AI/models

siddharthsambharia-portkey · 2026-04-14T00:26:50Z

🔄 Pricing Update: fireworks-ai

📊 Summary (complete_diff mode)

Change Type	Count
➕ Models added	16
🔄 Models updated (merged)	8

➕ New Models

deepseek-v3p1
deepseek-v3p2
glm-4p7
glm-5p1
gpt-oss-120b
gpt-oss-20b
qwen3-vl-30b-a3b-instruct
qwen3-vl-30b-a3b-thinking
llama-v3p3-70b-instruct
qwen3-8b
qwen3p6-plus
flux-1-dev-fp8
flux-1-schnell-fp8
flux-kontext-pro
flux-kontext-max
qwen3-embedding-8b

🔄 Updated Models

glm-5
kimi-k2-instruct-0905
kimi-k2-thinking
kimi-k2p5
minimax-m2p1
minimax-m2p5
minimax-m2p7
mixtral-8x22b-instruct

Model → Pricing Category Mapping

Named Families (exact page values)

Model ID	Input	Cache Read	Output	Notes
deepseek-v3p1, deepseek-v3p2	$0.56	$0.28 (50%)	$1.68	DeepSeek V3 family
glm-4p7	$0.60	$0.30 (50%)	$2.20	GLM-4.7
glm-5	$1.00	$0.20 (explicit)	$3.20	GLM-5
glm-5p1	$1.40	$0.26 (explicit)	$4.40	GLM-5.1
gpt-oss-120b	$0.15	$0.075 (50%)	$0.60	OpenAI GPT-OSS-120B
gpt-oss-20b	$0.07	$0.035 (50%)	$0.30	OpenAI GPT-OSS-20B
kimi-k2-instruct-0905, kimi-k2-thinking	$0.60	$0.30 (50%)	$2.50	Kimi K2
kimi-k2p5	$0.60	$0.10 (explicit)	$3.00	Kimi K2.5
qwen3-vl-30b-a3b-instruct, qwen3-vl-30b-a3b-thinking	$0.15	$0.075 (50%)	$0.60	Qwen3 VL 30B A3B
minimax-m2p1	$0.30	$0.15 (50%)	$1.20	MiniMax M2.1 (tier-based)
minimax-m2p5	$0.30	$0.03 (explicit)	$1.20	MiniMax 2.5
minimax-m2p7	$0.30	$0.06 (explicit)	$1.20	MiniMax 2.7

Tier-Based Text/Vision

Model ID	Tier	Input/Output
llama-v3p3-70b-instruct	>16B	$0.90
mixtral-8x22b-instruct	MoE 56.1–176B	$1.20
qwen3-8b	4B–16B	$0.20
qwen3p6-plus	MoE 0–56B (est.)	$0.50

Image Models

Model ID	Pricing
flux-1-dev-fp8	$0.0005/step
flux-1-schnell-fp8	$0.00035/step
flux-kontext-pro	$0.04/image
flux-kontext-max	$0.08/image

Embedding

Model ID	Input
qwen3-embedding-8b	$0.10/1M tokens

Skipped

qwen3-reranker-8b (reranker, excluded per skill rules)

Cache rule: 50% of input price (read only, no write charge), unless named-family row specifies explicitly.
Batch rule: 50% of serverless input AND output.

Generated by Pricing Agent on 2026-04-14

siddharthsambharia-portkey added 2 commits April 14, 2026 05:56

chore(pricing): Update fireworks-ai pricing

9a77a5e

chore(general): Add 16 new fireworks-ai model configs

102f560

siddharthsambharia-portkey closed this Apr 15, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore(pricing): Update fireworks-ai pricing#696

chore(pricing): Update fireworks-ai pricing#696
siddharthsambharia-portkey wants to merge 2 commits intomainfrom
pricing-update/fireworks-ai-24373741724

siddharthsambharia-portkey commented Apr 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

siddharthsambharia-portkey commented Apr 14, 2026

🔄 Pricing Update: fireworks-ai

📊 Summary (complete_diff mode)

➕ New Models

🔄 Updated Models

Model → Pricing Category Mapping

Named Families (exact page values)

Tier-Based Text/Vision

Image Models

Embedding

Skipped

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant