Skip to content

chore(pricing): Update fireworks-ai pricing#549

Open
siddharthsambharia-portkey wants to merge 36 commits intomainfrom
pricing-update/fireworks-ai
Open

chore(pricing): Update fireworks-ai pricing#549
siddharthsambharia-portkey wants to merge 36 commits intomainfrom
pricing-update/fireworks-ai

Conversation

@siddharthsambharia-portkey
Copy link
Collaborator

@siddharthsambharia-portkey siddharthsambharia-portkey commented Mar 17, 2026

🔄 Pricing Update: fireworks-ai

📊 Summary (complete_diff mode)

Change Type Count
➕ Models added 17
🔄 Models updated (merged) 4

➕ New Models

  • deepseek-v3p1
  • deepseek-v3p2
  • glm-4p7
  • glm-5
  • qwen3-vl-30b-a3b-instruct
  • qwen3-vl-30b-a3b-thinking
  • gpt-oss-120b
  • gpt-oss-20b
  • minimax-m2p1
  • minimax-m2p5
  • llama-v3p3-70b-instruct
  • qwen3-8b
  • flux-1-dev-fp8
  • flux-1-schnell-fp8
  • flux-kontext-pro
  • flux-kontext-max
  • qwen3-embedding-8b

🔄 Updated Models

  • kimi-k2-instruct-0905
  • kimi-k2-thinking
  • kimi-k2p5
  • mixtral-8x22b-instruct

Model → Pricing Category Mapping

Named Families (exact values from pricing page)

Model ID Pricing Row Input Output Cache Read
deepseek-v3p1 DeepSeek V3 family $0.56 $1.68 $0.28 (50%)
deepseek-v3p2 DeepSeek V3 family $0.56 $1.68 $0.28 (50%)
glm-4p7 GLM-4.7 $0.60 $2.20 $0.30 (50%)
glm-5 GLM-5 $1.00 $3.20 $0.20 (page-specified)
qwen3-vl-30b-a3b-instruct Qwen3 VL 30B A3B $0.15 $0.60 $0.075 (50%)
qwen3-vl-30b-a3b-thinking Qwen3 VL 30B A3B $0.15 $0.60 $0.075 (50%)
kimi-k2-instruct-0905 Kimi K2 Instruct $0.60 $2.50 $0.30 (50%)
kimi-k2-thinking Kimi K2 Thinking $0.60 $2.50 $0.30 (50%)
kimi-k2p5 Kimi K2.5 $0.60 $3.00 $0.10 (page-specified)
gpt-oss-120b OpenAI gpt-oss-120b $0.15 $0.60 $0.075 (50%)
gpt-oss-20b OpenAI gpt-oss-20b $0.07 $0.30 $0.035 (50%)
minimax-m2p1 MiniMax M2 family $0.30 $1.20 $0.03 (page-specified)
minimax-m2p5 MiniMax M2 family $0.30 $1.20 $0.03 (page-specified)

Tier-Based

Model ID Tier Input Output
llama-v3p3-70b-instruct >16B parameters $0.90 $0.90
mixtral-8x22b-instruct MoE 56.1B–176B $1.20 $1.20
qwen3-8b 4B–16B parameters $0.20 $0.20

Image Generation

Model ID Pricing Type Price
flux-1-dev-fp8 Per step $0.0005/step
flux-1-schnell-fp8 Per step $0.00035/step
flux-kontext-pro Per image $0.04/image
flux-kontext-max Per image $0.08/image

Embeddings

Model ID Input $/1M
qwen3-embedding-8b $0.10

Skipped

  • qwen3-reranker-8b — Reranker model (excluded per rules)

Generated by Pricing Agent on 2026-03-26

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant