DeepSeek R1 0528, an updated version of the state-of-the-art DeepSeek R1 model, is now available. Try it now!

Build with our Developer Toolkit

Build Prototypes. Evaluate Fast. Scale with Confidence.

Fireworks Build SDK
Rapid Deployment

Experiment instantly with serverless. Scale to production with on-demand.

Go from idea to output in seconds—with just a prompt. Run the latest open models on Fireworks serverless, with no GPU setup or cold starts. Move to production with on-demand GPUs that auto-scale as you grow.

Agentic pipelines
Fine Tuning

Evaluate 100× faster with Multi-LoRA

Run hundreds of fine-tuned model variants in parallel on a single deployment. Fireworks' Multi-LoRA architecture reduces iteration costs and time by 100×. Easily collaborate across teams with shared deployments in unified workspaces.

Model Library
Tool Calling

Build powerful agents with tool use and memory

Build agents that reason, plan, and act reliably. Use structured tool calls (JSON, grammar mode) to trigger APIs, fetch data, and integrate business logic. Fireworks also supports memory primitives—so your models can retain and reuse context across interactions.

Modalities
Model Library

Leverage 1000s of models across multiple modalities

Use preloaded, optimized models or bring your own text, vision, audio, speech, image, and video modles. Build rich, multimedia experiences, from image understanding and generation, speech transcription, and voice agents without infrastructure overhead.