Deploy San Francisco

The Conference for the Inference Era

Golden Gate Bridge at night

DigitalOcean Deploy 26

The Conference for the Inference Era

April 28, 2026
San Francisco - Convene 100 Stockton
12:00pm - 8:00pm PT
Mainstage keynote streamed live
Real companies. Real inference workloads. Running today.

Join the engineering leaders building AI products at scale—and see how they operate inference systems with predictable performance and economics.

The hardest problem in AI today is no longer just training models. It’s operating inference in production: latency, throughput, system reliability, and cost efficiency.

Deploy is where you will see how companies running AI in production solve these challenges by operating inference systems with performance, reliability, and predictable economics at scale.

Deploy conference presentation
Deploy conference presentation
  • The Conference for the Inference Era

  • The Conference for the Inference Era

Deploy conference attendees
Deploy conference speaker
Why Attend Deploy
  • See How Real AI Companies Run Inference at Scale Hear from companies already running inference at scale and learn how they handle spiking traffic, latency constraints, and real cost pressure.
  • Learn the Architecture Behind Full-for-Scale AI Products Routing, batching, caching, autoscaling, quantization, and GPU orchestration. Learn how these systems fit together effectively in real production stacks.

    As workloads shift toward multimodal and agentic loops, the teams that win will be the ones who design the right systems around their models.
  • Control the Economics of AI Products Walk away with practical frameworks to improve TTFT, tail latency, throughput per GPU, and cost per outcome without getting surprised by volatility, egress, or runaway retries.

    If you can't deliver high inference performance with predictable unit costs at scale, you don't have a product. You have a burn rate.
  • Meet the Teams Building the Next Generation of AI Products Connect with founders, CTOs, and engineering leaders running AI systems in production today. Compare approaches, share lessons learned, and connect with peers solving the same challenges of scaling inference reliably and cost-effectively.

Meet the Speakers

Paddy Srinivasan

Paddy Srinivasan

CEO, DigitalOcean

Vinay Kumar

Vinay Kumar

CPTO, DigitalOcean

Matt Makai

Matt Makai

VP of Developer Relations, DigitalOcean

Meghan Grady

Meghan Grady

Senior Director, Marketing & Communications, DigitalOcean

Agenda at a Glance

Smarter Routes Through the AI Model Maze

Scale your AI features without scaling your bill. Learn how the Intelligent Router automatically optimizes model paths to protect your unit economics.

Register now

The Shortcut to Shipping AI

Don't let infrastructure stall your roadmap. Learn how to integrate and ship production-ready AI features into your existing applications in record time.

Register now

Out of the Silo and Into Your Production Stack

AI belongs in your stack, not in a silo. See how a unified platform handles the security, data, and observability requirements of a production-grade AI feature.

Register now

The Machinery Behind the Magic

A technical deep-dive into the core inference engine. Master the transition from Serverless to Dedicated GPUs and the logic behind the Intelligent Router.

Register now

The Deploy 2026 agenda is taking shape - check back in for some more exciting updates!

Secure your seat in San Francisco

April 28, 2026 • 12:00pm – 8:00pm PT
📍 Convene 100 Stockton

Join the technical leaders and executives building the next generation of AI-native companies at Deploy, the Conference for the Inference Era.

Select a country
Are you attending the event in-person in San Francisco or virtually?*
Are you a DigitalOcean customer?

FAQ

When and where is Deploy?

Deploy 2026 will be hosted in person at Convene 100 Stockton, 40 O'Farrell St, San Francisco. The mainstage keynote will also be streamed live to registrants.

Who should attend Deploy?

Deploy is designed for teams responsible for managing or building AI workloads in production at scale.

Why should I attend Deploy?

This is a special Deploy that represents an evolution of cloud infrastructure that will change the way companies with AI in production conceive of their businesses. DigitalOcean's vertically integrated agentic inference cloud delivers radically simple operations and predictable unit economics that will set AI-natives on a path to success and growth.

Is there a cost to attend?

No. Deploy is free to attend. See you in San Francisco.

Is there a code of conduct?