2026 Schedule
Preliminary Schedule. The schedule is subject to change.
Monday, June 1st, 2026
Badge Pick-up (Stone Lobby) & Breakfast (Ziskind Lounge) - 1st Floor
Conference Introduction and Keynote:
Keeping ChatGPT Fast in the Agentic Era
Break - Snacks & Coffee Available in Ziskind Lounge
SafeChat: Building AI-Powered Safety Systems at Scale in a Real-Time Marketplace
Trust and safety challenges become exponentially harder at marketplace scale, especially when interactions happen in real time.
Beyond Prompting: Context Engineering for Production-Grade AI
If you're building production-grade AI applications, you may know an uncomfortable truth: reliable LLM outputs require far more than clever prompting.
Zero Trust Agent Systems that Pass Audits and Still Ship
Most agentic AI demos assume a greenfield environment. In a real enterprise, agents run inside strict boundaries where security, compliance, and incident response are non-negotiable.
Navigating Global Compliance After the First Major AI Regulations
Details coming soon.
Break
Context Engineering at LinkedIn: How We Built an Organizational Context Layer for AI Agents with MCP
AI coding agents are powerful out of the box, but they don't know your company. They can't navigate your services, understand your frameworks, query your data systems, or follow your organizational processes.
Serving LLMs at Scale: The Hidden KV Cache Advantage
KV cache is the hidden lever behind inference cost and performance. It directly impacts GPU utilization, throughput, and Time to First Token.
From Natural Language to Trusted AI: A Hybrid Architecture for Safe, Accurate, and Context-Aware AI Query Generation
Modern enterprises want everyone, not just data engineers, to be able to ask questions of their data. But turning natural language into reliable, secure, and business-aware queries is far from simple.
From Consumers to Builders: Turning 200 of our Team into Agent Creators in 2 Weeks
Most organizations approach "AI democratization" by giving coding assistants to engineers and simplistic chatbots to business users. Yet, building sophisticated, high-value AI agents remains a bottleneck restricted to a few specialists.
From Prompt Hacking to Architectural Determinism: Engineering Reliable GenAI Systems
As generative AI moves from experimental novelties to mission-critical enterprise applications, the limitations of prompt engineering have become a significant bottleneck.
Inferencing for Enterprises
This presentation will cover what areas enterprises like JPMC consider to be most important when running inferencing at scale.
From AI Agent Demo to Production: Automated Testing and Evaluation
90% of the enterprise agents are stuck in POC. We don't know when the agents are good enough to deploy without a systematic testing and evaluation process.
There's a Mod for That: Opening Up AI Agent Development
AI agents are moving amazingly fast — but they could be moving even faster. Today's agents are monolithic: one vendor controls the model, the tools, and the context pipeline. That works, but it leaves an enormous amount of potential on the table.
Vibecoding your own Multi-agent Workstation
I miss spending hours tuning my IDE, my dotfiles, my neovim setup, my PS1 prompt, my tmux config. In 2023, my workstation was hyper-customized to my preferences and needs.Now that agents write all my code, that work has gone in the bin.
Leverage AI to Protect Your Data Assets... from AI
Most PII leaks happen before your prompt ever hits the model — here's how to catch them at the pipeline level with deterministic ETLs and synthetic stand-ins.
Break
Beyond the Prototype: Scaling Frame Agnostic AI Agent Infrastructure with Ray
Building an AI agent is easier than ever. However, moving from a local notebook to a production-grade "Agent Engine" that serves large scale web services poses complex problems.
Architecting the Data Layer for AI Agents: From Transactional Systems to MCP and Semantic Models
Most conversations about production AI agents focus on the agent itself — the prompts, the orchestration, the framework. But the moment you put an agent in front of real enterprise data, a different problem dominates: the data layer wasn't designed for this consumer.
Opening the Black Box: Critical Explainability for AI Agent Tool Selection with Kiji Inspector
AI agents that autonomously select and invoke tools are becoming ubiquitous—yet their decision-making remains a black box. When an agent chooses to query a database instead of searching the web, what internal representations drive that choice?
Agentspan: The Durable Runtime Your Agents Deserve
Most frameworks—like OpenAI Agents SDK, Google Agent Development Kit, and LangChain—run agents in memory. When the process dies, so does the agent. Fine for demos; fragile in production.
Break - Snacks & Coffee Available in Ziskind Lounge
From Models to Agents: Building Context-Aware Consumer AI at Scale at DoorDash
Consumer AI is rapidly evolving beyond one-shot predictions toward systems that reason, plan, and adapt in real time.
Decision-Driven Evaluation for Generative AI
This talk starts with an example of an evaluation suite which looked good: it was comprehensive and detailed, but we discovered that it did not provide the clarity we needed to make a launch decision.
Prompt to Prod: Engineering an Autonomous SDLC at Scale
Engineering organizations are hitting a Productivity Paradox. AI tools are generating more code than ever, yet the time it takes to ship that code safely to production in large-scale environments remains stagnant.
The Trust Lifecycle for Safe, Secure and Reliable AI Agents
Shipping an AI agent is easy; keeping it trustworthy in production is not. Point-in-time audits fall short as failures emerge and attacks evolve. Teams must assess agent behavior across risks before deployment—and learn from production.
Unconference Session: Brainstorming with Peers
What is Unconference?Unconference sessions are a simple way to run productive meetings for 5 to 2000 or more people, and a powerful way to lead any kind of organization in everyday practice and extraordinary change.
Terrace Lounge
Attendee Social - Snacks & Beverages - Located in Ziskind Lounge
Tuesday, June 2nd, 2026
Badge Pick-up (Stone Lobby) & Breakfast (Ziskind Lounge) - 1st Floor
Conference Introduction and Keynote:
The Five Stages of AI Maturity in Engineering Organizations — Where and Why Teams Get Stuck
Break - Snacks & Coffee Available in Ziskind Lounge
From Fab To Token - The State Of The Market
This talk delivers a data-driven summary on the physical and economic bottlenecks in the AI infrastructure market today. We will cover the diverging strategies between traditional hyperscalers and specialized "Neoclouds," supported by deep supply chain data.
Building GenAI Platform at DoorDash
When we started adopting LLMs across DoorDash, every team was implementing the same infrastructure: retry logic, fallback mechanisms, cost tracking, prompt versioning, and batch processing pipelines. Engineering time was wasted on repetitive plumbing work instead of building features.
Scaling LLM-Based Ranking Systems for Latency-Critical Search & Recommendation Workloads
Large Language Models are powerful — but deploying them in latency-critical ranking systems is a fundamentally different problem than building chat applications.
AI First, Quality Always: Agentic SDLC Adoption Case Study
Every engineering organization is under pressure to go "AI First" — fast. But the rush to show AI-driven productivity creates a dangerous incentive: measure speed and volume instead of quality and trust.
From Demo to Production: Why Agentic AI Systems Fail—and How to Fix Them
Most agentic AI systems look compelling in a demo. The agent reasons, calls tools, generates output. Then you try to run it in production, and the non-deterministic behavior of LLMs hits you fast.
Break
Building Reusable Evaluation Frameworks for Agentic AI Products
This talk covers methods of evaluating AI Agents, with an example of how we built evaluation frameworks for a user-facing AI Agent system that has been in production for almost two years.
Multi-Agent Patterns from Spotify’s AI Powered Advertising Platform
Spotify's advertising platform spans audience targeting, ad creative generation, campaign goal resolution, budget optimization, and ad campaign recommendations -- workflows that touch different data sources, APIs, and business rules.
From Hype to Habit: Conquering AI Adoption at Zoox
Most organizations have purchased AI developer tools. Most are disappointed with adoption. The problem usually isn't the tool, it's that we treat AI adoption like a software rollout when it's actually a behavior change problem.
The Agent Harness: Control Planes, Invariants, and Approval Boundaries for Production AI Agents
AI agents can look autonomous, but what keeps them reliable in production is the harness around the model.
Deterministic Context at Scale: Architecting AI for Reliable SDLC Delivery
Most enterprise AI initiatives stall not because of the model, but because of missing context.
Adaptive Recommenders in the Real World: Inference, Evals, and System Design
Modern personalization systems are shifting from hand-tuned heuristics to AI-native architectures, but building an adaptive recommendation engine in production—one that continuously learns, evolves, and delivers measurable business value—requires far more than deploying a model.
Keeping Engineers Engaged When the Machine Does the Heavy Lifting
Details coming soon.
Actors for Agents: Patterns for Production AI
We started with the obvious stack: a prompt-chain framework, some glue code, a queue for the slow parts. It demoed beautifully.
Solving Context Bloat: Semantic Tool Routing in Multi-Server MCP Environments
Agentic systems adopting MCP face a scalability hurdle: managing interactions with numerous servers exposing dozens or hundreds of tools.
Break
What 25 Trillion Tokens Reveal About How Developers Actually Adopt AI Agents
Most AI coding tools are benchmarked on capability. But capability doesn't predict adoption. At Kilo Code, we've processed over 25 trillion tokens across 1.5M+ developers, giving us one of the clearest real-world datasets on how engineering teams actually integrate AI into their workflows.
Finetuning Your Embedding Model for Better Search: Learning from Agentic Search at Dell
Embedding models are a foundational component of modern ML systems, enabling clustering, ranking, and large-scale information retrieval.
Decision Models in Agentic Architectures: From Production to Agent Skills
In agentic architectures, not every step can be probabilistic. When an agent needs to decide whether to approve a loan, which treatment protocol to apply to a patient, or whether to pay an insurance claim, the output needs to be deterministic and explainable.
Building the Context Engine AI Agents Need
Every AI coding tool can generate code. Very few can generate the right code for your organization, because they're missing context.
Break - Snacks & Coffee Available in Ziskind Lounge
From Retrieval to Reasoning: Building Production-Ready Agentic AI Systems with Knowledge Graphs
The evolution of AI systems is shifting from simple retrieval to autonomous reasoning.
Batch Intelligence at Scale: Cost-Efficient Multi-Agent LLM Workflows with Built-In Resilience
Most teams building production LLM systems face a hard tradeoff: premium models with low latency or cost-efficient inference with unpredictable quality. This talk presents a third path, one validated at Walmart, the largest retailers in the world.
Unconference Session: Brainstorming with Peers
What is Unconference?Unconference sessions are a simple way to run productive meetings for 5 to 2000 or more people, and a powerful way to lead any kind of organization in everyday practice and extraordinary change.
Terrace Lounge
Closing Reception - Snacks & Beverages - Located in Ziskind Lounge