AlpaSim. E2E Closed Loop Challenge
2026 · NVIDIA AV Research · KE:SAI

AlpaSim E2E Closed Loop Challenge

A closed-loop benchmark for evaluating autonomous driving policies.

01 Overview

Welcome to the AlpaSim End to End Closed Loop Challenge. This competition invites teams to build autonomous driving policies and compare them head-to-head in realistic closed-loop simulation, where each policy's decisions shape the future scene it must handle.

Autonomous driving research has made major progress, but it remains hard to compare policies across labs and companies in a realistic, reproducible way. Open-loop evaluation is useful, but it misses the compounding effects that make driving hard: a small planning error can change future observations, interactions, and risk.

AlpaSim provides a shared simulator, public development data, starter tools, baseline policies, and a common containerized submission interface. Organizer-managed evaluation workers run submissions on private held-out scenarios and publish leaderboard results with both a policy capability score and a safety metric.

Across both tracks, the goal is not just to crown a winner. We want to learn which policy families are robust under distribution shift, where they fail, and how the community can make AV evaluation more trustworthy.

The challenge is currently in a soft-open period. Registration and leaderboard access are available now so teams can begin exercising the submission workflow, validating containers, and calibrating against the current evaluation service while final data updates are prepared.

02 Challenge Tracks

The competition has two complementary tracks, covering both large-scale geographically diverse driving data and a lower-barrier entry point for teams working with established nuPlan-style workflows.

Track 1 - PAI-AV

Physical AI AV Track

The larger-scale setting for testing whether promising policies hold up as scenario diversity and long-tail coverage increase.

The current test set includes a subset of the final scenes. Additional scenes will be added later in the competition.

Track 2 - nuPlan

nuPlan Track

A lower-barrier track for teams building on the widely used nuPlan ecosystem or NAVSIM-style development workflows.

The nuPlan leaderboard is currently backed by a provisional dataset. Teams are encouraged to submit during this period to test their workflow and understand the evaluation process. When the updated nuPlan data is available, the nuPlan leaderboard will be cleared and each team's best submission will be resubmitted by the organizers.

03 Timeline

  1. 2026-06-15 Competition goes live Registration, public data, starter tools, and submission instructions open.
  2. 2026-08-15 Mid-competition checkpoint FAQ updates, leaderboard-health notes, and non-breaking clarifications.
  3. 2026-09-15 Rules and submission format freeze Final rules, metric implementation, Docker base image, and public submission format freeze except for critical fixes.
  4. 2026-10-31 Public leaderboard closes Teams select one final valid container per track. Technical reports are due.
  5. 2026-11-15 Final results released Final results and award decisions are released to participants.
  6. NeurIPS 2026 Competition track workshop Winners and selected participants present results; organizers share final analysis and lessons learned.

04 Prizes

Each track will award two NVIDIA DGX Spark prizes: one for first place and one for an innovative solution.

DGX

PAI-AV Track

Two NVIDIA DGX Spark prizes awarded.

DGX

nuPlan Track

Two NVIDIA DGX Spark prizes awarded.

05 Documentation Links

06 Organizers

This competition is hosted by NVIDIA's Autonomous Vehicle Research Group, KE:SAI, and HKU.

07 Leaderboard