# MoE Competition Documentation

This site gives competition participants the technical background needed to reason about
Mixture-of-Experts (MoE) load balancing, with a focus on DeepSeek-style sparse expert models.

The competition simulator evaluates dynamic expert placement policies using trace-derived expert
hotness. These notes explain why expert routing creates load imbalance, how DeepSeek-style MoE
models route tokens, and what the simulator metrics are intended to capture.

```{toctree}
:maxdepth: 2
:caption: Contents

deepseek-overview
transformers-intro
mixture-of-experts
deepseek-moe-routing
competition-context
simulator-and-submissions
references
```