MoE
ZAYA1-8B: MoE Model Trained on AMD Beats GPT-5 at Math
ZAYA1-8B, a Mixture-of-Experts model with just 760M active parameters trained entirely on AMD GPUs, reportedly outperforms GPT-5-High on math benchmarks—challenging NVIDIA's dominance in frontier AI training.