In the last year or two, the most important trend in modern AI came to an end. The scaling-up of computational resources used to train ever-larger AI models through next-token prediction ( pre-training ) stalled out. Since late 2024, we’ve seen a new trend of using reinforcement learning (RL) in the