Test-time compute is eating training-time compute
A new research consensus: spending more compute at inference time often beats spending more during training. What this means for the next generation of models.
A new research consensus: spending more compute at inference time often beats spending more during training. What this means for the next generation of models.