Scaling out training clusters another 10X maybe no longer being financially worth it doesn't mean that the bitter lesson no longer applies. The bitter lesson points to ideas that pay off more effectively with the scaled compute we have today. The bitter lesson is here to stay.
7,19K