everyone who mentions ‘continual learning’ as a problem is usually just talking about sample efficiency. clearly, you should ‘continually learn’ by continually training trajectories back into the model! there’s no mystery: this just doesn’t work with low sample efficiency.
77,73K