CLIMB: Clustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

Publication
Advances in Neural Information Processing Systems (NeurIPS)
 Jan Kautz
Jan Kautz
Team Leader

Related