Scaling Laws for Neural Language Models
Scaling Data-Constrained Language Models Scaling Data-Constrained Language Models-Video
Chinchilla Scaling Laws for Large Language Models (LLMs)
Training Compute-Optimal Large Language Models