publications

* denotes equal contribution

preprints

  1. arXiv
    The Effect of SGD Batch Size on Autoencoder Learning: Sparsity, Sharpness, and Feature Learning
    Nikhil GhoshSpencer FreiWooseok Ha, and Bin Yu
    arXiv preprint, 2023

conference & journal articles

2024

  1. ICLR
    More is Better in Modern Machine Learning: when Infinite Overparameterization is Optimal and Overfitting is Obligatory
    James B. SimonDhruva KarkadaNikhil Ghosh, and Mikhail Belkin
    2024

2023

  1. NeuRIPS
    spotlight
    Alternating Updates for Efficient Transformers
    In Advances in Neural Information Processing Systems (NeurIPS), 2023
  2. SIMODS
    A Universal Trade-off Between the Model Size, Test Loss, and Training Loss of Linear Predictors
    Nikhil Ghosh, and Mikhail Belkin
    SIAM Journal on Mathematics of Data Science (SIMODS), 2023
  3. ICLR
    Deconstructing Distributions: A Pointwise Framework of Learning
    In International Conference on Learning Representations (ICLR), 2023

2022

  1. ICLR
    The Three Stages of Learning Dynamics in High-Dimensional Kernel Methods
    Nikhil GhoshSong Mei, and Bin Yu
    In International Conference on Learning Representations (ICLR), 2022

2019

  1. NeuRIPS
    Landmark Ordinal Embedding
    Nikhil GhoshYuxin Chen, and Yisong Yue
    In Advances in Neural Information Processing Systems (NeurIPS), 2019