Research

Interests

My current research interests regard the theory of deep learning. I am interested in understanding the optimization and statistical properties of various architectures, ranging from shallow networks to generative models. I am also starting to work on AI for maths.

Previously, I worked on Natural Language Processing and on Quasi-Monte Carlo methods.

Publications

See also my Google Scholar. Stars denote first co-authors.

Preprints

J. Wu*, P. Marion*, P. Bartlett, Large Stepsizes Accelerate Gradient Descent for Regularized Logistic Regression, arXiv preprint, June 2025. Online access
R. Maulen-Soto, P. Marion, C. Boyer, Attention-based clustering, arXiv preprint, May 2025. Online access

Peer-reviewed Publications

Y.-H. Wu, P. Marion, G. Biau, C. Boyer, Taking a Big Step: Large Learning Rates in Denoising Score Matching Prevent Memorization, COLT 2025, July 2025. Online access
P. Marion, A. Korba, P. Bartlett, M. Blondel, V. De Bortoli, A. Doucet, F. Llinares-López, C. Paquette, Q. Berthet, Implicit Diffusion: Efficient Optimization through Stochastic Sampling, AISTATS 2025, oral presentation, May 2025. Online access
P. Marion, R. Berthier, G. Biau, C. Boyer, Attention layers provably solve single-location regression, ICLR 2025, April 2025. Online access
P. Marion, A.Fermanian, G. Biau, J.P. Vert, Scaling ResNets in the Large-depth Regime, Journal of Machine Learning Research, February 2025. Online access
P. Marion, L. Chizat, Deep linear networks for regression are implicitly regularized towards flat minima, NeurIPS 2024, December 2024. Online access
P. Marion*, Y.-H. Wu*, M. Sander, G. Biau, Implicit regularization of deep residual networks towards neural ODEs , ICLR 2024, spotlight presentation, May 2024. Online access
P. Marion, Generalization bounds for neural ordinary differential equations and deep residual networks, NeurIPS 2023, December 2023. Online access
P. Marion, R. Berthier, Leveraging the two-timescale regime to demonstrate convergence of neural networks, NeurIPS 2023, December 2023. Online access
A. Fermanian*, P. Marion*, J.P. Vert, G. Biau, Framing RNN as a kernel method: A neural ODE approach, NeurIPS 2021, oral presentation, December 2021. Online access
P. Marion, P. Nowak, F. Piccinno, Structured Context and High-Coverage Grammar for Conversational Question Answering over Knowledge Graphs, EMNLP 2021, November 2021. Online access
P. L’Ecuyer, P. Marion, M. Godin, and F. Puchhammer, A Tool for Custom Construction of QMC and RQMC Point Sets, Monte Carlo and Quasi-Monte Carlo Methods 2020, August 2020. Online access.
P. Marion, M. Godin, and P. L’Ecuyer, An algorithm to compute the t-value of a digital net and of its projections, Journal of Computational and Applied Mathematics, June 2020. Online access

Technical Reports

L. Phan et al., Humanity's Last Exam, arXiv:2501.14249, January 2025. Online access

PhD Thesis

P. Marion, Mathematics of deep learning: generalization, optimization, continuous-time models, November 2023. Download my thesis.