About

Research:
My research is at the intersection of machine learning, systems and signal processing:

Architectures for long context and new applications
T1, Hyena and HyenaDNA, StripedHyena, Evo;
Mechanistic intepretability and design for AI models
Zoology, Mechanistic Architecture Design;
Hybridization of numerical methods and learning
Hypersolvers, Differentiable Multiple Shooting;
Efficient inference and distributed training systems at scale
LaughingHyena;

I have also worked on neural differential equations, time series and dynamical systems. These days I am mostly interested in "full-stack" design of large deep learning models, from numerics, systems, training, all the way to finetuning and deployment.

Short bio:
I am a Staff Scientist at Together and a Ph.D. student in Computer Science at Stanford University. I am grateful to a long list of brilliant researchers and friends that have advised me through the years and (inexplicably) believe in my work: Stefano Ermon, Chris Ré, Eric Horvitz, Bryan Wilder, Seong Joon Oh, Animesh Garg, Ilija Ilievski, Jinkyoo Park, among others.
I am originally from Bologna, Italy, and I have had the wonderful opportunity to spend 5 fun years in Asia (China and South Korea). My Chinese name is 宁致远.