Recent Interests: LLM, Deep Learning, PyTorch
A student's breakdown of Adaptive Branching Tree Search for smarter AI inference.
Exploring how diffusion models can generate language without autoregression.
What GitHub Copilot taught me about the future of consciousness.
Exploring the geometric intuition behind gradient descent.
Deep dive into Sparse Mixture of Experts models and why training big then pruning actually makes sense.
An introduction to this blog and what you can expect to find here.