Vinit Vyas

Exploring the mathematical foundations and practical implementations of deep learning and the AI solutions space.

Latest Posts

Mar 10, 2026179 min readfoundation

LLM Inference: From Black Box to Production

A ground-up explanation of LLM inference, from black box to production optimizations. Covers tokenization, embeddings, attention, KV cache, memory bottlenecks, batching, PagedAttention, and quantization, using TinyLlama 1.1B as the running example.

inference transformers attention kv-cache memory fundamentals vllm optimization batching quantization production

Oct 30, 202593 min readadvanced

Backpropagation Part 3: Systems, Stability, Interpretability, Frontiers

Theory assumes infinite precision; hardware delivers float16. Bridge the gap between mathematical backprop and production systems. In this post, we cover a lot of "practical" ground from PyTorch's tape to mixed precision training, from numerical disasters to systematic testing, from gradient monitoring to interpretability. What breaks, why, and how to fix it.

pytorch jax mixed-precision numerical-stability gradient-checking custom-gradients memory-optimization production backpropagation

Oct 27, 2025112 min readintermediate

Backpropagation Part 2: Patterns, Architectures, and Training

Every gradient rule, from convolutions to attention, follows one pattern: the vector-Jacobian product. See past the memorized formulas to the unifying abstraction, understand how residuals and normalization tame deep networks, and learn why modern architectures are really just careful gradient engineering.

vjp batch-normalization residual-connections attention rnns lstms initialization optimization backpropagation

Oct 24, 2025138 min readfoundation

Backpropagation Part 1: From Graphs to a Working MLP

Backprop computes a million gradients for the price of two forward passes. From computational graphs to adjoints, from chain rule to a working neural network, this is the algorithm that made deep learning possible; and is demystified here step by step.

backpropagation autodiff computational-graphs gradient-checking vanishing-gradients exploding-gradients fundamentals optimization

Oct 8, 2025131 min readfoundation

Multi-Layer Perceptrons: How Neural Networks Bend Space to See

Why does a 3-layer network solve problems a 1000-neuron single layer cannot? Understanding forward propagation, the exponential efficiency of depth, and how simple operations compose into hierarchical reasoning.

forward-propagation mlp multi-layer-perceptron machine-learning fundamentals intermediate calculus optimization

Sep 30, 2025110 min readfoundation

Gradient Descent: Theory, Mathematics, and Implementation

Why does walking downhill in parameter space solve everything from linear regression to GPT? A rigorous treatment of gradient descent: convergence theory, variants and the challenges of real-world optimization.

optimization gradient-descent machine-learning fundamentals intermediate calculus linear-regression

Sep 21, 202555 min readfoundation

The Perceptron: A Deep Dive

Understanding the fundamental building block of neural networks through intuition, mathematics, and implementation.

neural-networks fundamentals perceptron gradient-descent

Sep 8, 202514 min readfoundation

Model Context Protocol (MCP), Part 1 — Why was this inevitable?

A narrative of the conditions, failed alternatives, and timing that made MCP the standard way to connect LLMs to real tools and data.

mcp tool-calling standards ecosystem