A deep dive into building a WaveNet-inspired MLP architecture, exploring hierarchical structures and improvements to the model.
August 28, 2025A deep dive into the inner workings of the backward pass in neural networks, implementing manual backpropagation at the tensor level to match PyTorch's autograd.
August 18, 2025A deep dive into Batch Normalization, a technique to stabilize and accelerate the training of deep neural networks.
July 22, 2025A character-level language model using a multi-layer perceptron to predict the next character in a sequence.
July 14, 2025An exploration of building a bigram character-level language model using both frequency counting and a neural network approach.
July 4, 2025A deep dive into building a simple autograd engine for neural networks, explaining automatic differentiation and backpropagation.
July 3, 2025A quick introduction to the Hugging Face Transformers library and how to get started with state-of-the-art NLP models.
June 8, 2025