A character-level language model using a multi-layer perceptron to predict the next character in a sequence.
July 14, 2025An exploration of building a bigram character-level language model using both frequency counting and a neural network approach.
July 4, 2025A deep dive into building a simple autograd engine for neural networks, explaining automatic differentiation and backpropagation.
July 3, 2025A quick introduction to the Hugging Face Transformers library and how to get started with state-of-the-art NLP models.
June 8, 2025