-
Teaching LLMs to see
A medium article I wrote on building multimodal llms from scratch in colab
Posted on 2024-10-17
read more
-
Fast byte pair encoding in rust
An O(log n) optimization I found to speed up tokenizer throughput
Posted on 2024-05-10
read more
-
Training LLaVA on GPT-4 annotated data
Boosting OCR performances of open-source vision language models with synthetic datasets
Posted on 2024-03-20
read more
-
Sparse variational autoencoders
Playing around with bayesian priors and deep learning
Posted on 2023-04-10
read more