Greg Schoeninger

Greg Schoeninger

Dec
20
Practical ML Dive - How to train Mamba for Question Answering

Practical ML Dive - How to train Mamba for Question Answering

What is Mamba šŸ? There is a lot of hype about Mamba being a fast alternative to the Transformer architecture. The
22 min read
Dec
15
Mamba: Linear-Time Sequence Modeling with Selective State Spaces - Arxiv Dives

Mamba: Linear-Time Sequence Modeling with Selective State Spaces - Arxiv Dives

What is Mamba šŸ? Mamba at it's core is a recurrent neural network architecture, that outperforms Transformers with faster
15 min read
Dec
13
Practical ML Dive - How to customize a Vision Transformer on your own data

Practical ML Dive - How to customize a Vision Transformer on your own data

Welcome to Practical ML Dives, a series spin off of Arxiv Dives. In Arxiv Dives, we cover state of the
20 min read
Dec
08
Arxiv Dives - Zero-shot Image Classification with CLIP

Arxiv Dives - Zero-shot Image Classification with CLIP

CLIP explores the efficacy of learning image representations from scratch with 400 million image-text pairs, showcasing zero-shot transfer capabilities across
14 min read
Dec
07
How NOT to store unstructured machine learning datasets

How NOT to store unstructured machine learning datasets

Training data is typically the most valuable part of any machine learning project. As we converge on model architectures like
6 min read
Dec
07
šŸ§¼ SUDS - A Guide to Structuring Unstructured Data

šŸ§¼ SUDS - A Guide to Structuring Unstructured Data

At Oxen.ai we value high quality datasets. We have many years of experience training and evaluating models, and have
12 min read
Dec
01
Arxiv Dives - Vision Transformers (ViT)

Arxiv Dives - Vision Transformers (ViT)

With all of the hype around Transformers for natural language processing and text, the authors of this paper beg the
13 min read
Nov
26
Reading List For Andrej Karpathyā€™s ā€œIntro to Large Language Modelsā€ Video

Reading List For Andrej Karpathyā€™s ā€œIntro to Large Language Modelsā€ Video

Andrej Karpathy recently released an hour long talk on ā€œThe busy personā€™s intro to large language modelsā€ that had
11 min read
Nov
21
Arxiv Dives - A Mathematical Framework for Transformer Circuits - Part 2

Arxiv Dives - A Mathematical Framework for Transformer Circuits - Part 2

Every Friday at Oxen.ai we host a paper club called "Arxiv Dives" to make us smarter Oxen
16 min read
Nov
11
Arxiv Dives - A Mathematical Framework for Transformer Circuits - Part 1

Arxiv Dives - A Mathematical Framework for Transformer Circuits - Part 1

Every Friday at Oxen.ai we host a paper club called "Arxiv Dives" to make us smarter Oxen
13 min read