Greg Schoeninger

Greg Schoeninger

Dec
07
How NOT to store unstructured machine learning datasets

How NOT to store unstructured machine learning datasets

Training data is typically the most valuable part of any machine learning project. As we converge on model architectures like
6 min read
Dec
07
🧼 SUDS - A Guide to Structuring Unstructured Data

🧼 SUDS - A Guide to Structuring Unstructured Data

At Oxen.ai we value high quality datasets. We have many years of experience training and evaluating models, and have
12 min read
Dec
01
Arxiv Dives - Vision Transformers (ViT)

Arxiv Dives - Vision Transformers (ViT)

With all of the hype around Transformers for natural language processing and text, the authors of this paper beg the
13 min read
Nov
26
Reading List For Andrej Karpathy’s “Intro to Large Language Models” Video

Reading List For Andrej Karpathy’s “Intro to Large Language Models” Video

Andrej Karpathy recently released an hour long talk on “The busy person’s intro to large language models” that had
11 min read
Nov
21
Arxiv Dives - A Mathematical Framework for Transformer Circuits - Part 2

Arxiv Dives - A Mathematical Framework for Transformer Circuits - Part 2

Every Friday at Oxen.ai we host a paper club called "Arxiv Dives" to make us smarter Oxen
16 min read
Nov
11
Arxiv Dives - A Mathematical Framework for Transformer Circuits - Part 1

Arxiv Dives - A Mathematical Framework for Transformer Circuits - Part 1

Every Friday at Oxen.ai we host a paper club called "Arxiv Dives" to make us smarter Oxen
13 min read
Nov
09
Data Version Control 101 with Oxen

Data Version Control 101 with Oxen

This intro tutorial from Oxen.ai shows how Oxen can make versioning your data as easy as versioning your code.
12 min read
Nov
05
Arxiv Dive Manifesto

Arxiv Dive Manifesto

Every Friday the team at Oxen.ai gets together and goes over research papers, blog posts, or books that help
4 min read
Nov
04
Arxiv Dives - Attention Is All You Need

Arxiv Dives - Attention Is All You Need

Every Friday at Oxen.ai we host a paper club called "Arxiv Dives" to make us smarter Oxen
17 min read
Oct
27
Arxiv Dives - How LoRA fine-tuning works

Arxiv Dives - How LoRA fine-tuning works

Every Friday at Oxen.ai we host a paper club called "Arxiv Dives" to make us smarter Oxen
10 min read