Oxen.ai (Page 2)

Mar

05

Training a Rust 1.5B Coder LM with Reinforcement Learning (GRPO)

Group Relative Policy Optimization (GRPO) has proven to be a useful algorithm for training LLMs to reason and improve on

Mar 5, 2025

17 min read

Feb

11

Why GRPO is Important and How it Works

Last week on Arxiv Dives we dug into research behind DeepSeek-R1, and uncovered that one of the techniques they use

Feb 11, 2025

12 min read

Feb

05

🧠 GRPO VRAM Requirements For the GPU Poor

Since the release of DeepSeek-R1, Group Relative Policy Optimization (GRPO) has become the talk of the town for Reinforcement Learning

Feb 5, 2025

9 min read

Feb

04

How DeepSeek R1, GRPO, and Previous DeepSeek Models Work

In January 2025, DeepSeek took a shot directly at OpenAI by releasing a suite of models that “Rival OpenAI’s

Feb 4, 2025

15 min read

Jan

29

No Hype DeepSeek-R1 Reading List

DeepSeek-R1 is a big step forward in the open model ecosystem for AI with their latest model competing with OpenAI&

Jan 29, 2025

27 min read

Jan

27

Oxen v0.25.0 Migration

Today we released oxen v0.25.0 🎉 which comes with a few performance optimizations, including how we traverse the Merkle

Jan 27, 2025

3 min read

Jan

27

🌲 Merkle Tree VNodes

In this post we peel back some of the layers of Oxen.ai’s Merkle Tree and show how we

Jan 27, 2025

8 min read

Jan

27

🌲 Merkle Tree 101

Intro Merkle Trees are important data structures for ensuring integrity, deduplication, and verification of data at scale. They are used

Jan 27, 2025

9 min read

Jan

21

arXiv Dive: RAGAS - Retrieval Augmented Generation Assessment

RAGAS is an evaluation framework for Retrieval Augmented Generation (RAG). A paper released by Exploding Gradients, AMPLYFI, and CardiffNLP. RAGAS

Jan 21, 2025

13 min read

Dec

26

The Best AI Data Version Control Tools [2025]

Data is often seen as static. It's common to just dump your data into S3 buckets in tarballs

Dec 26, 2024

6 min read