Blog, Articles, and discussions

Jupyter Agents: training LLMs to reason with notebooks

By September 10, 2025 • 44

Community Articles

view all

Qianfan-VL: A Milestone Achievement in Chinese Multimodal AI with Domestic Chips

•

3 days ago

• 7

Small Language Models (SLM): A Comprehensive Overview

•

Feb 22

• 74

mem-agent: Persistent, Human Readable Memory Agent Trained with Online RL

and 1 other •

16 days ago

• 22

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 679

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 223

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

•

Feb 11

• 71

PP-OCRv5 on Hugging Face: A Specialized Approach to OCR

and 5 others •

17 days ago

• 100

Ground-up efforts to build large datasets for effective and accurate translation of Modi-Script documents into modern Marathi

and 1 other •

2 days ago

• 5

Code a simple RAG from scratch

•

Oct 29, 2024

• 201

Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face

•

Feb 11

• 68

AtlasOCR: Building the First Open-Source Darija OCR Model with Vision Language Models

and 4 others •

11 days ago

• 14

Unleashing the Full Potential of ERNIE4.5 using FastDeploy

and 3 others •

8 days ago

• 11

PrediBench: Testing AI models on prediction markets

and 1 other •

3 days ago

• 4

Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm

and 5 others •

Jun 11

• 93

Understanding Gemma 3n: How MatFormer Gives You Many Models in One

•

Jun 26

• 46

mmBERT: ModernBERT goes Multilingual

By September 9, 2025 • 99

MCP for Research: How to Connect AI to Research Tools

By August 18, 2025 • 56

TextQuests: How Good are LLMs at Text-Based Video Games?

By August 12, 2025 guest • 35

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

By July 29, 2025 • 179

Back to The Future: Evaluating AI Agents on Predicting Future Events

By July 17, 2025 guest • 41

Seq vs Seq: the Ettin Suite of Paired Encoders and Decoders

By July 16, 2025 • 69

SmolLM3: smol, multilingual, long-context reasoner

By July 8, 2025 • 685

Efficient MultiModal Data Pipeline

By July 8, 2025 • 56

Gemma 3n fully available in the open-source ecosystem!

By June 26, 2025 • 118

nanoVLM: The simplest repository to train your VLM in pure PyTorch

By May 21, 2025 • 218

Vision Language Models (Better, Faster, Stronger)

By May 12, 2025 • 533

Introducing HELMET

By April 16, 2025 • 37

Arabic Leaderboards: Introducing Arabic Instruction Following, Updating AraGen, and More

By April 8, 2025 guest • 19

Open R1: How to use OlympicCoder locally for coding?

By March 20, 2025 • 63

Community Articles

There is no such thing as a tokenizer-free lunch

•

3 days ago

• 52

RexBERT: Encoders for a brave new world of E-Commerce

and 1 other •

7 days ago

• 42

Nemotron-Personas-Japan: Synthesized Data for Sovereign AI

and 6 others •

4 days ago

• 20

SyGra: The One-Stop Framework for Building Data for LLMs and SLMs

and 3 others •

5 days ago

• 9

Model Quality: Hugging Face Is All You Need

•

1 day ago

• 9

Qianfan-VL: A Milestone Achievement in Chinese Multimodal AI with Domestic Chips

•

3 days ago

• 7

Small Language Models (SLM): A Comprehensive Overview

•

Feb 22

• 74

mem-agent: Persistent, Human Readable Memory Agent Trained with Online RL

and 1 other •

16 days ago

• 22

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 679

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 223

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

•

Feb 11

• 71

PP-OCRv5 on Hugging Face: A Specialized Approach to OCR

and 5 others •

17 days ago

• 100

Ground-up efforts to build large datasets for effective and accurate translation of Modi-Script documents into modern Marathi

and 1 other •

2 days ago

• 5

Code a simple RAG from scratch

•

Oct 29, 2024

• 201

Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face

•

Feb 11

• 68

AtlasOCR: Building the First Open-Source Darija OCR Model with Vision Language Models

and 4 others •

11 days ago

• 14

Unleashing the Full Potential of ERNIE4.5 using FastDeploy

and 3 others •

8 days ago

• 11

PrediBench: Testing AI models on prediction markets

and 1 other •

3 days ago

• 4

Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm

and 5 others •

Jun 11

• 93

Understanding Gemma 3n: How MatFormer Gives You Many Models in One

•

Jun 26

• 46

View all