Blog, Articles, and discussions

`LeRobotDataset`: Bringing large-scale datasets to lerobot

By September 16, 2025 • 28

Community Articles

view all

Qianfan-VL: A Milestone Achievement in Chinese Multimodal AI with Domestic Chips

•

1 day ago

• 7

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 222

AtlasOCR: Building the First Open-Source Darija OCR Model with Vision Language Models

and 4 others •

9 days ago

• 14

PP-OCRv5 on Hugging Face: A Specialized Approach to OCR

and 5 others •

15 days ago

• 99

🌎 What kind of environmental impacts are AI companies disclosing? (And can we compare them?) 🌎

and 1 other •

8 days ago

• 10

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 678

Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face

•

Feb 11

• 68

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

•

Feb 11

• 70

From GRPO to DAPO and GSPO: What, Why, and How

•

Aug 9

• 30

🥬 TinyLettuce: Efficient Hallucination Detection with 17–68M Encoders

and 1 other •

25 days ago

• 13

Introducing the Palmyra-mini family: Powerful, lightweight, and ready to reason!

and 1 other •

14 days ago

• 58

How to Train an Antibody Developability Model

and 1 other •

8 days ago

• 9

PrediBench: Testing AI models on prediction markets

and 1 other •

1 day ago

• 4

Asynchronous Robot Inference: Decoupling Action Prediction and Execution

By July 10, 2025 • 42

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

By June 3, 2025 • 254

LeRobot Community Datasets: The “ImageNet” of Robotics — When and How?

By May 11, 2025 • 78

LeRobot goes to driving school: World’s largest open-source self-driving dataset

By March 11, 2025 • 99

Community Articles

RexBERT: Encoders for a brave new world of E-Commerce

and 1 other •

5 days ago

• 40

Nemotron-Personas-Japan: Synthesized Data for Sovereign AI

and 6 others •

2 days ago

• 14

Unleashing the Full Potential of ERNIE4.5 using FastDeploy

and 3 others •

7 days ago

• 10

SyGra: The One-Stop Framework for Building Data for LLMs and SLMs

and 3 others •

4 days ago

• 9

Small Language Models (SLM): A Comprehensive Overview

•

Feb 22

• 73

How to Choose the Best Open Source LLM for Your Project in 2025

•

16 days ago

• 71

mem-agent: Persistent, Human Readable Memory Agent Trained with Online RL

and 1 other •

14 days ago

• 21

Qianfan-VL: A Milestone Achievement in Chinese Multimodal AI with Domestic Chips

•

1 day ago

• 7

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 222

AtlasOCR: Building the First Open-Source Darija OCR Model with Vision Language Models

and 4 others •

9 days ago

• 14

PP-OCRv5 on Hugging Face: A Specialized Approach to OCR

and 5 others •

15 days ago

• 99

🌎 What kind of environmental impacts are AI companies disclosing? (And can we compare them?) 🌎

and 1 other •

8 days ago

• 10

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 678

Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face

•

Feb 11

• 68

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

•

Feb 11

• 70

From GRPO to DAPO and GSPO: What, Why, and How

•

Aug 9

• 30

🥬 TinyLettuce: Efficient Hallucination Detection with 17–68M Encoders

and 1 other •

25 days ago

• 13

Introducing the Palmyra-mini family: Powerful, lightweight, and ready to reason!

and 1 other •

14 days ago

• 58

How to Train an Antibody Developability Model

and 1 other •

8 days ago

• 9

PrediBench: Testing AI models on prediction markets

and 1 other •

1 day ago

• 4

View all