Blog, Articles, and discussions

mmBERT: ModernBERT goes Multilingual

By September 9, 2025 • 96

Community Articles

view all

Qianfan-VL: A Milestone Achievement in Chinese Multimodal AI with Domestic Chips

•

about 15 hours ago

• 7

AtlasOCR: Building the First Open-Source Darija OCR Model with Vision Language Models

and 4 others •

9 days ago

• 14

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 221

Introducing the Palmyra-mini family: Powerful, lightweight, and ready to reason!

and 1 other •

13 days ago

• 58

How to Train an Antibody Developability Model

and 1 other •

8 days ago

• 9

🌎 What kind of environmental impacts are AI companies disclosing? (And can we compare them?) 🌎

and 1 other •

8 days ago

• 10

Finegrain Product Placement LoRA (experiment)

•

7 days ago

• 5

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 678

Code a simple RAG from scratch

•

Oct 29, 2024

• 199

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

•

Feb 11

• 70

Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm

and 5 others •

Jun 11

• 92

From GRPO to DAPO and GSPO: What, Why, and How

•

Aug 9

• 30

🥬 TinyLettuce: Efficient Hallucination Detection with 17–68M Encoders

and 1 other •

25 days ago

• 13

Federated Learning using Hugging Face and Flower

By March 27, 2023 guest

Welcome PaddlePaddle to the Hugging Face Hub

By January 17, 2023 guest • 3

From GPT2 to Stable Diffusion: Hugging Face arrives to the Elixir community

By December 9, 2022

From PyTorch DDP to 🤗 Accelerate to 🤗 Trainer, mastery of distributed training with ease

By October 21, 2022 • 38

Optimization story: Bloom inference

By October 12, 2022 • 7

How 🤗 Accelerate runs very large models thanks to PyTorch

By September 27, 2022 • 14

Introducing Skops

By August 12, 2022 • 1

Introducing The World's Largest Open Multilingual Language Model: BLOOM

By July 12, 2022 • 5

Gradio 3.0 is Out!

By May 16, 2022

Welcome fastai to the Hugging Face Hub

By May 6, 2022 • 2

Introducing Decision Transformers on Hugging Face 🤗

By March 28, 2022 • 7

Welcome Stable-baselines3 to the Hugging Face Hub 🤗

By January 21, 2022

Gradio joins Hugging Face!

By December 21, 2021 • 6

Welcome spaCy to the 🤗 Hub

By July 13, 2021 • 1

Community Articles

RexBERT: Encoders for a brave new world of E-Commerce

and 1 other •

4 days ago

• 35

Nemotron-Personas-Japan: Synthesized Data for Sovereign AI

and 6 others •

1 day ago

• 12

Unleashing the Full Potential of ERNIE4.5 using FastDeploy

and 3 others •

6 days ago

• 10

How to Choose the Best Open Source LLM for Your Project in 2025

•

16 days ago

• 71

mem-agent: Persistent, Human Readable Memory Agent Trained with Online RL

and 1 other •

14 days ago

• 21

SyGra: The One-Stop Framework for Building Data for LLMs and SLMs

and 3 others •

3 days ago

• 9

Small Language Models (SLM): A Comprehensive Overview

•

Feb 22

• 73

Qianfan-VL: A Milestone Achievement in Chinese Multimodal AI with Domestic Chips

•

about 15 hours ago

• 7

AtlasOCR: Building the First Open-Source Darija OCR Model with Vision Language Models

and 4 others •

9 days ago

• 14

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 221

Introducing the Palmyra-mini family: Powerful, lightweight, and ready to reason!

and 1 other •

13 days ago

• 58

How to Train an Antibody Developability Model

and 1 other •

8 days ago

• 9

🌎 What kind of environmental impacts are AI companies disclosing? (And can we compare them?) 🌎

and 1 other •

8 days ago

• 10

Finegrain Product Placement LoRA (experiment)

•

7 days ago

• 5

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 678

Code a simple RAG from scratch

•

Oct 29, 2024

• 199

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

•

Feb 11

• 70

Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm

and 5 others •

Jun 11

• 92

From GRPO to DAPO and GSPO: What, Why, and How

•

Aug 9

• 30

🥬 TinyLettuce: Efficient Hallucination Detection with 17–68M Encoders

and 1 other •

25 days ago

• 13

View all