Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Yinxu Pan's picture
24 59 230

Yinxu Pan

cppowboy
0xSojalSec's profile picture SteveSHEN's profile picture Pent's profile picture
·
https://github.com/Cppowboy
  • pnynx3
  • Cppowboy

AI & ML interests

RL for LLM, Code&Math Reasoning, Function Calling, Code Interpreter, Vision-Language Pretraining

Recent Activity

upvoted a paper 1 day ago
A Survey of Reinforcement Learning for Large Reasoning Models
upvoted a paper 1 day ago
On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification
upvoted a paper 4 days ago
WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents
View all activity

Organizations

Diffusers Pipelines Library for Stable Diffusion's profile picture OpenBMB's profile picture XAgentCommunity's profile picture

spaces 2

Runtime error
1

Viscpm Chat

🚀

Oct 4, 2023
Runtime error

Viscpm Paint

🏢

Jul 19, 2023

models 2

cppowboy/XAgentLLaMa-7B-preview

Text Generation • Updated Nov 21, 2023 • 9

cppowboy/XAgentLLaMa-34B-preview

Updated Nov 20, 2023

datasets 2

cppowboy/ktodata

Updated Jul 5, 2024 • 5

cppowboy/llava_zh

Viewer • Updated Jul 15, 2023 • 158k • 17 • 2
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs