Yinxu Pan's picture

Yinxu Pan

cppowboy

·

https://github.com/Cppowboy

AI & ML interests

RL for LLM, Code&Math Reasoning, Function Calling, Code Interpreter, Vision-Language Pretraining

Recent Activity

upvoted a paper 1 day ago

A Survey of Reinforcement Learning for Large Reasoning Models

upvoted a paper 1 day ago

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

upvoted a paper 4 days ago

WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents

View all activity

Organizations

spaces 2

Viscpm Chat

Viscpm Paint

models 2

cppowboy/XAgentLLaMa-7B-preview

Text Generation • Updated Nov 21, 2023 • 9

cppowboy/XAgentLLaMa-34B-preview

Updated Nov 20, 2023

datasets 2

cppowboy/ktodata

Updated Jul 5, 2024 • 5

cppowboy/llava_zh

Viewer • Updated Jul 15, 2023 • 158k • 17 • 2