Song Jiang's picture

1 2 4

Song Jiang

songjiang

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

SPG: Sandwiched Policy Gradient for Masked Diffusion Language Models

upvoted a paper 2 months ago

Large Reasoning Models Learn Better Alignment from Flawed Thinking

authored a paper 9 months ago

SWEET-RL: Training Multi-Turn LLM Agents on Collaborative Reasoning Tasks

View all activity

Organizations

None yet

upvoted a paper about 2 months ago

SPG: Sandwiched Policy Gradient for Masked Diffusion Language Models

Paper • 2510.09541 • Published Oct 10 • 14

upvoted a paper 2 months ago

Large Reasoning Models Learn Better Alignment from Flawed Thinking

Paper • 2510.00938 • Published Oct 1 • 58

authored a paper 9 months ago

SWEET-RL: Training Multi-Turn LLM Agents on Collaborative Reasoning Tasks

Paper • 2503.15478 • Published Mar 19 • 13

liked a model almost 2 years ago

liuhaotian/llava-v1.5-mlp2x-336px-pretrain-vicuna-7b-v1.5

Text Generation • Updated Oct 5, 2023 • 76 • 22

authored a paper over 2 years ago

LLM-Rec: Personalized Recommendation via Prompting Large Language Models

Paper • 2307.15780 • Published Jul 24, 2023 • 27

liked 3 models over 2 years ago

meta-llama/Llama-2-70b-hf

Text Generation • 69B • Updated Apr 17, 2024 • 21.5k • • 853

allenai/tulu-7b

Text Generation • Updated Jun 20, 2023 • 78 • 9

allenai/tulu-65b

Text Generation • Updated Jun 29, 2023 • 74 • 21