Peng

pennlio

pennlio111

AI & ML interests

None yet

Recent Activity

liked a model 14 days ago

InstantX/CSGO

liked a model 17 days ago

qth/DEADiff

upvoted a paper 4 months ago

Think in Games: Learning to Reason in Games via Reinforcement Learning with Large Language Models

View all activity

Organizations

liked a model 14 days ago

InstantX/CSGO

Text-to-Image • Updated Sep 18, 2024 • 272 • 38

liked a model 17 days ago

qth/DEADiff

Updated Apr 3, 2024 • 9

upvoted a paper 4 months ago

Think in Games: Learning to Reason in Games via Reinforcement Learning with Large Language Models

Paper • 2508.21365 • Published Aug 29 • 29

upvoted 2 papers 7 months ago

LLark: A Multimodal Foundation Model for Music

Paper • 2310.07160 • Published Oct 11, 2023 • 2

TALKPLAY: Multimodal Music Recommendation with Large Language Models

Paper • 2502.13713 • Published Feb 19 • 4

liked 2 models over 1 year ago

xai-org/grok-1

Text Generation • Updated Mar 28, 2024 • 1.4k • 2.37k

gradientai/Llama-3-8B-Instruct-Gradient-1048k

Text Generation • 8B • Updated Oct 29, 2024 • 8.87k • 679

liked a dataset over 1 year ago

m-a-p/COIG-CQIA

Viewer • Updated Apr 18, 2024 • 44.7k • 7.27k • 660

upvoted an article over 1 year ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

Dec 9, 2022

•

385

liked 2 models over 1 year ago

meta-llama/Meta-Llama-3-8B

Text Generation • 8B • Updated Sep 27, 2024 • 2.24M • • 6.41k

unsloth/llama-3-8b-bnb-4bit

Text Generation • 8B • Updated Jan 7 • 57.4k • 202

liked 2 models about 2 years ago

stabilityai/stable-diffusion-x4-upscaler

Updated Jul 5, 2023 • 47.2k • 718

stabilityai/stable-diffusion-xl-base-1.0

Text-to-Image • Updated Oct 30, 2023 • 1.92M • • 7.26k

liked a model over 2 years ago

Vision-CAIR/MiniGPT-4

Updated Apr 19, 2023 • 428

liked a dataset over 2 years ago

fka/awesome-chatgpt-prompts

Viewer • Updated about 15 hours ago • 664 • 23.2k • 9.52k

updated a model over 2 years ago

pennlio/test

Updated May 22, 2023

Peng

AI & ML interests

Recent Activity

Organizations

pennlio's activity

Illustrating Reinforcement Learning from Human Feedback (RLHF)