Jake De

goforit123

AI & ML interests

None yet

Recent Activity

updated a collection about 1 month ago

LLM

updated a model about 1 month ago

goforit123/poca-SoccerTwos

View all activity

Organizations

None yet

Collections 3

View 3 collections

models 15

datasets 0

None public yet

Jake De

AI & ML interests

Recent Activity

Organizations

Collections 3

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

Large Language Models for Scientific Idea Generation: A Creativity-Centered Survey

Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

One RL to See Them All: Visual Triple Unified Reinforcement Learning

Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't

Performance Trade-offs of Optimizing Small Language Models for E-Commerce

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

Large Language Models for Scientific Idea Generation: A Creativity-Centered Survey

Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

One RL to See Them All: Visual Triple Unified Reinforcement Learning

Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't

Performance Trade-offs of Optimizing Small Language Models for E-Commerce

models 15

goforit123/poca-SoccerTwos

goforit123/rl_course_vizdoom_health_gathering_supreme

goforit123/custom-ppo-LunarLander-v2

goforit123/goforit123

goforit123/ppo-Pyramids

goforit123/Pixelcopter-PLE-v0

goforit123/ppo-SnowballTarget

goforit123/CartPole-v1

goforit123/dqn-SpaceInvadersNoFrameskip-v4

goforit123/a2c-PandaReachDense-v3

datasets 0

Jake De

AI & ML interests

Recent Activity

Organizations

Collections 3

models 15 Sort: Recently updated

datasets 0

models 15