3 73 296

Kristoffer Rolf Deinoff

gatepoet

AI & ML interests

None yet

Recent Activity

liked a model about 9 hours ago

thesby/Qwen3-VL-8B-NSFW-Caption-V4

liked a model about 9 hours ago

ByteDance-Seed/M3-Agent-Control

liked a model about 9 hours ago

Danau5tin/Orca-Agent-v0.1

View all activity

Organizations

None yet

upvoted an article 7 days ago

Article

An Edge-First Generalized LLM LoRA Fine-Tuning Framework for Heterogeneous GPUs

9 days ago

•

upvoted a paper 7 days ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published 9 days ago • 196

upvoted a paper 14 days ago

GigaEvo: An Open Source Optimization Framework Powered By LLMs And Evolution Algorithms

Paper • 2511.17592 • Published 24 days ago • 118

upvoted a paper 28 days ago

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

Paper • 2511.06221 • Published Nov 9 • 129

upvoted a paper 30 days ago

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

Paper • 2511.04570 • Published Nov 6 • 208

upvoted a paper about 1 month ago

Video Reasoning without Training

Paper • 2510.17045 • Published Oct 19 • 7

upvoted 2 papers 2 months ago

MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

Paper • 2509.22186 • Published Sep 26 • 136

Rolling Forcing: Autoregressive Long Video Diffusion in Real Time

Paper • 2509.25161 • Published Sep 29 • 24

upvoted a paper 4 months ago

SONAR-LLM: Autoregressive Transformer that Thinks in Sentence Embeddings and Speaks in Tokens

Paper • 2508.05305 • Published Aug 7 • 46

upvoted 3 papers 5 months ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24 • 315

Vision-Language-Vision Auto-Encoder: Scalable Knowledge Distillation from Diffusion Models

Paper • 2507.07104 • Published Jul 9 • 45

High-Resolution Visual Reasoning via Multi-Turn Grounding-Based Reinforcement Learning

Paper • 2507.05920 • Published Jul 8 • 11

upvoted a paper 6 months ago

Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes Correct Reasoning in Base LLMs

Paper • 2506.14245 • Published Jun 17 • 44

upvoted 2 papers 7 months ago

MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

Paper • 2505.07608 • Published May 12 • 82

LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis

Paper • 2505.02625 • Published May 5 • 22

upvoted a collection 7 months ago

DeepSeek-Prover

Collection

DeepSeek-Prover-Series • 10 items • Updated 14 days ago • 59

upvoted 2 papers 8 months ago

ReTool: Reinforcement Learning for Strategic Tool Use in LLMs

Paper • 2504.11536 • Published Apr 15 • 63

VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning

Paper • 2504.08837 • Published Apr 10 • 43

upvoted an article 9 months ago

Article

Training and Finetuning Reranker Models with Sentence Transformers v4

Mar 26

•

176

upvoted a paper 9 months ago

UI-R1: Enhancing Action Prediction of GUI Agents by Reinforcement Learning

Paper • 2503.21620 • Published Mar 27 • 62

Kristoffer Rolf Deinoff

AI & ML interests

Recent Activity

Organizations

gatepoet's activity

**An Edge-First Generalized LLM LoRA Fine-Tuning Framework for Heterogeneous GPUs**

Training and Finetuning Reranker Models with Sentence Transformers v4

An Edge-First Generalized LLM LoRA Fine-Tuning Framework for Heterogeneous GPUs