JerrrrryKun/DeepSeek-R1-Distill-Qwen-7B-ER-v1-2-1epoch500steps Text Generation • 8B • Updated Nov 25 • 5
JerrrrryKun/DeepSeek-R1-Distill-Qwen-7B-ER-v1-2-1epoch500steps Text Generation • 8B • Updated Nov 25 • 5
JerrrrryKun/Qwen2.5-Math-7B-Instruct-LLM4Math-V2data-Sequential-perturbationsignalonly-ispass-400steps Text Generation • 8B • Updated Oct 7 • 11
JerrrrryKun/Qwen2.5-Math-7B-Instruct-LLM4Math-V2data-Sequential-perturbationsignalonly-ispass-400steps Text Generation • 8B • Updated Oct 7 • 11
JerrrrryKun/Llama-3.1-8B-Instruct-LLM4Math-V2data-Sequential-vanillaRL-200steps Text Generation • 8B • Updated Oct 6 • 8
JerrrrryKun/Llama-3.1-8B-Instruct-LLM4Math-V2data-Sequential-vanillaRL-200steps Text Generation • 8B • Updated Oct 6 • 8
JerrrrryKun/Llama-3.1-8B-Instruct-LLM4Math-V2data-Sequential-perturbationsignalonly-200steps Text Generation • 8B • Updated Oct 6 • 6
JerrrrryKun/Llama-3.1-8B-Instruct-LLM4Math-V2data-Sequential-perturbationsignalonly-200steps Text Generation • 8B • Updated Oct 6 • 6
JerrrrryKun/Llama-3.1-8B-Instruct-LLM4Math-V2data-Sequential-vanillaRL-100steps Text Generation • 8B • Updated Oct 6 • 9
JerrrrryKun/Llama-3.1-8B-Instruct-LLM4Math-V2data-Sequential-vanillaRL-100steps Text Generation • 8B • Updated Oct 6 • 9
JerrrrryKun/Llama-3.1-8B-Instruct-LLM4Math-V2data-Sequential-perturbationsignalonly-100steps Text Generation • 8B • Updated Oct 6 • 8
JerrrrryKun/Llama-3.1-8B-Instruct-LLM4Math-V2data-Sequential-perturbationsignalonly-100steps Text Generation • 8B • Updated Oct 6 • 8