Good job

by Trilogix1 - opened 23 days ago

23 days ago

Hi, your work is interesting and actually works. It is quite expensive in tokens and is not 0 shot but it works. Mean that your method brings it very near to compete with way bigger models and even beat them.
Can you try with Instruct and thinking qwen models?
Qwen3 Coder 30b a3b instruct and Qwen3 thinking 2507 (30b a3b).

Thanks in advance.

davidanugraha

rubricreward org 23 days ago

•

edited 23 days ago

Thank you for your interest! Actually the gpt-oss-20b models are experimental (not an official release) as there are some issues in the model since the training is quite expensive; we are still trying to figure this out, apologies for the confusion!

Actually, we have Qwen3 thinking models (but not the newer Instruct/Thinking model, working on those), we recommend you to try those! :)

Trilogix1 changed discussion status to closed 23 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment