thanks to the Thinking Machines team, we used Tinker to prototype our reward models and train the prompt expander via R…
By Krea
thanks to the Thinking Machines team, we used Tinker to prototype our reward models and train the prompt expander via RL. for more information, read the full technical report on the data, architecture, and training behind Krea 2 👇