thanks to the Thinking Machines team, we used Tinker to prototype our reward models and train the prompt expander via R…

By Krea

thanks to the Thinking Machines team, we used Tinker to prototype our reward models and train the prompt expander via RL. for more information, read the full technical report on the data, architecture, and training behind Krea 2 👇

View original

HomeResourceLoading…