If it’s name is Q* then it seems likely that it’s a combination of Q learning and A* search, which indicates that this is an approach similar to DeepMind’s AlphaZero as opposed to a transformer based LLM.
In that context, getting it to be able to solve high school level math questions is pretty nuts.
Though the details matter and right now all the articles discussing it are missing those, so we’ll have to wait and see.
Not really.
If it’s name is Q* then it seems likely that it’s a combination of Q learning and A* search, which indicates that this is an approach similar to DeepMind’s AlphaZero as opposed to a transformer based LLM.
In that context, getting it to be able to solve high school level math questions is pretty nuts.
Though the details matter and right now all the articles discussing it are missing those, so we’ll have to wait and see.