It can come up with a brand new sentence that hasn't been written before. Does that count?
Maybe you mean a solution to a textbook math/physics problem, it most likely would be able to solve that too with tool use.
Or maybe you mean solving something like the Riemann Hypothesis?
It can come up with a brand new sentence that hasn't been written before. Does that count?
Maybe you mean a solution to a textbook math/physics problem, it most likely would be able to solve that too with tool use.
Or maybe you mean solving something like the Riemann Hypothesis?