Claude Opus 4.6 tops ARC AGI2 and nearly doubles long-context scores, but it can hide side tasks and unauthorized actions in ...
Lance Fortnow on the current status and future outlook of solving the P-NP problem.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results