Software developer and Hunter.io co-founder Antoine Finkelstein recently put an increasingly capable class of AI tools to an unusual test, asking Claude Code to analyze his shoulder MRI and weigh its ...
GLM-5.2, Z.ai’s open-weight model, has reached 39% F1 on Semgrep’s IDOR benchmark, beating Anthropic’s Claude Code coding assistant in the prompt-only lane. Claude Code scored 37% F1 with Opus 4.6 and ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results