My Tiny Feed

themarker.com

🌐 85% Global Worthiness

Feb 3, 19:12

New AI Test Reveals Advanced Models' Significant Shortcomings

A new AI benchmark, the "Last Human Test," consisting of 3,000 complex questions in fields like philosophy and engineering, reveals that leading AI models achieve only 8.3% accuracy, questioning current methods of assessing AI intelligence.

New AI Test Reveals Advanced Models' Significant Shortcomings

56% Bias Score

Article

Quality Education

Tag #Ai Benchmark

New AI Test Reveals Advanced Models' Significant Shortcomings

New AI Test Reveals Advanced Models' Significant Shortcomings

How was the "Last Human Test" designed, and what specific knowledge domains are evaluated to assess AI capabilities?

New AI Test Reveals Advanced Models' Significant Shortcomings

New AI Test Reveals Advanced Models' Significant Shortcomings

Progress

How was the "Last Human Test" designed, and what specific knowledge domains are evaluated to assess AI capabilities?