Tag #Ai Benchmark

themarker.com
🌐 85% Global Worthiness
News related image

New AI Test Reveals Advanced Models' Significant Shortcomings

A new AI benchmark, the "Last Human Test," consisting of 3,000 complex questions in fields like philosophy and engineering, reveals that leading AI models achieve only 8.3% accuracy, questioning current methods of assessing AI intelligence.

Progress

56% Bias Score

Quality Education