HomeTechnologyEven the most advanced AI struggles to surpass this new benchmark

Even the most advanced AI struggles to surpass this new benchmark

January 23, 2025

The nonprofit Center for AI Safety (CAIS) and Scale AI, a company that provides various data labeling and AI development services, have introduced a challenging new benchmark for cutting-edge AI systems.

Known as Humanity’s Last Exam, this benchmark comprises numerous crowdsourced questions covering topics such as mathematics, humanities, and natural sciences. The questions are presented in various formats, including ones that feature diagrams and images to increase the difficulty of evaluation.

In a preliminary study, none of the prominent publicly available AI systems were able to achieve a score exceeding 10% on Humanity’s Last Exam.

CAIS and Scale AI intend to open up the benchmark to the research community to allow researchers to delve deeper into the nuances and assess new AI models.

Price of Season pass and all rewards

Maren Morris Flaunts Figure in Tiny Bikini, Enjoys Coconut Drink

DeskReporter https://theusapage.com

Even the most advanced AI struggles to surpass this new benchmark

Tesla faces setbacks, tariff turmoil commences, and one electric vehicle startup achieves a major milestone at TechCrunch Mobility

Trump extends TikTok ban delay for another 75 days

Turbine secures $22M in funding to assist VC investors in accessing cash without divesting their positions

LEAVE A REPLY Cancel reply

Most Popular

Weekend Plans: Cozy Up with Cup of Jo.

The Importance of Buy-In for Business Growth: Why it Shouldn’t be Overlooked

Ted Cruz warns of potential ‘bloodbath’ for Republicans in midterms if Trump tariffs cause US recession

Matthew Okula and Wife Nurse Hailey Discuss Plans to Have Daughter Next

Recent Comments

EDITOR PICKS

Small Business Marketing Expo by Adobe

A Wonderful Day at China Homelife USA, 2024

LATEST NEWS

Weekend Plans: Cozy Up with Cup of Jo.

The Importance of Buy-In for Business Growth: Why it Shouldn’t be...

POPULAR CATEGORY

ABOUT US

FOLLOW US

DOWNLOAD APP

Want to stay up to date with the latest news?

Even the most advanced AI struggles to surpass this new benchmark

LEAVE A REPLY Cancel reply

Most Popular

Recent Comments

EDITOR PICKS

LATEST NEWS

POPULAR CATEGORY

ABOUT US

FOLLOW US

DOWNLOAD APP