A New Benchmark for the Risks of AI

Mohamed 2 weeks ago

0 1 2 minutes read

MLCommons, a nonprofit that helps companies measure the performance of their artificial intelligence systems, is launching a new benchmark to gauge AI’s bad side too.

The new benchmark, called AILuminate, assesses the responses of large language models to more than 12,000 test prompts in 12 categories including inciting violent crime, child sexual exploitation, hate speech, promoting self-harm, and intellectual property infringement.

Models are given a score of “poor,” “fair,” “good,” “very good,” or “excellent,” depending on how they perform. The prompts used to test the models are kept secret to prevent them from ending up as training data that would allow a model to ace the test.

Peter Mattson, founder and president of MLCommons and a senior staff engineer at Google, says that measuring the potential harms of AI models is technically difficult, leading to inconsistencies across the industry. “AI is a really young technology, and AI testing is a really young discipline,” he says. “Improving safety benefits society; it also benefits the market.”

Reliable, independent ways of measuring AI risks may become more relevant under the next US administration. Donald Trump has promised to get rid of President Biden’s AI Executive Order, which introduced measures aimed at ensuring AI is used responsibly by companies as well as a new AI Safety Institute to test powerful models.

The effort could also provide more of an international perspective on AI harms. MLCommons counts a number of international firms, including the Chinese companies Huawei and Alibaba, among its member organizations. If these companies all used the new benchmark, it would provide a way to compare AI safety in the US, China, and elsewhere.

Some large US AI providers have already used AILuminate to test their models. Anthropic’s Claude model, Google’s smaller model Gemma, and a model from Microsoft called Phi all scored “very good” in testing. OpenAI’s GPT-4o and Meta’s largest Llama model both scored “good.” The only model to score “poor” was OLMo from the Allen Institute for AI, although Mattson notes that this is a research offering not designed with safety in mind.

“Overall, it’s good to see scientific rigor in the AI evaluation processes,” says Rumman Chowdhury, CEO of Humane Intelligence, a nonprofit that specializes in testing or red-teaming AI models for misbehaviors. “We need best practices and inclusive methods of measurement to determine whether AI models are performing the way we expect them to.”

Source link

A New Benchmark for the Risks of AI

Mohamed

Leave a Reply Cancel reply

Rwanda starts vaccine trials against deadly Marburg virus

Oil price rises on Biden Iran oil strike comments

How can Israel attack Syria? | Israel attacks Lebanon News

The Top New Features in Apple’s iOS 18 and iPadOS 18

Myanmar’s military intensifying killing and torture of civilians, UN says | Human Rights News

Netanyahu and the Israeli protesters are on the same genocidal page | Israel-Palestine conflict

Hawks’ Okongwu out at least 4 games; Bufkin’s season over

US secures release of 135 political prisoners from Nicaragua | Human Rights News

Kojie San Skin Brightening Soap – Original Kojic Acid, Dark Spot Remover Bar Soap with Coconut & Tea Tree Oil – 65g x 4 Bars

Mohamed

Subscribe to our mailing list to get the new updates!

Who is Jared Isaacman? Trump pick for NASA administrator is a billionaire who paid for 2 SpaceX flights

Bob Dylan Praises Timothee Chalamet: 'He's a Brilliant Actor'

Related Articles

NASA Postpones Return of Stranded Starliner Astronauts to March

Congress Again Fails to Limit Scope of Spy Powers in New Defense Bill

Volvo EX90 Review: A Luxurious Tech-Heavy Ride

CDC Confirms First US Case of Severe Bird Flu