Close Menu
The Politics
    What's Hot

    How Two Hardy North Dakotans Were Almost Thwarted by an Ice Storm

    January 24, 2026

    Beatriz González, Who Chronicled Colombia’s Turmoil in Paint, Dies at 93

    January 24, 2026

    Syrian and Kurdish Troops in Standoff as Truce Deadline Passes

    January 24, 2026
    Facebook X (Twitter) Instagram
    • Demos
    • Politics
    • Buy Now
    Facebook X (Twitter) Instagram
    The Politics
    Subscribe
    Saturday, January 24
    • Home
    • Breaking
    • World
      • Africa
      • Americas
      • Asia Pacific
      • Europe
    • Sports
    • Politics
    • Business
    • Entertainment
    • Health
    • Tech
    • Weather
    The Politics
    Home»Breaking»AI models need more standards and tests, say researchers
    Breaking

    AI models need more standards and tests, say researchers

    Justin M. LarsonBy Justin M. LarsonJune 22, 2025No Comments4 Mins Read
    Share Facebook Twitter Pinterest Copy Link LinkedIn Tumblr Email VKontakte Telegram
    Share
    Facebook Twitter Pinterest Email Copy Link


    As the usage of artificial intelligence — benign and adversarial — increases at breakneck speed, more cases of potentially harmful responses are being uncovered.

    Pixdeluxe | E+ | Getty Images

    As the usage of artificial intelligence — benign and adversarial — increases at breakneck speed, more cases of potentially harmful responses are being uncovered. These include hate speech, copyright infringements or sexual content.

    The emergence of these undesirable behaviors is compounded by a lack of regulations and insufficient testing of AI models, researchers told CNBC.

    Getting machine learning models to behave the way it was intended to do so is also a tall order, said Javier Rando, a researcher in AI.

    “The answer, after almost 15 years of research, is, no, we don’t know how to do this, and it doesn’t look like we are getting better,” Rando, who focuses on adversarial machine learning, told CNBC.

    However, there are some ways to evaluate risks in AI, such as red teaming. The practice involves individuals testing and probing artificial intelligence systems to uncover and identify any potential harm — a modus operandi common in cybersecurity circles.

    Shayne Longpre, a researcher in AI and policy and lead of the Data Provenance Initiative, noted that there are currently insufficient people working in red teams.

    While AI startups are now using first-party evaluators or contracted second parties to test their models, opening the testing to third parties such as normal users, journalists, researchers, and ethical hackers would lead to a more robust evaluation, according to a paper published by Longpre and researchers.

    “Some of the flaws in the systems that people were finding required lawyers, medical doctors to actually vet, actual scientists who are specialized subject matter experts to figure out if this was a flaw or not, because the common person probably couldn’t or wouldn’t have sufficient expertise,” Longpre said.

    Adopting standardized ‘AI flaw’ reports, incentives and ways to disseminate information on these ‘flaws’ in AI systems are some of the recommendations put forth in the paper.

    With this practice having been successfully adopted in other sectors such as software security, “we need that in AI now,” Longpre added.

    Marrying this user-centred practice with governance, policy and other tools would ensure a better understanding of the risks posed by AI tools and users, said Rando.

    We're pursing a path of AI development that's extremely harmful to a lot of people, says Karen Hao

    No longer a moonshot

    Project Moonshot is one such approach, combining technical solutions with policy mechanisms. Launched by Singapore’s Infocomm Media Development Authority, Project Moonshot is a large language model evaluation toolkit developed with industry players such as IBM and Boston-based DataRobot.

    The toolkit integrates benchmarking, red teaming and testing baselines. There is also an evaluation mechanism which allows AI startups to ensure that their models can be trusted and do no harm to users, Anup Kumar, head of client engineering for data and AI at IBM Asia Pacific, told CNBC.

    Evaluation is a continuous process that should be done both prior to and following the deployment of models, said Kumar, who noted that the response to the toolkit has been mixed.

    “A lot of startups took this as a platform because it was open source, and they started leveraging that. But I think, you know, we can do a lot more.”

    Moving forward, Project Moonshot aims to include customization for specific industry use cases and enable multilingual and multicultural red teaming.

    Higher standards

    Pierre Alquier, Professor of Statistics at the ESSEC Business School, Asia-Pacific, said that tech companies are currently rushing to release their latest AI models without proper evaluation.

    “When a pharmaceutical company designs a new drug, they need months of tests and very serious proof that it is useful and not harmful before they get approved by the government,” he noted, adding that a similar process is in place in the aviation sector.

    AI models need to meet a strict set of conditions before they are approved, Alquier added. A shift away from broad AI tools to developing ones that are designed for more specific tasks would make it easier to anticipate and control their misuse, said Alquier.

    “LLMs can do too many things, but they are not targeted at tasks that are specific enough,” he said. As a result, “the number of possible misuses is too big for the developers to anticipate all of them.”

    Such broad models make defining what counts as safe and secure difficult, according to a research that Rando was involved in.

    Tech companies should therefore avoid overclaiming that “their defenses are better than they are,” said Rando.



    Source link

    Related

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Telegram Copy Link
    Justin M. Larson
    • Website

    Related Posts

    Breaking

    Syria: UNICEF calls for safe access to children in Sweida as needs mount

    August 13, 2025
    Breaking

    Gaza Plan Stokes Tension Between Israel’s Military Chief and Government

    August 13, 2025
    Breaking

    Israel Hasn’t Prosecuted a Single Suspect for the Oct. 7 Attack

    August 13, 2025
    Breaking

    Ronaldo Moves From Unwedded Bliss to Engagement in Conservative Kingdom

    August 13, 2025
    Breaking

    Record starvation and malnutrition in Gaza; more West Bank displacement

    August 12, 2025
    Breaking

    Gaza: UNESCO condemns ‘unacceptable’ killing of journalists

    August 12, 2025
    Add A Comment
    Leave A Reply Cancel Reply

    • Africa
    • Americas
    • Asia Pacific
    • Breaking
    • Business
    • Economy
    • Entertainment
    • Europe
    • Health
    • Politics
    • Politics
    • Sports
    • Tech
    • Top Featured
    • Trending Posts
    • Weather
    • World
    Economy News

    How Two Hardy North Dakotans Were Almost Thwarted by an Ice Storm

    Justin M. LarsonJanuary 24, 20260

    Jackie Gaddie and Craig Pietruszewski had been anticipating the trip of a lifetime, to Antarctica…

    Beatriz González, Who Chronicled Colombia’s Turmoil in Paint, Dies at 93

    January 24, 2026

    Syrian and Kurdish Troops in Standoff as Truce Deadline Passes

    January 24, 2026
    Top Trending

    How Two Hardy North Dakotans Were Almost Thwarted by an Ice Storm

    Justin M. LarsonJanuary 24, 20260

    Jackie Gaddie and Craig Pietruszewski had been anticipating the trip of a…

    Beatriz González, Who Chronicled Colombia’s Turmoil in Paint, Dies at 93

    Justin M. LarsonJanuary 24, 20260

    Often drawing from reproduced images or newspaper photos, she made work that…

    Syrian and Kurdish Troops in Standoff as Truce Deadline Passes

    Justin M. LarsonJanuary 24, 20260

    Syria’s government and Kurdish-led forces in the country’s northeast have clashed as…

    Subscribe to News

    Get the latest sports news from NewsSite about world, sports and politics.

    Advertisement
    Demo
    Editors Picks

    Review: Record Shares of Voters Turned Out for 2020 election

    January 11, 2021

    EU: ‘Addiction’ to Social Media Causing Conspiracy Theories

    January 11, 2021

    World’s Most Advanced Oil Rig Commissioned at ONGC Well

    January 11, 2021

    Melbourne: All Refugees Held in Hotel Detention to be Released

    January 11, 2021
    Latest Posts

    Review: Russia’s Putin Sets Out Conditions for Peace Talks with Ukraine

    January 20, 2021

    Review: Implications of San Francisco Govts’ Green-Light Nation’s First City-Run Public Bank

    January 20, 2021

    Queen Elizabeth the Last! Monarchy Faces Fresh Demand to be Axed

    January 20, 2021
    Advertisement
    Demo
    Editors Picks

    How Two Hardy North Dakotans Were Almost Thwarted by an Ice Storm

    January 24, 2026

    Beatriz González, Who Chronicled Colombia’s Turmoil in Paint, Dies at 93

    January 24, 2026

    Syrian and Kurdish Troops in Standoff as Truce Deadline Passes

    January 24, 2026

    Germany arrests suspected Hamas member over alleged attack plot

    January 24, 2026
    Latest Posts

    Review: Russia’s Putin Sets Out Conditions for Peace Talks with Ukraine

    January 20, 2021

    Review: Implications of San Francisco Govts’ Green-Light Nation’s First City-Run Public Bank

    January 20, 2021

    Queen Elizabeth the Last! Monarchy Faces Fresh Demand to be Axed

    January 20, 2021
    Advertisement
    Demo
    Facebook X (Twitter) Pinterest Vimeo WhatsApp TikTok Instagram

    News

    • World
    • US Politics
    • EU Politics
    • Business
    • Opinions
    • Connections
    • Science

    Company

    • Information
    • Advertising
    • Classified Ads
    • Contact Info
    • Do Not Sell Data
    • GDPR Policy
    • Media Kits

    Services

    • Subscriptions
    • Customer Support
    • Bulk Packages
    • Newsletters
    • Sponsored News
    • Work With Us

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    © 2026 The Politics Designed by The Politics.
    • Privacy Policy
    • Terms
    • Accessibility

    Type above and press Enter to search. Press Esc to cancel.