• Microsoft’s new ASSERT framework lets developers test AI behavior using plain English
  • Gold Consolidation Narrows as Bearish Technical Signal Emerges: Scotiabank
  • Trump Signs Executive Order Granting Early Government Access to Advanced AI Models
  • USD/CHF Price Forecast: Bulls Clear 50-Day SMA, Set Sights on 0.7900
  • AMP Price Prediction 2025, 2026 – 2030: Can the Flexa Collateral Token Reach $0.050?
2026-06-03
Coins by Cryptorank
  • Crypto News
  • AI News
  • Forex News
  • Sponsored
  • Press Release
  • Media Kit
  • Advertisement
  • More
    • About Us
    • Learn
    • Exclusive Article
    • Reviews
    • Events
    • Contact Us
    • Privacy Policy
  • Crypto News
  • AI News
  • Forex News
  • Sponsored
  • Press Release
  • Media Kit
  • Advertisement
  • More
    • About Us
    • Learn
    • Exclusive Article
    • Reviews
    • Events
    • Contact Us
    • Privacy Policy
Skip to content
Home AI News Microsoft’s new ASSERT framework lets developers test AI behavior using plain English
AI News

Microsoft’s new ASSERT framework lets developers test AI behavior using plain English

  • by Keshav Aggarwal
  • 2026-06-03
  • 0 Comments
  • 2 minutes read
  • 0 Views
  • 17 seconds ago
Facebook Twitter Pinterest Whatsapp
Developer using Microsoft ASSERT to test AI behavior with natural language descriptions on a monitor

Microsoft has released a new open-source framework called ASSERT that aims to simplify how developers test whether their AI systems behave as intended. Instead of writing complex code for each evaluation scenario, the tool allows engineers to describe desired behaviors in plain English, and then automatically generates test cases, runs them, and scores the results.

What ASSERT does differently

ASSERT, short for Adaptive Spec-driven Scoring for Evaluation and Regression Testing, is designed to fill a gap that broader, more general AI benchmarks cannot address. While industry-wide evaluations like Stanford’s HELM or MLCommons’ AILuminate measure model capabilities at scale, they often miss application-specific nuances. For example, a document research AI agent might need to follow company-specific policies about emailing external contacts or sharing confidential data with executives. ASSERT lets developers define those rules in natural language, and the framework generates targeted tests to check compliance.

Sarah Bird, chief product officer of Responsible AI at Microsoft, said that evaluations are critical for making informed decisions about AI deployment. “If you don’t understand the behavior of the AI system, it’s really hard to know if it’s meeting your organization’s bar,” Bird said. She noted that ASSERT can be used during development, after deployment, and for continuous monitoring, making it a practical tool for production environments.

How the framework works

The framework takes a plain-language description of expected behavior and policies, then converts it into a structured set of acceptable and unacceptable actions. From there, it generates problem scenarios and test cases, runs them against the target system, and scores the results. Developers can also inspect the intermediate steps and tool calls the AI system made, which helps pinpoint where failures occur.

For instance, a developer might specify that an AI assistant should not send emails to people outside the company, should limit confidential information to C-level executives, and should provide concise summaries that account for prior context. ASSERT would then create test cases to verify each of those rules on an ongoing basis.

Why this matters for AI safety

The release comes at a time when the AI industry is increasingly focused on repeatable testing and regression checks. As models become more capable, ensuring they behave reliably in specific contexts has become a priority. Tools like ASSERT help bridge the gap between general model evaluation and the real-world constraints of a product or service. This is especially relevant for enterprises deploying AI in regulated industries, where compliance and safety are non-negotiable.

Conclusion

Microsoft’s ASSERT framework represents a practical step toward making AI behavior testing more accessible and thorough. By allowing developers to define expectations in natural language and automating the evaluation process, it addresses a growing need for application-specific testing that goes beyond generic benchmarks. As AI adoption accelerates, tools that simplify safety and compliance checks will become increasingly valuable.

FAQs

Q1: What does ASSERT stand for?
A: ASSERT stands for Adaptive Spec-driven Scoring for Evaluation and Regression Testing. It is an open-source framework from Microsoft.

Q2: Can ASSERT be used for continuous monitoring?
A: Yes, Microsoft says ASSERT can be used during development, after deployment, and for continuous monitoring of AI systems.

Q3: How does ASSERT differ from other AI evaluation tools?
A: ASSERT focuses on application-specific behavior testing using natural language descriptions, while broader benchmarks like HELM or AILuminate measure general model capabilities. ASSERT fills the gap for context-specific, policy-driven evaluations.

Disclaimer: The information provided is not trading advice, Bitcoinworld.co.in holds no liability for any investments made based on the information provided on this page. We strongly recommend independent research and/or consultation with a qualified professional before making any investment decisions.

Tags:

AIevaluationMicrosoftopen source.testing

Share This Post:

Facebook Twitter Pinterest Whatsapp
Avatar photo

Keshav Aggarwal

Co- Founder
Keshav Aggarwal is the Co-Founder & CEO of BitcoinWorld, a Google News - indexed publication covering crypto, AI, and forex markets since 2020. A blockchain investor and trader with over six years in the digital-asset space, he built one of India's most active crypto investor communities and has guided thousands of retail participants through their first investments in the asset class. At BitcoinWorld, he sets editorial direction across the newsroom and reports on the business of crypto, AI, and Web3 - tracking the funding rounds, product launches, and regulatory shifts shaping the future of finance and frontier technology.
Next Post

Gold Consolidation Narrows as Bearish Technical Signal Emerges: Scotiabank

Categories

92

AI News

Crypto News

Bitcoin Treasury Ambition: The Blockchain Group Seeks Staggering €10 Billion

Events

97

Forex News

33

Learn

Press Release

Reviews

Google NewsGoogle News TwitterTwitter LinkedinLinkedin coinmarketcapcoinmarketcap BinanceBinance YouTubeYouTubes

Copyright © 2026 BitcoinWorld | Powered by BitcoinWorld