News, Trends, and Insights for IT & Managed Services Providers
News, Trends, and Insights for IT & Managed Services Providers
person in blue shirt writing on white paper

Artificial Analysis has revamped its AI Intelligence Index, replacing outdated benchmarks with evaluations based on real-world tasks. This significant overhaul includes ten assessments across various categories, such as coding and scientific reasoning, aiming to better reflect the capabilities of AI systems in practical applications. The new benchmark methodology has recalibrated scoring, with top models now scoring below 50 points, down from an average of 73, a move designed to restore competitive differentiation among AI technologies. Notably, the introduction of GDPval-AA measures AI productivity across 44 occupations, shifting the focus from abstract problem-solving to tangible workplace outputs. Current leading models, including OpenAI’s GPT-5.2 and Anthropic’s Claude Opus 4.5, demonstrate this shift, as they are evaluated on their ability to produce relevant work deliverables rather than simply passing traditional tests. Artificial Analysis emphasizes that its independent evaluations are standardized to ensure fairness and applicability, highlighting a crucial transition in how AI’s effectiveness is assessed. This reflects a growing recognition in the industry that practical, economically valuable performance is now the priority for enterprises considering AI adoption.

Why do we care?

This is one of the most important AI stories you probably won’t hear hyped.

For the last two years, AI benchmarks have told a comforting lie: that capability was racing ahead and adoption just needed courage. Artificial Analysis just pulled the curtain back. When you measure AI by the work it can actually produce—across real jobs—the scores drop fast.

That’s not a failure. That’s honesty.

Here’s where MSPs get into trouble. If you sell AI based on how “advanced” a model is, you inherit disappointment when it doesn’t replace labor the way the chart implied. Clients don’t care that a model aced a reasoning test. They care whether it drafts contracts correctly, triages tickets reliably, or produces analysis they can trust.

GDPval-style benchmarking finally aligns AI evaluation with business reality. But it also raises the bar on you. You now have to choose where AI fits, where it doesn’t, and where the risk of being wrong outweighs the productivity gain.  That choice needs to show up in scoped use cases, exclusion clauses, and fees tied to outcomes—not model names.

The harmful behavior is continuing to pitch AI as a general upgrade instead of a targeted tool. That’s how you burn credibility, margins, and client trust.

This matters because enterprises are done experimenting. They want proof of value. Better benchmarks won’t save bad strategy—but they remove the excuse for it.

Choose your upgrade:

Get the full benefits of Business of Tech Plus

Insider Access

$12/month

Perfect for MSPs and ITSPs that want full interviews, early access, and ad-free listening

  • Programmatic Ad-free private podcast feedSame show, little interruptions
  • Channel Chatter previews1–2 topics with light insights
  • Early access to interview episodesHear it days before public release
  • Monthly Insider BriefTighter analysis you can share internally
  • Extra audio segmentsCut interviews, behind-the-scenes commentary, quick competitive notes
  • Become an Insider for $12/month

    Leadership Access

    $149/month

    Perfect for MSPs and Vendors that run a team and need the extended tactics, executive summaries, and weekly alignment brief

  • All Insider Access benefits plus . . .
  • Invite your teamIncludes access for 5 team members with option to add more
  • Vendor Strategy BriefsThe entire library, plus new analysis every month
  • Channel ChatterAll topics, full insights, complete vendor discussion + sentiment list
  • Quarterly State of the Channel Briefing
  • Monthly AMA submission priorityAsk Dave direct questions, and skip the line
  • Get the Leadership Edge for $149/month

    Vendor Partner

    $500/month

    Perfect for channel companies or vendors looking to deepen their engagement with the show.

  • All Leadership Access benefits plus . . .
  • Get highlighted as a show sponsor You'll get placement in the show notes, throughout the website, and on our dedicated sponsors page.
  • Enjoy regular shout outs You'll be featured in a rotating format during the show
  • Become a show sponsor for $500/month

    Search all stories