21 place 0

881 Frontier models are failing one in three production attempts — and getting harder to audit

VentureBeat
VentureBeat · 04/15/2026 15:35 EDT

AI agents are now embedded in real enterprise workflows, and they're still failing roughly one in three attempts on structured benchmarks. That gap between capability and reliability is the defining operational challenge for IT leaders in 2026, according to Stanford HAI's ninth annual AI Index report.This uneven, unpredictable performance is what the AI Index calls the "jagged frontier," a term coined by AI researcher Ethan Mollick to describe the boundary where AI excels and then suddenly fails.“AI models.

To see detailed statistics for the news please log in »

Read the original

Add your comment
You must be logged in with Facebook to read and write comments.

A newsletter a day!

You may get 10 most important news around midday in daily newsletter. Press the button and we will send you the most important news only, no spam attached.

or register

LIKE us on Facebook so you won't miss the most important news of the day!

News from the same source
VentureBeat VentureBeat
Silicon Valley
George Avalos @ Silicon Valley 1 place · 02/07/2106 01:28 EDT

Newark apartment complex bought for much less than prior value

An East Bay apartment complex has been bought at a price that's well below its prior value. Read more

0

🔮
26.05.2026 ♎︎ Horoscope for the Libra Zodiac Sign Today: Today will bring you interesting impressions and unexpected... Read more ›
Silicon Valley
George Avalos @ Silicon Valley 2 place · 02/07/2106 01:28 EDT

PG&E buys San Jose building to bolster South Bay operations

A PG&E Corp. unit has bought a San Jose building in a move to bolster the utility's South Bay operations. Read more

0

Skift
Dennis Schaal @ Skift 1 place · today 09:11 EDT

$11.4 Billion and Zero Churn: What SpaceX Said About Starlink’s Travel Industry Grip

The competitors to Musk's Starlink will eventually up their game. It's never healthy for customers and end-users to be dependent on a single vendor, regardless of the Starlink's competitive advantage at this moment. Read more

0 newcommer

SlashGear
SlashGear 1 place · today 09:10 EDT

The 2026 Porsche Cayenne Coupe Electric Is An Impressive All-Rounder That's Missing One Thing

Porsche buyers love SUV coupes, but can the Cayenne Coupe Electric deliver soul along with its undeniable EV speed? Read more

0 newcommer

Engadget
Engadget 1 place · today 09:10 EDT

GoPro Mission 1 Pro review: The best action cam video quality comes at a high price

GoPro's Mission 1 Pro action cam has a big 1-inch sensor that offers sharp, color accurate 8K 60 fps and 4K 240 fps video. Read more

0 newcommer

Eurogamer.net
Robert Purchese @ Eurogamer.net 1 place · today 09:08 EDT

Nearly 500 Marvel Rivals players have been named, shamed and banned in "a targeted purge" by NetEase following their exploits after a recent game update. Read more Read more

0 newcommer

Habr
O-Rogova (MIND Software) @ Habr 1 place · today 09:08 EDT

Архитектурный тупик корпоративного хранения: почему смена модели не снимает ограничений и что с этим делать

История корпоративных систем хранения данных – это путь от жестко специализированных «черных ящиков» к гибким программным платформам. Каждый шаг этой эволюции решал проблемы прошлого, но неизбежно порождал новые противоречия. Сегодня, столкнувшись с радикальным усложнением инфраструктур (от классических ЦОД до частных облаков и объектов КИИ), – отрасль оказалась в точке, где наследие прошлых архитектурных решений стало главным ограничением для будущего. Современная корпоративная инфраструктура перестала быт Read more

0 newcommer

Business Insider
Tom Carter @ Business Insider 1 place · today 09:04 EDT

Sam Altman says he's 'delighted to be wrong' about AI destroying white-collar jobs

The OpenAI boss told a tech conference that his predictions about AI wiping out entry-level white-collar jobs had been wide of the mark. Read more

0 newcommer

Habr
kmordanov (Финтех-группа «Свой») @ Habr 2 place · today 09:04 EDT

Атаки через подрядчиков, дефицит кадров и квест с импортозамещением: главные вызовы ИБ в 2026 году

Привет, Хабр! На связи Кирилл Морданов, PR-менеджер Своего Банка.Наша DevRel-команда регулярно вытаскивает на свет интересные кейсы и инсайды от технических специалистов СВОЙ Тех, чтобы вытащить наружу самые сочные инсайды. В этот раз с чашкой кофе я дошел до Алексея Ахмеева, начальника управления информационной безопасности банка.Тема ИБ сейчас максимально горячая: атаки эволюционируют, регуляторы закручивают гайки, а дедлайны по импортозамещению КИИ уже дышат в спину. Я позадавал Алексею вопросы о том, ка Read more

0 newcommer

MacRumors
Mitchel Broussard @ MacRumors 1 place · today 09:02 EDT

Apple's M5 MacBook Air Hits New Low Price of $899.99

Amazon today has introduced a new record low price on the 512GB 13-inch M5 MacBook Air, available for $899.99, down from $1,099.00. This deal is available in all colors and as of writing only Amazon has the discount. Note: MacRumors is an affiliate partner with Amazon. When you click a link and make a purchase, we may receive a small payment, which helps us keep the site running. This new... Read more

0 fresh

Habr
jakut_bmstu @ Habr 3 place · today 09:02 EDT

Я не оставлю детям наследства

Кто я такойМеня зовут Евгений, мне 31 год. Муж, отец дочки полутора лет, почти 10 лет в IT - финтех, стартапы, удалёнка. Объездил кучу стран, жил в Азии, Европе. Именно поэтому к вопросу наследства подошёл не с позиции «как принято», а с позиции — а зачем вообще?Вы вообще видели наследство? Я — нетМне 31, и я ни разу в жизни не видел наследства. Не получал, не делил, не ждал. Бабушке... Read more

0 newcommer

Habr
SrvTrantor (RUVDS.com) @ Habr · today 09:01 EDT

Почему порты стали «дверями» в сервер, и кто решил, что SSH будет 22

В 1995 году Тату Илонен написал письмо длиной с пост на Хабре и бесплатно получил номер ssh -p 22 user@host, который теперь знает каждый сисадмин. Но до этого порты были однонаправленными, чётные номера считались ненужными, а половина слотов вообще пустовала. О том, как порты стали «дверями» в сервер и что останется от них через десять лет, рассказал в статье.  Читать Read more

0 newcommer

Engadget
Engadget 2 place · today 09:00 EDT

Fitbit Air review: Health tracking for the AI generation

The Fitbit Air is a serious rival to Whoop and other screenless wearable trackers thanks to its solid hardware, comprehensive software and competitive pricing. Read more

0 fresh

Wired
Boutayna Chokrane @ Wired 1 place · today 09:00 EDT

Google Fitbit Air Review: Barely There, Always Running

Google’s latest Fitbit strips away the screen without sacrificing features, delivering the most approachable and affordable wearable yet. Read more

0 fresh

Tom's Hardware
Tom's Hardware 1 place · today 09:00 EDT

Arctic Freezer 36-S Review: Small size, effective performance, low price

We tested Arctic’s entry-level Freezer 36-S with AMD’s Ryzen 9 9950X3D, and it did better than you might expect for a single-tower air cooler. Read more

0 fresh

Habr
t3chnowolf (МТС) @ Habr · today 09:00 EDT

Почему зарубежные разработчики чипов возвращаются на китайские фабрики

Последние несколько лет полупроводниковая отрасль во многом живет за счет бума искусственного интеллекта. Спрос на чипы для обучения и работы нейросетей растет так быстро, что крупнейшие производственные компании все чаще отдают этим заказам приоритет, оставляя меньше места для выпуска более простых и привычных микросхем. На этом фоне неожиданно вырос интерес к китайским фабрикам. Компании, разрабатывающие чипы для автомобилей, бытовой техники, промышленного оборудования и другой обычной электроники, все ча Read more

0 fresh

TechRadar
TechRadar 3 place · today 09:00 EDT

What is the release date for The Four Seasons season 2 on Netflix?

Steve Carrell might not be returning, but that's not true for the rest of this Netflix show's cast. But when does The Four Seasons season 2 arrive on Netflix? Read more

0 fresh

The most popular news from the same source for the last week
VentureBeat VentureBeat
VentureBeat
VentureBeat · 05/19/2026 12:20 EDT

Andrej Karpathy, the influential 39-year-old Slovak-Canadian AI researcher and one of the original 11 co-founders of OpenAI, and former head of Tesla's AI division, announced on Tuesday, May 19 that he's joining rival lab Anthropic.As Karpathy posted from his account on the social network X: "Personal update: I've joined Anthropic. I think the next few years at the frontier of LLMs will be especially formative. I am very excited to... Read more

0

VentureBeat
VentureBeat · 05/19/2026 13:37 EDT

Although it was already discovered by intrepid AI power users weeks ahead of the official unveiling today at Google's annual I/O developer conference, the company's new Gemini Omni model marks a significantly new paradigm in the wider AI and tech marketplace.That's because as its "omni" (from the Latin omne — meaning "all") prefix would suggest, this is Google's first truly native, multimodal model, that is "a model that can create... Read more

0

VentureBeat
VentureBeat · 05/19/2026 13:45 EDT

Google unveiled Gemini 3.5 Flash at its annual I/O developer conference on Tuesday, a new artificial intelligence model that the company says shatters what had become a seemingly iron law of the AI industry: that the smartest models must also be the slowest and most expensive to run.The model sits at the center of a sweeping set of announcements — from a video-generating "world model" called Gemini Omni to a... Read more

0

VentureBeat
VentureBeat · 05/19/2026 13:45 EDT

Google on Tuesday unveiled Gemini Spark, a personal AI agent designed to work around the clock — drafting emails, assembling documents, monitoring inboxes, and eventually making purchases — even when a user's laptop is closed and their phone is locked.The announcement, made at Google I/O 2026, is the company's most ambitious attempt yet to transform its AI assistant from a tool that answers questions into one that autonomously completes tasks.... Read more

0

VentureBeat
VentureBeat · 05/19/2026 13:45 EDT

For a quarter century, the Google search box has been one of the most recognizable interfaces in computing: a thin white rectangle, a blinking cursor, a few typed words, and a list of blue links. On Tuesday, Google will formally retire that paradigm.At its annual I/O developer conference, Google announced a sweeping redesign of the search box itself — the literal text field where billions of queries begin every day... Read more

0

VentureBeat
VentureBeat · 05/19/2026 15:45 EDT

The reason enterprises have been slow to connect AI agents to internal APIs and databases isn't the models — it's the credentials. In most production deployments, the agent carries authentication tokens with it as it executes tool calls, which means a compromised or misbehaving agent takes the keys with it.Anthropic is addressing that problem with two new capabilities for Claude Managed Agents: self-hosted sandboxes, which let teams run tool execution... Read more

0

VentureBeat
VentureBeat · 05/19/2026 20:06 EDT

Generative AI’s rapid transition from text-based chatbots to high-fidelity media—spanning images, video, spatial 3D, and audio—has exposed a glaring bottleneck in the modern tech stack: infrastructure. Rendering pixels in real-time requires a staggering amount of compute, and developers are increasingly struggling to manage fragmented GPU clusters just to keep their applications online.Enter fal, a generative media creation platform that has quietly become the connective tissue for 2.5 million developers ac Read more

0

VentureBeat
VentureBeat · 05/20/2026 06:00 EDT

Today, Copenhagen-based healthcare AI Corti is launching Symphony for Speech-to-Text, a new generation of clinical-grade speech recognition models engineered specifically for real-time dictation, conversational transcription, and batch audio processing — and their accuracy rate is the highest for this specific use case yet recorded."We are focused on ensuring our AI scribes can be trusted by physicians, medical practitioners and patients...the entire healthcare system," said Andreas Cleve, co-founder and CE Read more

0

VentureBeat
VentureBeat · 05/20/2026 10:12 EDT

The creators of NanoClaw — the hit open source, enterprise-friendly variant of autonomous AI agent harness OpenClaw — are moving towards commercializing their technology for enterprises at scale, aiming to provide them with secure AI agents, and an ever-updating library of workplace context, for each human employee the enterprise has approved.The duo, including former Wix.com engineer Gavriel Cohen and his brother Lazer Cohen, also founder of tech public relations firm... Read more

0

VentureBeat
VentureBeat · 05/20/2026 13:21 EDT

GitHub confirmed on May 20 that a poisoned VS Code extension installed on an employee’s device gave attackers access to roughly 3,800 internal repositories at the Microsoft-owned code storage and authorship platform. The threat group TeamPCP, formally tracked by Google Threat Intelligence Group as UNC6780, claimed responsibility and is advertising the stolen repositories for sale starting at $50,000. GitHub’s assessment: the attacker’s claim is “directionally consistent” with the investigation so... Read more

0

Most popular sources

  • You see 668 news out of 668.
  • Sources 61 out of 61.
Financial Times 0%
CNET 0%
Mobile ID World 0%
Slashdot 0%
AlleyWatch 0%
View sources »

LIKE us on Facebook so you won't miss the most important news of the day!

26.05.2026 09:25
Last update: 09:21 EDT.
News rating updated: 16:21.

What is Times42?

Times42 brings you the most popular news from tech news portals in real-time chart.
Read about us in FAQ section.


Times42 © 2026