21 place 0

627 Monitoring LLM behavior: Drift, retries, and refusal patterns

VentureBeat
VentureBeat 2 place · 04/26/2026 11:13 EDT

The stochastic challengeTraditional software is predictable: Input A plus function B always equals output C. This determinism allows engineers to develop robust tests. On the other hand, generative AI is stochastic and unpredictable. The exact same prompt often yields different results on Monday versus Tuesday, breaking the traditional unit testing that engineers know and love.To ship enterprise-ready AI, engineers cannot rely on mere “vibe checks” that pass today but fail when customers use the product. Pr

To see detailed statistics for the news please log in »

Read the original

Add your comment
You must be logged in with Facebook to read and write comments.

A newsletter a day!

You may get 10 most important news around midday in daily newsletter. Press the button and we will send you the most important news only, no spam attached.

or register

LIKE us on Facebook so you won't miss the most important news of the day!

News from the same source
VentureBeat VentureBeat
Silicon Valley
George Avalos @ Silicon Valley 1 place · 02/07/2106 01:28 EDT

Newark apartment complex bought for much less than prior value

An East Bay apartment complex has been bought at a price that's well below its prior value. Read more

0

🔮
09.06.2026 ♌︎ Dear Lev, today your day is filled with a variety of impressions and opportunities, despite... Read more ›
Silicon Valley
George Avalos @ Silicon Valley 2 place · 02/07/2106 01:28 EDT

PG&E buys San Jose building to bolster South Bay operations

A PG&E Corp. unit has bought a San Jose building in a move to bolster the utility's South Bay operations. Read more

0

GSMArena.com
GSMArena.com 1 place · today 21:31 EDT

Opera for Android updated with a new start page and football hub

Opera has updated its mobile browser on Android with a new design as well as a specialized hub for the upcoming 2026 FIFA World Cup matches. Starting with the design, the new Opera for Android features a design that is very much based on Chrome for Android. This includes a search bar, an AI mode button, as well as a private browsing mode button. You can also customize your pinned... Read more

0 newcommer

TechRadar
TechRadar 2 place · today 21:30 EDT

Congratulations, Apple Intelligence can now effectively generate fake images just like all the other AI and I hope you're happy

We got an up-close look at Apple's super-charged generative image tools in Apple Intelligence, and they change the game for Apple images and the photos you take and create. Read more

0 fresh

TechRadar
TechRadar 3 place · today 21:28 EDT

Don't wait for Prime Day — there are already record-low prices on top-rated headphones, air fryers, vacuums and more

Some of the discounts available right now on Amazon are far better than the recently concluded Mid-Year Sale, dropping some prices to all-time lows, including on premium Bose headphones, Ninja air fryers, Shark vacuums and more. Read more

0 fresh

Habr
rukhi7 @ Habr 1 place · today 21:17 EDT

Динамический полиморфизм против std::variant на указателях: Разрушаем мифы о скорости std::visit

В экосистеме современного C++ прочно укоренилось мнение: классический динамический полиморфизм через виртуальные функции (vtable) — это устаревший, медленный и недружелюбный к кэшу процессора механизм. В качестве «серебряной пули» модно предлагать связку std::variant и std::visit. По интернету кочуют статьи, утверждающие, что std::visit выполняет диспетчеризацию за фиксированное время O(1) и полностью уничтожает старый добрый ООП-подход.Но в таких сравнениях авторы часто совершают методологическую ошибку: о Read more

0 fresh

ScienceDaily
ScienceDaily 1 place · today 21:16 EDT

“Chemo brain” affects up to 80% of people receiving chemotherapy, making everyday tasks harder. In a new trial, cancer patients who followed a home-based exercise program showed better attention and fewer noticeable cognitive problems than those who received a placebo. Low-dose ibuprofen also improved some cognitive measures, though its effects were less consistent. Read more

0 fresh

Habr
Rikkster @ Habr 2 place · today 21:04 EDT

Многолетний опыт создания MVP сейчас на рынке ИТ-труда не востребован, или AI убил HR найм?

Как быстро можно найти работу в 2026, если за плечами – серьёзный опыт fullstack-разработки?На фоне блокировки Telegram у фаундеров проекта над которым я трудился упали доходы на ±60%, и они не смогли поддерживать дальнейшую разработку. Я обнаружил, что рынок РФ фриланса можно сказать что умер, а выход на собес через ИТ-вакансии практически... Читать далее Read more

0 fresh

Business Insider
Alistair Barr @ Business Insider 1 place · today 20:46 EDT

Anthropic purposely made its new Mythos-based models bad at AI research, and developers are fuming

Anthropic faces backlash as Mythos-based models intentionally limit help for AI research, raising transparency and ethical concerns. Read more

0 fresh

SlashGear
SlashGear 1 place · today 20:45 EDT

This Chinese Company Is Making Brand-New Bodies For A Whole List Of Classic Cars

Restomods may just become the hot new thing, with car prices being what they are. If so, this Chinese company is ready with newly minted bodies for classics. Read more

0 fresh

Inc42 Media
Gaurav Bagur @ Inc42 Media 1 place · today 20:30 EDT

EV Charging Startup Exponent Energy Nets ₹200 Cr To Accelerate R&D

EV rapid-charging startup Exponent Energy has raised ₹200 Cr ($21.1 Mn) in a funding round co-led by 360 ONE Asset… Read more

0 fresh

SlashGear
SlashGear 2 place · today 20:30 EDT

10 Milwaukee M12 Products You Probably Didn't Realize Existed

The Milwaukee M12 lineup includes surprising tools you may not expect, from infrared temp guns to drain cameras, tackling a wide range of jobs. Read more

0 fresh

Inc42 Media
Anne Florentyna @ Inc42 Media 2 place · today 20:30 EDT

Mygate Secures ₹225 Cr To Fuel Footprint Expansion

Community management and security startup Mygate has raised ₹225 Cr ($23.6 Mn) in a fresh funding round from Dharana Capital… Read more

0 fresh

CNET
David Carnoy @ CNET 1 place · today 20:08 EDT

Best Bluetooth Speakers of 2026

Portable Bluetooth speakers keep getting better with each passing year. As CNET's mobile audio expert, I've tested hundreds of wireless speakers. Here are my current top picks for every budget. Read more

0 fresh

GSMArena.com
GSMArena.com 2 place · today 20:02 EDT

Samsung Galaxy S27 surfaces for the first time

The upcoming Samsung Galaxy S27 has now been spotted in the GSM Association's IMEI database. This confirms its name and its model number, that is SM-S952 with a letter affixed to that, which signifies which market(s) it's intended for. The specific model in the IMEI database is the SM-S952U, as you can see in the screenshot below. This one is going to US carriers. Of course, having this confirmation that... Read more

0 fresh

The Information
Martin Peers @ The Information 1 place · today 20:00 EDT

Can OpenAI do anything simply? Many companies, when publicly acknowledging the possibility of an IPO, emphasize that the final decision will depend in part on “market conditions.” That’s what OpenAI’s rival Anthropic did last week. But the ChatGPT firm chose to be more cryptic. Revealing its confidential IPO filing on Monday, OpenAI said it might hold off on its debut for a while “because there are things we want to... Read more

0 fresh

The most popular news from the same source for the last week
VentureBeat VentureBeat
VentureBeat
VentureBeat 2 place · 06/02/2026 21:55 EDT

Every new AI agent your team deploys starts from scratch: no memory of how the business works, where data lives, or what rules apply. And as agentic coding tools spin up applications faster than anyone can govern them, each one risks becoming another silo outside your data layer entirely. Microsoft is addressing both problems directly at Build 2026.According to VentureBeat's VB Pulse's Q1 2026 RAG Infrastructure Market Tracker, hybrid retrieval... Read more

0

VentureBeat
VentureBeat 1 place · 06/03/2026 14:49 EDT

While many AI open source model providers are pursuing larger and more powerful models, Google is still giving attention to the smaller, more local side of the market. Today, the tech giant released Gemma 4 12B, an 11.95-billion-parameter open-weights model with permissive Apache 2.0 license optimized to execute locally on a standard enterprise laptop using just 16GB of VRAM or unified memory.That means those enterprise users looking to keep working... Read more

0

VentureBeat
VentureBeat · 06/04/2026 16:25 EDT

Anthropic co-founder and CEO Dario Amodei said it was coming, but it still feels like a milestone: More than 80% of the code merged into Anthropic’s production codebase in May wasn't authored by humans, but by its own AI model, Claude, according to a new report shared by the record-breaking AI startup today.This transformation has triggered an 8x increase in the volume of code shipped per engineer per quarter compared... Read more

0

VentureBeat
VentureBeat · 06/05/2026 12:42 EDT

Meta's AI support agent bound recovery emails to accounts for whoever asked, and SOCs never saw an alert. An authorized agent writes a log of legitimate transactions, so nothing in the detection stack fired. Attackers asked the bot to make the change, took the one-time code it sent, and ran the password reset, 404 Media reported.No malware, no stolen credentials, and no prompt injection in the sense most security teams... Read more

0

VentureBeat
VentureBeat 3 place · 06/05/2026 13:51 EDT

When someone on a team corrects an AI agent — better prompts, better feedback, better context — that improvement disappears the moment a colleague opens the same tool. The correction doesn't transfer, and the next person starts from zero.The problem compounds in multi-agent workflows, where teams expect agents to share context across users and tasks. Without a shared memory layer, every team member effectively trains a different version of the... Read more

0

VentureBeat
VentureBeat 3 place · 06/05/2026 15:31 EDT

Microsoft used its Build 2026 conference this week to push a clear message: agents are rapidly moving into production throughout enterprise systems, and the winning platform will be the one that gives them reliable context, governance, identity, memory — and secure access to enterprise data. The company announced Microsoft IQ as a context layer across GitHub Copilot, Microsoft Foundry and Copilot Studio; Work IQ APIs coming June 16; Fabric IQ... Read more

0

VentureBeat
VentureBeat 2 place · 06/05/2026 18:55 EDT

For three years, Microsoft's artificial intelligence story has been inseparable from OpenAI. The partnership — cemented by a cumulative investment exceeding $13 billion — gave Microsoft early access to the most advanced AI models on the planet, catapulting its Copilot products into the enterprise mainstream and adding hundreds of billions of dollars to its market capitalization. To the outside world, Microsoft's AI strategy was OpenAI.Mustafa Suleyman wants to change that... Read more

0

VentureBeat
VentureBeat 1 place · 06/06/2026 00:00 EDT

Our system did one thing, and it did it well: It turned natural-language questions into API calls.The users were analysts, account managers, and operations leads. They knew what data they needed, but assembling it manually meant pulling from four dashboards, two BI tools, and a Salesforce report builder. With our system, they typed the request in plain English. A request like "Compile a report on sales volume for January through... Read more

0

VentureBeat
VentureBeat 2 place · 06/07/2026 12:00 EDT

Agentic AI is now a core part of the engineering process, driving massive execution leverage and helping us generate more code than ever before. Yet, a difficult question I’ve increasingly heard from business leaders is: if we’re shipping code faster than ever, why aren’t our products improving at the same rate?The reason is that writing code was never the rate limiter. Defining the right requirements, integrating with complex systems, and... Read more

0

VentureBeat
VentureBeat 3 place · 06/07/2026 21:02 EDT

Our system did one thing, and it did it well: It turned natural-language questions into API calls.The users were analysts, account managers, and operations leads. They knew what data they needed, but assembling it manually meant pulling from four dashboards, two BI tools, and a Salesforce report builder. With our system, they typed the request in plain English. A request like "Compile a report on sales volume for January through... Read more

0

Most popular sources

  • You see 936 news out of 936.
  • Sources 61 out of 61.
ScienceDaily 0%
Tech Wire Asia 0%
Tech.eu 0%
ReadWrite 0%
ArcticStartup 0%
View sources »

LIKE us on Facebook so you won't miss the most important news of the day!

09.06.2026 21:54
Last update: 21:40 EDT.
News rating updated: 04:40.

What is Times42?

Times42 brings you the most popular news from tech news portals in real-time chart.
Read about us in FAQ section.


Times42 © 2026