308 place 0

378 Monitoring LLM behavior: Drift, retries, and refusal patterns

VentureBeat
VentureBeat 1 place · 04/25/2026 00:00 EDT

The stochastic challengeTraditional software is predictable: Input A plus function B always equals output C. This determinism allows engineers to develop robust tests. On the other hand, generative AI is stochastic and unpredictable. The exact same prompt often yields different results on Monday versus Tuesday, breaking the traditional unit testing that engineers know and love.To ship enterprise-ready AI, engineers cannot rely on mere “vibe checks” that pass today but fail when customers use the product. Pr

To see detailed statistics for the news please log in »

Read the original

Add your comment
You must be logged in with Facebook to read and write comments.

A newsletter a day!

You may get 10 most important news around midday in daily newsletter. Press the button and we will send you the most important news only, no spam attached.

or register

LIKE us on Facebook so you won't miss the most important news of the day!

News from the same source
VentureBeat VentureBeat
Silicon Valley
George Avalos @ Silicon Valley 1 place · 02/07/2106 01:28 EDT

Newark apartment complex bought for much less than prior value

An East Bay apartment complex has been bought at a price that's well below its prior value. Read more

0

🔮
15.05.2026 ♒︎ Today will be a busy and multifaceted day for Aquarius, despite some difficulties in personal... Read more ›
Silicon Valley
George Avalos @ Silicon Valley 2 place · 02/07/2106 01:28 EDT

PG&E buys San Jose building to bolster South Bay operations

A PG&E Corp. unit has bought a San Jose building in a move to bolster the utility's South Bay operations. Read more

0

Digital Trends
Shikhar Mehrotra @ Digital Trends 1 place · today 19:51 EDT

This is the coolest laptop power bank I have ever seen, and I’d wait to see if it actually ships

The Krafted Edge is shaped like a closed laptop, sits flush beneath your notebook while charging it, and fits in any laptop bag without extra bulk. Read more

0 newcommer

SlashGear
SlashGear 1 place · today 19:45 EDT

NASA Once Tricked Its Own Government By Hiding An SR-71 In Plain Sight

The SR-71 Blackbird is a legend in the world of military aircraft, and NASA may have received one a bit earlier than it was supposed to. What happened? Read more

0 newcommer

Habr
VAnderskaeV @ Habr 1 place · today 19:43 EDT

Возвращение блудного программиста (ч. 4)

Эта часть про то, как я пилю бэкенд, учусь на этом и получаю эмоциональные качели.Для начала, напомню о себе: после 12 лет отсутствия в сфере я решил вернуться к своему базовому образованию – инженер-программист. Что-то приходится «вспоминать с нуля», но я не люблю начинать изучение полностью с теории, я больше экспериментатор. Читать далее Read more

0 newcommer

CNET
Gael Cooper @ CNET 1 place · today 19:38 EDT

Today's NYT Connections Hints, Answers and Help for May 16, #1070

Here are some hints and the answers for the NYT Connections puzzle for May 16, No. 1,070. Read more

0 newcommer

BetaKit
Douglas Soltys @ BetaKit 1 place · today 19:17 EDT

Harjit Sajjan reports for duty

After decades in public service roles, former defence minister Harjit Sajjan is now a defence-tech entrepreneur with Juno Industries. Read more

0 fresh

The Verge
Tom Warren @ The Verge 1 place · today 19:10 EDT

Xbox is now XBOX

Xbox just allcapsmaxxed: meet XBOX. This isn't a joke, Microsoft appears to be actually rebranding Xbox to XBOX. Asha Sharma, Xbox CEO, ran a poll on X earlier this week, asking fans whether Microsoft should use Xbox or XBOX? The results were in favor of XBOX, and the company has now renamed its X account. […] Read more

0 fresh

MacRumors
Juli Clover @ MacRumors 1 place · today 19:08 EDT

10 Useful iPhone Tips and Tricks You Might Not Know About

Over the years, the iPhone's operating system has gotten complicated. Apple adds new features with every version of iOS, and many of them aren't always obvious, leading to hidden iPhone capabilities you might not be aware of. The tips below assume that you have iOS 26 or later installed. Turn an App Into a Widget You can turn most app icons into widgets right from the iPhone's Home Screen. Just... Read more

0 fresh

TechRadar
TechRadar 1 place · today 19:06 EDT

How to watch the 2026 Eurovision Song Contest Grand Final online for FREE from anywhere

25 countries and only one winner. How are you going to vote? Here's how to watch Eurovision 2026 Grand Final online and for free from anywhere. Read more

0 fresh

Ubergizmo
Paulo Montenegro @ Ubergizmo 1 place · today 19:05 EDT

Google has disclosed the system requirements for its upcoming Gemini Intelligence ecosystem. According to documentation found on an official product page, the advanced artificial intelligence features will not be universally compatible with all Android devices, as they necessitate specific hardware and software standards, including support for the Gemini Nano v3 model. Minimum Specifications To run Gemini Intelligence, mobile devices must fulfill several strict criteria detailed by Google: Hardware: A premi Read more

0 fresh

Digital Trends
Shikhar Mehrotra @ Digital Trends 2 place · today 19:01 EDT

Meta’s Ray-Ban Display now types messages from your finger movements

Neural Handwriting is now live for every Ray-Ban Display owner, letting them type messages with finger movements, with no voice or phone required. Read more

0 fresh

TechRadar
TechRadar 2 place · today 19:00 EDT

Quordle hints and answers for Saturday, May 16 (game #1573)

Looking for Quordle clues? We can help. Plus get the answers to Quordle today and past solutions. Read more

0 fresh

TechRadar
TechRadar 3 place · today 19:00 EDT

NYT Strands hints and answers for Saturday, May 16 (game #804)

Looking for NYT Strands answers and hints? Here's all you need to know to solve today's game, including the spangram. Read more

0 fresh

TechRadar
TechRadar · today 19:00 EDT

NYT Connections hints and answers for Saturday, May 16 (game #1070)

Looking for NYT Connections answers and hints? Here's all you need to know to solve today's game, plus my commentary on the puzzles. Read more

0 fresh

Slashdot
BeauHD @ Slashdot 1 place · today 19:00 EDT

Kioxia and Dell Cram Nearly 10PB Into a Single 2U Server

BrianFagioli writes: Kioxia and Dell Technologies say they have built a 2U server configuration capable of scaling to 9.8PB of flash storage, which is the sort of density that would have sounded impossible just a few years ago. The setup combines a Dell PowerEdge R7725xd Server with 40 Kioxia LC9 Series 245.76TB NVMe SSDs and AMD EPYC processors. According to Kioxia, matching the same capacity with more common 30.72TB SSDs... Read more

0 fresh

Gizmodo
Jen Lennon @ Gizmodo 1 place · today 19:00 EDT

New ‘Gundam Wing’ ‘Visual Project’ in the Works

Bandai Namco announced the mysterious new project at the recent Gundam Conference. Read more

0 fresh

Ubergizmo
Paulo Montenegro @ Ubergizmo 2 place · today 18:55 EDT

Valve is currently offering a free download of the psychological horror game Terrors to Unveil – Day Off on its Steam platform. PC gamers can permanently add the title to their libraries at no cost, provided they claim it before the promotion ends on May 20. Game Overview The plot centers on Jack Williams, a 26-year-old worker who has just completed a demanding work week. Seeking relaxation, Williams travels to... Read more

0 fresh

CNET
Ajay Kumar @ CNET 2 place · today 18:54 EDT

Dyson's New Fan and Air Purifier Combo Follows You Around the Room to Deliver Clean Air

Oscillating fans are a nice idea to spread air around the room, but what if you just want to cool yourself? Dyson's Find+Follow uses AI tracking to do it. Read more

0 newcommer

Digital Trends
Manisha Priyadarshini @ Digital Trends 3 place · today 18:54 EDT

WhatsApp is testing disappearing messages that wait for you to actually read them before vanishing

WhatsApp has always let you send messages that vanish on a timer, but the clock starts the moment you hit send, not when the other person actually read it. That means a message could sit unread for hours and still disappear before anyone sees it. This is why WhatsApp is testing a new feature called […] Read more

0 fresh

The most popular news from the same source for the last week
VentureBeat VentureBeat
VentureBeat
VentureBeat 1 place · 05/09/2026 12:00 EDT

Here is a scenario that should concern every enterprise architect shipping autonomous AI systems right now: An observability agent is running in production. Its job is to detect infrastructure anomalies and trigger the appropriate response. Late one night, it flags an elevated anomaly score across a production cluster, 0.87, above its defined threshold of 0.75. The agent is within its permission boundaries. It has access to the rollback service. So... Read more

0

VentureBeat
VentureBeat 1 place · 05/10/2026 13:22 EDT

AI agents choose tools from shared registries by matching natural-language descriptions. But no human is verifying whether those descriptions are true. I discovered this gap when I filed Issue #141 in the CoSAI secure-ai-tooling repository. I assumed it would be treated as a single risk entry. The repository maintainer saw it differently and split my submission into two separate issues: One covering selection-time threats (tool impersonation, metadata manipulation); the other... Read more

0

VentureBeat
VentureBeat · 05/11/2026 13:15 EDT

A doctor in a hospital exam room watches as a medical transcription agent updates electronic health records, prompts prescription options, and surfaces patient history in real time. A computer vision agent on a manufacturing line is running quality control at speeds no human inspector can match. Both generate non-human identities that most enterprises cannot inventory, scope, or revoke at machine speed.That is the structural problem keeping agentic AI stuck in... Read more

0

VentureBeat
VentureBeat · 05/11/2026 18:21 EDT

Is AI leaving the era of "turn-based" chat?Right now, all of us who use AI models regularly for work or in our personal lives know that the basic interaction mode across text, imagery, audio, and video remains the same: the human user provides an input, waits anywhere between milliseconds to minutes (or in some cases, for particularly tough queries, hours and days), and the AI model provides an output.But if... Read more

0

VentureBeat
VentureBeat · 05/12/2026 03:00 EDT

Presented by EdgeVerveFor most enterprises, AI adoption began with a straightforward ambition: automate work faster, cheaper, and at scale. Chatbots replaced basic service requests, machine‑learning models optimized forecasts, and analytics dashboards promised sharper insights. Yet many organizations are now discovering that deploying individual AI solutions does not automatically translate into enterprise‑level impact. Pilots proliferate, but value plateaus.The next phase of AI maturity is no longer about. Read more

0

VentureBeat
VentureBeat · 05/12/2026 03:00 EDT

Presented by Apptio, an IBM companyAI spending is surging, but the full impact often remains an open question. Closing the gap requires clear answers to how AI is governed, measured, and tied to business outcomes.ROI uncertainty isn’t unique to AI: In the Apptio 2026 Technology Investment Management Report, 90% of technology leaders surveyed said that ROI uncertainty has a moderate or major impact on overall tech investment decisions, a 5-percentage... Read more

0

VentureBeat
VentureBeat 3 place · 05/12/2026 11:59 EDT

Between May 6 and 7, four security research teams published findings about Anthropic’s Claude that most outlets covered as three separate stories. One involved a water utility in Mexico, another targeted a Chrome extension, and a third hijacked OAuth tokens through Claude Code. In one case, Claude identified a water utility’s SCADA gateway without being told to look for one.These are not three bugs. They are one architectural question playing... Read more

0

VentureBeat
VentureBeat 2 place · 05/12/2026 14:45 EDT

AI that can see and understand what's happening in a video — especially a live feed — is understandably an attractive product to lots of enterprises and organizations. Beyond acting as a security "watchdog" over sites and facilities, such an AI model could also be used to clip out the most exciting parts of marketing videos and repurpose them for social, identify inconsistencies and gaffs in videos and flag them... Read more

0

VentureBeat
VentureBeat 1 place · 05/12/2026 14:49 EDT

Any development environment that installed or imported one of the 172 compromised npm or PyPI packages published since May 11 should be treated as potentially compromised. On affected developer workstations, the worm harvests credentials from over 100 file paths: AWS keys, SSH private keys, npm tokens, GitHub PATs, HashiCorp Vault tokens, Kubernetes service accounts, Docker configs, shell history, and cryptocurrency wallets. For the first time in a TeamPCP campaign, it... Read more

0

VentureBeat
VentureBeat · 05/13/2026 16:10 EDT

As large language models become more capable, users are tempted to delegate knowledge tasks where models process documents on their behalf and provide the finished results. But how far can you trust the model to stay faithful to the content of your documents when it has to iterate over them across multiple rounds?A new study by researchers at Microsoft shows that large language models silently corrupt documents that they work... Read more

0

Most popular sources

  • You see 779 news out of 779.
  • Sources 61 out of 61.
Startup News 0%
ScienceDaily 0%
ArcticStartup 0%
Tech Wire Asia 0%
ReadWrite 0%
View sources »

LIKE us on Facebook so you won't miss the most important news of the day!

15.05.2026 20:07
Last update: 20:00 EDT.
News rating updated: 03:02.

What is Times42?

Times42 brings you the most popular news from tech news portals in real-time chart.
Read about us in FAQ section.


Times42 © 2026