24 place 54

249 Nvidia’s new technique cuts LLM reasoning costs by 8x without losing accuracy

VentureBeat
VentureBeat 2 place · 02/12/2026 17:00 EDT

Researchers at Nvidia have developed a technique that can reduce the memory costs of large language model reasoning by up to eight times. Their technique, called dynamic memory sparsification (DMS), compresses the key value (KV) cache, the temporary memory LLMs generate and store as they process prompts and reason through problems and documents.While researchers have proposed various methods to compress this cache before, most struggle to do so without degrading the model's intelligence. Nvidia's approach m

To see detailed statistics for the news please log in »

Read the original

Add your comment
You must be logged in with Facebook to read and write comments.

A newsletter a day!

You may get 10 most important news around midday in daily newsletter. Press the button and we will send you the most important news only, no spam attached.

or register

LIKE us on Facebook so you won't miss the most important news of the day!

News from the same source
VentureBeat VentureBeat
Silicon Valley
George Avalos @ Silicon Valley 1 place · 02/07/2106 01:28 EDT

Newark apartment complex bought for much less than prior value

An East Bay apartment complex has been bought at a price that's well below its prior value. Read more

0

🔮
19.04.2026 ♑︎ Dear Capricorn, today marks an interesting and eventful day for you, filled with both pleasant... Read more ›
Silicon Valley
George Avalos @ Silicon Valley 2 place · 02/07/2106 01:28 EDT

PG&E buys San Jose building to bolster South Bay operations

A PG&E Corp. unit has bought a San Jose building in a move to bolster the utility's South Bay operations. Read more

0

Irish Tech News
Marc-Roger Gagné MAPP @ Irish Tech News 1 place · today 09:05 EDT

The Digital God Doesn’t Run on Faith; It Runs on Water

AI is not running in the cloud. It is running on infrastructure, and that infrastructure has a cost. Nobody talks about what it costs to think at scale. That silence is doing work. AI is being sold as a technology story. It isn’t. It is a resource story; and the ledger is missing. Every response […] Read more

0 newcommer

The Information
Qianer Liu @ The Information 1 place · today 09:00 EDT

Google in Talks With Marvell to Build New AI Chips for Inference

Google is in talks with Marvell Technology to develop two new chips aimed at running AI models more efficiently, according to two people with direct knowledge of the discussions. One is a memory processing unit designed to work alongside Google’s tensor processing unit. The other is a new TPU built specifically for running AI models. The moves underscore surging demand for inference chips that run AI powering commercial products such... Read more

0 newcommer

TechRadar
TechRadar 1 place · today 09:00 EDT

4 unmissable new movies and TV shows to get excited by on Netflix this week (April 19-26)

From Stranger Things' return to a new survival thriller movie, you won't want to miss this film and TV show quartet. Read more

0 newcommer

TechRadar
TechRadar 2 place · today 09:00 EDT

What is the release date for The Testaments episode 5 on Hulu and Disney+?

The grisly truth about Dr Grove has finally come to light. But when does The Testaments episode 5 drop on Hulu and Disney+? Read more

0 newcommer

CoinDesk
Jamie Crawley @ CoinDesk 1 place · today 09:00 EDT

Nomura study says 65% of institutional investors see crypto as a vital portfolio diversifier

A new survey from Nomura and Laser Digital shows improving sentiment among institutional investors, as regulatory clarity and new products drive deeper engagement with digital assets. Read more

0 newcommer

Business Insider
Melissa Noble @ Business Insider 1 place · today 08:54 EDT

I had my first mammogram at age 40. After dreading it for years, I discovered it wasn't as bad as I thought.

When my doctor suggested I get a mammogram at 40, I was filled with anxiety. It wasn't painful at all and gave me peace of mind. Read more

0 newcommer

Habr
Combinator_30 @ Habr 1 place · today 08:48 EDT

Происхождение жизни — водный парадокс

В прошлой статье я попытался кратко изложить свои мысли, почему жизнь, вероятно, произошла задолго до появления Солнечной системы. В этой постараюсь вкратце обсудить, насколько логично искать "под фонарем" ту среду, в которой это произошло. Погнали! Read more

0 fresh

SlashGear
SlashGear 1 place · today 08:45 EDT

The Safest Large Pickup In 2026 Isn't A Toyota Or Ford, According To The IIHS

Looking for a safe large pickup truck? According to the IIHS, there's one excellent performer worth considering - it's just not the one you may have expected. Read more

0 fresh

Habr
desai @ Habr 2 place · today 08:45 EDT

Лицензии уходят, музыка остаётся: как я превратил тему для музыкального клиента в runtime-аддон с блекджеком и WASM

Около года назад моё желание кастомизировать десктопный клиент популярного музыкального сервиса привело меня в некое сообщество. Всё началось с попытки восстановить заброшенную тему «Blurity» после очередного обновления Electron-хоста, которое сломало все селекторы. Но проект быстро перерос рамки обычных правок CSS.В этой статье я расскажу, как ChromaSync эволюционировал из простого визуального патча в полноценную инженерную систему — runtime-аддон со сложной архитектурой. Мы разберем «анатомию» плеера, соз Read more

0 fresh

TechRadar
TechRadar 3 place · today 08:30 EDT

Man City vs Arsenal Live Streams: How to watch Premier League 2025/26 from anywhere in the world

All the ways to watch Man City vs Arsenal live streams online and from anywhere, in a prospective Premier League 2025/26 title decider at the Etihad. Read more

0 fresh

SlashGear
SlashGear 2 place · today 08:30 EDT

5 Pickup Trucks That Are Infamously Quick To Rust

These pickup trucks might be tough off-road, but they have a rusty reputation for corrosion, body rot, and long term structural concerns. Read more

0 fresh

Silicon Canals
James Brennan @ Silicon Canals 1 place · today 08:25 EDT

The real enemy of high performance isn’t laziness, it’s low-grade busyness

For most of the last year of my second startup, I worked constantly and produced almost nothing. That sounds like a contradiction. It isn’t. My calendar was full. My inbox stayed at near zero. I was in calls, on Slack, rearranging roadmaps, scheduling “alignment meetings” (with people who didn’t need to be aligned). I closed ... Read more Read more

0 fresh

Habr
Miller83 @ Habr 3 place · today 08:15 EDT

Design by Contract в эпоху AI: как контракты Мейера защищают криптографию там, где тесты молчат

Design by Contract Мейера не взлетел в 1986 из-за двойной работы. AI-агент убирает вторую половину. Я построил PKI-систему с аппаратным TRNG, формальными контрактами на криптографию и открытым репозиторием, чтобы это доказать. Читать далее Read more

0 fresh

SlashGear
SlashGear 3 place · today 08:15 EDT

Can You Turn Right On Red In California? Not Everywhere

In much of the U.S., safely making a right turn on red is perfectly legal and acceptable. However, this has recently changed in more than one California city. Read more

0 fresh

The most popular news from the same source for the last week
VentureBeat VentureBeat
VentureBeat
VentureBeat · 04/12/2026 12:00 EDT

For the last 18 months, the CISO playbook for generative AI has been relatively simple: Control the browser.Security teams tightened cloud access security broker (CASB) policies, blocked or monitored traffic to well-known AI endpoints, and routed usage through sanctioned gateways. The operating model was clear: If sensitive data leaves the network for an external API call, we can observe it, log it, and stop it. But that model is starting... Read more

0

VentureBeat
VentureBeat 3 place · 04/12/2026 15:00 EDT

Data drift happens when the statistical properties of a machine learning (ML) model's input data change over time, eventually rendering its predictions less accurate. Cybersecurity professionals who rely on ML for tasks like malware detection and network threat analysis find that undetected data drift can create vulnerabilities. A model trained on old attack patterns may fail to see today's sophisticated threats. Recognizing the early signs of data drift is the... Read more

0

VentureBeat
VentureBeat 3 place · 04/13/2026 00:00 EDT

Presented by EdgeverveSmart, semi‑autonomous AI agents handling complex, real‑time business work is a compelling vision. But moving from impressive pilots to production‑grade impact requires more than clever prompts or proof‑of‑concept demos. It takes clear goals, data‑driven workflows, and an enterprise platform that balances autonomy, governance, observability, and flexibility with hard guardrails from day one. From pilots to the “operational grey zones”The next wave of value sits in the connective tissue Read more

0

VentureBeat
VentureBeat 3 place · 04/13/2026 03:00 EDT

Presented by EdgeverveSmart, semi‑autonomous AI agents handling complex, real‑time business work is a compelling vision. But moving from impressive pilots to production‑grade impact requires more than clever prompts or proof‑of‑concept demos. It takes clear goals, data‑driven workflows, and an enterprise platform that balances autonomy, governance, observability, and flexibility with hard guardrails from day one. From pilots to the “operational grey zones”The next wave of value sits in the connective tissue Read more

0

VentureBeat
VentureBeat · 04/13/2026 13:11 EDT

A growing number of developers and AI power users are taking to social media to accuse Anthropic of degrading the performance of Claude Opus 4.6 and Claude Code — intentionally or as an outcome of compute limits — arguing that the company’s flagship coding model feels less capable, less reliable and more wasteful with tokens than it did just weeks ago. The complaints have spread quickly on Github, X and... Read more

0

VentureBeat
VentureBeat · 04/14/2026 00:00 EDT

Presented by AWSAutonomous agents are compressing software delivery timelines from weeks to days. The enterprises that scale agents safely will be the ones that build using spec-driven development.There’s a moment in every technology shift where the early adopters stop being outliers and start being the baseline. We’re at that moment in software development, and most teams don’t realize it yet.A year ago, vibe coding went viral. Non-developers and junior developers... Read more

0

VentureBeat
VentureBeat · 04/14/2026 09:00 EDT

The software industry is racing to write code with artificial intelligence. It is struggling, badly, to make sure that code holds up once it ships.A survey of 200 senior site-reliability and DevOps leaders at large enterprises across the United States, United Kingdom, and European Union paints a stark picture of the hidden costs embedded in the AI coding boom. According to Lightrun's 2026 State of AI-Powered Engineering Report, shared exclusively... Read more

0

VentureBeat
VentureBeat · 04/14/2026 11:00 EDT

Data teams building AI agents keep running into the same failure mode. Questions that require joining structured data with unstructured content, sales figures alongside customer reviews or citation counts alongside academic papers, break single-turn RAG systems. New research from Databricks puts a number on that failure gap. The company's AI research team tested a multi-step agentic approach against state-of-the-art single-turn RAG baselines across nine enterprise knowledge tasks and reported gains... Read more

0

VentureBeat
VentureBeat · 04/14/2026 12:00 EDT

Microsoft today launched MAI-Image-2-Efficient, a lower-cost, higher-speed variant of its flagship text-to-image model that the company says delivers production-ready quality at nearly half the price. The release, available immediately in Microsoft Foundry and MAI Playground with no waitlist, marks the fastest turnaround yet from Microsoft's in-house AI superintelligence team — and the clearest signal that Redmond is serious about building a self-sufficient AI stack that doesn't depend on OpenAI.The new... Read more

0

VentureBeat
VentureBeat · 04/14/2026 12:51 EDT

A viral post on X from veteran programmer and former Google engineer Steve Yegge set off a rhetorical firestorm this week, drawing sharp public rebuttals from some of Google’s most prominent AI leaders and reopening a sensitive question for the company: how deeply are its own engineers really using the latest generation of AI coding tools? The debate began after Yegge summarized what he said was the view of his... Read more

0

Most popular sources

  • You see 356 news out of 356.
  • Sources 61 out of 61.
MacRumors 0%
Ubergizmo 0%
Eurogamer.net 0%
AlleyWatch 0%
Engadget 0%
View sources »

LIKE us on Facebook so you won't miss the most important news of the day!

19.04.2026 09:12
Last update: 09:05 EDT.
News rating updated: 16:03.

What is Times42?

Times42 brings you the most popular news from tech news portals in real-time chart.
Read about us in FAQ section.


Times42 © 2026