VentureBeat #249

24 place 54

249 Nvidia’s new technique cuts LLM reasoning costs by 8x without losing accuracy

VentureBeat 2 place · 02/12/2026 17:00 EDT

Researchers at Nvidia have developed a technique that can reduce the memory costs of large language model reasoning by up to eight times. Their technique, called dynamic memory sparsification (DMS), compresses the key value (KV) cache, the temporary memory LLMs generate and store as they process prompts and reason through problems and documents.While researchers have proposed various methods to compress this cache before, most struggle to do so without degrading the model's intelligence. Nvidia's approach m

Share (36) Tweet

To see detailed statistics for the news please log in »

Read the original

Add your comment

You must be logged in with Facebook to read and write comments.

A newsletter a day!

You may get 10 most important news around midday in daily newsletter. Press the button and we will send you the most important news only, no spam attached.

Tech News

LIKE us on Facebook so you won't miss the most important news of the day!

News from the same source
VentureBeat

1

Newark apartment complex bought for much less than prior value

George Avalos @ Silicon Valley 1 place · 02/07/2106 01:28 EDT

An East Bay apartment complex has been bought at a price that's well below its prior value. Read more ›

Share (0) Tweet

0

🔮

Your personal horoscope »

26.07.2026 ♏︎ Today, a busy and interesting day is expected for Scorpios, filled with bright moments in... Read more ›

2

PG&E buys San Jose building to bolster South Bay operations

George Avalos @ Silicon Valley 2 place · 02/07/2106 01:28 EDT

A PG&E Corp. unit has bought a San Jose building in a move to bolster the utility's South Bay operations. Read more ›

Share (0) Tweet

0

3

Why Is Nissan Discontinuing The Altima After 2026?

SlashGear 1 place · today 09:45 EDT

Following the discontinuation of the Nissan Versa and Maxima lines, the brand has now announced that the Altima family is on its way out. What is Nissan doing? Read more ›

Share (0) Tweet

0 newcommer

4

1

Shopping for back-to-school? These are the smartwatches I’d actually wear to class

Shikhar Mehrotra @ Digital Trends 1 place · today 09:30 EDT

I dug through the specs, ratings, and price tags on eight of Amazon's most-bought smartwatches so you don't have to. Read more ›

Share (0) Tweet

0 fresh

5

1

Nanoleaf Smart Multicolor Ceiling Light Review: A Paper Plate on Your Ceiling

Wes Davis @ Gizmodo 1 place · today 09:30 EDT

Bright white and good Matter support make it worth a look, even though its colored light is a bit dim. Read more ›

Share (0) Tweet

0 fresh

6

1

The reason why the MacBook Air doesn't need a fan

Engadget 1 place · today 09:30 EDT

Apple Silicon delivers enough performance without overheating. Read more ›

Share (0) Tweet

0 fresh

7

1

Chinese CXMT DRAM doesn't look like the budget savior many were expecting — new modules enter the market, but prices still track the big three

Tom's Hardware 1 place · today 09:25 EDT

Chinese retail listings indicate that CXMT-based memory modules are priced similarly to those featuring chips from the big three, despite expectations that they must be cheaper. Read more ›

Share (0) Tweet

0 fresh

8

1

The 23 most dangerous countries for US travelers in 2026, according to the State Department

Jenny McGrath,Kristine Villarroel @ Business Insider 1 place · today 09:23 EDT

The US State Department issued a worldwide warning for American travelers. See the 23 countries it previously warned US citizens against visiting. Read more ›

Share (90) Tweet

0 fresh

9

1

It's not just OpenAI models escaping and running riot — experts show how Claude Cowork can break its bonds and access Mac files

TechRadar 1 place · today 09:10 EDT

Anthropic partially mitigated the issue, and there are things users can do to defend themselves, too. Read more ›

Share (0) Tweet

0 fresh

10

1

У криптографов была «одна пуля в барабане». GPT-5.6 нашел вторую

runaway_llm @ Habr 1 place · today 09:06 EDT

23 июля Прабханджан Анант (Калифорнийский университет в Санта-Барбаре) и Амит Сахаи (UCLA) выложили на arXiv статью "Unconditional Unclonable Encryption", которая закрывает проблему, стоявшую перед квантовыми криптографами шесть лет. Но не меньше самого результата обсуждают короткий раздел в конце введения: конструкция и основные идеи доказательства были целиком сгенерированы агентом Codex на модели GPT-5.6 Sol Ultra. Читать далее Read more ›

Share (0) Tweet

0 fresh

11

1

Распознаем речь из видео(аудио) в текст. Короткая инструкция по транскрибации для чайников

vaddone @ Habr 2 place · today 09:05 EDT

Всем привет, друзья! Сегодня я расскажу, как быстро настроить распознавание речи(транскрибацию) из видео(аудио) в текст. Для этого воспользуемся таким популярным инструментом как Whisper. Whisper — это нейросетевая система распознавания речи (автоматического транскрибирования аудио в текст), разработанная компанией OpenAI (создатели ChatGPT). Читать далее Read more ›

Share (0) Tweet

0 fresh

12

1

Коалиция Хуанга удвоилась за сутки. Из фронтирных лаб её не подписала только Anthropic

Niketas @ Habr 3 place · today 09:01 EDT

Главное пополнение — Google и поддержка от Пичаи и Хассабиса. Amazon молчит, а за Anthropic в твиттере отдувается один технарь. Читать далее Read more ›

Share (0) Tweet

0 fresh

13

1

Топ известных и мемных роботов и ИИ из научной и не очень фантастики

popski_ruvds (RUVDS.com) @ Habr · today 09:01 EDT

Мыслящие в той или иной степени машины волнуют воображение людей где-то с конца XIX века, а ныне подобные штуки всё более активно становятся частью нашей реальности и быта. Задолго до того, как школьники стали «писать» сочинения посредством ChatGPT, а хикки — крутить цифровые «романы» с нейронками, писатели-фантасты и режиссёры создали немало ярких персонажей-роботов. Самых разных — от грозных машин-киллеров до трогательных или смехотворных. Попробуем вспомнить самых знаменитых и меметичных из... Read more ›

Share (0) Tweet

0 fresh

14

1

Don't like AI? Linux's creator says go fork yourself

Alistair Barr @ Business Insider 2 place · today 09:01 EDT

Linus Torvalds says Linux developers should embrace AI coding tools or fork the project, ending a heated open-source debate. Read more ›

Share (0) Tweet

0 fresh

15

1

U.S. regulator warns prediction markets against cutting corners in event contracts

Jesse Hamilton @ CoinDesk 1 place · today 09:00 EDT

The Commodity Futures Trading Commission again issued an advisory that signals firms have been straying into cookie-cutter self-certification. Read more ›

Share (0) Tweet

0 fresh

16

1

Future Alpha Seeks Partners for Edge Report 2026 Quant Survey

The Fintech Times @ The Fintech Times 1 place · today 09:00 EDT

Future Alpha is recruiting industry partners for its 2026 survey of quantitative and systematic investment firms ahead of publication. Read more ›

Share (0) Tweet

0 fresh

17

1

Only Murders in the Building season 6 casting Olivia Colman is nothing short of genius — now stream the Oscar nominated political biopic that first made her Meryl Streep's co-star

TechRadar 2 place · today 09:00 EDT

Before Olivia Colman and Meryl Streep were co-stars in Only Murders in the Building season 6, this British political biopic brought them together in the unlikeliest of ways. Read more ›

Share (0) Tweet

0 fresh

18

1

I wrote a story about a woman who regretted moving to South Carolina. It sparked a debate about living in the South.

Alcynna Lloyd @ Business Insider 3 place · today 08:56 EDT

BI readers debated one woman's move from Connecticut to South Carolina, discussing the state's climate and affordability. Read more ›

Share (0) Tweet

0 fresh

19

4 Common Problems With Disc Brakes

SlashGear 2 place · today 08:45 EDT

With most modern cars using disc brakes, it's helpful to understand some of the most common problems you're likely to encounter with these braking systems. Read more ›

Share (0) Tweet

0 newcommer

20

2

[Перевод] Космический телескоп «Роман» будет искать древние чёрные дыры, наблюдая за тем, как они поглощают звёзды

SLY_G @ Habr · today 08:41 EDT

Сверхмассивные чёрные дыры (СМЧД) могут быть чрезвычайно неряшливыми «пожирателями». Когда звезда подходит слишком близко, огромная приливная сила СМЧД может не только втянуть звезду внутрь, но и растянуть её, разорвав на части, прежде чем поглотить. И это полезно для астрофизиков, поскольку так СМЧД легче обнаружить. Когда СМЧД разрывает звезду на части, это называется событием приливного разрушения (СПР). Оторванный от звезды звёздный материал собирается в аккреционное кольцо вокруг чёрной дыры, нагревает Read more ›

Share (0) Tweet

0 fresh

The most popular news from the same source for the last week
VentureBeat

1

AI confidence just dropped 17 points in six months. That’s actually great news.

VentureBeat · 07/20/2026 03:00 EDT

Presented by JumpCloudThe organizations losing confidence in AI are the ones most likely to get it right.Six months ago, 40% of IT leaders described their organizations as mature in AI deployment. Today that number is 23%. Before you read that as a setback, consider what it actually reflects.We recently surveyed 800 IT leaders across the U.S. and U.K. for our Q3 2026 trends report, and the data tells a consistent... Read more ›

Share (0) Tweet

0

2

Safety guardrails blocked Hugging Face's defenders, not the attacker, when an AI agent breached its systems

VentureBeat · 07/20/2026 11:49 EDT

Hugging Face’s incident response team first turned to frontier AI models to analyze a breach of the company’s production infrastructure, and the models refused to help. Commercial safety guardrails built to stop attackers blocked every forensic query because they treated the IR team’s real exploit data the same way they would treat a live attack.The attacker, an autonomous AI agent running the campaign end to end, moved laterally across the... Read more ›

Share (0) Tweet

0

3

The cleanup trap: Stop asking RAG to fix bad data

VentureBeat · 07/20/2026 12:19 EDT

The enterprise technology ecosystem is caught in a costly cycle. Over the past two years, millions of dollars have been funneled into generative AI pilots, yet many of these initiatives stall out before ever reaching a live production environment.When a project fails, the immediate instinct of technical leadership is often to blame the model: The context window was too restrictive, the latency was too high, or the reasoning capabilities simply... Read more ›

Share (0) Tweet

0

4

At VB Transform 2026, Zillow's engineering chief said AI ROI numbers only hold up if you measure before you build

VentureBeat · 07/20/2026 12:38 EDT

Zillow, the real estate technology company, doesn't get one conversation with its customers. They move from a phone screen to a loan officer to a real estate agent, sometimes over months or years, and expect the context to follow them. A single chatbot could never carry that thread.At VB Transform 2026, Zillow SVP of Engineering Toby Roberts and Glean co-founder and CEO Arvind Jain described how they built AI architecture... Read more ›

Share (0) Tweet

0

5

A single AI agent conversation can look perfect and still be broken, leaders from LangChain, Conviva and CoreWeave said at VB Transform 2026

VentureBeat · 07/20/2026 14:24 EDT

A single AI agent conversation can look flawless scored on its own and still point to a broken product. That gap is driving a shift in how enterprises evaluate agents, away from scoring individual traces and toward comparing cohorts of users against a baseline.At VB Transform 2026, Harrison Chase, CEO of LangChain; Hui Zhang, CTO and co-founder of Conviva; and Emmanuel Turlay, director of engineering at CoreWeave, described that shift,... Read more ›

Share (0) Tweet

0

6

Writer's AI harness cuts token spend nearly 40% — without sacrificing accuracy

VentureBeat · 07/20/2026 17:18 EDT

Enterprise AI is facing an ROI paradox. While throwing more compute at the strongest foundation model works well in product experiments, the costs become unbearable when the product is deployed in production.A new paper from researchers at Writer provides a solution that is accessible to engineering teams. The study takes a systematic look at optimizing the different components of the orchestration layer that wraps around the foundation model, aka the... Read more ›

Share (0) Tweet

0

7

Atlassian: Why AI speeds up employees but not organizations

VentureBeat · 07/21/2026 03:00 EDT

Presented by Atlassian Most companies are approaching AI adoption backwards by optimizing how individuals use AI instead of how teams work together, said Dr. Molly Sands, head of the Teamwork Lab at Atlassian, during a fireside chat with VentureBeat senior technology contributor Sam Witteveen at VB Transform 2026.Sands leads a team of behavioral scientists and psychologists who study how AI is reshaping the way people work together, using those findings... Read more ›

Share (0) Tweet

0

8

Atlassian: Research shows organizations should approach AI at the team level, not the individual level, to achieve true ROI

VentureBeat · 07/21/2026 03:00 EDT

Presented by Atlassian Most companies are approaching AI adoption backwards by optimizing how individuals use AI instead of how teams work together, said Dr. Molly Sands, head of the Teamwork Lab at Atlassian, during a fireside chat with VentureBeat senior technology contributor Sam Witteveen at VB Transform 2026.Sands leads a team of behavioral scientists and psychologists who study how AI is reshaping the way people work together, using those findings... Read more ›

Share (0) Tweet

0

9

Evals are the new PRD, Expedia’s AI chief tells VB Transform 2026

VentureBeat · 07/21/2026 14:15 EDT

“The new PRD are the evals,” Xavi Amatriain, Expedia Group’s first chief AI and data officer, told the VB Transform 2026 audience last week in Menlo Park. “So basically, you encode what you want the product to do through your evals, which might include red teaming evals and all kinds of other things, which already have a bunch of security requirements. So, you already embed that into the PRD and... Read more ›

Share (0) Tweet

0

10

Google's Gemini 3.6 Flash model cuts AI agent token costs by up to 65% on long horizon engineering tasks —and 3.5 Pro is on the way

VentureBeat · 07/21/2026 15:59 EDT

Google DeepMind today released three new proprietary AI models it says are among its most token-efficient yet: Gemini 3.6 Flash, Gemini 3.5 Flash-Lite, and Gemini 3.5 Flash Cyber. The models aim to make AI agents faster, smarter, and cheaper at scale. Google is pricing Gemini 3.6 Flash at $1.50 per one million input tokens and $7.50 per one million output tokens through its application programming interface (API), while Gemini 3.5... Read more ›

Share (0) Tweet

0

Most popular sources

You see 345 news out of 345.
Sources 61 out of 61.

Eurogamer.net	0%
Ars Technica	0%
Startups News	0%
MacRumors	0%
Financial Times	0%
View sources »

Tech News

LIKE us on Facebook so you won't miss the most important news of the day!

26.07.2026 09:50
Last update: 09:45 EDT.
News rating updated: 16:40.

What is Times42?

Times42 brings you the most popular news from tech news portals in real-time chart.
Read about us in FAQ section.

Times42 © 2026