VentureBeat #960

31 place 0

960 Google's new TurboQuant algorithm speeds up AI memory 8x, cutting costs by 50% or more

VentureBeat · 03/25/2026 15:27 EDT

As Large Language Models (LLMs) expand their context windows to process massive documents and intricate conversations, they encounter a brutal hardware reality known as the "Key-Value (KV) cache bottleneck."Every word a model processes must be stored as a high-dimensional vector in high-speed memory. For long-form tasks, this "digital cheat sheet" swells rapidly, devouring the graphics processing unit (GPU) video random access memory (VRAM) system used during inference, and slowing the model performance dow

Share (0) Tweet

To see detailed statistics for the news please log in »

Read the original

Add your comment

You must be logged in with Facebook to read and write comments.

A newsletter a day!

You may get 10 most important news around midday in daily newsletter. Press the button and we will send you the most important news only, no spam attached.

Tech News

LIKE us on Facebook so you won't miss the most important news of the day!

News from the same source
VentureBeat

1

Newark apartment complex bought for much less than prior value

George Avalos @ Silicon Valley 1 place · 02/07/2106 01:28 EDT

An East Bay apartment complex has been bought at a price that's well below its prior value. Read more ›

Share (0) Tweet

0

🔮

Your personal horoscope »

26.07.2026 ♓︎ Today promises to be a busy day for the Pisces sign, filled with events and... Read more ›

2

PG&E buys San Jose building to bolster South Bay operations

George Avalos @ Silicon Valley 2 place · 02/07/2106 01:28 EDT

A PG&E Corp. unit has bought a San Jose building in a move to bolster the utility's South Bay operations. Read more ›

Share (0) Tweet

0

3

Основы биологии для инженеров (Часть 3. Онкология)

gmuzykantov @ Habr 1 place · today 03:08 EDT

Еще недавно персонализированные мРНК-вакцины казались научной фантастикой. Сегодня они уже проходят клинические испытания. Давайте разберемся, как выглядит вычислительная часть этого процесса и какое место в ней занимают биоинформатика и AI. Читать далее Read more ›

Share (0) Tweet

0 newcommer

4

Я перестал носить руководству правильные решения. Стал носить деньги и риски

CratosArt @ Habr 2 place · today 03:00 EDT

Как технически безупречный запрос на серверную завернули на финкомитете — и что я изменил, чтобы бюджет всё-таки дали. Мой профиль — строить ИТ-подразделения, которые из банальной поддержки превращаются в подразделение, с которым считаются при принятии бизнес-решений. А если точнее — приходить туда, где от ИТ остались руины, и за полгода собирать направление и команду заново, в рамках жёстких ограничений. Вот одна история из практики, которая многому меня научила. Придя в... Read more ›

Share (0) Tweet

0 newcommer

5

2

'That is the wrong model for the AI era': Employers now expect basic AI skills from all workers

TechRadar 1 place · today 03:00 EDT

97% of organizations would pay 10%+ extra salary just for workers who have AI skills – workers, take responsibility now. Read more ›

Share (0) Tweet

0 fresh

6

2

Crypto exchange BitMart to shut down after nine years, BMX token crashes 58%

Shaurya Malwa @ CoinDesk 1 place · today 02:59 EDT

Users have a month to close trades and six months to withdraw, and the exchange gave no specific reason for closing. Read more ›

Share (0) Tweet

0 fresh

7

2

Управление проектами: 10 самых интересных публикаций за 2 недели

tmplts @ Habr 3 place · today 02:54 EDT

От провалов в внедрении ERP до построения системы эскалаций - всё самое интересное, что писали за последние 2 недели про управление проектами. Мы прочитали все публикации и выбрали для вас самые крутые и полезные. Читайте, сохраняйте и применяйте! Читать далее Read more ›

Share (0) Tweet

0 fresh

8

2

Сколько на самом деле стоит решённая задача: считаем экономику self-hosted моделей vs API токены

dreary_muskrat @ Habr · today 02:32 EDT

Каждый раз, когда заходит разговор "что дешевле — поднять у себя открытую модель или платить за API", его почему-то ведут в ценах за токен. И почти всегда это спор не в тех единицах.Во-первых, self-hosting — это постоянные затраты, а API — переменные. Значит, ответ зависит не от цены за токен, а от того, насколько плотно вы это железо загрузите. Во-вторых, токены вообще никто не покупает как цель. Покупают решённые задачи.... Read more ›

Share (0) Tweet

0 fresh

9

2

4 delicious high-protein Ninja Creami recipes that completely skip the pudding mix, protein shakes, and chalky powders

TechRadar 2 place · today 02:00 EDT

Make your own high-protein frozen desserts without pre-made protein shakes, protein powder, or pudding mix Read more ›

Share (0) Tweet

0 fresh

10

2

vivo X300 E appears in an official trailer ahead of launch, processor confirmed

GSMArena.com 1 place · today 01:57 EDT

vivo will unveil the X300 E as the newest member of the X300 series in China on July 27. vivo already revealed the X300 E's design, and now it has posted a video trailer, showing the smartphone from multiple angles while also confirming its chipset. The vivo X300 E will be powered by the Snapdragon 8 Gen 5 SoC, and fueling the entire package will be a 7,200 mAh battery.... Read more ›

Share (0) Tweet

0 fresh

11

2

DeepSeek Puts Current Funding Round on Hold

Qianer Liu @ The Information 1 place · today 01:36 EDT

Chinese AI developer DeepSeek has told investors that it is putting its fundraising talks on hold, according to a person with direct knowledge of the matter. The pause on the new funding round, which values DeepSeek at 500 billion yuan ($74 billion), comes after a leaked transcript of CEO Liang ... Read more ›

Share (0) Tweet

0 fresh

12

2

Why workers are nostalgic for life before AI

Financial Times 1 place · today 00:00 EDT

Many white-collar professionals worry about the tech inhibiting creativity and making errors Read more ›

Share (0) Tweet

0 fresh

13

2

Chips and drones to be at heart of Burnham’s push to ‘reindustrialise’, AI minister says

Financial Times 2 place · today 00:00 EDT

Kanishka Narayan tells FT that new prime minister views technology as way for Britain ‘to get its mojo back’ Read more ›

Share (0) Tweet

0 fresh

14

2

Defence giants provide record backing for military start-ups

Financial Times 3 place · today 00:00 EDT

As drones and autonomous systems transform the battlefield, traditional defence companies start to act more like venture capitalists Read more ›

Share (0) Tweet

0 fresh

15

2

British unicorn Humanoid points to way forward for European tech

Financial Times · today 00:00 EDT

London-based company shows that while the continent is industrialising, it still has manufacturing clout Read more ›

Share (0) Tweet

0 fresh

16

2

Hurdle hints and answers for July 26, 2026

Mashable 1 place · today 00:00 EDT

Hints and answers to today's Hurdle all in one place. Read more ›

Share (0) Tweet

0 fresh

17

2

Exclusive: 'We want to keep people away from doctors': Why Samsung has gone all-in on AI health, according to its executives — but do users trust it?

TechRadar 3 place · today 00:00 EDT

'Once you lose trust, nothing else matters': AI is here for Samsung Health users, but if it's going to work, its executive team needs two things: data, and user trust. Read more ›

Share (0) Tweet

0 fresh

18

2

Moon phase today: What the Moon will look like on July 26

Mashable 2 place · today 00:00 EDT

See the Moon phase expected for July 26, 2026 as well as when the next Full Moon is expected. Read more ›

Share (0) Tweet

0 fresh

19

2

5 Disadvantages Of Inline-4 Motorcycle Engines

SlashGear 1 place · 07/25/2026 23:45 EDT

Thinking about buying a screaming inline-four motorcycle? Discover the five hidden drawbacks that could make you regret this popular engine layout. Read more ›

Share (0) Tweet

0 fresh

20

2

32 of 35 Students Caught Using Hilariously Wrong AI-Generated Answers for Professor's Midterm

EditorDavid @ Slashdot 1 place · 07/25/2026 23:34 EDT

"32 of my 35 students between two classes failed a portion of their midterm because they all used AI to generate their entire response," history professor Jason Gibson says in a viral video shared over 10 million times. "And apparently, they didn't proofread it." The instructions included a hidden white-font prompt to use the word Madagascar "in a way that makes no sense." So if he saw the word Madagascar,... Read more ›

Share (0) Tweet

0 fresh

The most popular news from the same source for the last week
VentureBeat

1

AI confidence just dropped 17 points in six months. That’s actually great news.

VentureBeat · 07/20/2026 03:00 EDT

Presented by JumpCloudThe organizations losing confidence in AI are the ones most likely to get it right.Six months ago, 40% of IT leaders described their organizations as mature in AI deployment. Today that number is 23%. Before you read that as a setback, consider what it actually reflects.We recently surveyed 800 IT leaders across the U.S. and U.K. for our Q3 2026 trends report, and the data tells a consistent... Read more ›

Share (0) Tweet

0

2

Safety guardrails blocked Hugging Face's defenders, not the attacker, when an AI agent breached its systems

VentureBeat · 07/20/2026 11:49 EDT

Hugging Face’s incident response team first turned to frontier AI models to analyze a breach of the company’s production infrastructure, and the models refused to help. Commercial safety guardrails built to stop attackers blocked every forensic query because they treated the IR team’s real exploit data the same way they would treat a live attack.The attacker, an autonomous AI agent running the campaign end to end, moved laterally across the... Read more ›

Share (0) Tweet

0

3

The cleanup trap: Stop asking RAG to fix bad data

VentureBeat · 07/20/2026 12:19 EDT

The enterprise technology ecosystem is caught in a costly cycle. Over the past two years, millions of dollars have been funneled into generative AI pilots, yet many of these initiatives stall out before ever reaching a live production environment.When a project fails, the immediate instinct of technical leadership is often to blame the model: The context window was too restrictive, the latency was too high, or the reasoning capabilities simply... Read more ›

Share (0) Tweet

0

4

At VB Transform 2026, Zillow's engineering chief said AI ROI numbers only hold up if you measure before you build

VentureBeat · 07/20/2026 12:38 EDT

Zillow, the real estate technology company, doesn't get one conversation with its customers. They move from a phone screen to a loan officer to a real estate agent, sometimes over months or years, and expect the context to follow them. A single chatbot could never carry that thread.At VB Transform 2026, Zillow SVP of Engineering Toby Roberts and Glean co-founder and CEO Arvind Jain described how they built AI architecture... Read more ›

Share (0) Tweet

0

5

A single AI agent conversation can look perfect and still be broken, leaders from LangChain, Conviva and CoreWeave said at VB Transform 2026

VentureBeat · 07/20/2026 14:24 EDT

A single AI agent conversation can look flawless scored on its own and still point to a broken product. That gap is driving a shift in how enterprises evaluate agents, away from scoring individual traces and toward comparing cohorts of users against a baseline.At VB Transform 2026, Harrison Chase, CEO of LangChain; Hui Zhang, CTO and co-founder of Conviva; and Emmanuel Turlay, director of engineering at CoreWeave, described that shift,... Read more ›

Share (0) Tweet

0

6

Writer's AI harness cuts token spend nearly 40% — without sacrificing accuracy

VentureBeat · 07/20/2026 17:18 EDT

Enterprise AI is facing an ROI paradox. While throwing more compute at the strongest foundation model works well in product experiments, the costs become unbearable when the product is deployed in production.A new paper from researchers at Writer provides a solution that is accessible to engineering teams. The study takes a systematic look at optimizing the different components of the orchestration layer that wraps around the foundation model, aka the... Read more ›

Share (0) Tweet

0

7

Atlassian: Why AI speeds up employees but not organizations

VentureBeat · 07/21/2026 03:00 EDT

Presented by Atlassian Most companies are approaching AI adoption backwards by optimizing how individuals use AI instead of how teams work together, said Dr. Molly Sands, head of the Teamwork Lab at Atlassian, during a fireside chat with VentureBeat senior technology contributor Sam Witteveen at VB Transform 2026.Sands leads a team of behavioral scientists and psychologists who study how AI is reshaping the way people work together, using those findings... Read more ›

Share (0) Tweet

0

8

Atlassian: Research shows organizations should approach AI at the team level, not the individual level, to achieve true ROI

VentureBeat · 07/21/2026 03:00 EDT

Presented by Atlassian Most companies are approaching AI adoption backwards by optimizing how individuals use AI instead of how teams work together, said Dr. Molly Sands, head of the Teamwork Lab at Atlassian, during a fireside chat with VentureBeat senior technology contributor Sam Witteveen at VB Transform 2026.Sands leads a team of behavioral scientists and psychologists who study how AI is reshaping the way people work together, using those findings... Read more ›

Share (0) Tweet

0

9

Evals are the new PRD, Expedia’s AI chief tells VB Transform 2026

VentureBeat · 07/21/2026 14:15 EDT

“The new PRD are the evals,” Xavi Amatriain, Expedia Group’s first chief AI and data officer, told the VB Transform 2026 audience last week in Menlo Park. “So basically, you encode what you want the product to do through your evals, which might include red teaming evals and all kinds of other things, which already have a bunch of security requirements. So, you already embed that into the PRD and... Read more ›

Share (0) Tweet

0

10

Google's Gemini 3.6 Flash model cuts AI agent token costs by up to 65% on long horizon engineering tasks —and 3.5 Pro is on the way

VentureBeat · 07/21/2026 15:59 EDT

Google DeepMind today released three new proprietary AI models it says are among its most token-efficient yet: Gemini 3.6 Flash, Gemini 3.5 Flash-Lite, and Gemini 3.5 Flash Cyber. The models aim to make AI agents faster, smarter, and cheaper at scale. Google is pricing Gemini 3.6 Flash at $1.50 per one million input tokens and $7.50 per one million output tokens through its application programming interface (API), while Gemini 3.5... Read more ›

Share (0) Tweet

0

Most popular sources

You see 349 news out of 349.
Sources 61 out of 61.

ScienceDaily	0%
Wired	0%
Android Authority	0%
Vox	0%
Inc42 Media	0%
View sources »

Tech News

LIKE us on Facebook so you won't miss the most important news of the day!

26.07.2026 03:30
Last update: 03:20 EDT.
News rating updated: 10:20.

What is Times42?

Times42 brings you the most popular news from tech news portals in real-time chart.
Read about us in FAQ section.

Times42 © 2026