938 place 0

940 Nvidia says it can shrink LLM memory 20x without changing model weights

VentureBeat
VentureBeat · 03/16/2026 20:00 EDT

Nvidia researchers have introduced a new technique that dramatically reduces how much memory large language models need to track conversation history — by as much as 20x — without modifying the model itself. The method, called KV Cache Transform Coding (KVTC), applies ideas from media compression formats like JPEG to shrink the key-value cache behind multi-turn AI systems, lowering GPU memory demands and speeding up time-to-first-token by up to 8x.For enterprise AI applications that rely on agents and long.

To see detailed statistics for the news please log in »

Read the original

Add your comment
You must be logged in with Facebook to read and write comments.

A newsletter a day!

You may get 10 most important news around midday in daily newsletter. Press the button and we will send you the most important news only, no spam attached.

or register

LIKE us on Facebook so you won't miss the most important news of the day!

News from the same source
VentureBeat VentureBeat
Silicon Valley
George Avalos @ Silicon Valley 1 place · 02/07/2106 01:28 EDT

Newark apartment complex bought for much less than prior value

An East Bay apartment complex has been bought at a price that's well below its prior value. Read more

0

🔮
27.04.2026 ♉︎ Dear Taurus! Today, the stars favor you in many areas of life, bringing harmony and... Read more ›
Silicon Valley
George Avalos @ Silicon Valley 2 place · 02/07/2106 01:28 EDT

PG&E buys San Jose building to bolster South Bay operations

A PG&E Corp. unit has bought a San Jose building in a move to bolster the utility's South Bay operations. Read more

0

Engadget
Steve Dent @ Engadget 1 place · today 07:22 EDT

Ford's Mustang Cobra Jet sets a new EV quarter mile record at 6.87 seconds

Ford Racing's Mustang Cobra Jet 2200 just ran a quarter mile in 6.87 seconds at 221 mph at an NHRA event in Charlotte, setting a new world record for an EV. The run smashed Ford's own previous EV record of 7.62 seconds, set by the Cobra Jet 1800 last September, by an impressive 0.75 seconds.   As the name suggests, Ford's Cobra Jet 2200 puts a massive 2,200 horsepower to the... Read more

0 newcommer

Business Insider
Lorraine C. Ladish @ Business Insider 1 place · today 07:17 EDT

My 87-year-old father still works, and I'm 62 with no plans to retire. We believe working gives us purpose.

My superager father is still writing and teaching. Like him, I have no retirement plans. Our jobs give us so much more than just a paycheck. Read more

0 newcommer

Tech.eu
John Reynolds @ Tech.eu 1 place · today 07:15 EDT

Spacetech investor Seraphim Space targets £350M raise

Spacetech investor Seraphim Space is looking to raise up to £350m, it said today, as it seeks to tap into the current fervour around the spacetech sector.The London-listed investment trust heralded th... Read more

0 fresh

Vox
Alex Abad-Santos @ Vox 1 place · today 07:15 EDT

The great 2028 Olympic ticket crashout, explained

Buying tickets to the 2028 Los Angeles Olympics is kind of like having a megawealthy friend talk to you about the hobbies that they enjoy.  Do you fence? Do you like cricket? Badminton’s fun, right?  Like a diabolically rich friend, the Olympics are also, at the same time, a test of financial responsibility.  How much […] Read more

0 fresh

Slashdot
EditorDavid @ Slashdot 1 place · today 07:14 EDT

America Now Has 70% More Bookstores Than in 2020, Says Bookshop.org Founder

"There are about 70% more bookstores now than there were six years ago in the United States," says Andy Hunter, the founder/CEO of Bookshop.org. Fast Company checks in on his site, which gives over 80% of its profit margin to independent bookstores, structuring itself as a B Corporation (a for-profit company certified for its social-impact) while providing an alternative to Amazon and other online booksellers: Hunter created Bookshop.org in January... Read more

0 newcommer

Eurogamer.net
Connor Makar @ Eurogamer.net 1 place · today 07:13 EDT

South Korean prime minister praises Crimson Desert and Pearl Abyss, believes it has opened "new chapter in K-content"

Crimson Desert has proven a smash hit since its launch last month, quickly shooting past four million worldwide sales roughly two weeks after its release. It has proven so fruitful that the prime minister of South Korea has heaped praise on the game in recent days. Read more Read more

0 fresh

Business Insider
Hugh Langley @ Business Insider 2 place · today 07:13 EDT

What's at stake for Meta if China kills its Manus deal

The Chinese government has ordered Meta to unwind its acquisition of Manus, potentially hurting Meta's push into building AI agents. Read more

0 newcommer

Habr
aomsk55 @ Habr 1 place · today 07:10 EDT

Интерславик. Он же  Interslavic  или Medžuslovjansky. Искусственный «Усредненный» между славянский язык общения

Почему то для большинства моих друзей и знакомых новость о том что есть общеславянский "синтетический, обобщенный" язык общения на котором можно говорить не зная конкретного языка с любым из 13 братских (ну или почто братских) народов , высказанная мной по приколу под банку чешского, стала новостью. Между тем проект действует с 2011 года. 13 языков (или может даже и 16, точно я не уверен) это соответственно 13 признанных славянских государств,... Read more

0 newcommer

Habr
vQFd4 (Ростелеком) @ Habr 2 place · today 07:10 EDT

Hooks в LLM-агентах: детерминизм, инъекция контекста и контроль над жизненным циклом

Хуки — это детерминированный код, который выполняется в строго заданных точках жизненного цикла LLM-агента: до и после tool call, на старте сессии, перед компактификацией контекста и т. д. Они превращают недетерминированного агента в систему, обязанную пройти ваши gates — lint, typecheck, secrets-scan — прежде чем что-либо записать. В статье разбираем модель жизненного цикла агента, каталог событий и matcher’ов Claude Code, контракт stdin/stdout JSON, ключевой паттерн PreToolUse gate на ESLint, вопросы... Read more

0 newcommer

Wired
Louryn Strampe @ Wired 1 place · today 07:08 EDT

Best iPhone Charger: Cable, Wireless, MagSafe, and More

Whether you’re a Screen Time champion or you’re constantly on Low Power Mode, we found an iPhone charger perfect for you. Read more

0 fresh

Habr
DimaIam (StudyAI) @ Habr 3 place · today 07:06 EDT

Древний “нейрослоп” из 70-х, о котором все забыли

Удивительно, но “нейрохудожники” появилась не вчера. Оказывается, компьютеры умели генерировать целые картины еще когда в мире бушевал глэм-рок, молодежь страны Советов находила романтику в поездке на БАМ, а жесткий диск со 100 мегабайтами считался эпохальным прорывом технической мысли… Читать далее Read more

0 fresh

Habr
Balansse @ Habr · today 07:06 EDT

Как снизить стресс в эпоху информационной перегрузки

По данным аналитической компании DSM Group, с 2019 года объем продаж антидепрессантов в России увеличился в четыре раза. Данный тревожный тренд может быть связан в том числе и с ростом уровня стресса из‑за информационной перегрузки. Кто из нас не грешит тем, что постоянно заглядывает в телефон, чтобы проверить мессенджеры и не упустить что-то важное. Постоянный поток новостей, уведомлений и сообщений держит нашу нервную систему в постоянном напряжении, увеличивая уровень тревожности.... Read more

0 fresh

Habr
Livadies @ Habr · today 07:05 EDT

Запускаем DeepSeek-V4 (1.6T) на «калькуляторе»: SVD-трансмутация, Identity Theft и гаражный MLOps

Что делать, если у вас есть 1.6-триллионная модель и видеокарта из прошлого десятилетия? Пока корпорации покупают H100 фурами, мы используем SVD-трансмутацию и архитектурный Identity Theft, чтобы запустить DeepSeek-V4 на бесплатном инстансе Kaggle. Инструкция по сборке Мутанта внутри. Читать далее Read more

0 fresh

Habr
Medox @ Habr · today 07:03 EDT

Алиса в вашем умном доме. Или Маруся. Или Салют

Универсальный шлюз для работы с разными голосовыми помощниками и разными умными домами и умными устройствами. Читать далее Read more

0 fresh

TechRadar
TechRadar 2 place · today 07:02 EDT

Exclusive: Piaggio Fast Forward is back with another Star Wars cargo robot — and this time it gets a Grogu makeover

Piaggio Fast Forward is back with a new Star Wars cargo robot, turning its gitamini into a Grogu-themed follow-me companion ahead of May the 4th and The Mandalorian and Grogu. Read more

0 fresh

The most popular news from the same source for the last week
VentureBeat VentureBeat
VentureBeat
VentureBeat · 04/21/2026 08:05 EDT

Adversaries injected malicious prompts into legitimate AI tools at more than 90 organizations in 2025, stealing credentials and cryptocurrency. Every one of those compromised tools could read data, and none of them could rewrite a firewall rule.The autonomous SOC agents shipping now can. That escalation, from compromised tools that read data to autonomous agents that rewrite infrastructure, has not been exploited in production at scale yet. But the architectural conditions... Read more

0

VentureBeat
VentureBeat · 04/21/2026 10:55 EDT

Looking at enterprise AI adoption, VentureBeat has anecdotally observed a fairly wide divergence when it comes to specific roles: For those who build—engineers and developers—the arrival of AI has been transformative, moving through the workflow with the speed of tools like Claude Code and Cursor to automate the heavy lifting of syntax and architecture. Yet, for those who sell, the "revenue stack" has remained a fragmented collection of data silos,... Read more

0

VentureBeat
VentureBeat · 04/21/2026 10:51 EDT

A security researcher, working with colleagues at Johns Hopkins University, opened a GitHub pull request, typed a malicious instruction into the PR title, and watched Anthropic’s Claude Code Security Review action post its own API key as a comment. The same prompt injection worked on Google’s Gemini CLI Action and GitHub’s Copilot Agent (Microsoft). No external infrastructure required.Aonan Guan, the researcher who discovered the vulnerability, alongside Johns Hopkins colleagues Zhengyu... Read more

0

VentureBeat
VentureBeat · 04/21/2026 12:55 EDT

Most orchestration frameworks were built for agents that run for seconds or minutes. Now that agents are running for hours — and in some cases days — those frameworks are starting to crack.Several model providers, such as Anthropic with Claude Code and OpenAI with Codex, introduced early support for long-horizon agents through multi-session tasks, subagents and background execution. However, these systems sometimes assume agents are still operating within bounded-time workflows... Read more

0

VentureBeat
VentureBeat · 04/21/2026 15:00 EDT

It's been only a few months since OpenAI released its last big improvement to AI image generations in ChatGPT and through its application programming interface (API) — namely, a new image generation model known as GPT-Image-1.5, released in December 2025, which brought about improved instruction following, colors, and lighting.Now, after weeks of testing, the company that kicked off the generative AI boom is unveiling a far more dramatic and even... Read more

0

VentureBeat
VentureBeat · 04/21/2026 15:04 EDT

Decision makers at 72% of organizations claim to have two or more AI platforms that they identify as their "primary" layer, according to a survey of 40 enterprise companies conducted by VentureBeat last month, revealing real gaps in security and control. For enterprise management and technical leaders, and especially security leaders, these multiple AI platforms extend the attack surfaces of most enterprises at a time when AI-driven attacks have become... Read more

0

VentureBeat
VentureBeat · 04/21/2026 16:07 EDT

One employee at Vercel adopted an AI tool. One employee at that AI vendor got hit with an infostealer. That combination created a walk-in path to Vercel’s production environments through an OAuth grant that nobody had reviewed.Vercel, the cloud platform behind Next.js and its millions of weekly npm downloads, confirmed on Sunday that attackers gained unauthorized access to internal systems. Mandiant was brought in. Law enforcement was notified. Investigations remain... Read more

0

VentureBeat
VentureBeat · 04/21/2026 16:43 EDT

Google on Monday unveiled the most significant upgrade to its autonomous research agent capabilities since the product's debut, launching two new agents — Deep Research and Deep Research Max — that for the first time allow developers to fuse open web data with proprietary enterprise information through a single API call, produce native charts and infographics inside research reports, and connect to arbitrary third-party data sources through the Model Context... Read more

0

VentureBeat
VentureBeat · 04/22/2026 08:00 EDT

Enterprise data stacks were built for humans running scheduled queries. As AI agents increasingly act autonomously on behalf of businesses around the clock, that architecture is breaking down — and vendors are racing to rebuild it. Google's answer, announced at Cloud Next on Wednesday, is the Agentic Data Cloud.The architecture has three pillars:Knowledge Catalog. Automates semantic metadata curation, inferring business logic from query logs without manual data steward interventionCross-cloud lakehouse.... Read more

0

VentureBeat
VentureBeat · 04/22/2026 08:00 EDT

Cirrascale Cloud Services today announced it has expanded its partnership with Google Cloud to deliver the Gemini model on-premises through Google Distributed Cloud, making it the first neocloud provider to offer Google's most advanced AI model as a fully private, disconnected appliance. The announcement, timed to coincide with Google Cloud Next 2026 in Las Vegas, addresses a stubborn problem that has plagued regulated industries since the generative AI boom began:... Read more

0

Most popular sources

  • You see 531 news out of 531.
  • Sources 61 out of 61.
VentureBeat 0%
Skift 0%
The Information 0%
Financial Times 0%
Mashable 0%
View sources »

LIKE us on Facebook so you won't miss the most important news of the day!

27.04.2026 07:31
Last update: 07:25 EDT.
News rating updated: 14:20.

What is Times42?

Times42 brings you the most popular news from tech news portals in real-time chart.
Read about us in FAQ section.


Times42 © 2026