32 place 0

448 New KV cache compaction technique cuts LLM memory 50x without accuracy loss

VentureBeat
VentureBeat · 03/06/2026 16:00 EDT

Enterprise AI applications that handle large documents or long-horizon tasks face a severe memory bottleneck. As the context grows longer, so does the KV cache, the area where the model’s working memory is stored.A new technique developed by researchers at MIT addresses this challenge with a fast compression method for the KV cache. The technique, called Attention Matching, manages to compact the context by up to 50x with very little loss in quality.While it is not the only memory compaction technique avail

To see detailed statistics for the news please log in »

Read the original

Add your comment
You must be logged in with Facebook to read and write comments.

A newsletter a day!

You may get 10 most important news around midday in daily newsletter. Press the button and we will send you the most important news only, no spam attached.

or register

LIKE us on Facebook so you won't miss the most important news of the day!

News from the same source
VentureBeat VentureBeat
Silicon Valley
George Avalos @ Silicon Valley 1 place · 02/07/2106 01:28 EDT

Newark apartment complex bought for much less than prior value

An East Bay apartment complex has been bought at a price that's well below its prior value. Read more

0

🔮
07.05.2026 ♋︎ Dear Cancer, today is a day when your heart is filled with love and romantic... Read more ›
Silicon Valley
George Avalos @ Silicon Valley 2 place · 02/07/2106 01:28 EDT

PG&E buys San Jose building to bolster South Bay operations

A PG&E Corp. unit has bought a San Jose building in a move to bolster the utility's South Bay operations. Read more

0

EU-Startups
Rahul Raj @ EU-Startups 1 place · today 06:13 EDT

Tallinn’s Skeleton Technologies announces €33 million first close of pre-IPO round ahead of planned 2027 US IPO

Skeleton Technologies, a Tallinn-based AI infrastructure and grid power systems provider, today announced the first close of a larger funding round at €33 million. This brings its total venture capital funding to €392 million in preparation for its planned initial public offering (IPO) in the United States in 2027. The new round expands Skeleton’s investor ... Read more

0 newcommer

Business Insider
Dan DeFrancesco @ Business Insider 1 place · today 06:11 EDT

Confusion over student debt comes as people are already rethinking college

Borrowers told their loans had been paid off, later learned they had actually been transferred to a third party and were being sued for the balance. Read more

0 newcommer

Digital Trends
Paulo Vargas @ Digital Trends 1 place · today 06:11 EDT

Mortal Kombat isn’t done ripping spines out yet

Ed Boon says NetherRealm is pursuing another Mortal Kombat game after Mortal Kombat 1, but the studio still hasn’t revealed a title, release window, platforms, or roster details. Read more

0 newcommer

CoinDesk
Francisco Rodrigues @ CoinDesk 1 place · today 06:08 EDT

Core Scientific sold $208 million of bitcoin in Q1 as AI pivot continues

The firm's AI pivot relies on a 590 MW contract expansion with CoreWeave, projected for $10.2 billion in revenue over 12 years. Read more

0 newcommer

Tech Wire Asia
Dashveenjit Kaur @ Tech Wire Asia 1 place · today 06:00 EDT

Malaysia wants 7% of the global advanced packaging market by 2035. Right now, it has zero.

Five local companies form consortium to build what Malaysia has never had. No funding secured, no anchor customer, and the clock is running. SEMICON SEA 2026: local players still don’t own an advanced packaging process. The number that stopped the room at MITEC on Wednesday afternoon was not the target. It was the starting point. ... Read more

0 fresh

Wired
David Gilbert @ Wired 1 place · today 06:00 EDT

There Is No Evidence the Trump Assassination Attempts Were Staged. People Still Believe They Were

Elements of the right and the left are united in the baseless belief that two attempts on Donald Trump's life were staged. Read more

0 fresh

Habr
strannik96 @ Habr 1 place · today 06:00 EDT

Душат ли на самом деле бизнес: мнение бывшего сотрудника ФНС

Сейчас многие говорят о том, что бизнес находится в упадке. Собираемость падает, а возросший административный контроль и налоговая нагрузка буквально лишают предпринимателей воздуха. И как говорят, если так пойдет и дальше, то в скором времени нас ждет массовая волна закрытий малого и среднего бизнеса. Со службы в ФНС я ушел совсем недавно. Во мне, можно сказать, еще теплится «дух» налогового инспектора, да и профессиональное мышление вряд ли изменится. В налоговой... Read more

0 newcommer

Business Insider
Alice Tecotzky @ Business Insider 2 place · today 06:00 EDT

Investment banks are back on top as private equity loses its grip on Wall Street's biggest paydays

Investment and commercial bankers are projected to see a 10% bump in bonuses compared to last year, compensation consultant Johnson Associates says. Read more

0 fresh

Vox
Christian Paz @ Vox 1 place · today 06:00 EDT

The next redistricting war will be even harder for Democrats

Just as the redistricting wars were coming to a close, the Supreme Court blew up the entire landscape with a decision that all but gutted the Voting Rights Act.  And since that decision last week, Republicans around the country have been moving quickly to see how they can take advantage of the new redistricting rules. […] Read more

0 fresh

Tom's Hardware
Tom's Hardware 2 place · today 06:00 EDT

Startup successfully tests 3D-printed rocket fuel that could enable lighter missiles and faster production rates — new additive manufacturing process tested at 1,800 PSI

Chromatic 3D Materials has successfully tested 3D-printed rocket propellant capable of withstanding 1,800 PSI combustion pressures, potentially paving the way for faster rocket production, more advanced thrust geometries, and resilient distributed defense manufacturing. Read more

0 fresh

UK Tech News
Russell Brown @ UK Tech News 1 place · today 06:00 EDT

Artificial intelligence is often positioned as the biggest opportunity for business, that is true, but it is also changing the nature of cyber risk far faster than most organisations are prepared for. The conversation needs to catch up. This is no longer just about productivity or efficiency. It is about the speed at which systems ... Read more

0 fresh

Ubergizmo
Paulo Montenegro @ Ubergizmo 1 place · today 05:59 EDT

Qualcomm Debuts Snapdragon 6 And 4 Gen 5: Faster Performance And Better Battery

Qualcomm has officially announced its latest mobile platforms, the Snapdragon 6 Gen 5 and Snapdragon 4 Gen 5. These new chipsets are designed to enhance mid-range and entry-level smartphones by prioritizing battery efficiency, smoother user interfaces, and robust connectivity. A key introduction across both platforms is the Snapdragon Smooth Motion interface. This technology is specifically engineered to improve device interactions, reducing screen stutters and increasing the speed at which applications... Read more

0 fresh

The most popular news from the same source for the last week
VentureBeat VentureBeat
VentureBeat
VentureBeat · 04/30/2026 07:00 EDT

Netomi, the San Francisco-based startup building AI systems for enterprise customer service, said Thursday that it has raised $110 million in new funding in a round led by Accenture Ventures, with participation from Adobe Ventures, WndrCo, Silver Lake Waterman, NAVER Ventures, Metis Strategy and Fin Capital. Jeffrey Katzenberg, managing partner of WndrCo and co-founder of DreamWorks, has joined the company's board. The round builds on early backing from a roster... Read more

0

VentureBeat
VentureBeat · 04/30/2026 12:00 EDT

Writer, the enterprise AI agent platform backed by Salesforce Ventures, Adobe Ventures, and Insight Partners, today launched event-based triggers for its Writer Agent platform, enabling AI agents to autonomously detect business signals across Gmail, Gong, Google Calendar, Google Drive, Microsoft SharePoint, and Slack — and execute complex multi-step workflows without any human initiating the process.The release, which also includes a new Adobe Experience Manager connector and a suite of enhanced... Read more

0

VentureBeat
VentureBeat · 04/30/2026 12:30 EDT

On March 30, BeyondTrust proved that a crafted GitHub branch name could steal Codex’s OAuth token in cleartext. OpenAI classified it Critical P1. Two days later, Anthropic’s Claude Code source code spilled onto the public npm registry, and within hours, Adversa found Claude Code silently ignored its own deny rules once a command exceeded 50 subcommands. These were not isolated bugs. They were the latest in a nine-month run: six... Read more

0

VentureBeat
VentureBeat · 04/30/2026 13:11 EDT

AI is more than a technology — it's magic.Don't believe me? Why, then, is one of the leading companies in the space, OpenAI, publishing entire official, corporate blog posts about goblins?To understand, we first have to go back to earlier this week, on Monday, April 27, 2026, when a developer under the handle @arb8020 on the social network X posted a snippet from the OpenAI open source Codex GitHub repository,... Read more

0

VentureBeat
VentureBeat · 04/30/2026 14:31 EDT

Runpod, the high-performance cloud computing and GPU platform designed specifically for AI development, today launched a new open source, MIT licensed, enterprise-friendly Python programming tool called Runpod Flash — and it is poised to make creation, iteration and deployment of AI systems inside and outside of foundation model labs much faster. The tool aims to eliminate some of the biggest barriers and hurdles to training and using AI models today,... Read more

0

VentureBeat
VentureBeat · 04/30/2026 16:51 EDT

One of the key challenges of building effective AI agents is teaching them to choose between using external tools or relying on their internal knowledge. But large language models are often trained to blindly invoke tools, which causes latency bottlenecks, unnecessary API costs, and degraded reasoning caused by environmental noise. To overcome this challenge, researchers at Alibaba introduced Hierarchical Decoupled Policy Optimization (HDPO), a reinforcement learning framework that trains agents... Read more

0

VentureBeat
VentureBeat · 05/01/2026 09:03 EDT

Presented by TeamViewerEnterprise technology failures are largely invisible. Research from TeamViewer, based on a global survey of 4,200 managers and employees, finds that the majority of digital dysfunction never reaches the IT help desk. Employees work around slow applications, failed logins, and intermittent glitches rather than reporting them, leaving organizations without an accurate picture of how their technology is performing. The cumulative cost is significant: employees lose an average of... Read more

0

VentureBeat
VentureBeat · 05/01/2026 13:49 EDT

While Elon Musk faces off against his former colleague and OpenAI co-founder Sam Altman in court, Musk's rival firm xAI, founded to take on OpenAI, isn't slowing down on launching competitive new products and services.Last night, xAI shipped a new, proprietary base large language model (LLM), Grok 4.3, and a new voice cloning suite on the web. The new products arrive after months of tumult from xAI that saw all... Read more

0

VentureBeat
VentureBeat 3 place · 05/01/2026 14:01 EDT

The scaffolding layer that developers once needed to ship LLM applications — indexing layers, query engines, retrieval pipelines, carefully orchestrated agent loops — is collapsing. And according to Jerry Liu, co-founder and CEO of LlamaIndex, that's not a problem. It's the point.“As a result, there's less of a need for frameworks to actually help users compose these deterministic workflows in a light and shallow manner,” Jerry Liu, co-founder and CEO... Read more

0

VentureBeat
VentureBeat 2 place · 05/01/2026 16:35 EDT

Anthropic created the Model Context Protocol as the open standard for AI agent-to-tool communication. OpenAI adopted it in March 2025. Google DeepMind followed. Anthropic donated MCP to the Linux Foundation in December 2025. Downloads crossed 150 million. Then four researchers at OX Security found an architectural problem that affects all of them.MCP's STDIO transport, the default for connecting an AI agent to a local tool, executes any operating system command... Read more

0

Most popular sources

  • You see 883 news out of 886.
  • Sources 61 out of 61.
AlleyWatch 0%
VentureBeat 0%
StartupNation 0%
ReadWrite 0%
Droid Life 0%
View sources »

LIKE us on Facebook so you won't miss the most important news of the day!

07.05.2026 06:29
Last update: 06:20 EDT.
News rating updated: 13:20.

What is Times42?

Times42 brings you the most popular news from tech news portals in real-time chart.
Read about us in FAQ section.


Times42 © 2026