716 place 0

841 IndexCache, a new sparse attention optimizer, delivers 1.82x faster inference on long-context AI models

VentureBeat
VentureBeat 1 place · 03/26/2026 20:00 EDT

Processing 200,000 tokens through a large language model is expensive and slow: the longer the context, the faster the costs spiral. Researchers at Tsinghua University and Z.ai have built a technique called IndexCache that cuts up to 75% of the redundant computation in sparse attention models, delivering up to 1.82x faster time-to-first-token and 1.48x faster generation throughput at that context length.The technique applies to models using the DeepSeek Sparse Attention architecture, including the latest De

To see detailed statistics for the news please log in »

Read the original

Add your comment
You must be logged in with Facebook to read and write comments.

A newsletter a day!

You may get 10 most important news around midday in daily newsletter. Press the button and we will send you the most important news only, no spam attached.

or register

LIKE us on Facebook so you won't miss the most important news of the day!

News from the same source
VentureBeat VentureBeat
Silicon Valley
George Avalos @ Silicon Valley 1 place · 02/07/2106 01:28 EDT

Newark apartment complex bought for much less than prior value

An East Bay apartment complex has been bought at a price that's well below its prior value. Read more

0

🔮
27.05.2026 ♑︎ Horoscope for the Capricorn zodiac sign today Dear Capricorn, today promises to be a busy... Read more ›
Silicon Valley
George Avalos @ Silicon Valley 2 place · 02/07/2106 01:28 EDT

PG&E buys San Jose building to bolster South Bay operations

A PG&E Corp. unit has bought a San Jose building in a move to bolster the utility's South Bay operations. Read more

0

Inc42 Media
Akshit Pushkarna @ Inc42 Media 1 place · today 08:23 EDT

PhysicsWallah Q4: Loss Declines 76% YoY To ₹69 Cr, Revenue Up 51%

Edtech major PhysicsWallah narrowed its consolidated net loss for the March quarter (Q4 FY26) by 76% to ₹69.1 Cr from… Read more

0 newcommer

Tech.eu
Tamara Djurickovic @ Tech.eu 1 place · today 08:22 EDT

Mykor lands £4M to scale waste-based construction materials

Mykor, a UK biotechnology companydeveloping low-carbon construction materials from industrial and agriculturalwaste, has secured £4 million in funding to accelerate the scale-up of itsindustrial biofa... Read more

0 newcommer

Inc42 Media
Akshit Pushkarna @ Inc42 Media 2 place · today 08:20 EDT

Zappfresh FY26 PAT Surges 59% YoY To ₹14 Cr

Online meat delivery company Zappfresh saw its consolidated net profit for FY26 surge 59% to ₹14.3 Cr from ₹9 Cr… Read more

0 newcommer

Habr
zeron0de (LANSOFT) @ Habr 1 place · today 08:18 EDT

Горячо-холодно: как определить температуру бизнеса с помощью тепловой карты BPMSoft

Сколько не оптимизируй бизнес-процессы, в них всегда остается какое-то узкое место, которое может застопорить всю работу. Знакомая ситуация? Но самое неприятное, что этот «засор» иногда очень сложно найти. В обновлении 1.9 платформы BPMSoft, о нем мы рассказывали вот тут, появилась тепловая карта бизнес-процессов (БП). Это инструмент визуальной аналитики, позволяющий оценивать эффективность исполнения БП с помощью цветовой индикации: от «холодного» к «горячему». С его помощью можно анализировать всю ветку п Read more

0 newcommer

Business Insider
Jodie Hughes @ Business Insider 1 place · today 08:14 EDT

I took a solo trip to Japan in my 20s. My experience traveling there alone as an English-speaking woman astonished me.

As an English-speaking woman in my 20s, my solo trip to Japan was surprising. The language barrier wasn't an issue and I felt so safe traveling alone. Read more

0 newcommer

Startups News
Daniel Levi @ Startups News 2 place · today 08:12 EDT

Pace raises $46M from Sequoia and Thrive to bring AI agents to the insurance industry

Insurance has long been one of the biggest targets for AI automation. The industry still runs on mountains of paperwork, manual data entry, phone calls, policy reviews, and claims processing that can take days or weeks to complete. Investors are ... Read more

0 fresh

Habr
RChotchaev (YADRO) @ Habr 2 place · today 08:11 EDT

«Насколько вы контролируете то, из чего состоит ваш продукт?». Как и зачем проводить Open Source Analysis

Привет! Меня зовут Руслан, я инженер в отделе развития процессов безопасности в YADRO. Сегодня поговорим об открытом исходном коде (open source). В мире современной разработки он используется практически в каждом приложении: open source-библиотеки, фреймворки и компоненты помогают ускорить разработку и сделать ее гораздо более удобной. Но есть проблема: каждая зависимость — это не только «плюшки», но и дополнительные риски. Если в open source, который вы используете, появится уязвимость, придется срочно... Read more

0 newcommer

Habr
GRADDATA (VK Tech) @ Habr 3 place · today 08:09 EDT

[Перевод] Дезагрегированный инференс LLM в Kubernetes: префилл, декодирование и планирование подов

С ростом сложности рабочих нагрузок инференса больших языковых моделей (LLM) единый монолитный процесс обслуживания упирается в свои пределы. У префилла и декодирования принципиально разные профили вычислений, но традиционные развёртывания заставляют их работать на одном оборудовании. В итоге GPU недозагружены, а масштабирование — негибкое.Дезагрегированный инференс решает эту проблему: разбивает конвейер на отдельные этапы — префилл, декодирование и маршрутизацию. Каждый этап работает как независимый серви Read more

0 fresh

Business Insider
Dan DeFrancesco @ Business Insider 2 place · today 08:07 EDT

Small-business owners could look to Wolfgang Puck for help with their biggest problem

We spoke to world-famous chef Wolfgang Puck about his son's growing role in his restaurant empire as he thinks through succession planning. Read more

0 fresh

TechRadar
TechRadar 1 place · today 08:05 EDT

Smeg's iconic drip coffee maker just got a makeover to make your breakfast routine 'feel calmer and more intentional' — and I think I'm in love

Moonlight is a creamy color with a matte finish, which Smeg says "transforms everyday routines, from morning coffee to slower moments in the evening." Read more

0 fresh

The Verge
Jess Weatherbed @ The Verge 1 place · today 08:05 EDT

Xreal’s budget AR glasses feature anti-shake tech and swappable frames

Augmented reality wearables provider Xreal has launched a new "X By Xreal" (XBX) subbrand, with its first customizable, lightweight smart glasses coming to the US in July. The new a01 AR glasses will be available starting at $299, featuring a "highly stable anti-shake mode" and interchangeable front frames. While the a01 lacks the degrees-of-freedom (DoF) […] Read more

0 fresh

The Verge
Dominic Preston @ The Verge 2 place · today 08:04 EDT

Redmagic’s liquid-cooled gaming phone arrives with overclocked Snapdragon chip

Nubia has announced the international launch of the Redmagic 11S Pro, its new flagship Android gaming phone. It's not a significant change from the 11 Pro, which launched internationally last November, but has been upgraded to the overclocked Snapdragon 8 Elite Gen 5 Leading Version. Otherwise things look similar. There's a large 7,500mAh battery, fast […] Read more

0 fresh

CNET
Mike Sorrentino @ CNET 1 place · today 08:01 EDT

RedMagic 11S Pro Shows Off Liquid Cooling on Every Model, but With a Price Bump

The slightly revamped gaming phone gets a $100 price bump over the RedMagic 11 Pro from last fall. Read more

0 newcommer

The Verge
Hayden Field @ The Verge 3 place · today 08:00 EDT

The Pope isn’t AGI-pilled

On Monday, Pope Leo XIV unveiled an encyclical letter addressing the societal implications of artificial intelligence. The letter, titled Magnifica Humanitas, warned that the "use of AI is never a purely technical matter: when it enters processes that affect people's lives, it touches on rights, opportunities, status and freedom." Alongside him was Anthropic cofounder and […] Read more

0 fresh

The Verge
Allison Johnson @ The Verge · today 08:00 EDT

The new Razr Ultra isn’t your average phone — for better and worse

I had one ask for friends, colleagues, the lady checking me in for a meeting at a large software company's headquarters, and everyone else who stopped to admire the phone I've been carrying around. "Pet it." The Razr Ultra is not your average phone. I got the orient blue color option to test, which has […] Read more

0 fresh

The most popular news from the same source for the last week
VentureBeat VentureBeat
VentureBeat
VentureBeat · 05/20/2026 10:12 EDT

The creators of NanoClaw — the hit open source, enterprise-friendly variant of autonomous AI agent harness OpenClaw — are moving towards commercializing their technology for enterprises at scale, aiming to provide them with secure AI agents, and an ever-updating library of workplace context, for each human employee the enterprise has approved.The duo, including former Wix.com engineer Gavriel Cohen and his brother Lazer Cohen, also founder of tech public relations firm... Read more

0

VentureBeat
VentureBeat · 05/20/2026 13:21 EDT

GitHub confirmed on May 20 that a poisoned VS Code extension installed on an employee’s device gave attackers access to roughly 3,800 internal repositories at the Microsoft-owned code storage and authorship platform. The threat group TeamPCP, formally tracked by Google Threat Intelligence Group as UNC6780, claimed responsibility and is advertising the stolen repositories for sale starting at $50,000. GitHub’s assessment: the attacker’s claim is “directionally consistent” with the investigation so... Read more

0

VentureBeat
VentureBeat · 05/20/2026 14:43 EDT

RAG architectures are good at one thing: surfacing semantically relevant documents. That's also where they stop.A framework called a decision context graph addresses that gap by giving agents structured memory, time-aware reasoning, and explicit decision logic. Rippletide, a startup in the Neo4j ecosystem, has built one. The key capability: agents that are non-regressive, able to freeze validated sequences of actions and compound on them over time.“The key point you want... Read more

0

VentureBeat
VentureBeat · 05/20/2026 15:59 EDT

Less than a week after completing the largest tech IPO of 2026, Cerebras Systems is making its most aggressive play yet to dominate the fast-growing AI inference market. On Monday, the Sunnyvale-based chipmaker announced that it is now running Kimi K2.6 — a trillion-parameter open-weight model developed by Beijing-based Moonshot AI — for enterprise customers at nearly 1,000 tokens per second, a speed no GPU-based provider has come close to... Read more

0

VentureBeat
VentureBeat · 05/20/2026 17:16 EDT

Canadian AI lab Cohere made waves recently by announcing a merger with German AI startup Aleph Alpha, but now it has even more in store for enterprise builders around the globe: today, the firm co-founded by former Googler and "Attention Is All You Need" co-author Aidan Gomez unveiled Command A+, a highly optimized, 218-billion-parameter language model engineered specifically for complex reasoning, multimodal document processing, and agentic workflows.The most significant aspect... Read more

0

VentureBeat
VentureBeat · 05/20/2026 18:26 EDT

At Google I/O, the company unveiled Managed Agents in its Gemini API — a service that promises to collapse weeks of agent deployment work into a single API call. It's also a sign that Google believes its ecosystem, including the newly launched Antigravity CLI, is ready to own the execution layer end-to-end.Before a single agent is written, teams are already spending days on the unglamorous work: standing up execution environments,... Read more

0

VentureBeat
VentureBeat · 05/21/2026 08:48 EDT

Presented by Design.comGenerative AI has made design radically more accessible. A founder can now create a logo, launch a website, build social campaigns, generate presentations, and produce marketing collateral in a single afternoon — work that once required agencies, freelancers, or internal creative teams.But as design generation becomes easier, maintaining a recognizable identity becomes harder.The problem is no longer whether businesses can create content. It’s whether all of that content... Read more

0

VentureBeat
VentureBeat · 05/21/2026 09:00 EDT

Resolve AI, the production-operations startup backed by Greylock and Lightspeed Venture Partners, today announced a sweeping expansion of its platform that introduces always-on background agents, a redesigned investigation architecture, and a shared workspace where engineers and AI agents collaborate in real time on live incidents.The centerpiece of the release is a new multi-agent investigation system developed by Resolve AI's in-house research lab. Instead of deploying a single AI agent to... Read more

0

VentureBeat
VentureBeat · 05/21/2026 09:00 EDT

Kore.ai on Wednesday launched what amounts to a ground-up reinvention of its core technology: the Artemis edition of its Agent Platform, a system designed to let enterprises build, govern, and optimize AI agents using AI itself — compressing what has traditionally been months of engineering work into days.The platform arrives at a moment when every major technology vendor — from Microsoft and Salesforce to Google and ServiceNow — is racing... Read more

0

VentureBeat
VentureBeat · 05/21/2026 12:30 EDT

Presented by Veriff Americans can’t reliably distinguish real from AI-generated content, and that’s not just a media literacy problem; it’s a direct threat to how businesses verify identity online.New research finds that while many people are aware of deepfakes, their ability to distinguish them from reality is barely better than a coin flip. A 2026 survey conducted by Veriff and Kantar among 3,000 respondents in the United States, the United... Read more

0

Most popular sources

  • You see 839 news out of 839.
  • Sources 61 out of 61.
Mobile ID World 0%
Vox 0%
StartupNation 0%
AlleyWatch 0%
Droid Life 0%
View sources »

LIKE us on Facebook so you won't miss the most important news of the day!

27.05.2026 08:39
Last update: 08:31 EDT.
News rating updated: 15:32.

What is Times42?

Times42 brings you the most popular news from tech news portals in real-time chart.
Read about us in FAQ section.


Times42 © 2026