41 place 0

969 Apple Researchers Challenge AI Reasoning Claims With Controlled Puzzle Tests

Slashdot
msmash @ Slashdot · 06/09/2025 10:00 EDT

Apple Researchers Challenge AI Reasoning Claims With Controlled Puzzle Tests

Apple researchers have found that state-of-the-art "reasoning" AI models like OpenAI's o3-mini, Gemini (with thinking mode-enabled), Claude 3.7, DeepSeek-R1 face complete performance collapse [PDF] beyond certain complexity thresholds when tested on controllable puzzle environments. The finding raises questions about the true reasoning capabilities of large language models.

The study, which examined models using Tower of Hanoi, checker jumping, river crossing, and blocks world puzzles rather than standard

To see detailed statistics for the news please log in »

Read the original

Add your comment
You must be logged in with Facebook to read and write comments.

A newsletter a day!

You may get 10 most important news around midday in daily newsletter. Press the button and we will send you the most important news only, no spam attached.

or register

LIKE us on Facebook so you won't miss the most important news of the day!

News from the same source
Slashdot Slashdot
Vox
Noel King @ Vox 1 place · today 07:00 EDT

Trump killed affirmative action. His base might not like what comes next.

President Donald Trump’s administration is scrutinizing higher education. Last week, the White House issued a memorandum requiring all universities receiving federal funds to submit admissions data on all applicants to the Department of Education. The goal is to enforce the 2023 Supreme Court decision that ended race-based affirmative action.  Days before the memo was released, […] Read more ›

1,131 fresh

🔮
17.08.2025 ♎︎ Horoscope for Libra Today Dear Libras! Today promises you an overall positive mood, especially in... Read more ›
Tom's Hardware
Tom's Hardware 1 place · today 08:30 EDT

Nvidia's midrange GPUs through the years revisited — pitting the RTX 5070 versus the 4070, 3070 and 2070 in an all-encompassing gaming showdown

From the RTX 2070 to the 5070, Nvidia’s midrange GPUs show impressive leaps: 1080p ray tracing climbs from 56 to 130 FPS, while 4K performance grows from 36 to 107 FPS. The 4070 strikes the best balance of speed and efficiency, while the 5070 delivers raw power—though at a higher cost. Read more ›

724 fresh

Android Authority
Dhruv Bhutani @ Android Authority 1 place · today 08:00 EDT

5 reasons I swapped Google Keep for this open-source app

This self-hosted, open-source app offers speed, markdown support and full data ownership. What's not to like? Read more ›

530 fresh

CNET
Jesse Orrall @ CNET 1 place · today 08:00 EDT

Watch Figure 02 Humanoid Fold Laundry in New AI Demo

Can I compete with the Figure's humanoid robot at household chores like folding laundry? Let's see who's faster and better. Read more ›

514 fresh

Wired
Sophie Charara @ Wired 1 place · today 08:00 EDT

Pebblebee Is Getting Serious About Personal Safety Tracking

The Bluetooth tracker maker is adding free and paid SOS features to its products, including emergency contact alerts, silent alarms, and real-time location sharing. Read more ›

454 fresh

The Verge
Jay Peters @ The Verge 1 place · today 08:30 EDT

Why the former editor of Polygon is making a podcast for old gamers

In a recent episode of Post Games, host Chris Plante explores how video games can help players understand death. He's interviewing Kaitlin Tremblay, who is working on Ambrosia Sky, a game about death. "What is it about games that is so useful for exploring the topic?" Plante asks. "I think there's something really lovely about […] Read more ›

449 fresh

Tom's Hardware
Tom's Hardware 2 place · today 08:15 EDT

Is Wi-Fi bad for the environment? Eye-catching London ad suggests our digital habits are ‘damaging the climate’

The University of East London just put out an ad that seemingly says that Wi-Fi is bad for the environment, but it just wants you to look closer. Read more ›

397 fresh

Business Insider
Sarah Jackson @ Business Insider 1 place · today 05:31 EDT

3 tips for employees to manage their wellbeing in a challenging job environment, from EY's Chief Wellbeing Officer

EY's Chief Wellbeing Officer shared 3 tips for employees to take charge of their well-being in a tough job environment. Read more ›

395 fresh

CNET
Corin Cesaric @ CNET 2 place · today 07:50 EDT

AI Data Centers Are Coming for Your Land, Water and Power

Think of them as AI factories, churning out your responses from ChatGPT, Gemini, Claude and all the other generative AI tools. The costs are staggering. Read more ›

387 fresh

The Verge
Jay Peters @ The Verge 2 place · today 08:00 EDT

Teenage Engineering did it again

Hi, friends! Welcome to Installer No. 94, your guide to the best and Verge-iest stuff in the world. (If you're new here, welcome, did you hydrate today, and also you can read all the old editions at the Installer homepage.) This week, I'm visiting LinkedIn way too much because of Mini Sudoku, looking at the […] Read more ›

297 fresh

Business Insider
Polly Thompson @ Business Insider 2 place · today 05:11 EDT

Consulting is changing. Here are 4 unlikely ways the Big Four are reinventing themselves to seem less 'stodgy.'

A job at the Big Four doesn't always mean consulting and accounting. Here are some of the unexpected projects they are developing. Read more ›

234 fresh

The Verge
Allison Johnson @ The Verge 3 place · today 08:00 EDT

The one feature that keeps me from recommending flip phones

How it started I carry a lot of different phones around, and I rarely get questions about them because most people stopped talking about which phone they own around 2017. I could be using an unreleased iPhone 18 Pro Max Air Ultra to pay for my coffee and nobody would raise an eyebrow (present company […] Read more ›

223 fresh

TechRadar
TechRadar 1 place · today 08:04 EDT

These scientists have a unique way of tackling video deepfakes - and all it takes is a burst of light

Cornell scientists developed noise-coded illumination, embedding invisible watermarks in light patterns to detect video tampering across varied conditions. Read more ›

214 fresh

Business Insider
James Faris @ Business Insider 3 place · today 05:15 EDT

Disney adults say judgment from their own community hurts more than jeers from the haters

A new best-selling book on Disney adults says enthusiasts get lots of judgment from their own community, based on their perceived passion level. Read more ›

213 fresh

Business Insider
Joey Hadden @ Business Insider · today 07:26 EDT

I went to Italy for the first time and left with 5 big regrets

I spent six days exploring three cities in Italy — Venice, Rome, and Milan. It was my first time visiting the country, and I left with some regrets. Read more ›

203 fresh

Business Insider
Erika Ebsworth-Goold @ Business Insider · today 07:17 EDT

My son brought home his college girlfriend for a week. We had to set ground rules, but yes, they slept in the same room.

I was nervous when my son wanted to bring home his college girlfriend this summer. But we treated them like adults, giving them the space they needed. Read more ›

188 fresh

Business Insider
Pete Syme @ Business Insider · today 04:00 EDT

See inside a JetBlue Airbus A220, the versatile jet now running its longest route yet for the airline

The Airbus A220 is ideal for serving less popular routes, but with more seats and range than a regional jet. JetBlue's planes are modern and spacious. Read more ›

179 fresh

SlashGear
SlashGear 1 place · today 07:45 EDT

What Would Happen If China Sinks A US Aircraft Carrier?

Foreign nations rarely attack aircraft carriers unprovoked, but it's always a possibility officials have to keep in mind. Here's what would likely happen. Read more ›

175 fresh

CNET
Kevin Lynch @ CNET 3 place · today 06:00 EDT

Premier League Soccer: Stream Chelsea vs. Crystal Palace Live From Anywhere

London derby sees the new FIFA Club World Cup winners host Oliver Glasner's FA Cup holders. Read more ›

157 fresh

The most popular news from the same source for the last week
Slashdot Slashdot
Slashdot
BeauHD @ Slashdot · 08/13/2025 23:30 EDT

First Antidote For Carbon Monoxide Poisoning 'Cleans' Blood In Minutes

An anonymous reader New Atlas: An engineered protein that acts like a molecular sponge has the potential to change how carbon monoxide poisoning is treated, chasing down CO molecules in the bloodstream and helping the body flush them out in just minutes, without the risk of short- or long-term health issues that come with the current frontline treatment, pure oxygen. Researchers at the University of Maryland School of Medicine (UMSOM)... Read more ›

90

Slashdot
msmash @ Slashdot · 08/15/2025 11:20 EDT

China is About To Launch SSDs So Small You Insert Them Like a SIM Card

A Chinese storage manufacturer has developed a solid-state drive smaller than a U.S. penny that delivers sequential read speeds of 3,700 megabytes per second, according to The Verge. The "Mini SSD" by Biwin measures 15mm x 17mm x 1.4mm thick and connects via PCIe 4x2, offering 512GB to 2TB capacities. The drive inserts into devices using a SIM card-style tray mechanism and claims IP68 water resistance plus three-meter drop protection.... Read more ›

81

Slashdot
BeauHD @ Slashdot · 08/11/2025 23:30 EDT

LLMs' 'Simulated Reasoning' Abilities Are a 'Brittle Mirage,' Researchers Find

An anonymous reader quotes a report from Ars Technica: In recent months, the AI industry has started moving toward so-called simulated reasoning models that use a "chain of thought" process to work through tricky problems in multiple logical steps. At the same time, recent research has cast doubt on whether those models have even a basic understanding of general logical concepts or an accurate grasp of their own "thought process."... Read more ›

80

Slashdot
BeauHD @ Slashdot · 08/13/2025 19:30 EDT

Kodak Warns It May Go Out of Business

After over 130 years in business, Kodak has warned it may not survive. From a report: The Rochester, New York-based Eastman Kodak Co. offered a bleak picture of its financials in earnings reports and filings, tracking a second quarter loss and sending shares tumbling in early trading Tuesday, Aug. 12. The iconic brand said in Monday, Aug. 11 government filings that there is "substantial doubt" about the company's ability to... Read more ›

77

Slashdot
msmash @ Slashdot · 08/12/2025 18:41 EDT

Google and IBM Believe First Workable Quantum Computer is in Sight

IBM and Google report they will build industrial-scale quantum computers containing one million or more qubits by 2030, following IBM's June publication of a quantum computer blueprint addressing previous design gaps and Google's late-2023 breakthrough in scaling error correction. Current experimental systems contain fewer than 200 qubits. IBM encountered crosstalk interference when scaling its Condor chip to 433 qubits and subsequently adopted low-density parity-check code requiring 90% fewer qubits than Read more ›

70

Slashdot
BeauHD @ Slashdot · 08/14/2025 19:20 EDT

Impoverished Streaming Services Are Driving Viewers Back to Piracy

Rising subscription costs, shrinking content libraries, and regional restrictions are pushing viewers back toward piracy. Once seen as nearly dead, piracy has resurged through illicit streaming platforms as the fractured, ad-laden streaming market struggles to deliver convenience and value. The Guardian reports: According to London-based piracy monitoring and content-protection firm MUSO, unlicensed streaming is the predominant source of TV and film piracy, accounting for 96% in 2023 (PDF). Piracy reached... Read more ›

69

Slashdot
BeauHD @ Slashdot · 08/16/2025 03:00 EDT

Arctic Glaciers Face 'Terminal' Decline As Microbes Accelerate Ice Melt

Scientists in Svalbard warn Arctic glaciers are in "terminal" decline, with microbe-driven biological darkening accelerating ice melt and potentially triggering major climate feedback loops. The Guardian reports: Recent research implicates snow and ice-dwelling microbes in positive feedback loops that can accelerate melting. With more than 70% of the planet's freshwater stored in ice and snow -- and billions of lives sustained by glacier-fed rivers -- this has profound implications everywhere.... Read more ›

66

Slashdot
msmash @ Slashdot · 08/15/2025 14:00 EDT

Proton Begins Shifting Infrastructure Outside of Switzerland Ahead of Surveillance Legislation

Proton has begun relocating infrastructure outside Switzerland ahead of proposed surveillance legislation requiring VPNs and messaging services with over 5,000 users to identify customers and retain data for six months. The company's AI chatbot Lumo became the first product hosted on German servers rather than Swiss infrastructure. CEO Andy Yen confirmed the decision and a spokesperson told TechRadar that the company isn't fully exiting Switzerland. In a blog post about... Read more ›

64

Slashdot
EditorDavid @ Slashdot · 08/10/2025 14:48 EDT

As Electric Bills Rise, Evidence Mounts That U.S. Data Centers Share Blame

"Amid rising electric bills, states are under pressure to insulate regular household and business ratepayers from the costs of feeding Big Tech's energy-hungry data centers..." reports the Associated Press. "Some critics question whether states have the spine to take a hard line against tech behemoths like Microsoft, Google, Amazon and Meta." [T]he Data Center Coalition, which represents Big Tech firms and data center developers, has said its members are committed... Read more ›

55

Slashdot
EditorDavid @ Slashdot · 08/11/2025 07:34 EDT

It's Steve Wozniak's 75th Birthday.  Whatever Happened to His YouTube Lawsuit?

In 2020 a YouTube video used video footage of Steve Wozniak in a scam to steal bitcoin. "Some people said they lost their life savings," Wozniak tells CBS News, explaining why he sued YouTube in 2020 — and where his case stands now: Wozniak's lawsuit against YouTube has been tied up in court now for five years, stalled by federal legislation known as Section 230. Attorney Brian Danitz said, "Section... Read more ›

53

Most popular sources

  • You see 286 news out of 286.
  • Sources 61 out of 61.
Business Insider 19% 49
Gizmodo 10% 5
CNET 9% 7
Tom's Hardware 9% 7
Eurogamer.net 8% 8
View sources »

LIKE us on Facebook so you won't miss the most important news of the day!

17.08.2025 09:58
Last update: 09:50 EDT.
News rating updated: 16:50.

What is Times42?

Times42 brings you the most popular news from tech news portals in real-time chart.
Read about us in FAQ section.


Times42 © 2025