misk@sopuli.xyz to Technology@lemmy.worldEnglish · 25 days agoApple study exposes deep cracks in LLMs’ “reasoning” capabilitiesarstechnica.comexternal-linkmessage-square103fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkApple study exposes deep cracks in LLMs’ “reasoning” capabilitiesarstechnica.commisk@sopuli.xyz to Technology@lemmy.worldEnglish · 25 days agomessage-square103fedilink
minus-squaremisk@sopuli.xyzOPlinkfedilinkEnglisharrow-up0·24 days agoGiven the use cases they were benchmarking I would be very surprised if they were any better.
Given the use cases they were benchmarking I would be very surprised if they were any better.