misk to Technology@lemmy.worldEnglish · 1 year agoApple study exposes deep cracks in LLMs’ “reasoning” capabilitiesarstechnica.comexternal-linkmessage-square106linkfedilinkarrow-up1508arrow-down120cross-posted to: apple_enthusiast@lemmy.worldarstechnica_index@rss.ponder.cat
arrow-up1488arrow-down1external-linkApple study exposes deep cracks in LLMs’ “reasoning” capabilitiesarstechnica.commisk to Technology@lemmy.worldEnglish · 1 year agomessage-square106linkfedilinkcross-posted to: apple_enthusiast@lemmy.worldarstechnica_index@rss.ponder.cat
minus-squaremiskOPlinkfedilinkEnglisharrow-up11·1 year agoGiven the use cases they were benchmarking I would be very surprised if they were any better.
Given the use cases they were benchmarking I would be very surprised if they were any better.