sanitation@lemmy.today to Technology@lemmy.worldEnglish · 18 hours agoNvidia’s CEO Jensen Huang says electricians and plumbers will be needed by the hundreds of thousands in the new working worldfortune.comexternal-linkmessage-square62linkfedilinkarrow-up1165arrow-down119
arrow-up1146arrow-down1external-linkNvidia’s CEO Jensen Huang says electricians and plumbers will be needed by the hundreds of thousands in the new working worldfortune.comsanitation@lemmy.today to Technology@lemmy.worldEnglish · 18 hours agomessage-square62linkfedilink
minus-squareboonhetlinkfedilinkEnglisharrow-up3arrow-down1·7 hours agoThe carwash thing applies to low end models and older models. Here’s Claude from lowest to highest model, ignoring the banned Fable
minus-squarereplicat@lemmy.worldlinkfedilinkEnglisharrow-up3·6 hours agoThey altered the training data to address this challenge. The underlying issue wasn’t solved in any way. Don’t be naive.
minus-squareboonhetlinkfedilinkEnglisharrow-up1·4 hours agoTakes months to train a model, there were already models that got it right when the question was popular, as long as thinking was enabled. Also if they were optimising for this question, why not update their lower end model (Haiku) as well? The interesting question would be what percent of humans get it wrong. Smaller than LLMs for sure, but I somehow doubt it’s 0.
The carwash thing applies to low end models and older models. Here’s Claude from lowest to highest model, ignoring the banned Fable
They altered the training data to address this challenge. The underlying issue wasn’t solved in any way. Don’t be naive.
Takes months to train a model, there were already models that got it right when the question was popular, as long as thinking was enabled.
Also if they were optimising for this question, why not update their lower end model (Haiku) as well?
The interesting question would be what percent of humans get it wrong. Smaller than LLMs for sure, but I somehow doubt it’s 0.