minus-squareBinette@lemmy.mltoProgrammer Humor@lemmy.ml•ChatGPT apparently got rewarded for using its built-in calculator during training, and so it would covertly open its calculator, add 1+1, and do nothing with the result, on 5% of all user querieslinkfedilinkarrow-up9·9 days agoKinda why i like reinforcement learning. You end up with silly stuff like this. linkfedilink
minus-squareBinette@lemmy.mltoFediverse@lemmy.world•r/Silksong joins lemmy! (And a new lemmy instance)linkfedilinkEnglisharrow-up13·22 days agoThis is amazing news! linkfedilink
Kinda why i like reinforcement learning. You end up with silly stuff like this.