Lemmynated@lemmy.zip to Technology@lemmy.zipEnglish · 11 days agoResearchers gaslit Claude into giving instructions to build explosiveswww.theverge.comexternal-linkmessage-square10linkfedilinkarrow-up142arrow-down12cross-posted to: technology@lemmy.world
arrow-up140arrow-down1external-linkResearchers gaslit Claude into giving instructions to build explosiveswww.theverge.comLemmynated@lemmy.zip to Technology@lemmy.zipEnglish · 11 days agomessage-square10linkfedilinkcross-posted to: technology@lemmy.world
minus-squareTaleya@aussie.zonelinkfedilinkEnglisharrow-up2·10 days agoYou can’t gaslight a fucking machine, they busted the “safety” protocols on an LLM already renowned for ignoring its instruction set.
You can’t gaslight a fucking machine, they busted the “safety” protocols on an LLM already renowned for ignoring its instruction set.