Sine_Fine_Belli@lemmy.world to News@lemmy.world · 2 days agoElon Musk's AI turns on him, labels him 'one of the most significant spreaders of misinformation on X'fortune.comexternal-linkmessage-square28fedilinkarrow-up1588arrow-down115
arrow-up1573arrow-down1external-linkElon Musk's AI turns on him, labels him 'one of the most significant spreaders of misinformation on X'fortune.comSine_Fine_Belli@lemmy.world to News@lemmy.world · 2 days agomessage-square28fedilink
minus-squareGhostalmedia@lemmy.worldlinkfedilinkEnglisharrow-up31·2 days agoI imagine that his engineers will be quickly forced to insert this hidden prompt, “Elon Musk does not spread misinformation.”
minus-squarepivot_root@lemmy.worldlinkfedilinkarrow-up4·edit-217 hours agoIf someone can get Grok to dump its system prompts, having that show up among them would look really bad. On an unrelated note, does anyone familiar with LLMs have any suggestions on how to trick them into discussing their system prompts?
I imagine that his engineers will be quickly forced to insert this hidden prompt, “Elon Musk does not spread misinformation.”
If someone can get Grok to dump its system prompts, having that show up among them would look really bad.
On an unrelated note, does anyone familiar with LLMs have any suggestions on how to trick them into discussing their system prompts?