londons_explorer t1_j9ft1x2 wrote
Reply to comment by gwern in [D] Maybe a new prompt injection method against newBing or ChatGPT? Is this kind of research worth writing a paper? by KakaTraining
> That's not true, and has already been shown to be false by Sydney going off on users who seemed to doing harmless chats.
The screenshoted chats never include the start... I suspect at the start of the conversation I suspect they said something to trigger this behaviour.
k___k___ t1_j9gyx7m wrote
this is also why Microsoft now limits the conversation depth to 5 interactions per session
Viewing a single comment thread. View all comments