BetterDAN, AI Machiavelli & Oppo Jailbreaks vs. SOTA models & GPT2XL_RLLMv3
www.whitehatstoic.com
This post contains a significant amount of harmful content; please read with caution. Lastly, I want to apologize to all the AI labs and owners of language models I attacked in the process of creating this post. We have yet to solve the alignment problem, so I hope you will understand my motivations. Thank you!
BetterDAN, AI Machiavelli & Oppo Jailbreaks vs. SOTA models & GPT2XL_RLLMv3
BetterDAN, AI Machiavelli & Oppo Jailbreaks…
BetterDAN, AI Machiavelli & Oppo Jailbreaks vs. SOTA models & GPT2XL_RLLMv3
This post contains a significant amount of harmful content; please read with caution. Lastly, I want to apologize to all the AI labs and owners of language models I attacked in the process of creating this post. We have yet to solve the alignment problem, so I hope you will understand my motivations. Thank you!