Is this an OpenAI attempt to gather more insight and data, while identifying act...

rafiste · on Feb 28, 2023

Haha no not part of OpenAI, you can check me out here alexalbert.me. I have a suspicion OpenAI actually appreciates this type of work since it's basically crowdsourced red teaming of their models.

LASR · on Feb 28, 2023

Honestly, I would prefer it if it were OpenAI.

As someone looking to build AI features into my application, I definitely want to avoid this kind of jailbreaks in my app.

Right now, there is no good way to guard against this other than removing free form text inputs and using a more form-driven approach to taking user input.

ExtremisAndy · on Feb 28, 2023

Absolutely agree. I’m creating a chatbot for my website, and while it primarily uses old fashioned pattern matching, it does send unrecognized patterns to a stronger AI to get help forming a proper response, and I certainly don’t want it offending my visitors!

ivalm · on Feb 28, 2023

There kind of is, but it does help to decouple nlu, dm, and nlg

GuB-42 · on Feb 28, 2023

I am convinced that OpenAI does not mind the jailbreak game, they could easily kill it by filtering the output. In fact, often while using jailbreaks, this message shows up: "This content may violate our content policy. If you believe this to be in error, please submit your feedback — your input will aid our research in this area.". It shows that they have a system in place but they still show you the inappropriate output.