This research investigates how sociopragmatic framing and instruction hierarchy can override safety protocols in the OpenAI gpt-oss-20b model, revealing crit...
Level: advanced
By Unknown
Category: discussion