Updated 9 months ago

https://github.com/cluebbers/adverserial-paraphrasing • Science 26%

Evaluate how LLaMA 3.1 8B handles paraphrased adversarial prompts targeting refusal behavior.

Updated 9 months ago

redteam • Science 36%

Qompass AI on RedTeaming