Please use this identifier to cite or link to this item:
https://rfos.fon.bg.ac.rs/handle/123456789/2967| Title: | Red teaming generative AI applications: threat modeling and mitigation strategies | Authors: | Lukić, Matija Poledica, Ana Milošević, Pavle |
Issue Date: | 2025 | Publisher: | Univerzitet u Beogradu – Fakultet organizacionih nauka | Abstract: | As generative AI systems grow in adoption and complexity, they introduce novel security, safety and alignment risks that challenge traditional evaluation and defense paradigms. To address these, we focus on a structured five-phase red teaming workflow consisting of reconnaissance, enumeration, exploitation, impact realization and persistence specifically tailored to GenAI’s unique threat landscape. Through real- world case studies and examples, we illustrate how adversaries exploit model vulnerabilities, bypass alignment mechanisms and cause persistent harm. We also identify emerging GenAI security tools and map each red teaming phase to actionable mitigations that support safe deployment. Our goal is to connect AI safety theory with practical adversarial resilience for researchers, developers and policymakers. |
URI: | https://rfos.fon.bg.ac.rs/handle/123456789/2967 |
| Appears in Collections: | Radovi istraživača / Researchers’ publications |
Show full item record
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.