Please use this identifier to cite or link to this item: https://rfos.fon.bg.ac.rs/handle/123456789/2967
Title: Red teaming generative AI applications: threat modeling and mitigation strategies
Authors: Lukić, Matija
Poledica, Ana 
Milošević, Pavle 
Issue Date: 2025
Publisher: Univerzitet u Beogradu – Fakultet organizacionih nauka
Abstract: As generative AI systems grow in adoption and complexity, they introduce novel security, safety
and alignment risks that challenge traditional evaluation and defense paradigms. To address these, we focus
on a structured five-phase red teaming workflow consisting of reconnaissance, enumeration, exploitation,
impact realization and persistence specifically tailored to GenAI’s unique threat landscape. Through real-
world case studies and examples, we illustrate how adversaries exploit model vulnerabilities, bypass
alignment mechanisms and cause persistent harm. We also identify emerging GenAI security tools and map
each red teaming phase to actionable mitigations that support safe deployment. Our goal is to connect AI
safety theory with practical adversarial resilience for researchers, developers and policymakers.
URI: https://rfos.fon.bg.ac.rs/handle/123456789/2967
Appears in Collections:Radovi istraživača / Researchers’ publications

Show full item record

Page view(s)

24
checked on Dec 14, 2025

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.