Please use this identifier to cite or link to this item: https://rfos.fon.bg.ac.rs/handle/123456789/2959
Title: Evaluating Generative Models for Synthetic Tabular Data: A Comparative Analysis of Fidelity, Diversity, and Generalization
Authors: Mahovac, Zoran
Petrović, Andrija
Radovanović, Sandro 
Delibašić, Boris 
Keywords: Generative models;synthetic data;Fidelity;Diversity;Generalization;tabular data generation
Issue Date: 2025
Publisher: Univerzitet u Beogradu – Fakultet organizacionih nauka
Abstract: Tabular data, such as relational tables, Web tables and CSV files, is among the most primitive and essential forms of data in machine learning, characterized by excellent structural properties, readability, and interpretability. However, acquiring substantial amounts of high-quality tabular data for ML model training remains a persistent challenge. This study evaluates the performance of six generative models — TVAE, RTVAE, CTGAN, ADSGAN, BNN, and Marginal Distributions on synthetic data generation. The evaluation is based on three key metrics: Fidelity, Diversity, and Generalization. Fidelity measures the quality of synthetic data, Diversity assesses how well the samples cover the variability of the real dataset, and Generalization quantifies the risk of overfitting. The research applies these metrics to four datasets: Abalone, Acute Inflammation, Census Income, and Pittsburgh Bridges. Results show that CTGAN consistently outperforms other models measured by IPα and IRβ metrics, while RTVAE excels in the Census Income dataset in terms of Generalization. Marginal Distributions stands out in preserving data authenticity. This study offers a refined method of evaluating generative models, emphasizing precision–recall analysis grounded in minimum volume sets, thus providing a deeper understanding of model performance across multiple dimensions.
URI: https://rfos.fon.bg.ac.rs/handle/123456789/2959
ISBN: 978-86-7680-484-9
Appears in Collections:Radovi istraživača / Researchers’ publications

Files in This Item:
File Description SizeFormat 
133_mahovac_et_al_icdsst_2025.pdf1.38 MBAdobe PDFView/Open
Show full item record

Page view(s)

18
checked on Dec 14, 2025

Download(s)

38
checked on Dec 14, 2025

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.