Please use this identifier to cite or link to this item:
https://rfos.fon.bg.ac.rs/handle/123456789/2959| Title: | Evaluating Generative Models for Synthetic Tabular Data: A Comparative Analysis of Fidelity, Diversity, and Generalization | Authors: | Mahovac, Zoran Petrović, Andrija Radovanović, Sandro Delibašić, Boris |
Keywords: | Generative models;synthetic data;Fidelity;Diversity;Generalization;tabular data generation | Issue Date: | 2025 | Publisher: | Univerzitet u Beogradu – Fakultet organizacionih nauka | Abstract: | Tabular data, such as relational tables, Web tables and CSV files, is among the most primitive and essential forms of data in machine learning, characterized by excellent structural properties, readability, and interpretability. However, acquiring substantial amounts of high-quality tabular data for ML model training remains a persistent challenge. This study evaluates the performance of six generative models — TVAE, RTVAE, CTGAN, ADSGAN, BNN, and Marginal Distributions on synthetic data generation. The evaluation is based on three key metrics: Fidelity, Diversity, and Generalization. Fidelity measures the quality of synthetic data, Diversity assesses how well the samples cover the variability of the real dataset, and Generalization quantifies the risk of overfitting. The research applies these metrics to four datasets: Abalone, Acute Inflammation, Census Income, and Pittsburgh Bridges. Results show that CTGAN consistently outperforms other models measured by IPα and IRβ metrics, while RTVAE excels in the Census Income dataset in terms of Generalization. Marginal Distributions stands out in preserving data authenticity. This study offers a refined method of evaluating generative models, emphasizing precision–recall analysis grounded in minimum volume sets, thus providing a deeper understanding of model performance across multiple dimensions. | URI: | https://rfos.fon.bg.ac.rs/handle/123456789/2959 | ISBN: | 978-86-7680-484-9 |
| Appears in Collections: | Radovi istraživača / Researchers’ publications |
Files in This Item:
| File | Description | Size | Format | |
|---|---|---|---|---|
| 133_mahovac_et_al_icdsst_2025.pdf | 1.38 MB | Adobe PDF | View/Open |
Page view(s)
18
checked on Dec 14, 2025
Download(s)
38
checked on Dec 14, 2025
Google ScholarTM
Check
Altmetric
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.