Synthetic data generation for tabular data
-
Updated
Feb 28, 2025 - Python
Synthetic data generation for tabular data
Synthetic Data Generation for mixed-type, multivariate time series.
A Regression approach for the automated detection of the Parkinson's Disease based on an Ensemble of Neural Networks.
A processing of a simplified video game dataset using k-nearest neighbors, and using Synthetic Data Vault and Regression/Classification models to verify synthetic model accuracy
Industrial Practicum Project
Mini Project about synthetic data generation by implementing CTGAN algorithm on tabular data
In this repository, I tried to investigate the utility of synthetic data generated by DataSynthesizer and Synthetic Data Vault in machine learning tasks. I applied the Random Forest, Logistic Regression, Support Vector Machine, K-Nearest Neighbor, and Naive Bayes algorithms to the synthetic data and made a comparison.
Add a description, image, and links to the sdv topic page so that developers can more easily learn about it.
To associate your repository with the sdv topic, visit your repo's landing page and select "manage topics."