Open this publication in new window or tab >>2024 (English)In: ICT systems security and privacy protection: 39th IFIP international conference, SEC 2024, Edinburgh, UK, June 12–14, 2024, proceedings / [ed] Nikolaos Pitropakis; Sokratis Katsikas; Steven Furnell; Konstantinos Markantonakis, Cham: Springer, 2024, p. 134-147Chapter in book (Refereed)
Abstract [en]
Machine learning has shown remarkable performance in modeling large datasets with complex patterns. As the amount of data increases, it often leads to high-dimensional feature spaces. This data may contain confidential information that must be safeguarded against disclosure. One way to make the data accessible could be by using anonymization. An alternative is to use synthetic data that mimics the behavior of the original data. GANs represent a prominent approach for generating synthetic samples that faithfully replicate the distributional characteristics of the original data. In scenarios involving high-dimensional data, preserving the geometric properties, structural integrity, and relative positioning of data points is paramount, as neglecting such information may compromise utility. This research aims to investigate the manifold properties of synthetically generated data and introduces a novel framework for producing privacy-preserving synthetic data while upholding the manifold structure of the original data. While existing studies predominantly focus on privacy preservation within GANs, the critical aspect of preserving the manifold structure of data remains unaddressed. Our novel approach adeptly addresses both privacy concerns and manifold structure preservation, distinguishing it from prior research endeavors. Comparative assessments against baseline models are conducted using metrics such as Maximum Mean Discrepancy (MMD), Fréchet Inception Distance (FID), and F1-score. Additionally, the privacy risk posed by the models is evaluated through data reconstruction attacks. Results demonstrate that the proposed framework exhibits diminished vulnerability to privacy breaches while more effectively preserving the intrinsic structure of the data.
Place, publisher, year, edition, pages
Cham: Springer, 2024
Series
IFIP Advances in Information and Communication Technology, ISSN 1868-4238, E-ISSN 1868-422X ; 710
Keywords
Generative Adversarial Network, k-Anonymity, Manifold Learning, Synthetic Data
National Category
Computational Mathematics Computer Sciences
Identifiers
urn:nbn:se:umu:diva-228477 (URN)10.1007/978-3-031-65175-5_10 (DOI)2-s2.0-85200774719 (Scopus ID)9783031651748 (ISBN)9783031651779 (ISBN)9783031651755 (ISBN)
Conference
39th IFIP International Conference on ICT Systems Security and Privacy Protection, SEC 2024
Note
Revised papers from the 39th IFIP International Conference on ICT Systems Security and Privacy Protection, SEC 2024, Edinburgh, UK, June 12-14, 2024.
2024-08-152024-08-152024-08-15Bibliographically approved