Synthcity is an open-source software package for innovative use cases of synthetic data in ML fairness, privacy and augmentation across diverse tabular data modalities, including static data, regular and irregular time series, data with censoring, multi-source data, composite data, and more. Synthcity provides the practitioners with a single access point to cutting edge research and tools in synthetic data. It also offers the community a playground for rapid experimentation and prototyping, a one-stop-shop for SOTA benchmarks, and an opportunity for extending research impact. The library can be accessed on GitHub (https://github.com/vanderschaarlab/synthcity) and pip (https://pypi.org/project/synthcity/). We warmly invite the community to join the development effort by providing feedback, reporting bugs, and contributing code.
翻译:合成城市是一个开放源码软件包,用于以多种表格数据模式,包括静态数据、定期和不定期的时间序列、带有审查数据的数据、多来源数据、综合数据及其他数据,在ML公平性、隐私和扩增方面的合成数据创新使用案例;合成城市为从业者提供了一个进入尖端研究和合成数据工具的单一接入点;它还为社区提供了一个快速实验和原型操场,一个SOTA基准一站式服务站,以及扩大研究影响的机会;图书馆可访问GitHub(https://github.com/vanderchaarlab/synthcity)和Pip(https://pip.org/project/synthcity/)。我们热情地邀请社区通过提供反馈、报告错误和贡献代码加入发展努力。