Name	Name	Last commit message	Last commit date
Latest commit fabclmnt chore: update github actions and docs to remove support for python 3.… Jan 30, 2025 abc6198 · Jan 30, 2025 History 146 Commits
.github	.github	chore: update github actions and docs to remove support for python 3.…	Jan 30, 2025
docs	docs	chore: update github actions and docs to remove support for python 3.…	Jan 30, 2025
examples	examples	docs: fix examples for time-series synthetic data generation (#137 )	Nov 18, 2024
src/ydata	src/ydata	feat: update multitable interface & datasources information (#136 )	Jan 29, 2025
tests	tests	feat: first version of the SDK (#3 )	Feb 28, 2023
.editorconfig	.editorconfig	fix(synthesizer): conform with recent changes to the metadata (#66 )	Sep 7, 2023
.flake8	.flake8	feat: first version of the SDK (#3 )	Feb 28, 2023
.gitignore	.gitignore	fix(packaging): include sutbs in compiled wheel (#29 )	Mar 13, 2023
.pre-commit-config.yaml	.pre-commit-config.yaml	chore(deps): update linting dependencies (#141 )	Jan 20, 2025
.python-version	.python-version	chore: wrap up project	Jan 19, 2023
.releaserc.json	.releaserc.json	feat: first version of the SDK (#3 )	Feb 28, 2023
CHANGELOG.md	CHANGELOG.md	chore: wrap up project	Jan 19, 2023
CODE_OF_CONDUCT.md	CODE_OF_CONDUCT.md	chore(setup): add python package basic structure (#1 )	Feb 2, 2023
LICENSE	LICENSE	chore: wrap up project	Jan 19, 2023
Makefile	Makefile	chore(actions): fix rename for wheel and upload files (#142 )	Jan 20, 2025
README.md	README.md	chore: update github actions and docs to remove support for python 3.…	Jan 30, 2025
environment.yml	environment.yml	fix: rename to ydata-fabric-sdk (#140 )	Jan 20, 2025
mkdocs.yml	mkdocs.yml	fix: rename to ydata-fabric-sdk (#140 )	Jan 20, 2025
pyproject.toml	pyproject.toml	chore: update github actions and docs to remove support for python 3.…	Jan 30, 2025
renovate.json	renovate.json	chore: add renovatebot	Mar 9, 2023

Name

Last commit message

Last commit date

fabclmnt

chore: update github actions and docs to remove support for python 3.…

Jan 30, 2025

abc6198 · Jan 30, 2025

146 Commits

.github

chore: update github actions and docs to remove support for python 3.…

Jan 30, 2025

docs

chore: update github actions and docs to remove support for python 3.…

Jan 30, 2025

examples

docs: fix examples for time-series synthetic data generation (#137 )

Nov 18, 2024

src/ydata

feat: update multitable interface & datasources information (#136 )

Jan 29, 2025

tests

feat: first version of the SDK (#3 )

Feb 28, 2023

.editorconfig

fix(synthesizer): conform with recent changes to the metadata (#66 )

Sep 7, 2023

.flake8

feat: first version of the SDK (#3 )

Feb 28, 2023

.gitignore

fix(packaging): include sutbs in compiled wheel (#29 )

Mar 13, 2023

.pre-commit-config.yaml

chore(deps): update linting dependencies (#141 )

Jan 20, 2025

.python-version

chore: wrap up project

Jan 19, 2023

.releaserc.json

feat: first version of the SDK (#3 )

Feb 28, 2023

CHANGELOG.md

chore: wrap up project

Jan 19, 2023

CODE_OF_CONDUCT.md

chore(setup): add python package basic structure (#1 )

Feb 2, 2023

LICENSE

chore: wrap up project

Jan 19, 2023

Makefile

chore(actions): fix rename for wheel and upload files (#142 )

Jan 20, 2025

README.md

chore: update github actions and docs to remove support for python 3.…

Jan 30, 2025

environment.yml

fix: rename to ydata-fabric-sdk (#140 )

Jan 20, 2025

mkdocs.yml

fix: rename to ydata-fabric-sdk (#140 )

Jan 20, 2025

pyproject.toml

chore: update github actions and docs to remove support for python 3.…

Jan 30, 2025

renovate.json

chore: add renovatebot

Mar 9, 2023

YData Fabric SDK

🚀 YData Fabric SDK 🎉 Fabric's platform capabilities at the distance of a Python command!

ydata-fabric-sdk is here! Create a YData Fabric account so you can start using today!

YData Fabric SDK empowers developers with easy access to state-of-the-art data quality tools and generative AI capabilities. Stay tuned for more updates and new features!

Documentation | More on YData

Overview

The Fabric SDK is an ecosystem of methods that allows users to, through a python interface, adopt a Data-Centric approach towards the AI development. The solution includes a set of integrated components for data ingestion, standardized data quality evaluation and data improvement, such as synthetic data generation, allowing an iterative improvement of the datasets used in high-impact business applications.

Synthetic data can be used as Machine Learning performance enhancer, to augment or mitigate the presence of bias in real data. Furthermore, it can be used as a Privacy Enhancing Technology, to enable data-sharing initiatives or even to fuel testing environments.

Under the Fabric SDK hood, you can find a set of algorithms and metrics based on statistics and deep learning based techniques, that will help you to accelerate your data preparation.

What you can expect:

Fabric SDK is composed by the following main modules:

Datasources
- Fabric’s SDK includes several connectors for easy integration with existing data sources. It supports several storage types, like filesystems and RDBMS. Check the list of connectors.
- Fabric SDK’s Datasources run on top of Dask, which allows it to deal with not only small workloads but also larger volumes of data.
Synthesizers
- Simplified interface to train a generative model and learn in a data-driven manner the behavior, the patterns and original data distribution. Optimize your model for privacy or utility use-cases.
- From a trained synthesizer, you can generate synthetic samples as needed and parametrise the number of records needed.
Synthetic data quality report Coming soon
- An extensive synthetic data quality report that measures 3 dimensions: privacy, utility and fidelity of the generated data. The report can be downloaded in PDF format for ease of sharing and compliance purposes or as a JSON to enable the integration in data flows.
Profiling Coming soon
- A set of metrics and algorithms summarizes datasets quality in three main dimensions: warnings, univariate analysis and a multivariate perspective.

Supported data formats

Tabular The RegularSynthesizer is perfect to synthesize high-dimensional data, that is time-independent with high quality results.
Time-Series The TimeSeriesSynthesizer is perfect to synthesize both regularly and not evenly spaced time-series, from smart-sensors to stock.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

YData Fabric SDK

Overview

What you can expect:

Supported data formats

About

Releases 125

Packages

Contributors 10

Languages

License

ydataai/ydata-fabric-sdk

Folders and files

Latest commit

History

Repository files navigation

YData Fabric SDK

Overview

What you can expect:

Supported data formats

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases 125

Packages 0

Contributors 10

Languages

Packages