Skip to content
@ydataai

YData

Accelerating AI with improved data

banner_ydata

YData.ai Medium LinkedIn Twitter Youtube Data-Centric AI Discord YData Profiling YData Synthetic YData Academy

Welcome to YData

Our mission is to help data science teams access and understand their data assets, and produce quality data to sucessfully deploy machine learning models.

We're the creators of YData Fabric, the first data-centric platform for data quality. We're also strong advocates of open source software and we're actively developing ydata-profiling, ydata-synthetic, and ydata-quality, three open source projects focused on producing high-quality data for machine learning applications.

You can stay up to date with the latest developments on our News or follow our Medium blog for hands-on tutorials on our open source packages.

We have a growing community of data scientists on our Discord Server, where we discuss emergent topics on Data Profiling, Data Labeling, and Synthetic Data. Join us to share feedback and discuss feature requests!

You can also find all about our montly events and data initiatives on our newsletter or reach us at [email protected].

footer_ydata

Pinned Loading

  1. ydata-profiling ydata-profiling Public

    1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

    Python 12.6k 1.7k

  2. ydata-sdk ydata-sdk Public

    Public SDK to interact with the platform, either public or private

    Python 16 4

  3. ydata-synthetic ydata-synthetic Public

    Synthetic data generators for tabular and time-series data

    Jupyter Notebook 1.5k 238

  4. academy academy Public

    Tutorials for YData's Fabric platform

    Jupyter Notebook 32 7

  5. ydata-talkdatatome ydata-talkdatatome Public

    Make your dataset talk to you. The AI assistant for data preparation.

    Python 9 1

  6. sd-metrics sd-metrics Public

    A repository that collects different metrics evaluate the quality of synthetic data under the scope data democratization. The metrics evaluate the quality of the synthetic data under the following …

    2

Repositories

Showing 10 of 71 repositories
  • go-core Public

    Core and shared code for our go projects

    ydataai/go-core’s past year of commit activity
    Go 4 MIT 0 1 6 Updated Dec 12, 2024
  • sketch-dask-extension Public

    Extension to support Sketch working with Dask Dataframes

    ydataai/sketch-dask-extension’s past year of commit activity
    Python 0 MIT 0 1 11 Updated Dec 11, 2024
  • ydata-sdk Public

    Public SDK to interact with the platform, either public or private

    ydataai/ydata-sdk’s past year of commit activity
    Python 16 MIT 4 1 11 Updated Dec 10, 2024
  • ydata-synthetic Public

    Synthetic data generators for tabular and time-series data

    ydataai/ydata-synthetic’s past year of commit activity
    Jupyter Notebook 1,457 MIT 238 42 11 Updated Dec 10, 2024
  • ydata-profiling Public

    1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

    ydataai/ydata-profiling’s past year of commit activity
    Python 12,591 MIT 1,690 236 (39 issues need help) 23 Updated Dec 10, 2024
  • python-core Public

    Core functionality for all python packages at YData

    ydataai/python-core’s past year of commit activity
    Python 0 MIT 0 1 8 Updated Dec 9, 2024
  • ydata-quality Public

    Data Quality assessment with one line of code

    ydataai/ydata-quality’s past year of commit activity
    Jupyter Notebook 429 MIT 55 19 (6 issues need help) 15 Updated Dec 9, 2024
  • create-tag Public
    ydataai/create-tag’s past year of commit activity
    JavaScript 0 MIT 0 1 9 Updated Dec 9, 2024
  • authentication-service Public

    Handles authentication using OIDC flow

    ydataai/authentication-service’s past year of commit activity
    Go 2 MIT 0 1 9 Updated Dec 3, 2024
  • ydataai/update-notion-page’s past year of commit activity
    JavaScript 2 MIT 0 1 7 Updated Dec 2, 2024