Skip to content

grooviter/tablesaw

Repository files navigation

Tablesaw

Apache 2.0 main Maven Central

Overview

Tablesaw is a dataframe and visualization library that supports loading, cleaning, transforming, filtering, and summarizing data.

Features

Data processing & transformation

  • Import and export data: RDBMS, Excel, CSV, TSV, JSON, HTML, Fixed Width text files.
  • Combine tables by appending or joining.
  • Add and remove columns or rows.
  • Sort, Group, Filter, Edit, Transpose, etc.
  • Map/Reduce operations.
  • Handle missing values.

Statistics

Descriptive stats such as mean, min, max, median, sum, product, standard deviation, variance, percentiles, geometric mean, skewness, kurtosis, etc.

Visualization

Tablesaw supports data visualization by providing a wrapper for the Plot.ly JavaScript plotting library. Here are a few examples of the new library in action.

chart chart chart chart
chart chart chart chart
chart chart chart chart
chart chart chart

Getting started

Add tablesaw-core to your maven project. You can find the version number for the latest release in the release notes page:

<dependency>
    <groupId>com.github.grooviter</groupId>
    <artifactId>tablesaw-core</artifactId>
    <version>VERSION_NUMBER_GOES_HERE</version>
</dependency>

Or as a gradle dependency:

implementation "com.github.grooviter:tablesaw-core:VERSION_NUMBER_GOES_HERE"

You may also add supporting projects:

  • tablesaw-beakerx - for using Tablesaw inside BeakerX
  • tablesaw-excel - for using Excel workbooks
  • tablesaw-html - for using HTML
  • tablesaw-json - for using JSON
  • tablesaw-jsplot - for creating charts

External supporting projects - outside of this organization:

Documentation and support

Integrations

Jupyter Notebooks

Other integrations

Credits

This repository was born as a fork of the original Tablesaw project.

License

This project is licensed under the Apache 2.0