diff --git a/DESCRIPTION.rst b/DESCRIPTION.rst index 34767b1b2..1daa6f410 100644 --- a/DESCRIPTION.rst +++ b/DESCRIPTION.rst @@ -30,6 +30,7 @@ Feature Visualization - **Parallel Coordinates**: horizontal visualization of instances - **Radial Visualization**: separation of instances around a circular plot - **PCA Projection**: projection of instances based on principal components +- **Manifold Visualization**: high dimensional visualization with manifold learning - **Feature Importances**: rank features based on their in-model performance - **Recursive Feature Elimination**: find the best subset of features by importance - **Scatter and Joint Plots**: direct data visualization with feature selection @@ -67,7 +68,7 @@ Text Visualization ~~~~~~~~~~~~~~~~~~ - **Term Frequency**: visualize the frequency distribution of terms in the corpus -- **TSNE**: use stochastic neighbor embedding to project documents. +- **t-SNE Corpus Visualization**: use stochastic neighbor embedding to project documents. ... and more! Visualizers are being added all the time; be sure to check the examples_ (or even the develop_ branch) and feel free to contribute your ideas for new Visualizers! diff --git a/README.md b/README.md index 206607ab4..bcc298c41 100644 --- a/README.md +++ b/README.md @@ -29,10 +29,11 @@ Visualizers are estimators (objects that learn from data) whose primary objectiv #### Feature Visualization +- **Rank Features**: single or pairwise ranking of features to detect relationships - **Parallel Coordinates**: horizontal visualization of instances +- **Radial Visualization**: separation of instances around a circular plot - **PCA Projection**: projection of instances based on principal components -- **RadViz**: separation of instances around a circular plot -- **Rank Features**: single or pairwise ranking of features to detect relationships +- **Manifold Visualization**: high dimensional visualization with manifold learning - **Feature Importances**: rank features based on their in-model performance - **Recursive Feature Elimination**: find the best subset of features by importance - **Scatter and Joint Plots**: direct data visualization with feature selection @@ -42,15 +43,15 @@ Visualizers are estimators (objects that learn from data) whose primary objectiv - **Class Balance**: see how the distribution of classes affects the model - **Class Prediction Error**: shows error and support in classification - **Classification Report**: visual representation of precision, recall, and F1 -- **Confusion Matrices**: visual description of class decision making - **ROC/AUC Curves**: receiver operator characteristics and area under the curve +- **Confusion Matrices**: visual description of class decision making - **Discrimination Threshold**: find a threshold that best separates binary classes #### Regression Visualization -- **Alpha Selection**: show how the choice of alpha influences regularization - **Prediction Error Plots**: find model breakdowns along the domain of the target - **Residuals Plot**: show the difference in residuals of training and test data +- **Alpha Selection**: show how the choice of alpha influences regularization #### Clustering Visualization @@ -59,13 +60,13 @@ Visualizers are estimators (objects that learn from data) whose primary objectiv #### Model Selection Visualization -- **Validation Curve**: tune a model with respect to a single hyperparameter -- **Learning Curve**: show if a model might benefit from more data or less complexity +- **Validation Curve**: tune a model with respect to a single hyperparameter +- **Learning Curve**: show if a model might benefit from more data or less complexity #### Text Visualization - **Term Frequency**: visualize the frequency distribution of terms in the corpus -- **TSNE**: use stochastic neighbor embedding to project documents. +- **t-SNE Corpus Visualization**: use stochastic neighbor embedding to project documents. And more! Visualizers are being added all the time, so be sure to check the examples (or even the develop branch) and feel free to contribute your ideas for Visualizers! diff --git a/tests/test_text/test_freqdist.py b/tests/test_text/test_freqdist.py index d7ecc84c1..a7364f670 100644 --- a/tests/test_text/test_freqdist.py +++ b/tests/test_text/test_freqdist.py @@ -1,13 +1,14 @@ # tests.test_text.test_freqdist # Tests for the frequency distribution visualization # -# Author: Rebecca Bilbro +# Author: Rebecca Bilbro +# Github: @rebeccabilbro # Created: 2017-03-22 15:27 # -# Copyright (C) 2017 District Data Labs +# Copyright (C) 2018 # For license information, see LICENSE.txt # -# ID: test_freqdist.py [bd9cbb9] rebecca.bilbro@bytecubed.com $ +# ID: test_freqdist.py [bd9cbb9] rbilbro@districtdatalabs.com $ """ Tests for the frequency distribution text visualization