Skip to content

Commit

Permalink
Merge branch 'develop' of github.com:DistrictDataLabs/yellowbrick int…
Browse files Browse the repository at this point in the history
…o release-0.7
  • Loading branch information
bbengfort committed May 18, 2018
2 parents 96abe31 + 39402b2 commit 55b2485
Show file tree
Hide file tree
Showing 3 changed files with 14 additions and 11 deletions.
3 changes: 2 additions & 1 deletion DESCRIPTION.rst
Original file line number Diff line number Diff line change
Expand Up @@ -30,6 +30,7 @@ Feature Visualization
- **Parallel Coordinates**: horizontal visualization of instances
- **Radial Visualization**: separation of instances around a circular plot
- **PCA Projection**: projection of instances based on principal components
- **Manifold Visualization**: high dimensional visualization with manifold learning
- **Feature Importances**: rank features based on their in-model performance
- **Recursive Feature Elimination**: find the best subset of features by importance
- **Scatter and Joint Plots**: direct data visualization with feature selection
Expand Down Expand Up @@ -67,7 +68,7 @@ Text Visualization
~~~~~~~~~~~~~~~~~~

- **Term Frequency**: visualize the frequency distribution of terms in the corpus
- **TSNE**: use stochastic neighbor embedding to project documents.
- **t-SNE Corpus Visualization**: use stochastic neighbor embedding to project documents.

... and more! Visualizers are being added all the time; be sure to check the examples_ (or even the develop_ branch) and feel free to contribute your ideas for new Visualizers!

Expand Down
15 changes: 8 additions & 7 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -29,10 +29,11 @@ Visualizers are estimators (objects that learn from data) whose primary objectiv

#### Feature Visualization

- **Rank Features**: single or pairwise ranking of features to detect relationships
- **Parallel Coordinates**: horizontal visualization of instances
- **Radial Visualization**: separation of instances around a circular plot
- **PCA Projection**: projection of instances based on principal components
- **RadViz**: separation of instances around a circular plot
- **Rank Features**: single or pairwise ranking of features to detect relationships
- **Manifold Visualization**: high dimensional visualization with manifold learning
- **Feature Importances**: rank features based on their in-model performance
- **Recursive Feature Elimination**: find the best subset of features by importance
- **Scatter and Joint Plots**: direct data visualization with feature selection
Expand All @@ -42,15 +43,15 @@ Visualizers are estimators (objects that learn from data) whose primary objectiv
- **Class Balance**: see how the distribution of classes affects the model
- **Class Prediction Error**: shows error and support in classification
- **Classification Report**: visual representation of precision, recall, and F1
- **Confusion Matrices**: visual description of class decision making
- **ROC/AUC Curves**: receiver operator characteristics and area under the curve
- **Confusion Matrices**: visual description of class decision making
- **Discrimination Threshold**: find a threshold that best separates binary classes

#### Regression Visualization

- **Alpha Selection**: show how the choice of alpha influences regularization
- **Prediction Error Plots**: find model breakdowns along the domain of the target
- **Residuals Plot**: show the difference in residuals of training and test data
- **Alpha Selection**: show how the choice of alpha influences regularization

#### Clustering Visualization

Expand All @@ -59,13 +60,13 @@ Visualizers are estimators (objects that learn from data) whose primary objectiv

#### Model Selection Visualization

- **Validation Curve**: tune a model with respect to a single hyperparameter
- **Learning Curve**: show if a model might benefit from more data or less complexity
- **Validation Curve**: tune a model with respect to a single hyperparameter
- **Learning Curve**: show if a model might benefit from more data or less complexity

#### Text Visualization

- **Term Frequency**: visualize the frequency distribution of terms in the corpus
- **TSNE**: use stochastic neighbor embedding to project documents.
- **t-SNE Corpus Visualization**: use stochastic neighbor embedding to project documents.

And more! Visualizers are being added all the time, so be sure to check the examples (or even the develop branch) and feel free to contribute your ideas for Visualizers!

Expand Down
7 changes: 4 additions & 3 deletions tests/test_text/test_freqdist.py
Original file line number Diff line number Diff line change
@@ -1,13 +1,14 @@
# tests.test_text.test_freqdist
# Tests for the frequency distribution visualization
#
# Author: Rebecca Bilbro <[email protected]>
# Author: Rebecca Bilbro
# Github: @rebeccabilbro
# Created: 2017-03-22 15:27
#
# Copyright (C) 2017 District Data Labs
# Copyright (C) 2018
# For license information, see LICENSE.txt
#
# ID: test_freqdist.py [bd9cbb9] rebecca.bilbro@bytecubed.com $
# ID: test_freqdist.py [bd9cbb9] rbilbro@districtdatalabs.com $

"""
Tests for the frequency distribution text visualization
Expand Down

0 comments on commit 55b2485

Please sign in to comment.