NLPBuddy - Open Source Text Analysis Tool

About the project

NLPBuddy is a text analysis application for performing common NLP tasks through a web dashboard interface and an API.

It leverages Spacy for the NLP tasks plus Gensim's implementation of the TextRank algorithm for text summarization.

It supports texts in the following languages: Greek, English, German, Spanish, Portoguese, French, Italian and Dutch. Language identification is performed automatically through langid

Tasks include:

Text tokenization
Sentence splitting (lemmatized sentences too)
Part of Speech tags identification (verbs, nouns etc)
Named Entity Recognition (Location, Person, Organisation etc)
Text summarization (using TextRank algorithm, implemented by Gensim)
Keywords extraction
Language identification
For the Greek language, Categorization of text

Text can either be provided or imported after specifying a url - we use library python readability for this plus BeautifulSoup4

The Greek classifier is built with FastText and is trained in 20.000 articles labeled in these categories.

Demo

A working demo can be found on http://www.nlpbuddy.io/

Usage

Enter text and hit 'Analyze it',

API Usage

https://github.com/eellak/text-analysis/wiki/API-usage

Installation

Find development and deployment instructions here: https://github.com/eellak/text-analysis/wiki/Install

License

The code is provided under the GNU AGPL v3.0 License.

Name		Name	Last commit message	Last commit date
Latest commit History 129 Commits
datasets/sentiment_analysis		datasets/sentiment_analysis
demo		demo
deploy		deploy
nlp		nlp
static		static
templates		templates
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
manage.py		manage.py
requirements.txt		requirements.txt
uwsgi.ini		uwsgi.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NLPBuddy - Open Source Text Analysis Tool

About the project

Demo

Usage

API Usage

Installation

License

About

Releases

Packages

Contributors 2

Languages

License

eellak/nlpbuddy

Folders and files

Latest commit

History

Repository files navigation

NLPBuddy - Open Source Text Analysis Tool

About the project

Demo

Usage

API Usage

Installation

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages