Forte is a toolkit for building Natural Language Processing pipelines. This repository tries to wrap the fantastic collections of NLP libraries built by the community.
This project is part of the CASL Open Source family.
- First, install the library along with the desired tools. Let's take AllenNLP as an example:
git clone https://github.com/asyml/forte-wrappers.git
cd forte-wrappers
pip install src/allennlp
- NLTK (Processors)
pip install src/nltk
- POS Tagger
- Sentence Segmenter
- Tokenizer
- Lemmatizer
- NER
- spaCy (Processors)
pip install src/spacy
- Tokenizer, Lemmatizer and POS Tagging
- NER
- AllenNLP (Processors)
pip install src/allennlp
- Tokenizer, POS Tagging
- Semantic Role Labeling
- Dependency Parsing
- Stanza (Processors)
pip install src/stanza
- Tokenization, POS Tagging, Lemmatizer
- Dependency Parsing
- HuggingFace Models
pip install src/huggingface
- BioBERT NER (Processors)
- Zero Shot Classifier(Processors)
- Question Answering(Processors)
- Vader Sentiment (Processors)
pip install src/vader
- Sentiment Analysis
- Elastic Search (Processors)
pip install src/elastic
- Elastic Indexer
- Elastic Search
- Faiss (Processors)
pip install src/faiss
- Faiss Indexer
- GPT2 (Processors)
pip install src/gpt2
- GPT2 Text Generation
- Tweepy (Processors)
pip install src/tweepy
- TwitterAPI Search
If you are interested in making enhancement to this repository, Forte or other ASYML/CASL projects, please first go over our Code of Conduct and Contribution Guideline
This project is licensed by Apache License 2.0. The project provides wrappers to other open-sourced projects. To use them in your project, please check the license of the corresponding project.