Document binary classification (signed or not)

The dataset contains information relative to different documents. The goal is to develop a predictive model to decide whether a document will be signed or not.

The metrics chosen were accuracy but as the customer commented that "to us predicting that a document will be signed when in reality it won’t is slightly worse than otherwise (predicting a document to not be signed when it is in fact signed)." we have to pay attention to minimize the False Positives increasing the precission playing with the threshold feature of sklearn.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
README.md		README.md
Signed_or_not_notext.ipynb		Signed_or_not_notext.ipynb
Text_classification_with_TF_Hub.ipynb		Text_classification_with_TF_Hub.ipynb
signed_text.ipynb		signed_text.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Document binary classification (signed or not)

About

Releases

Packages

Languages

juanluisrosaramos/document_classification

Folders and files

Latest commit

History

Repository files navigation

Document binary classification (signed or not)

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages