GitHub - aidoraide/sif: SIF classification of StackOverflow questions by tag from body and title

The pipeline to classify data is:
trim_data.py
word_weights.py
sentence_embeddings.py
train_model.py

To see which tags are most frequent run get_tag_counts.py

To have a baseline to compare your classifiers to run dumb_algorithm.py to see how it performs.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.gitignore		.gitignore
__init__.py		__init__.py
const.py		const.py
dumb_algorithm.py		dumb_algorithm.py
get_tag_counts.py		get_tag_counts.py
readme.md		readme.md
sentence_embeddings.py		sentence_embeddings.py
train_model.py		train_model.py
trim_data.py		trim_data.py
trim_html.py		trim_html.py
word_weights.py		word_weights.py

Provide feedback