SmashRNN (based on hierarchical attention)

It uses 3-Stage, 3 levels encoder (word_level, sent_level, para_level) to encode the input using bi-GRUs. Further , it uses Siamese network (of MashRNNs) to learn the similarity between the documents. This is a modified implementation of original paper of SMASH-RNN using HAN(Hierarchical Attention Network). It works for long size docs too. As the elmo model itself uses bi-lstm so we use elmo representation directly instead of word_encoder. Here, we are not using dot-product attention, instead we have used feed-forward attention. Binary Cross Entropy has been used as loss function. The pytoch implementation is uploaded.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
README.md		README.md
SmashRnn.py		SmashRnn.py
img1.png		img1.png
img2.png		img2.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SmashRNN (based on hierarchical attention)

About

Releases

Packages

Languages

SageAgastya/SmashRNN

Folders and files

Latest commit

History

Repository files navigation

SmashRNN (based on hierarchical attention)

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages