Decoder head

Exploration of unsupervised translation using fastai and pytorch

Imagine you have trained a language model (LM) to perform some task, say predict the next word or perform sentiment analysis. You freeze the model with removing one piece of information from it - the mapping from embeddings to words.

Can you create a setup in which the mapping from words to embeddings will be learnable? Something that the model could learn utilizing the information encoded in the LM? Surprisingly, the answer is yes. This repository is based on original work by Aza Raskin exploring this idea.

Please read this for a more full description of a potential Decoder Head architecture.

How well will the model be able to learn this mapping? How will it handle synonyms? Is there a way to present this task to the model to make the learning more efficient? Iterating on the original idea and answering these questions is what the experiments in this repository will center on.

Special thanks

Special thanks to the authors of the MultiFiT: Efficient Multi-lingual Language Model Fine-tuning that is Julian Eisenschlos, Sebastian Ruder, Piotr Czapla, Marcin Kardas, Sylvain Gugger and Jeremy Howard! We trained our models on the wikipedia dumps that you were so kind to provide!

Another round of thanks goes out to authors of MUSE framework @ facebookresearch, that is Guillaume Lample and Alexis Conneau. We used the ground-truth bilingual dictionaries to evaluate the performance of our models.

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
.github/workflows		.github/workflows
data		data
decoder_head		decoder_head
.gitattributes		.gitattributes
.gitignore		.gitignore
00_data.ipynb		00_data.ipynb
01_train_LM_en.ipynb		01_train_LM_en.ipynb
02_train_LM_es.ipynb		02_train_LM_es.ipynb
03_translate_en_to_es.ipynb		03_translate_en_to_es.ipynb
03a_translate_en_to_es_sinkhorn.ipynb		03a_translate_en_to_es_sinkhorn.ipynb
04_LM_with_normalized_embeddings.ipynb		04_LM_with_normalized_embeddings.ipynb
04a_LM_with_normalized_embeddings_mixer_softmax.ipynb		04a_LM_with_normalized_embeddings_mixer_softmax.ipynb
05_aligning_the_embeddings_using_vecmap.ipynb		05_aligning_the_embeddings_using_vecmap.ipynb
99_index.ipynb		99_index.ipynb
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
settings.ini		settings.ini
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Decoder head

Special thanks

About

Releases

Packages

Languages

License

earthspecies/decoder-head-unsupervised-translation

Folders and files

Latest commit

History

Repository files navigation

Decoder head

Special thanks

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages