Consider using DictVectorizer and semi-supervised learning to see if any generalizations arise from using a neural network. Review contrastive loss and ideas here #44

Shuyib · 2023-02-16T16:45:27Z

The dictVectorizer will not work so well. We have variable lengths of the sequences. Therefore, embeddings have an argument padding in order to make the sequences of the same length.

This workflow allows use to make a representation of the data with dictionary structure that is, an embedding. Which we can use for the semi-supervised or unsupervised methods. Which we can use loss functions like contrastive loss to examine similarity and differences.

Shuyib · 2023-02-16T16:46:55Z

Seems this was repeated. I'll try make a naive version so that we can build upon this.

wangarijw added this to Phylogenetics tree project Oct 19, 2022

Shuyib assigned Shuyib and wangarijw Feb 16, 2023

Shuyib converted this from a draft issue Feb 16, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consider using DictVectorizer and semi-supervised learning to see if any generalizations arise from using a neural network. Review contrastive loss and ideas here #44

Consider using DictVectorizer and semi-supervised learning to see if any generalizations arise from using a neural network. Review contrastive loss and ideas here #44

Shuyib commented Feb 16, 2023

Shuyib commented Feb 16, 2023

Consider using DictVectorizer and semi-supervised learning to see if any generalizations arise from using a neural network. Review contrastive loss and ideas here #44

Consider using DictVectorizer and semi-supervised learning to see if any generalizations arise from using a neural network. Review contrastive loss and ideas here #44

Comments

Shuyib commented Feb 16, 2023

Shuyib commented Feb 16, 2023