This is a Kaggle task I'm trying to produce useful protein function information via mapping the protein sequences to the GO dataset