MPT for Sequence Classification #611

boomanaiden154 · 2023-09-19T23:27:33Z

I'm interested in using llm-foundry infrastructure for training LLMs for sequence classification/regression tasks. I currently have a fork of llm-foundry where I got this working (in a fairly hacky manner that definitely needs to be cleaned up) within the MPT models provided by the repository (creating a new MPTForSequenceRegression class and associated composer model). HuggingFace also has sequence classification versions of most of the LLMs that they have available (which would just require a composer wrapper.

Is there an interest in having tooling for sequence classification/regression live upstream in llm-foundry? I'd be interested in cleaning up and upstreaming what I have so far in addition to probably writing some documentation on performing finetuning for these tasks if such patches would be accepted.

The text was updated successfully, but these errors were encountered:

dakinggg · 2023-09-20T16:49:51Z

Hey @boomanaiden154, the approach seems right! You should still be able to use the base HuggingFaceModel in composer, and just add the head classes as you described. Support for sequence classification/regression would be great!

boomanaiden154 mentioned this issue Sep 25, 2023

Add support for pre-tokenized streaming dataset finetuning #601

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MPT for Sequence Classification #611

MPT for Sequence Classification #611

boomanaiden154 commented Sep 19, 2023

dakinggg commented Sep 20, 2023

MPT for Sequence Classification #611

MPT for Sequence Classification #611

Comments

boomanaiden154 commented Sep 19, 2023

dakinggg commented Sep 20, 2023