Using streaming from HuggingFace to finetune document understanding models for classification My target is to finetune the base models for classification on RVL-CDIP These models are publicly available currently:
- DiT [Paper]
I'll be making the notebooks for the following models available soon: