We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
arxiv: https://arxiv.org/abs/2411.02393 github: https://github.com/ShivamDuggal4/adaptive-length-tokenizer
Evaluation 일환으로 depth estimation / image captioning (using GPT4)을 활용한게 인상적
Encode 과정에서 2D 정보를 1D latent에 담기도록 하고, Decode 하기 전에 1D latent + mask objective (SSL) 통해 2D token을 recon한다.
후속 연구: visual alignment (decomposition)
The text was updated successfully, but these errors were encountered:
No branches or pull requests
arxiv: https://arxiv.org/abs/2411.02393
github: https://github.com/ShivamDuggal4/adaptive-length-tokenizer
Evaluation 일환으로 depth estimation / image captioning (using GPT4)을 활용한게 인상적
Encode 과정에서 2D 정보를 1D latent에 담기도록 하고,
Decode 하기 전에 1D latent + mask objective (SSL) 통해 2D token을 recon한다.
후속 연구: visual alignment (decomposition)
The text was updated successfully, but these errors were encountered: