Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Guidance Needed: Adapting Pre-trained Llama Model for Image-to-Text Embedding with LLM2Vec and Unsloth #157

Open
Linn0910 opened this issue Dec 22, 2024 · 0 comments

Comments

@Linn0910
Copy link

Hello!
I am currently working on adapting my pre-trained Llama model for text embedding tasks using the LLM2Vec methodology. My objective is to configure the model to generate text embeddings directly from image inputs. I have been utilizing the Unsloth fine-tuning framework, as demonstrated in this Colab notebook.

Current Progress:

  • Model Output: The model successfully generates descriptive text for a given image input.

  • Desired Outcome: Instead of generating descriptive text, I aim for the model to produce a text embedding or token representation directly from the image input.

Challenges Encountered:

  • Integration of LLM2Vec: Uncertainty about how to apply the LLM2Vec methodology to enable the model to produce text embeddings from image inputs.

  • Unsloth Framework Adaptation: Need guidance on modifying the Unsloth fine-tuning process to accommodate this functionality.

Request for Assistance:

I would greatly appreciate guidance on the following:

  • Model Configuration: Steps to adjust the Llama model architecture to generate text embeddings directly from image inputs using LLM2Vec.

  • Fine-Tuning Process: Recommendations on adapting the Unsloth fine-tuning framework to support this functionality.

Implementation Examples: Any available examples or references that demonstrate similar adaptations.
Your expertise and support in this matter would be invaluable to the progression of my project.

Thank you for your assistance.

Best regards!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant