Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Model conversion problem #449

Open
yuanzhiyong1999 opened this issue Sep 26, 2024 · 1 comment
Open

Model conversion problem #449

yuanzhiyong1999 opened this issue Sep 26, 2024 · 1 comment

Comments

@yuanzhiyong1999
Copy link

我想做继续预训练,使用tools/hf2megads_weight_converter.py脚本将hf模型转换为megads格式,但是我不理解为什么在模型转换的时候也需要各种跟训练相关的参数?
image

@busishengui
Copy link

我想做继续预训练,使用tools/hf2megads_weight_converter.py脚本将hf模型转换为megads格式,但是我不理解为什么在模型转换的时候也需要各种跟训练相关的参数? image

因为他的模型转换策略是这样的:先用训练代码搭个框架出来,然后随机填充,再读取HF模型后,将模型参数填充到对应的位置,然后再保存模型。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants