Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

mini-monky学习率设置问题 #140

Open
pangr opened this issue Sep 13, 2024 · 5 comments
Open

mini-monky学习率设置问题 #140

pangr opened this issue Sep 13, 2024 · 5 comments

Comments

@pangr
Copy link

pangr commented Sep 13, 2024

image
请问这个学习率为什么设置的这么低,好像无法学习到什么东西

@mxin262
Copy link
Collaborator

mxin262 commented Sep 14, 2024

Hi~, 这里的训练是为了让模型能接受论文提出的MSAC的输入,但是我们没有这么多的数据去训练模型,所以学习率设置比较小,让模型学习到MSAC的输入,而不破坏预训练好的能力

@pangr
Copy link
Author

pangr commented Sep 14, 2024

Hi~, 这里的训练是为了让模型能接受论文提出的MSAC的输入,但是我们没有这么多的数据去训练模型,所以学习率设置比较小,让模型学习到MSAC的输入,而不破坏预训练好的能力

明白,感谢回复,还有个问题是mini-monkey的整个预训练中都只训练了文本大模型、没有训练视觉模块和模态对齐的模块吗

@mxin262
Copy link
Collaborator

mxin262 commented Sep 14, 2024

是的,只训练了文本大模型,全部模块都进行训练的话,8卡3090会爆显存

@pangr
Copy link
Author

pangr commented Sep 24, 2024

是的,只训练了文本大模型,全部模块都进行训练的话,8卡3090会爆显存

全参训练会有提升吗

@mxin262
Copy link
Collaborator

mxin262 commented Oct 30, 2024

应该是会有提升的,但是由于计算资源有限,没有进行这个实验

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants