Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

baseline #4

Open
betoobusy opened this issue Dec 30, 2024 · 7 comments
Open

baseline #4

betoobusy opened this issue Dec 30, 2024 · 7 comments

Comments

@betoobusy
Copy link

Dear Author,

In order to better understand your approach and implement it in my research, I sincerely request if you could share the relevant experimental code or its implementation. I believe that these codes will greatly assist my research work and allow me to delve deeper into learning and exploring the related technologies. For example, how the dataset is processed, as well as the baseline-related code.

@palelaughing
Copy link

Dear Author,

In order to better understand your approach and implement it in my research, I sincerely request if you could share the relevant experimental code or its implementation. I believe that these codes will greatly assist my research work and allow me to delve deeper into learning and exploring the related technologies. For example, how the dataset is processed, as well as the baseline-related code.

Has your problem been solved? I encountered the same issue. The code seems incomplete and it appears that some files are missing.

@AkaliKong
Copy link
Owner

AkaliKong commented Feb 1, 2025 via email

@palelaughing
Copy link

Thank you for your email and for your interest in our work. 

Regarding your request for the experimental code, I’m pleased to inform you that we have completed fixing the code, and it is now fully functional and reproducible. If you encounter any issues while trying to replicate our results, please don’t hesitate to reach out to us. We’ll be more than happy to assist you.

As for the baseline-related code you mentioned, I would like to direct your attention to another work from our lab, LLaRA. Our current work is built upon the foundation laid by LLaRA. We followed the same settings and used the same dataset as described in LLaRA. You might find the baseline-related code and additional insights in the LLaRA repository, which could be beneficial for your research.

Please let me know if you need any further information or assistance.

Best regards

JOKER
@.***

 

Thank you very much for the author's prompt response. The code can now be run.

@palelaughing
Copy link

palelaughing commented Feb 2, 2025 via email

@betoobusy
Copy link
Author

尊敬的作者您好:
我在复现您代码过程中也发现了valid的metric一直是0的问题,我运行的是lastfm的数据集,训练过程中一段时间后会出现损失为Nan的问题,我显卡用的是单张A100,运行LLaRa却没有出现这个问题

@AkaliKong
Copy link
Owner

感谢各位对我们工作的关注,我会尽快检查您所提到的问题的根源并进行更新。
Thank you all for your attention to our work. I will promptly investigate the root cause of the issue you mentioned and proceed with an update.

@AkaliKong
Copy link
Owner

Thank you for bringing this matter to my attention. I sincerely apologize if this situation has affected your research progress.

Based on my initial assessment, the issue might be related to the increased complexity of the ilora structure, which could require adjustments to the learning rate. I believe using a smaller learning rate might resolve the problem you've encountered. Another potential solution would be to implement gradient clipping.

I will carefully consider the issue you've raised and will update my GitHub repository as soon as possible with the necessary adjustments.

Thank you for your understanding and patience.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants