-
Notifications
You must be signed in to change notification settings - Fork 62
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Yarn support #69
base: master
Are you sure you want to change the base?
Yarn support #69
Conversation
2 change build.sbt to avoid deduplicate errors
@rjagerman Could You Please help me to review the change. Thanks |
Hi @batizty, Thanks! This looks really nice! I haven't had the time yet to review it due to several projects and deadlines at work. I hope to review it some time next week. |
Hi @rjagerman, Understand. And feature for yarn support is used in weibo.com(Maybe you have heard about this web site, maybe not, and it is top 5 website in China, similar twitter with more users in China). And it works well. And I also developed some other features on Glint, which includes additional operations like Save and Load which could used to store and read quickly models in HDFS, and I believe it is useful for most of Glint Users who are working on Big Vector and Matrix Machine Learning. If could, I wanna to be an contributor for Glint because it is very simple and stable for large scale Machine learning. Thank you for your work on Glint. |
Still haven't found the time to do it, too many deadlines unfortunately :-( I'll let you know when I get around to it. |
Got it. later I will send out another patch for Glint, which could be used to store all parameters into HDFS by nodes independently. Thanks. |
Hi, @batizty |
Hi, @baukloze And could you please wait one or two days, I will send out my modification ASAP. By the way, @rjagerman my workmates and i have implemented basic ML algorithms based on Glint, but it is not stable enough now. When our data size reached to 1000B, and the matrix/vector width reached 500B, a lot of traffic load will cause some of AKKA nodes became Quarantined State. Any Suggestion or method to fix this problem? |
@batizty ok, thanks. |
Hi rjagemen,
Could you please help me to review the request?
All codes are tested on online in my cluster environment.
Any question is welcome and appreciate your previous work.
Thanks