-
-
Notifications
You must be signed in to change notification settings - Fork 4.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Update documentation for 4.0.0 #2960
Comments
Fixes go into PR #2954. |
@mpenkov release notes below. Can you please edit this comment to add (auto-generate?) the Next up, I'll fill in the performance numbers & finish the Migration guide, and we're ready to release 4.0.0beta. I don't think any of the other tickets are blocking (CC @gojomo). 1. TweetSo, Gensim 4.0 is nearly here. Faster models, less RAM, prettier documentation. Want to help? Install the pre-release and let us know how it went! pip install --pre --upgrade gensim https://github.com/RaRe-Technologies/gensim/releases/tag/4.0.0beta 2. Release notes (maybe also a blog post?)Moved to CHANGELOG: see #2981. |
Nice progress! A few thoughts:
|
Thanks. Re. migrations: I replied in #2967, to keep it tidy. |
I think it's better to do this directly in the changelog, because that gets version-controlled and is generally easier to edit than comments in a github PR. It already has some entries. I will double-check that everything merged after 3.8.3 is in there. |
Thanks. So to be clear, your proposed workflow is:
I was under the impression you had some script that takes merged PRs + closed tickets and auto-generates this list (which can then be edited). Was I confusing that with something else? |
Sort of.
Why do we need to copy them here? They can go straight into the release notes when we do the release.
There's a script that takes a PR number and then grabs its title, etc. We use it for individual PRs - it's not quite what we need here, because there have been dozens of merged PRs since the last release. |
Please see #2981 |
Because I don't know what to put in the release notes :D Relying on my memory is not a good idea. |
Can't we just link to the changelog in the comment instead of copying? e.g.
The risk is we copy stuff to here, then change it in the changelog, and forget to update the comment. |
Yes. I can edit the changelog in #2981, instead of here. |
@gojomo @mpenkov I updated the Migration guide: https://github.com/RaRe-Technologies/gensim/wiki/Migrating-from-Gensim-3.x-to-4 There are two FIXMEs I still wasn't sure about, one for @gojomo and one for @mpenkov . Can you please have a look? And fix / expand / improve anything else you might see there. Otherwise ready for the beta release (with #2981) this week. Thanks! |
I addressed the FIXME with my name beside it, clarifying the I also added extra clarification around I think the commitment "All your existing stored models... will continue to work" is an overly-strong commitment, as our testing doesn't really guarantee that for all models/all features, and I doubt it will be expanded to guarantee that, and if some stray reports of problems with old/rare configurations come up, I know I won't necessarily have time to fix them. The truth is closer to: "most will work & we haven't gratuitously/knowingly broken anything in recent/common use - try it & check results." (For example, some shallow testing of pre-1.0 models used to be present but I disabled-via-renaming during other updates. Similarly. some other testing of old-models-of-inexact-vintage is disabled. Will these old models load & work? Should they work? Would re-enabling this test code actually test anything useful? I don't know, but I suspect it'd be a waste of time given the real possibility that there are zero people in the world who'd actually need this.) Similarly, and for reasons I've also alluded to elsewhere, I think the process authoritatively described at https://github.com/RaRe-Technologies/gensim/wiki/Gensim-And-Compatibility#upgrading-trained-models is somewhat over-involved for the common case. Usually, things will just work, no multiple-hops necessary, because most 4.X to 4.[X+1] releases won't even touch most models, and even where it does, we'll try to support forward-migration when practical. But in the occasional case where bigger changes happen and/or we didn't notice an issue, the guidance risks creating too much confidence in this "migration" script when it runs without error. That won't, practically, be a true guarantee everything is OK given our current (& foreseeable) level of testing. Saying less would be be more reasonable. |
Thanks. What I meant is "if loading your model worked in 3.8.3, it will continue to work in 4.0.0". Of course we don't support & test everything meticulously, all the way back to Gensim 0.4.0. I'll check your links and reword as appropriate. About https://github.com/RaRe-Technologies/gensim/wiki/Gensim-And-Compatibility#upgrading-trained-models : was discussed here #2967 (comment) |
I checked your links and don't think that's something we need to draw attention to or split hairs over, pedantically. If there are any people who need to load models from Gensim<1.0 or some such "support", we can handle them on a case-by-case basis. I suspect there are none, just as you said. And by "handle" I expect saying "go and retrain your model in 4.0". |
My main concern is that we don't project a false confidence - to do so increases the risk of users being less vigilant in their upgrades, and increases the implied obligations about what might be fixed if buggy. The current wording without extra emphasis on "all... will work" is better than before, but my own preferred wording would still be "should work", implying a goal not a guarantee. More like: "Our goal has been that most older models should load & function as before, but you should independently verify that your models load & work, especially if older than 3.8.3". It's hair-splitting, but it just seems safer to me to lean in the direction of lower-expectation/lower-commitments. |
OK, I'll relax the language further. It's a Wiki page, so we can update it any time easily, even post-release. I'm hoping the beta release will shine light on several places not covered by the migration guide, or not covered adequately. Or nobody will care, which is also quite possible. |
@piskvorky This got closed by automation: not sure if that's what we actually wanted to do, so double-checking. |
Yes, I'm done here. |
Internal checklist so I don't forget something:
rst
docs up-to-date for the new website, check hyperlinks, update stats & copy.The text was updated successfully, but these errors were encountered: