Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Missing files on MIND dataset #8

Open
LUUUUUUZ opened this issue Dec 1, 2022 · 6 comments
Open

Missing files on MIND dataset #8

LUUUUUUZ opened this issue Dec 1, 2022 · 6 comments

Comments

@LUUUUUUZ
Copy link

LUUUUUUZ commented Dec 1, 2022

Could you share some missing files on MIND dataset?
python mind_preprocess.py --content_info
It seems ../data/mind/articles_embeddings_1.pkl is missing, which I thought maybe is generated from get_content_vec.py.
But how could I load the pre-trained model?

Besides,
python data_process/generate_neighbor.py
It seems publish_time_1.txt is missing?

I will really appreciate it if you could provide above two files.
Thank you!

@summmeer
Copy link
Owner

summmeer commented Dec 1, 2022

articles_embeddings_1.pkl: articles_embeddings_1.zip

publish_time_1.txt can be generated in mind_preprocess.py.

@LUUUUUUZ
Copy link
Author

LUUUUUUZ commented Dec 1, 2022

Thanks!
Have a good day ;)

@LUUUUUUZ
Copy link
Author

LUUUUUUZ commented Dec 1, 2022

Sorry to bother again.
It seems MIND/articles_category.pkl is also missing.
Although it's simple to generate but I don't know the form of this data structure.
I will appreciate it if you could provide this file.
Thanks!

@LUUUUUUZ
Copy link
Author

LUUUUUUZ commented Dec 1, 2022

And could you share your environment?
like python version and tf version?

@summmeer
Copy link
Owner

summmeer commented Dec 2, 2022

articles_category is a dictionary, pseudo code:

articles_category = {}
for each news_id, news_category:
    articles_category[news_id] = news_category
pickle.dump(articles_category, open('../articles_category.pkl', 'wb'))

Environment info:
Python: 3.6.9, TensorFlow: 1.14.0

@LUUUUUUZ
Copy link
Author

LUUUUUUZ commented Dec 2, 2022

Thank you so much!
It's so nice of you to help me with so many problems.
Enjoy your weekend!
:)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants