yandex-q

Scripts that were used to scrape and process data from Yandex.Q. The resulting dataset can be found here.
Some scripts are messy, but they get the job done.

Scripts used

parse_questions_search.py - to parse questions by searching all 4 letter combinations, because of the 1000 items limit per search
parse_question_ids.py - to parse question ids by using question recommendation endpoint
get_ids.py - to extract ids from questions that were retrieved from search
parse_qa.py - to parse all question info from ids collected

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
get_ids.py		get_ids.py
parse_qa.py		parse_qa.py
parse_question_ids.py		parse_question_ids.py
parse_questions_search.py		parse_questions_search.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

yandex-q

Scripts used

About

Releases

Packages

Languages

License

its5Q/yandex-q

Folders and files

Latest commit

History

Repository files navigation

yandex-q

Scripts used

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages