-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
hanging? #7
Comments
I have similar issues with a set of articles, also by Elsevier. DOIs resolve fine in browser, but http://doi.org/10.1016/j.dsr2.2016.12.010 Adding a timeout and wrapping the
but I'd be curious to know if anyone has an idea why |
Is it possibly some kind of anti-crawling device? simplifying a bit to a wget call from the command line shows that wget also doesn't complete.
|
@sje30 you're right. You see I wrote this in R to make it a bit more cross-platform / portable. So this will work for you:
So I need to update the script to use a user-agent string with |
Hmmm...
But when I try to set a user-agent with
|
For debugging purposes, it may be useful to compare
|
I think Nature returns a 401 if the article is paywalled. Wget log
|
I think Nature returns a 401 if one if the article is paywalled.
Maybe at this stage we can link tools up with http://dissem.in
( CC @wetneb)
dissem.in does a great job of parsing ORCID records and working out
which entries are freely available and which are paywalled.
|
I haven't read everything but I'm not sure dissemin would help much here. Maybe Zotero's translation server? |
I haven't read everything but I'm not sure dissemin would help much
here. Maybe Zotero's translation server?
sorry Antonin... just assuming given my ORCID profile, you must have a
way of dividing the DOIs for my papers into those that are OA and those
that are not?
|
dissemin creates yet another set of problems for me: It won't parse publications from Inter Research (a small independent publisher), even though the papers have valid DOIs and are recognized by ORCID. Thanks for the |
@sje30 yes, but we don't do any scraping (which is necessary for accurate results). The issue seems to be about making requests to the publisher's website, which is something we can't help with. |
ok, thanks @wetneb. |
when I try this (one of my DOIs) it seems to hang a long time. Do we need a timeout?
The paper works ok though when I visit the DOI in a browser.
The text was updated successfully, but these errors were encountered: