Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Search GB for OCLC/LCCN/ISBN matches #3

Closed
ryanfb opened this issue Mar 15, 2016 · 5 comments
Closed

Search GB for OCLC/LCCN/ISBN matches #3

ryanfb opened this issue Mar 15, 2016 · 5 comments

Comments

@ryanfb
Copy link
Owner

ryanfb commented Mar 15, 2016

Use the Google Books search API to see if we find matching identifiers from HT/IA results.

@ryanfb
Copy link
Owner Author

ryanfb commented Apr 10, 2018

For ISBN, some example data, using "Parody in the Middle Ages".

GB https://books.google.com/books?id=LSh_IPyVIEIC under volumeInfo (see #15):

  "industryIdentifiers": [
   {
    "type": "ISBN_10",
    "identifier": "047210649X"
   },
   {
    "type": "ISBN_13",
    "identifier": "9780472106493"
   }
  ],

HT https://babel.hathitrust.org/cgi/pt?id=mdp.39015038607985: ISBN of "9780472106493" repeated 3 times. So we need to uniq the array before iterating it (probably the same of OCLC/LCCN).

@ryanfb
Copy link
Owner Author

ryanfb commented Apr 10, 2018

We should also query HT/IA for these, so that we can get from e.g. (above) https://books.google.com/books?id=LSh_IPyVIEIC to https://babel.hathitrust.org/cgi/pt?id=mdp.39015038607985 instead of only HT->GB.

ryanfb added a commit that referenced this issue Apr 10, 2018
… biblio query functions, have IA metadata queries handle arrays OR strings (see #3)
@ryanfb
Copy link
Owner Author

ryanfb commented Apr 10, 2018

For IA volumes, we should also use e.g.:

"related-external-id": [
      "urn:isbn:0714120588",
      "urn:lccn:90213619",
      "urn:oclc:611878143",
      "urn:oclc:185754288",
      "urn:oclc:22731242",
      "urn:oclc:246635459",
      "urn:oclc:460183744",
      "urn:oclc:831416100",
      "urn:isbn:0520070879",
      "urn:lccn:89052107",
      "urn:oclc:869257238",
      "urn:oclc:22279285",
      "urn:oclc:750645767",
      "urn:isbn:071411703X",
      "urn:oclc:438658389",
      "urn:oclc:462786221",
      "urn:oclc:801861458",
      "urn:oclc:848632400"
    ],

@ryanfb
Copy link
Owner Author

ryanfb commented Apr 10, 2018

Note that we have to use the OpenLibrary API to search the Internet Archive because of the CORS issue in ArchiveLabs/archive.org#1

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant