Skip to content
This repository has been archived by the owner on Oct 1, 2024. It is now read-only.

Broken UTF-8 characters in indexed literals #1

Open
ewinge opened this issue May 31, 2016 · 2 comments
Open

Broken UTF-8 characters in indexed literals #1

ewinge opened this issue May 31, 2016 · 2 comments

Comments

@ewinge
Copy link
Collaborator

ewinge commented May 31, 2016

Sublima indexes all string literals in the predicate sub:literals. The indexing breaks Unicode characters:

De nordiske Juristmøder. Litteratur. Nordic countries. Literature. Nordic. Københavns Universitet. Norden. Rettskilder generelt. De nordiske juristmøter Artikler og debatter fra de nordiske juristmøtene siden 1948. Juraportal.dk driftet av det juridiske fakultetsbibliotek i København.. The Nordic countries. 

http://juridisk.net:8890/sparql?default-graph-uri=&query=Select+%3Fp+%3Fo%0D%0AWhere{%0D%0A%3Chttp%3A%2F%2Fjura.ku.dk%2Fnjm%2F%3E+%3Fp+%3Fo%0D%0A}&format=text%2Fhtml&timeout=0&debug=on

sparql query:

select ?p ?o
where {
    <http://jura.ku.dk/njm/> ?p ?o
}
@ewinge
Copy link
Collaborator Author

ewinge commented Jun 2, 2016

Possibly caused by this issue in virtuoso:
openlink/virtuoso-opensource#17

@ewinge ewinge closed this as completed Jun 2, 2016
@ewinge ewinge reopened this Jun 2, 2016
@ewinge
Copy link
Collaborator Author

ewinge commented Jun 2, 2016

Examples:

  1. Broken RDF/XML
  2. Compare HTML
  3. Describe query works

In the last example, literals and externalliterals are broken, because of this bug. However, the title is correct.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant