Why `’` is converted to `'`? #247

dimus · 2023-09-12T15:14:30Z

Another issue is that "D'Orbigny" in the original is "D’Orbigny" in the gnparser output. Why change UTF-8 27 to e2 80 99?

dimus · 2023-09-12T15:19:41Z

I do try to normalize/simplify characters if it does not change semantic meaning. My impression is that ' and ’ are used interchangeably for authors in scientific names, and I picked ' because it is ASCII, meaning it will generate less problems for people with weird default encoding.

The original spelling of the authorship is preserved in JSON format in the verbatim field:

"authorship": {
    "verbatim": "B.D’Orbigny",
    "normalized": "B. D' Orbigny",
    "authors": [
      "B. D' Orbigny"
    ],
    "originalAuth": {
      "authors": [
        "B. D' Orbigny"
      ]
    }
  },

It might make sense to leave verbatim authorship in csv/tsv output, let me think about it a bit.

Mesibov · 2023-09-15T07:31:02Z

@dimus, I've rechecked the original dataset and found that the compilers used both characters:
3 records Acteocina candei (D’Orbigny, 1841)
37 records Acteocina candei (D'Orbigny, 1842)

gnparser converted both to apostrophe in Author, which is OK. I was looking at "D’Orbigny" in the verbatim field and thinking I had inputted "D'Orbigny", so my mistake, all is well. In my pseudo-duplicate search the results are fine:

Acteocina candei (D’Orbigny, 1841) [3]
Acteocina candei (D'Orbigny, 1842) [37]

dimus changed the title ~~Why "’" is converted to "'"?~~ Why ’ is converted to '? Sep 12, 2023

dimus transferred this issue from gnames/gnames Sep 12, 2023

dimus closed this as completed Sep 20, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why `’` is converted to `'`? #247

Why `’` is converted to `'`? #247

dimus commented Sep 12, 2023

dimus commented Sep 12, 2023 •

edited

Loading

Mesibov commented Sep 15, 2023 •

edited

Loading

Why ’ is converted to '? #247

Why ’ is converted to '? #247

Comments

dimus commented Sep 12, 2023

dimus commented Sep 12, 2023 • edited Loading

Mesibov commented Sep 15, 2023 • edited Loading

Why `’` is converted to `'`? #247

Why `’` is converted to `'`? #247

dimus commented Sep 12, 2023 •

edited

Loading

Mesibov commented Sep 15, 2023 •

edited

Loading