Change single backticks in Sphinx to render as code #13519

asmeurer · 2017-10-23T06:33:57Z

Right now single backticks, like `this` in a docstring render as math. You need double backticks like ``this`` to render as code.

This confuses just about everyone because it's different from markdown, which is what GitHub uses for comments and so on.

Apparently you can set the single backticks to anything. I think the mathjax extension just sets it to math by default.

We do have a decent amount of math in our docstrings, but you can also use :math:`math`. I wonder if it would also be possible to make it use dollar signs. I suspect for every docstring currently using single backquotes for math there are two using it incorrectly for code.

gxyd · 2017-10-23T06:38:30Z

Yup a lot of places make use of single and double backticks inconsistently. Add 'Easy to Fix'?

asmeurer · 2017-10-23T06:40:32Z

Well first we should discuss if we actually want to do this.

Also I don't know how easy this is to fix, since it requires modifying the Sphinx configuration. Hopefully it's not too hard, though.

skirpichev · 2017-10-23T14:20:59Z

This confuses just about everyone because it's different from markdown

More important, probably, that this break numpy docstring standard. Using single backticks required to enclose variables, e.g. in Parameters section.

I think the mathjax extension just sets it to math by default.

It's not mathjax, it's sympy devs (default_role sphinx option).

jksuom · 2017-10-24T06:34:59Z

Changing the role of single backticks is easy to do in Sphinx config file, but I think that should be done only if single dollars can be used for inline math. MathJax can be configured to do that; in fact, this is in its config/default.js file:

    inlineMath: [
//    ['$','$'],      // uncomment this for standard TeX math delimiters
      ['\\(','\\)']
    ],

The hard part is making the MathJax server use the modified configuration. It is not necessary to completely replace the standard combined configuration file (TeX-AMS-HTML-full); it will suffice to add a small local file to the config list provided the server can access it. That can apparently be arranged in two ways, either having the file locally on the server location or having it at some publicly accessible URL. There is already a skeleton local config file that does nothing. We could prepare a similar file local/inline-math.js for inline math. Our problem would then be solved if that could be placed at some public location. It could even be possible to have it at cdn.mathjax.org since we would certainly not be the only ones interested in it.

asmeurer · 2017-10-24T18:45:37Z

It might be simpler to use a Sphinx extension to convert $math$ to the appropriate inline math directive.

asmeurer · 2017-10-24T18:47:00Z

For example https://github.com/certik/sphinx-jax/blob/master/exts/math_dollar.py (CC @certik)

certik · 2017-10-24T20:37:25Z

I agree, I think `this` should render code, and $ can render math. One can use the extension, that seems to work pretty well.

vishalg2235 · 2017-11-06T09:07:16Z

Should I change `this` to render code and $this$ to render math part in matrices.py docstring ?

asmeurer · 2017-11-06T09:30:46Z

The change here will need to happen in all modules at once when we make it, as it would change a single backtick to do code everywhere.

vishalg2235 · 2017-11-06T16:05:22Z

ok.. is there something that I could help ?

asmeurer · 2017-11-06T18:43:24Z

Since it will be a lot of work to change all single backticks throughout the code, a good first step would be to add the ability to use $math$, without messing with backticks just yet. Then we can transition over a few PRs instead of trying to do it all in one.

vishalg2235 · 2017-11-06T19:01:19Z

can you please guide me how to add that ability ?

asmeurer · 2017-11-06T20:28:41Z

We need to copy the Sphinx extension from @certik that I mentioned above.

vishalg2235 · 2017-11-07T16:15:25Z

Ok so we pass every module from that extension and then every math part will be changed to $math$ . can we also use https://regexr.com/ for that

vishalg2235 · 2017-11-08T13:08:23Z

@asmeurer I have gone through extension code. So how can I help (what is my task) ?
@certik https://github.com/certik/sphinx-jax/blob/master/exts/math_dollar.py#L36 I didn't understood this ?

certik · 2017-11-08T17:50:10Z

@vishalg2235 I tried to document this part here:

https://github.com/certik/sphinx-jax/blob/master/exts/math_dollar.py#L15

There is an example there ($f(n) = 0 \text{ if $n$ is prime}$) where you don't want to change the inner $n$ into math, since mathjax or latex themselves will do that. So then the comment here:

https://github.com/certik/sphinx-jax/blob/master/exts/math_dollar.py#L27

explains that you substitute $n$ for a temporary string, then convert the rest of dollar signs to math, and then substitute back.

@vishalg2235 let me know if it is clear, and then once it is, can you please improve the documentation and comments in the code using your own words so that it's clear to others as well? That would be very helpful.

skirpichev · 2017-11-08T18:17:29Z

Besides backticks vs dollar, using latex in docstring - probably a bad idea in general (except for optional sections like suggested by numpy standard, e.g. Notes).

certik · 2017-11-08T18:35:02Z

Yeah, I agree, I think it's best is to keep latex to minimum for docstrings.

asmeurer · 2017-11-08T20:14:22Z

I like the use of LaTeX math. It makes docstrings like the ones in the integral transforms docs easier to read. And even for small things, I think using LaTeX makes the docs look more professional. I think most people consume the SymPy documentation online.

Even for those that use ?, I think most people use the notebook. It is possible to render docstrings as html in the notebook. Unfortunately, it doesn't yet support LaTeX math. It also isn't enabled by default. So we should work with the IPython guys to get that fixed. But when we do the docs will render nicely with LaTeX math for almost everyone.

The main downside to too much LaTeX is for those of us who primarily look at docstrings in the terminal, or in the SymPy sources (myself included). It may actually be possible to extend the above docrepr idea to the terminal IPython so that ? opens the docstring in the browser.

Regardless, I think we should primarily focus on the majority of users, who use the html docs or the notebook.

certik · 2017-11-08T20:22:15Z

I meant especially things like like $f(n) = 0 \text{ if $n$ is prime}$, which looks complicated, i.e., I think we should use rest or markdown to do the formatting, not latex. A simple latex math, like the one used in the integrals page, is fine I think.

skirpichev · 2017-11-08T21:10:32Z

It is possible to render docstrings as html in the notebook. Unfortunately, it doesn't yet support LaTeX math.

numpy docstring from example seems to be too ugly even without LaTeX rendering. Yet this feature looks promising.

But when we do the docs will render nicely with LaTeX math for almost everyone.

For Diofant, I'm thinking about using unicode pretty-printing for math, e.g. for above integral transform definitions (except for optional heavy-math sections). This should work both for sphinx docs and in any Jupyter frontend. (Or even in plain CPython console.)

moorepants · 2017-11-08T21:51:43Z

I would also vote for moving most, if not all, latex into the "Notes" section of the docstrings. The main portion of the docstrings should be readable in ascii or unicode. I, for example, almost exclusively read SymPy doc strings in the terminal or notebook which only show the ascii/unicode. We followed this pattern in the mechanics package and the balance works pretty good. You can understand the docstrings when working interactively and look up the rendered versions if you want to know more details. The primary use of the docstring is to figure out quickly what to type as arguments to a method/function. The first lines of the docstring should reflect this.

asmeurer · 2017-11-09T00:49:26Z

If you want to refactor our docstrings, go for it. It's a separate question of whether we should use ` for code and $ for math, which I assume we all agree on.

vishalg2235 · 2017-11-09T12:50:53Z

@certik Ok I got that.. I'm getting an idea what math_dollar.py is doing. Still I don't understand many lines because regular expressions is new to me.
I can improve documentation and comments in code. So should I send a PR in sphinx_jax repository ?

vishalg2235 · 2017-11-10T13:35:57Z

also while checking $...$ why we need r"(?<!$)(?<!\)$([^\$]+?)$" in Line 41 . What is it doing

asmeurer · 2018-09-14T21:46:15Z

I suggest a multi-stage process for changing this:

Make $math$ work for inline math.
Change default_role to "error" and modify all the warnings to use double backticks if they should be code or dollar signs if they should be math. The recently added --keep-going flag to Sphinx should help with this.
Do a mass find and replace of all double backticks with single backticks. Double backticks will continue to work, but single backticks will be preferred.

Step 1 can and should be done as a separate pull request.

The default_role="error" should only be done to find and replace, not merged into master. Steps 2 and 3 can be done in the same PR.

The idea is that there are a lot of misuses of single backticks still in the codebase that should be double backticks, so we should do an audit first to fix them all. We know that audit will be complete if they are all replaced with double backticks or dollar signs, which shouldn't be in error (hence the default_role="error").

As a side note, let's discuss whether or not LaTeX should be used at #14964, and not do any mass changes as part of this issue.

asmeurer · 2019-09-10T22:55:25Z

I added math_dollar at #17605.

I don't plan on replacing the backticks throughout SymPy with dollars, so if someone wants to take that up, please do. Note that it is not as simple as constructing a regular expression replacement, because many backticks in the current docstrings are actually supposed to be code (double backticks), so each replacement needs to be manually checked.

moorepants · 2019-09-30T23:53:38Z

The numpydoc spec does say "Enclose variables in single backticks. The colon must be preceded by a space, or omitted if the type is absent." But I guess changing single backticks to code is better than having them render as math.

mgeier · 2019-10-11T15:40:02Z

The numpydoc spec does say "Enclose variables in single backticks. [...]"

I think that's an unfortunate choice from their part.

This assumes that backticks have the default docutils meaning of "emphasis".

I think it would be much better to recommend enclosing variables in asterisks, like this:

Description of parameter *x*.

This has the same result as intended by the NumPy people, but it doesn't depend on the default_role setting.

Ideally, this should be changed in the NumPy docstring guidelines (https://numpydoc.readthedocs.io/en/latest/format.html#sections).

I personally always simply ignored this specific recommendation (and used asterisks), and I think SymPy should do so, too.

asmeurer · 2019-10-11T16:09:11Z

To me, variables should be rendered as code, not italics. So it should be double backticks. Unless the variable can be cross referenced, then you can use the colon-backtick syntax.

mgeier · 2019-10-11T16:18:18Z

This is of course a matter of taste (and my taste isn't important here), but I wanted to stress that both NumPy and Python use "emphasis" instead of "code" in this case.

asmeurer · 2019-10-11T16:26:13Z

"code" is obviously the correct thing to use in terms of strict markup elements. Most importantly, code with single backticks matches the way things are typically written in Markdown, which is used in more and more places, including here on GitHub issues. In my experience, people are so used to Markdown that they tend to write things that way in docstrings. Sometimes they do it even if they know better (I do it by accident myself all the time). So for that reason alone I think we should use single backticks.

Note that it's quite easy to change the way that code elements look in the final HTML. It's just a matter of tweaking the CSS in the theme. IMO the best formatting is to make variables monospace, with the exact same font as is used for any source code examples such as doctests and for the function parameters (the current docs don't quite do this right). That way, it is clear that they refer to elements of code, as opposed to say a mathematical variable. And it helps avoid ambiguities for variables that are also normal English words (like a).

moorepants · 2019-10-11T16:38:47Z

This statement in the sphinx docs is relevant: https://www.sphinx-doc.org/en/master/usage/restructuredtext/roles.html#roles

The default role (content) has no special meaning by default. You are free to use it for anything you like, e.g. variable names; use the default_role config value to set it to a known role – the any role to find anything or the py:obj role to find Python objects are very useful for this.

mgeier · 2019-10-11T16:44:47Z

I don't want to argue about taste, I just want to point out what others (NumPy, Python) are doing.

Philosophically, one could argue that variables (at least positional arguments) are not actually code, right?

In fact, Sphinx (or the autodoc extension, to be more specific) doesn't format them like code, see for example:

I'm talking about the highlighted lines, BTW.

So in reality it would be less consistent to format function arguments as "code" in the docstrings!

asmeurer · 2019-10-11T16:54:26Z

I guess you mean because the user wouldn't use those variable names directly?

Technically, any positional argument can be used as a keyword argument. It also is "code" in the sense that it appears literally in the source code, which is useful for anyone reading the documentation at that level.

But either way, I think that's looking at the wrong thing. The important thing is the user knowing what a piece of text in the documentation refers to. In SymPy in particular there is a lot of potential for ambiguity, because something could be referring to a variable name (including a function parameter), or a mathematical variable, or something else. And sometimes they would all use the exact same spelling, so you have to use formatting to distinguish them. There are some examples of this in the style guide PR.

In fact, Sphinx (or the autodoc extension, to be more specific) doesn't format them like code, see for example:

I guess it uses different formatting to distinguish the different parts of the function definition. Also the function definition line in Sphinx isn't strictly valid Python (at least the ones in the Python docs aren't), so it wouldn't be correct for them to be monospace code formatted.

mgeier · 2019-10-12T09:30:42Z

I guess it uses different formatting to distinguish the different parts of the function definition. Also the function definition line in Sphinx isn't strictly valid Python (at least the ones in the Python docs aren't), so it wouldn't be correct for them to be monospace code formatted.

Yeah, I guess that was the original idea behind formatting them with "emphasis".
And then, to be consistent, they also used "emphasis" for function arguments mentioned in the description text.

This is now the default in Sphinx, and just about everyone uses this. Most importantly, NumPy and probably most of the scientific Python projects use it like this.

Now if you (@asmeurer) want to format function arguments as "code" in the description text, to be consistent, you should also change the formatting of the (auto-generated) line with the function definition.

That's one of my points: consistency.

My other point: why not just do it like everybody else?

But either way, I think that's looking at the wrong thing. The important thing is the user knowing what a piece of text in the documentation refers to. In SymPy in particular there is a lot of potential for ambiguity, because something could be referring to a variable name (including a function parameter), or a mathematical variable, or something else. And sometimes they would all use the exact same spelling, so you have to use formatting to distinguish them.

AFAICT, there is no ambiguity:

variables, meaning function arguments: "emphasis", e.g. *a* (typeset as slanted)
variables, meaning math symbols: "math", e.g. $a$ (typeset with MathJax font, italics)
Python objects: e.g. :attr:`a` (typeset as link with a bold upright font)
actual code: e.g. `a` (typeset as monospaced, assuming that default_role will be changed)
normal text: e.g. a (typeset in the default font)

Did I forget something?
All those have different and distinguishable formatting.

One counter-argument: people might want to use "emphasis" (using asterisks) just to emphasize things in the description text, which could be confused with function arguments.
Response: I think it should be obvious from context what is what. And it won't happen that often anyway (I guess?).

@moorepants (#13519 (comment)):

This statement in the sphinx docs is relevant: https://www.sphinx-doc.org/en/master/usage/restructuredtext/roles.html#roles

The default role (content) has no special meaning by default. You are free to use it for anything you like, e.g. variable names; use the default_role config value to set it to a known role – the any role to find anything or the py:obj role to find Python objects are very useful for this.

Sadly, this isn't really solid advice (and should be changed in the Sphinx docs).

If you use default_role = any, you'll get warnings like this (referring to a parameter `a` in the example):

WARNING: 'any' reference target not found: a

So this is a bad choice, because you would flood your Sphinx output with warnings, hiding more important warnings.

If you use default_role = py:obj, you won't get any warnings, and variables will be typeset in an upright bold font like links, but they won't be actual links, because no Python object will be found with that name (or it would be an unrelated Python object that happens to have the same name).
But even worse, if you use this for actual Python objects, but make a typo, you will not get a warning (whereas with default_role = any you will get a warning).

So neither one of those choices should be recommended.

Apart from that, it doesn't make sense anyway, because function argument names simply aren't part of the linkable API. They aren't even Python objects in the module namespace, they are only Python objects within the function scope (which isn't really relevant for the documentation).

Anyway, this is just a response to @moorepants, and it is not really relevant in the discussion because using any or py:obj as default_role wasn't actually suggested.

asmeurer · 2019-10-12T20:33:22Z

So when you say "variables" you specifically mean things that are function parameters, not Python variables in general?

mgeier · 2019-10-13T11:31:44Z

Oh, sorry if that has been confusing ... yes, I was using the word "variables" for "function parameters", so were other people in the comments above.
It is used like this in https://numpydoc.readthedocs.io/en/latest/format.html#sections, and we were just repeating it. I'm pretty sure they do not mean Python variables in general. This should probably be changed in the NumPy styleguide.

And no, I don't mean Python variables in general. Those would be "actual code" (as mentioned in my comment above), and I would suggest formatting those in a monospace font, by using backticks (once the default_role is changed, or by double backticks before that).
However, I don't think that "normal" Python variables will appear much in docstrings, right?
Probably when talking about a code snippet in the "Examples" section?

asmeurer · 2019-10-14T00:36:57Z

If you only mean parameters I'm not opposed to it. My main concern is it would be another rule that will require more work to actually maintain the consistency.

theWiseAman · 2023-02-25T13:44:17Z

@asmeurer I have a little question. Why we didn't use single backticks as "code" snippets in the first place? Let's say I am willing to undo the double backticks, can we now implement single backticks as "code" so that it is consistent with the GitHub markdown typing habit styles? Can't the style configuration for single backticks be changed to render as "code" rather than "math"? Is there some problem with changing the style configuration?

I acknowledge it would be much simpler to correct single backtick uses but if we consider long-term perspective using single backticks as code would mean lesser errors from future contributors. Anyways correcting single backticks or converting double backticks amounts to the same effort at the end of the day.

asmeurer mentioned this issue Oct 23, 2017

Fix issue #13449 related to matrices.py docStrings #13461

Closed

gxyd mentioned this issue Oct 28, 2017

Update doc strings in matrices.py #13459

Closed

sidhantnagpal mentioned this issue Jul 24, 2018

Should ASCII/Unicode or LaTeX formulas be used in docstrings? #14964

Open

asmeurer mentioned this issue Sep 14, 2018

documentation mods for transolve #15204

Merged

asmeurer mentioned this issue Sep 10, 2019

Allow dollar signs for math in the documentation #17605

Merged

asmeurer added this to the Docstring Style Guide milestone Sep 23, 2019

This was referenced Oct 18, 2019

Add SymPy Documentation Style Guide #17715

Merged

Add a guard against people using * in LaTeX in the docs #17803

Open

asmeurer added the GSoD label Mar 23, 2020

asmeurer mentioned this issue Jun 14, 2020

RST needs double backticks #19549

Merged

brandondavid mentioned this issue Aug 8, 2020

[WIP] DOC: docstyle enforcement for stats module #19863

Draft

4 tasks

pauladkisson mentioned this issue Oct 12, 2020

Standardize docs graspologic-org/graspologic#525

Merged

mwaskom mentioned this issue Oct 27, 2020

Error if row/col colors are indexed but data isn't mwaskom/seaborn#2313

Merged

tlambert03 mentioned this issue Feb 18, 2021

Standardize usage of single backticks for variables in docstrings. napari/napari#2286

Open

paulmand3l mentioned this issue Feb 24, 2021

Expanding documentation #20984

Merged

asmeurer mentioned this issue Apr 25, 2022

Testing sphinx-lint. #23412

Merged

asmeurer mentioned this issue Jan 25, 2023

Use validated strict numpydoc format #17205

Open

sylee957 linked a pull request Jan 28, 2024 that will close this issue

Change default role of single backticks from math to code #26137

Open

pplantinga mentioned this issue Apr 18, 2024

Add fetch local_strategy parameter and disable symlinks by default speechbrain/speechbrain#2476

Merged

13 tasks

mdhaber mentioned this issue Jun 2, 2024

DOC: Clarify recommendations regarding use of backticks numpy/numpydoc#525

Merged

Change single backticks in Sphinx to render as code #13519

Change single backticks in Sphinx to render as code #13519

Comments

asmeurer commented Oct 23, 2017

gxyd commented Oct 23, 2017

asmeurer commented Oct 23, 2017

skirpichev commented Oct 23, 2017

jksuom commented Oct 24, 2017

asmeurer commented Oct 24, 2017

asmeurer commented Oct 24, 2017

certik commented Oct 24, 2017

vishalg2235 commented Nov 6, 2017

asmeurer commented Nov 6, 2017

vishalg2235 commented Nov 6, 2017

asmeurer commented Nov 6, 2017

vishalg2235 commented Nov 6, 2017

asmeurer commented Nov 6, 2017

vishalg2235 commented Nov 7, 2017 • edited Loading

vishalg2235 commented Nov 8, 2017

certik commented Nov 8, 2017

skirpichev commented Nov 8, 2017

certik commented Nov 8, 2017

asmeurer commented Nov 8, 2017

certik commented Nov 8, 2017

skirpichev commented Nov 8, 2017

moorepants commented Nov 8, 2017

asmeurer commented Nov 9, 2017

vishalg2235 commented Nov 9, 2017

vishalg2235 commented Nov 10, 2017 • edited Loading

asmeurer commented Sep 14, 2018

asmeurer commented Sep 10, 2019

moorepants commented Sep 30, 2019

mgeier commented Oct 11, 2019

asmeurer commented Oct 11, 2019

mgeier commented Oct 11, 2019

asmeurer commented Oct 11, 2019

moorepants commented Oct 11, 2019

mgeier commented Oct 11, 2019

asmeurer commented Oct 11, 2019

mgeier commented Oct 12, 2019

asmeurer commented Oct 12, 2019

mgeier commented Oct 13, 2019

asmeurer commented Oct 14, 2019

theWiseAman commented Feb 25, 2023

vishalg2235 commented Nov 7, 2017 •

edited

Loading

vishalg2235 commented Nov 10, 2017 •

edited

Loading