Header-like structures in content should be escaped #76

marijnvdwerf · 2015-11-05T16:40:09Z

Input:

<p>Foo<br>--<br>Bar</p>
<p>Foo<br>Bar<br>--</p>

Actual:

Foo  
--  
Bar

Foo  
Bar  
--

Expected:

Foo  
\--  
Bar

Foo  
Bar  
\--

andreskrey · 2016-10-08T11:11:42Z

I don't understand why the last -- is not escaped. What's the logic behind this? Only the -- tags should be escaped if they have more text below?

marijnvdwerf · 2016-10-10T20:54:41Z

This was changed in version 0.24 of the spec. 'setext' headings couldn't span multiple lines before. Updated the expected output, thanks!

andreskrey · 2016-10-11T18:44:20Z

I could fix this in a similar way I did for #77, but since there are other escaping issues waiting to be fixed, adding new rules to that same chunk of code will make it quite messy.

I'm thinking of creating a separate function, like escapeSpecialCharacters, that will call other functions (like escapeBlockquotelikeCharacters, escapeHeaderlikeCharacters, etc) but I'm not sure what's the correct approach.

Since I'll be adding new functions to the ParagraphConverter class, should I also create a proper interface for it that would define this escaping functions? Is this necessary? Is there a better approach for this issue? Perhaps instead of escaping this characters at the Converter level, another solution could be to create a function inside the HtmlConverter Class, within the convertChildren function and before the convertToMarkdown call, to catch the string before the convertion and sanitize it there.

Any thoughts @colinodell?

andreskrey · 2016-10-21T21:53:20Z

I just went ahead and created a PR for this. It's #105

colinodell · 2016-10-22T14:16:19Z

Fixed via #105

colinodell added bug commonmark-compatibility character-escaping labels Nov 5, 2015

andreskrey mentioned this issue Oct 21, 2016

Sanitization function for ParagraphConverter #105

Merged

colinodell closed this as completed Oct 22, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Header-like structures in content should be escaped #76

Header-like structures in content should be escaped #76

marijnvdwerf commented Nov 5, 2015 •

edited

Loading

andreskrey commented Oct 8, 2016

marijnvdwerf commented Oct 10, 2016

andreskrey commented Oct 11, 2016

andreskrey commented Oct 21, 2016

colinodell commented Oct 22, 2016

Header-like structures in content should be escaped #76

Header-like structures in content should be escaped #76

Comments

marijnvdwerf commented Nov 5, 2015 • edited Loading

andreskrey commented Oct 8, 2016

marijnvdwerf commented Oct 10, 2016

andreskrey commented Oct 11, 2016

andreskrey commented Oct 21, 2016

colinodell commented Oct 22, 2016

marijnvdwerf commented Nov 5, 2015 •

edited

Loading