Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Markdown improvements regarding tables, nested statements and special character support in links #1647

Open
wants to merge 13 commits into
base: main
Choose a base branch
from

Conversation

weissm
Copy link
Contributor

@weissm weissm commented Oct 31, 2024

IS YOUR COMMIT SIGNED?

yes

Enhancement or Defect Addressed by This PR

Enhancement

Description of Proposed Changes

When generating markdown nested tables the handled was not done properly. Also the handling of numbered list in tables was not properly processed.
The following changes are applied:

  • A Class PrefixClass is introduced to pass sticky parameters by reference.
  • The actual writing of text was pushed to the "T" object.
  • The anchor link is now surrounded by <>. To protect these characters, the Replace statement is moved before.

The push requested was tested on the OneNote TextToTable. For clarification another example onenote page NestedTableContent is added.

@weissm weissm changed the title squased Markdown commits Markdown improvements regarding tables, nested statements and special character support in links Nov 4, 2024
@stevencohn
Copy link
Owner

stevencohn commented Nov 5, 2024

I will take a look at this. Just need a little time. Thanks!

@stevencohn
Copy link
Owner

stevencohn commented Nov 7, 2024

@weissm Using your Nested Tables example page, this results in markdown as follows, with a table header divider line after the table, not before:

| **🛈**   | **Test Protocol**  Markdown generation of nested tables  <ol><li>Here is a nested table</li><li>Export this to an Markdown file or Copy as Markdown</li><li>Confirm that a netsted table is generate</li></ol> |  |  | 
|  :--- | :--- | :--- | :--- |

and when converted back to OneNote content, looks like this:

image

The divider line should be removed, or before the table and translated to an actual divider

OneMore/Commands/File/Markdown/MarkdownWriter.cs Outdated Show resolved Hide resolved
OneMore/Commands/File/Markdown/MarkdownWriter.cs Outdated Show resolved Hide resolved

public PrefixClass(string set_indents = "", string set_tags = "", string set_bullets = "", string tablelistid = "")
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Remove the set_ prefix on these parameter names.
Use a default constructor to take advantage of the field default values above, otherwise the default parameter values are redundant here.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

set_ removed. The default parameters I left equal to allow instantiation with detault parameters.

OneMore/Commands/File/Markdown/MarkdownWriter.cs Outdated Show resolved Hide resolved
break;

case "T":
{
var context = DetectQuickStyle(element);
if (context is not null)
if (contained) // not in table cell
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You've lost calls to DetectQuickStyle and Stylize here; OneNote can add quickStyleIndex to T, OE, OEChildren - one or any of them.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Added DetectQuickStyle again. Let it out as I did not see this actually been used in OneNote so far.

}

if (index >= 0)
if (element.GetAttributeValue("quickStyleIndex", out int index))
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The code as-is, only detects quick style on OE. It's missing detection on OEChildren and T. That why the deleted code look at parent elements until it found a quickStyleIndex. Text may not be styled correctly.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The intention here was to have the check only in the current level for both OE and T as in the intially code. The recursive nature did not work here and it should not be needed, as it is handled at each stage already.


switch (symbol)
{
case 3: // to do
case 8: // client request
case 12: // schedule/callback
case 28: // to do prio 1
case 71: // to do prio 2
case 28: // todo prio 1
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

OneNote calls them "To do" tags, not "Todo" tags

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done

case 64: retValue = (":star: "); break; // custom 1
case 70: retValue = (":one: "); break; // one
case 116: retValue = (":busts_in_silhouette: "); break; // busts_in_silhouette
case 117: retValue = (":notebook: "); break; // notebook
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

117 is a bell, not a notebook

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done

case 70: retValue = (":one: "); break; // one
case 116: retValue = (":busts_in_silhouette: "); break; // busts_in_silhouette
case 117: retValue = (":notebook: "); break; // notebook
case 118: retValue = (":mailbox: "); break; // contact
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

118 is a letter, not a mailbox

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done.

while (nestedtables.Count() != 0)
{
var nestedtable = nestedtables.First();
writer.WriteLine(prefix.indents + "<details id=\"nested-table" + nestedtable.index + "\" open>");
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Shouldn't <details> have a closure </details> ?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

good catch. Added closing and additional

pair to allow collapse tables.

Copy link
Contributor Author

@weissm weissm left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Worked in changes as suggested.

OneMore/Commands/File/Markdown/MarkdownWriter.cs Outdated Show resolved Hide resolved
while (nestedtables.Count() != 0)
{
var nestedtable = nestedtables.First();
writer.WriteLine(prefix.indents + "<details id=\"nested-table" + nestedtable.index + "\" open>");
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

good catch. Added closing and additional

pair to allow collapse tables.

case 70: retValue = (":one: "); break; // one
case 116: retValue = (":busts_in_silhouette: "); break; // busts_in_silhouette
case 117: retValue = (":notebook: "); break; // notebook
case 118: retValue = (":mailbox: "); break; // contact
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done.

case 64: retValue = (":star: "); break; // custom 1
case 70: retValue = (":one: "); break; // one
case 116: retValue = (":busts_in_silhouette: "); break; // busts_in_silhouette
case 117: retValue = (":notebook: "); break; // notebook
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done


switch (symbol)
{
case 3: // to do
case 8: // client request
case 12: // schedule/callback
case 28: // to do prio 1
case 71: // to do prio 2
case 28: // todo prio 1
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done

}

prefix.bullets = "";
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done

break;

case "T":
{
var context = DetectQuickStyle(element);
if (context is not null)
if (contained) // not in table cell
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Added DetectQuickStyle again. Let it out as I did not see this actually been used in OneNote so far.


public PrefixClass(string set_indents = "", string set_tags = "", string set_bullets = "", string tablelistid = "")
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

set_ removed. The default parameters I left equal to allow instantiation with detault parameters.

}

if (index >= 0)
if (element.GetAttributeValue("quickStyleIndex", out int index))
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The intention here was to have the check only in the current level for both OE and T as in the intially code. The recursive nature did not work here and it should not be needed, as it is handled at each stage already.

@stevencohn
Copy link
Owner

@weissm This PR is now far beyond the original intent described by the title. I cannot accept this as a single PR.

@weissm
Copy link
Contributor Author

weissm commented Nov 24, 2024

@weissm This PR is now far beyond the original intent described by the title. I cannot accept this as a single PR.

Understood. This PR extension was meant to address:
@weissm Using your Nested Tables example page, this results in markdown as follows, with a table header divider line after the table, not before:

Will create separate PRs to ease handling.

@weissm weissm force-pushed the improveMarkdownPullRequest branch from b3c24d7 to a4e4ada Compare November 28, 2024 20:45
@weissm
Copy link
Contributor Author

weissm commented Nov 28, 2024

Reworked the complete change set in cluster changes into indivdiual commits.

@weissm weissm force-pushed the improveMarkdownPullRequest branch from a4e4ada to 8bbae19 Compare December 1, 2024 19:32
@weissm weissm force-pushed the improveMarkdownPullRequest branch 3 times, most recently from 1160af0 to bda1f0e Compare December 21, 2024 07:18
@@ -122,6 +121,33 @@ public void OptimizeForSave(bool keep)

public bool IsValid => Root != null;

public static List<(string name, string id, string topic, int type)> taglist = new List<(string name, string id, string topic, int type)>
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This list does not belong in the Page model

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I was looking for a common place to handle the taglist in both MarkDownWriter and OneMoreDigExtension. The class page looked like a suitable place. Let me know, which place would fit better.

Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Add it as a separate static class under the Markdown folder to keep it within its domain, maybe something like MarkdownEmojis.cs

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Created a dedicated class and file as proposed.

@@ -39,24 +39,32 @@ public static QuickStyleDef GetDefaults(this StandardStyles key)
{
case StandardStyles.Heading1:
style.FontSize = "16.0";
style.SpaceAfter = "0.5";
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This extension is used from other commands beyond just those related to markdown. Does this adversely affect those? These styles are supposed to emulate the built-in OneMore styles as they appear on the page, not how markdown treats similar style types.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

seperated the spaceafter/spacebefore handling into RewriteHeadings

OneMore/Models/Page.cs Show resolved Hide resolved
.RewriteTodo(touched)
.SpaceOutParagraphs(touched, 12);
.RewriteTodo(touched);
// disabled, as it will space out also short lines. Added space in heading defintion instead
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

See my note below about changing the heading definitions. They need to match the OneNote built-in definitions. This needs to be solved a diffferent way

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Moved the spacebefore/after handling into RewriteHeadings

@@ -122,6 +121,33 @@ public void OptimizeForSave(bool keep)

public bool IsValid => Root != null;

public static List<(string name, string id, string topic, int type)> taglist = new List<(string name, string id, string topic, int type)>
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Add it as a separate static class under the Markdown folder to keep it within its domain, maybe something like MarkdownEmojis.cs

OneMore/Commands/File/Markdown/MarkdownConverter.cs Outdated Show resolved Hide resolved
@weissm weissm force-pushed the improveMarkdownPullRequest branch 2 times, most recently from 5e99072 to b24e7f6 Compare December 30, 2024 15:29
@weissm weissm requested a review from stevencohn December 30, 2024 15:36
@weissm weissm force-pushed the improveMarkdownPullRequest branch 3 times, most recently from 82429f4 to ac90721 Compare January 5, 2025 08:49
@weissm weissm force-pushed the improveMarkdownPullRequest branch from ac90721 to 24f15db Compare January 6, 2025 08:51
Copy link
Owner

@stevencohn stevencohn left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you for your work here. I really do appreciate it. I'm going to adopt your changes, except under a different branch so I can incorporate some of my own changes. I've left some comments here to explain those changes. I'll merge to main soon. Best Regards.

@@ -51,6 +52,28 @@ public void RewriteHeadings()
{
RewriteHeadings(outline.Descendants(ns + "OE"));
}

// added header spacing specific to markdown
var quickstyles = page.Root.Elements(ns + "QuickStyleDef");
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is a bad assumption. Should only affect Outline where we are converting markdown. Should abide by existing quick style definitions for page. If user wants spacing, they can create their own custom OneMore Style set and apply that after the conversion.

/// on the page
/// </summary>
public void RewriteTodo()
{
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Here also, should only affect Outlines that were affected, not all Outlines on page. User can apply custom style after if they want, otherwise abide by existing styles on page.

@@ -158,21 +195,43 @@ public MarkdownConverter RewriteTodo(IEnumerable<XElement> paragraphs)
{
var cdata = run.GetCData();
var wrapper = cdata.GetWrapper();
if (wrapper.FirstNode is XText)
{
cdata.Value = wrapper.GetInnerXml();
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This does absolutely nothing? Convert cdata to wrapper and then set cdata to wrapper XML? That's the same thing.

@@ -204,7 +263,7 @@ public MarkdownConverter SpaceOutParagraphs(float spaceAfter)
public MarkdownConverter SpaceOutParagraphs(
IEnumerable<XElement> paragraphs, float spaceAfter)
{
var after = $"{spaceAfter:0.0}";
var after = spaceAfter.ToString("####0.00", CultureInfo.InvariantCulture);
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We have an extension method, spaceAfter.ToInvariantString()

/// <summary>
/// Extended version from RemindCommand to handle also non Todo Tags
/// </summary>
public string SetTag(XElement paragraph, string tagSymbol, string tagName, int tagType = 0, bool tagStatus = false)
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm moving this to PageEditor class. Trying to keep the Page class focused on page-level properties and definitions and let PageEditor modify the content of the page.

@@ -600,6 +603,7 @@ private async Task ImportMarkdownFile(string filepath, CancellationToken token)

converter = new MarkdownConverter(page);
converter.RewriteHeadings();
converter.RewriteTodo();
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

See my comment on this method. Should not affect entire page. I'm removing this line.

@weissm
Copy link
Contributor Author

weissm commented Jan 13, 2025 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants