Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Nuemark2.0 issues #379

Open
tomByrer opened this issue Oct 3, 2024 · 24 comments
Open

Nuemark2.0 issues #379

tomByrer opened this issue Oct 3, 2024 · 24 comments
Assignees
Labels
bug nuemark Related to nuemark package

Comments

@tomByrer
Copy link

tomByrer commented Oct 3, 2024

Doing a quick code review of your dev branch, I am confused if the Blocks or Inline is scanned first? If Blocks, then it seems this code:

 *** Bold Italic ***

Would render as an <hr> since you are just scanning first few characters.?

Also be aware that CommonMark is VERY flexible for <hr>. I personally don't think you need to cover every edge case, but it could help to document "Only the common Markdown/CommonMark rules are covered."

@tomByrer tomByrer added improvement new feature New feature or request labels Oct 3, 2024
@nobkd nobkd added nuemark Related to nuemark package needs investigation and removed new feature New feature or request labels Oct 3, 2024
@nobkd

This comment was marked as resolved.

@nobkd nobkd added bug and removed improvement labels Oct 12, 2024
@nobkd

This comment was marked as resolved.

@nobkd nobkd changed the title Newmark2.0 *** Bold Italic *** & HR Nuemark2.0 *** Bold Italic *** & HR Oct 12, 2024
@nobkd

This comment was marked as resolved.

@tipiirai
Copy link
Contributor

@tomByrer @nobkd fixed all issues mentioned on this article

@nobkd
Copy link
Collaborator

nobkd commented Oct 21, 2024

Thank youu!
I'll check later today, if I find more unexpected results :)

@nobkd
Copy link
Collaborator

nobkd commented Oct 21, 2024

InputExpected (Commonmark Spec Implementation)Nuemark 2
*test**

**test***

***test****

- - -

****test***

***test**

**test*
<p><em>test</em>*</p>
<p><strong>test</strong>*</p>
<p><em><strong>test</strong></em>*</p>
<hr>
<p>*<em><strong>test</strong></em></p>
<p>*<strong>test</strong></p>
<p>*<em>test</em></p>
<p><em>test</em>*</p>
<p><strong>test</strong>*</p>
<p><em><strong>test</strong></em>*</p>
<hr>
<p><em><strong>*test</strong></em></p>
<p>***test**</p>
<p>**test*</p>

The first half (before hr) is correct, the second half is wrong (e.g. position of * in first after hr).

PS: I started building a test using the commonmark-spec test suite. See dev...nobkd:nue:test/cmark-spec. Maybe you want to try using it.

@nobkd

This comment was marked as resolved.

@nobkd nobkd changed the title Nuemark2.0 *** Bold Italic *** & HR Nuemark2.0 issues Oct 22, 2024
@tipiirai
Copy link
Contributor

No more loops with unclosed image tags such as ![foo *bar*]

@tipiirai
Copy link
Contributor

tipiirai commented Oct 23, 2024

Are image reflinks like ![foo][bar] supported in Markdown?

@tipiirai
Copy link
Contributor

btw: Nuemark will not support raw HTML, because that violates the separation of concerns principle

@nobkd
Copy link
Collaborator

nobkd commented Oct 23, 2024

Are image reflinks like ![foo][bar] supported in Markdown?

The commonmark reference implementation does support it: https://spec.commonmark.org/dingus/?text=!%5Bimg%5D%5Btag%5D%0A%0A%5Btag%5D%3A%20%2Fimg.png%0A#result

(PS: did you do the strike through implementation with one tilde (~) or with two (~~)? ~strike~ oder ~~strike~~?
Also, may I know why you used the pipe symbol |marked text| for <mark> and not the (in my opinion) more commonly used ==marked text==? (To be more similar to glow?)
(PPS: do we have a simple way to write <details> with <summary>?)

Edit : you can btw remove marked from this tourimage: https://github.com/nuejs/nue/blob/dev/packages%2Fnuejs.org%2Ftour%2Fimg%2Fnpm-nue.png (maybe remove node modules and do a clean install on dev, to get all the dependency changes)

@nobkd
Copy link
Collaborator

nobkd commented Oct 28, 2024

Found more problems:

1. **bold** not bold **bold**
2. **test**:

expected:

<ol>
  <li><p><strong>bold</strong> not bold <strong>bold</strong></p></li>
  <li><p><strong>test</strong>:</p></li>
</ol>

reality:

<ol>
  <li><p><strong>bold** not bold **bold</strong></p></li>
  <li><p>**test**:</p></li>
</ol>

You can see this issue e.g. on the docs index page in on dev branch: http://localhost:8080/docs/

Edit: maybe the ps should not belong to the list items?

@nobkd
Copy link
Collaborator

nobkd commented Nov 2, 2024

Nuemark also doesn't support <br> line breaks using two or more spaces at the end of a line or a backslash at the end of a line:

foo  
bar

foo\
bar

expected:

<p>foo<br />
bar</p>
<p>foo<br />
bar</p>

nobkd pushed a commit to nobkd/nue that referenced this issue Nov 14, 2024
nobkd pushed a commit to nobkd/nue that referenced this issue Nov 14, 2024
nobkd pushed a commit to nobkd/nue that referenced this issue Nov 14, 2024
nobkd pushed a commit to nobkd/nue that referenced this issue Nov 14, 2024
nobkd pushed a commit to nobkd/nue that referenced this issue Nov 14, 2024
nobkd pushed a commit to nobkd/nue that referenced this issue Nov 14, 2024
@nobkd
Copy link
Collaborator

nobkd commented Nov 17, 2024

Collapsed part fixed by #420

Escaping seems to not work properly: E.g. https://nuejs.org/docs/content-syntax.html#code-blocks
image

or
image


I also tried using code blocks with more than three backticks to wrap the md code block, but that didn't work. I tried this:

````md
```md
// here is a CSS code block
:root {
  --base-100: #f3f4f6;
  --base-200: #e5e7eb;
  --base-300: #d1d5db;
  --base-400: #6b7280;
}
```
````

But it results in this:
image

<pre></pre>
<p>:root { }</p>
<pre></pre>

@viktor-yakubiv
Copy link

@tipiirai I am wondering what was the original decision to write own parser in contrast to extending/adopting an existing one?

It seems to me, that micromark could be a great foundation that supports CommonMark out of the box hense does not have a behaviour mentioned in #379 (comment)

@viktor-yakubiv
Copy link

Are image reflinks like ![foo][bar] supported in Markdown?

Yes, image reflinks are supported by CommonMark. You may test it directly on GitHub.

@Dorifor
Copy link

Dorifor commented Nov 28, 2024

what was the original decision to write own parser in contrast to extending/adopting an existing one?

It was explained here : https://nuejs.org/blog/nue-release-candidate/#new-markdown-parser

I don't really know if the same applies to micromark

@viktor-yakubiv
Copy link

viktor-yakubiv commented Nov 28, 2024

I don't really know if the same applies to micromark

Thank you for the provided extra information. No, none of the listed problems applies to micromark (remark, unified project). Moreover, the unified project was started to solve the listed problems and provide a unified AST that can be easily transformed on any step.

I asked the reasoning for writing another parser due to the following observations:

I assume that adopting and already existing powerful foundation would reduce a lot of pain in maintaining of another parser and provide a match to the existing standard. I highly recommend you looking into the unified ecosystem. Here are a few benefits:

  • remark is widely adopted as it is already a foundation of MDX;
  • it is well documented, well typed and has an actual AST specification;
  • plugins are very easy to write and this would provide even an option for Nue users write own extensions.

@Dorifor
Copy link

Dorifor commented Nov 28, 2024

I have discovered from the #414 that Nuemark renders lists differently from the standard

I'm here for the same reason 😅

I honestly don't care much what is used I just want the thing to work out well and without issues in the end, hope it'll come to that !

@viktor-yakubiv
Copy link

Users write other Markdown much more than Nue's Markdown. This means that users would prefer Nuemark work in the same way as CommonMark (or GitHub Flavoured Markdown).
— (paraphrased) Jakob's Law

Comming from #414 (comment) and continuing the previous discussion, a few remarks on that point below.

  1. Markdown by definition is a superset of HTML, i. e. it empowers the users write a simpler form of HTML but it still is an HTML code. Forbidding HTML in Markdown can be acceptable due to security reasons but it still violates this original intention.

    Although, I support Nue's team intetion to invent a simple yet more powerful syntax than CommonMark currently is (provide blocks etc.). There is a relevant directive proposal and mast-util-directive.

    However, while forbidding HTML seems to be a technically practical decision and a good enforcment for the user to write nicer looking markup, it takes the power from the user disabling them doing things they might want in some exceptional cases. And it's important to remember: "you are not the user". As an example, there might be a hightly technical post explaining some bits of HTML and a need to provide live examples in place, or some small bits where providing ARIA would be necessary. These are out of my mind but I am sure there will be very unpredicted real use cases.

  2. Markdown in 2024 is well specified. CommonMark is the most popular and well adopted standard of Markdown syntax. The website explains why it exists and provides a lint to versioned specification.

    There is another popular standard that we all use — GitHub Flafoured Markdown (GFM) that is an extension of CommonMark and has own specification.

    There is also Multimarkdown that has own specification and toolkit but is less adopted. It aims to provide more powerful markup toolset: footnotes, citations, abreviations etc. Multimarkdown still supports all basic Markdown features (I am not sure about HTML) but it's diverges a lot and therefore provides own file extension .mmd.

    There might be others that can be found in the Interner and Wikipedia but I didn't dig into the topic further.


Coming from the point 2, Nuemark can head in one of the following directions:

  • diverge from Markdown, provide a (significantly) different markup, an independent toolkit (parser, compiler, syntax highlighters, editor extensions) for it and use unassociated with Markdown file extension like .nm;
  • adopt the Markdown (Commonmark) fully, either extend an existing toolkit or provide an independent one, and keep using common file extension .md providing and optional one .nmd if a proper distinguishing is necessary.

In any of those cases, Nuemark requires a proper specification, in my opinion. Currently, it is vaguely defined.

@nobkd
Copy link
Collaborator

nobkd commented Dec 9, 2024

I personally think, that nuemark should be commonmark compliant.
Maybe not in the area of indented code blocks (maybe clashing with multiline components), but if they were supported, it would be great.

I think, we should not reuse the --- for section splitting, because it is an already used syntax in cmark, and might confuse people. Maybe adding a new syntax, like +++ which isn't used yet would be a good alternative.

What do you think?

@yerlaser
Copy link

yerlaser commented Jan 1, 2025

Markdown has a lot of issues and despite it being the most popular, it is worth considering alternatives sometimes.
Especially because switching dialects isn't as difficult for the users as switching programming languages.
My personal favorite is Typst (https://typst.app/docs/tutorial/advanced-styling/), but ASCIIDOC is also very good (https://docs.asciidoctor.org/asciidoc/latest/text/)

@viktor-yakubiv
Copy link

@yerlaser can you please uncover what many issues there are with Markdown? I am very curious to learn

@yerlaser
Copy link

yerlaser commented Jan 2, 2025

@yerlaser can you please uncover what many issues there are with Markdown? I am very curious to learn

Well, one of the biggest issues you described yourself - it doesn't have a common standard.
And the language features too few formatting primitives so any heavy usage requires creating of a new variant.
Aside from that, the language syntax is not orthogonal where my biggest gripe is that **bold** and __bold__ as well as *italic* and _italic_ mean the same.
On the same note, - can start an unordered list and mark a <h2> title.
Another issue is manual line breaks that are made in markdown with two trailing spaces.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug nuemark Related to nuemark package
Projects
None yet
Development

No branches or pull requests

6 participants