Newline Fix #75

AndrewHawes · 2019-12-20T19:43:37Z

The parser was only looking for the line feed character ('\n'), so when it encountered an empty line with a Windows-style newline beginning with a carriage return ('\r\n'), it would miss it and instead return a PlaintextNode. This would mess up the structure of the generated HTML if the indentation prior to the newline characters was incorrect. For example, a blank line with no indentation would collapse all previous elements and put the following content after the html closing tag.

nodes.py was returning a PlaintextNode upon encountering a carriage return and skipping the following newline character. Added check for '\r'.

Parser was missing newlines on Windows files as it was only checking for the line feed character ('\n'). This was causing it to return a PlaintextNode when encountering a carriage return ('\r'), causing all sorts of problems if the white space before the newline character didn't provide the correct level of indentation (such as collapsing all elements on a completely blank line and placing following content after the html element's end tag).

rowanseymour · 2020-01-07T15:33:37Z

The way new line support usually works in Python is that \n, \r\n, and \r are all translated to just \n at the file level.. isn't that happening in this case? See https://docs.python.org/3/library/functions.html#open

AndrewHawes · 2020-01-07T23:58:19Z

It isn't in this case, because the file is being opened with codecs.open rather than open, which doesn't perform conversion on \n. So the parser is reading the line ending exactly as it's written, coming across the '\r', not recognizing it, and returning a PlaintextNode. https://docs.python.org/3/library/codecs.html?highlight=codecs#codecs.open

So with how it's working now, say you have a blank line between the head and body tags in your haml, and that blank line has no white space. The parser will return a PlaintextNode with an indentation of 0, closing all previously opened elements, and it will place the body tag after the closing html tag. It will only do this for a file with Windows line endings. It will render the same file normally if it's created with Unix line endings.

(I actually like the behavior better like this, as long as all of your blank lines are indented to where they need to be. This way, blank lines after a tag will be placed after the closing tag in the rendered document rather than before it, and there's no longer the problem with duplicate newlines. The problem is that if you're not paying attention, one missing space on a blank line can completely screw up the output.)

I guess I'd accidentally added white space to a couple empty lines when I last edited this, which is slightly ironic given the nature of the modification.

rowanseymour · 2020-01-09T15:39:08Z

hamlpy/parser/nodes.py

@@ -48,6 +48,16 @@ def read_node(stream, prev, compiler):
        if indent:
            indent = indent[0] * len(indent)

+        # empty lines with carriage returns are recorded as newlines on previous node
+        # if followed by a newline, it is skipped
+        if stream.text[stream.ptr] == '\r':


i think it would be easier to understand this if it was happening at the stream level.. what if we added def is_newline() and def read_newline() to the Stream class?

rowanseymour · 2020-01-09T15:39:55Z

@AndrewHawes ah ok that makes sense - have added a comment

AndrewHawes added 3 commits December 6, 2019 04:09

Update nodes.py to fix newlines on Windows

e889a44

nodes.py was returning a PlaintextNode upon encountering a carriage return and skipping the following newline character. Added check for '\r'.

Update nodes.py

2ac9ecb

Deleted white space from empty lines

2c69df4

I guess I'd accidentally added white space to a couple empty lines when I last edited this, which is slightly ironic given the nature of the modification.

rowanseymour reviewed Jan 9, 2020

View reviewed changes

Base automatically changed from master to main January 21, 2021 14:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Newline Fix #75

Newline Fix #75

AndrewHawes commented Dec 20, 2019

rowanseymour commented Jan 7, 2020

AndrewHawes commented Jan 7, 2020

rowanseymour Jan 9, 2020

rowanseymour commented Jan 9, 2020

Newline Fix #75

Are you sure you want to change the base?

Newline Fix #75

Conversation

AndrewHawes commented Dec 20, 2019

rowanseymour commented Jan 7, 2020

AndrewHawes commented Jan 7, 2020

rowanseymour Jan 9, 2020

Choose a reason for hiding this comment

rowanseymour commented Jan 9, 2020