Ignore a note that occurs right after \id marker #203

ddaspit · 2024-05-23T19:08:14Z

fixes "Stack empty" error for invalid USFM serval#393

This change is

Enkidu93

Reviewed 3 of 4 files at r1, all commit messages.
Reviewable status: 3 of 4 files reviewed, 1 unresolved discussion (waiting on @ddaspit)

src/SIL.Machine/Corpora/UsfmTokenizer.cs line 393 at r1 (raw file):

                            if (
                                usfm[usfm.Length - 1] == ' '
                                && ((prevToken != null && prevToken.ToUsfm().Trim() != "") || !tokensHaveWhitespace)

What's the point of this logic here? I'm not quite getting it.

- fixes sillsdev/serval#393

codecov-commenter · 2024-05-31T22:11:18Z

Codecov Report

Attention: Patch coverage is 95.65217% with 1 lines in your changes are missing coverage. Please review.

Project coverage is 67.32%. Comparing base (a19d577) to head (6cc0bc7).

Files	Patch %	Lines
src/SIL.Machine/Corpora/UsfmTokenizer.cs	93.75%	0 Missing and 1 partial ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master     #203      +/-   ##
==========================================
+ Coverage   67.31%   67.32%   +0.01%     
==========================================
  Files         441      441              
  Lines       35001    35021      +20     
  Branches     4695     4700       +5     
==========================================
+ Hits        23560    23579      +19     
  Misses      10352    10352              
- Partials     1089     1090       +1

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

Enkidu93

Reviewed 1 of 4 files at r1.
Reviewable status: all files reviewed, 1 unresolved discussion (waiting on @ddaspit)

src/SIL.Machine/Corpora/UsfmTokenizer.cs line 393 at r1 (raw file):

Previously, Enkidu93 (Eli C. Lowry) wrote…

What's the point of this logic here? I'm not quite getting it.

Sorry, I haven't approved this yet because I was hoping to get a better handle on this logic here. Could you explain what's going on, @ddaspit ?

ddaspit

Reviewable status: all files reviewed, 1 unresolved discussion (waiting on @Enkidu93)

src/SIL.Machine/Corpora/UsfmTokenizer.cs line 393 at r1 (raw file):

Previously, Enkidu93 (Eli C. Lowry) wrote…

Sorry, I haven't approved this yet because I was hoping to get a better handle on this logic here. Could you explain what's going on, @ddaspit ?

This checks to see if we need to strip off the space at the end before a newline. There are two cases that we have to handle:

The tokens contain whitespace. This occurs when we are trying to preserve the whitespace from the original USFM.
The tokens do not contain whitespace. This occurs when we want to normalize the whitespace.

This logic is preserved from the original USFM parser code in Paratext.

Enkidu93

Reviewable status: all files reviewed, 1 unresolved discussion (waiting on @ddaspit)

src/SIL.Machine/Corpora/UsfmTokenizer.cs line 393 at r1 (raw file):

Previously, ddaspit (Damien Daspit) wrote…

This checks to see if we need to strip off the space at the end before a newline. There are two cases that we have to handle:

The tokens contain whitespace. This occurs when we are trying to preserve the whitespace from the original USFM.

The tokens do not contain whitespace. This occurs when we want to normalize the whitespace.

This logic is preserved from the original USFM parser code in Paratext.

Thank you!

Enkidu93 reviewed May 31, 2024

View reviewed changes

Ignore a note that occurs right after \id marker

6cc0bc7

- fixes sillsdev/serval#393

ddaspit force-pushed the note-after-id branch from 9743519 to 6cc0bc7 Compare May 31, 2024 22:06

Enkidu93 reviewed Jun 3, 2024

View reviewed changes

ddaspit commented Jun 3, 2024

View reviewed changes

Enkidu93 approved these changes Jun 3, 2024

View reviewed changes

ddaspit merged commit bf2b46d into master Jun 4, 2024
4 checks passed

ddaspit deleted the note-after-id branch June 4, 2024 19:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ignore a note that occurs right after \id marker #203

Ignore a note that occurs right after \id marker #203

ddaspit commented May 23, 2024 •

edited

Loading

Enkidu93 left a comment

codecov-commenter commented May 31, 2024

Enkidu93 left a comment

ddaspit left a comment

Enkidu93 left a comment

Ignore a note that occurs right after \id marker #203

Ignore a note that occurs right after \id marker #203

Conversation

ddaspit commented May 23, 2024 • edited Loading

Enkidu93 left a comment

Choose a reason for hiding this comment

codecov-commenter commented May 31, 2024

Codecov Report

Enkidu93 left a comment

Choose a reason for hiding this comment

ddaspit left a comment

Choose a reason for hiding this comment

Enkidu93 left a comment

Choose a reason for hiding this comment

ddaspit commented May 23, 2024 •

edited

Loading