WIP: slightly improve substitutions #562

carenas · 2024-11-13T13:36:08Z

Avoid at least one crash introduced with recent changes to substitute code as well as clarify what the expected offset value should be when overflowing the provided buffer.

While at it, make sure that the returned string is always NUL terminated, and do some minor cleanup.

NOTE: at least truncation is wrong so posting mainly as a FYI with the hopes someone else might give it some love

Avoid at least one crash introduced with recent changes to substitute code as well as clarify what the expected offset value should be when overflowing the provided buffer. While at it, make sure that the returned string is always NUL terminated, and do some minor cleanup.

zherczeg · 2024-11-15T17:28:37Z

Could you update this patch?

NWilson · 2024-11-20T13:36:04Z

Could you also split out the unrelated changes into their own PRs? It should be quick to do, and would let us merge the cosmetic changes and behaviour-altering changes in their own commits.

NWilson · 2024-11-20T13:39:12Z

src/pcre2_compile.c

@@ -1113,7 +1112,7 @@ in the decoded tables. */

 if ((code->flags & PCRE2_DEREF_TABLES) != 0)
  {
-  ref_count = (PCRE2_SIZE *)(code->tables + TABLES_LENGTH);
+  PCRE2_SIZE *ref_count = (PCRE2_SIZE *)(code->tables + TABLES_LENGTH);


Personally, I'm very happy with these changes.

I know Philip likes the old style of defining variables high up, at the top of a scope, and with a blank line after variable definitions.

But I don't see any benefit to having variables available for use, but not yet initialised. Much better to define & initialise at the same time (safer).

The compiler will hoist all the variables up to the top anyway (it will bump the stack pointer just once at the start of a block, rather than bump the stack pointer multiple times, when it sees a new variable).

Partly it's because I'm a dinosaur from the age when one had to define variables like that, but partly also I find it makes it easier when looking back up some code to find where a variable is defined. However, I am not going to try to impose my own preferences on the future. I can certainly see the advantage of always initializing at definition time. So please don't worry about me too much.

Funny is that this change is still valid C89 code and the main motivation wasn't to go against Philip's advice of defining variables at the beginning of blocks, but just reducing the scope of this variable to where it was actually needed/used.

Since we have at least one CI job with -Wshadow and I wanted to minimize churn didn't rename the variable to reflect its "temp" holder (might be even optimized out) status.

src/pcre2_substitute.c

NWilson · 2024-11-20T13:41:36Z

src/pcre2_substitute.c

+  extra_needed++;
+  lengthleft = 0;
+  }
+if (!overflowed || lengthleft == 0) buffer[buff_offset] = 0; else extra_needed++;


Could you explain why you need to inline the CHECKMEMCPY here for, for the trailing NUL?

What was wrong before? Do you want the returned string to be NUL-terminated, even if the function returns an error?

Do you want the returned string to be NUL-terminated, even if the function returns an error?

Correct, I found the way this function behaves strange and the fact that it will return non NUL terminated strings on overflow, potentially risky.

NWilson · 2024-11-20T13:43:17Z

testdata/testinput2

+    123abc123\=substitute_overflow_length,replace=[9]XYZ
+    123abc123\=substitute_overflow_length,replace=[6]XYZ
+    123abc123\=substitute_overflow_length,replace=[1]XYZ
+    123abc123\=substitute_overflow_length,replace=[0]XYZ


I'd be curious to run these new tests against the old code, just to see which (if any) of the test outputs have changed.

None does, but could add split the tests in a "setup" patch of its own so it will be obvious

carenas mentioned this pull request Nov 13, 2024

Several heap-BoF crashes from pcre2_compile related to parsed_pattern #561

Closed

NWilson reviewed Nov 20, 2024

View reviewed changes

src/pcre2_substitute.c Show resolved Hide resolved

NWilson reviewed Nov 20, 2024

View reviewed changes

NWilson modified the milestone: 10.45-RC1 Dec 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: slightly improve substitutions #562

WIP: slightly improve substitutions #562

carenas commented Nov 13, 2024

zherczeg commented Nov 15, 2024

NWilson commented Nov 20, 2024

NWilson Nov 20, 2024

PhilipHazel Nov 20, 2024

carenas Nov 30, 2024

NWilson Nov 20, 2024

carenas Nov 20, 2024

NWilson Nov 20, 2024

carenas Nov 20, 2024

WIP: slightly improve substitutions #562

Are you sure you want to change the base?

WIP: slightly improve substitutions #562

Conversation

carenas commented Nov 13, 2024

zherczeg commented Nov 15, 2024

NWilson commented Nov 20, 2024

NWilson Nov 20, 2024

Choose a reason for hiding this comment

PhilipHazel Nov 20, 2024

Choose a reason for hiding this comment

carenas Nov 30, 2024

Choose a reason for hiding this comment

NWilson Nov 20, 2024

Choose a reason for hiding this comment

carenas Nov 20, 2024

Choose a reason for hiding this comment

NWilson Nov 20, 2024

Choose a reason for hiding this comment

carenas Nov 20, 2024

Choose a reason for hiding this comment