fix: Handle "filename*" field in MP header #3239

airween · 2024-08-25T20:27:11Z

what

Add handling of Multipart Header's "filename*" (asterisk at the end!) field.

why

mod_security2 (v2) does not handle MULTIPART header's "filename*" field, eg:

Content-Disposition: form-data; name="file"; filename*=UTF-8''r%C3%A9sum%C3%A9.pdf

where the filename is UTF8 encoded string with value "résumé.pdf".

references

RFC 7578

additional notes

v3 handles as well this header, I almost copied that part of code.

Thanks @fzipi for bringing it to our attention.

apache2/msc_multipart.c

theseion · 2024-08-26T06:14:38Z

apache2/msc_multipart.c

+                p++;
+            }
+            if (*p != '\'') {
+                return -17; // Single quote for end-of-language not found


Shouldn't you use something like this here, as below?

msr->mpd->flag_invalid_quoting = 1;

It's a good idea - thanks.

@fzipi, @marcstern - what do you think about this guys?

I already added this in 2b22261, but I can remove that.

If we keep this feature, we should add that to v3 too.

apache2/msc_multipart.c

marcstern · 2024-08-26T06:31:42Z

https://www.rfc-editor.org/rfc/rfc5987#section-3.2
This RFC says that only UTF-8 & ISO-8859-1 are standardized:

Producers MUST use either the "UTF-8" ([RFC3629]) or the "ISO-8859-1" ([ISO-8859-1]) character set. Extension character sets (mime-charset) are reserved for future use.

We're restricting for years the languages to these two and never found a false positive.
Should we allow all of them?
If the code allows it, we should add a new collection (FILES_LANG?) allowing checking this syntax.

marcstern · 2024-08-26T06:41:57Z

apache2/msc_multipart.c

+                return -16; // Must be at least one legit char before ' for start of language
+            }
+            p++;
+            while ((*p != '\0') && (*p != '\'')) {


Languages can only contain alphanum & '-'
while (isalnum(*p) || *p == '-') p++;

You're right, and here - as you suggested - we can control this.

The question is: is it allowed to send the request without charset? I mean:

Content-Disposition: form-data; name="post"; filename*=resume.pdf

I can't find any relevant information about that.

https://www.rfc-editor.org/rfc/rfc7578#section-4.2 also says:

Some commonly deployed systems use multipart/form-data with file
names directly encoded including octets outside the US-ASCII range.
The encoding used for the file names is typically UTF-8, although
HTML forms will use the charset associated with the form.

Which sounds to me that any charset is valid, regardless of what the other RFC says.

charset is required, AFAICT, only language is optional.

So, do you think we do not need to check the charset and we should allow anything? Or restrict the charset content only to alnum chars (+ -)?

IMO, the charset should be restricted to the ABNF from the RFC.

apache2/msc_multipart.c

sonarqubecloud · 2024-08-26T10:32:44Z

Quality Gate failed

Failed conditions
1 Security Hotspot
C Maintainability Rating on New Code (required ≥ A)

See analysis details on SonarCloud

Catch issues before they fail your Quality Gate with our IDE extension SonarLint

airween · 2024-08-26T10:36:00Z

https://www.rfc-editor.org/rfc/rfc5987#section-3.2 This RFC says that only UTF-8 & ISO-8859-1 are standardized:

Producers MUST use either the "UTF-8" ([RFC3629]) or the "ISO-8859-1" ([ISO-8859-1]) character set. Extension character sets (mime-charset) are reserved for future use.

We're restricting for years the languages to these two and never found a false positive. Should we allow all of them? If the code allows it, we should add a new collection (FILES_LANG?) allowing checking this syntax.

Well, you are right, but as it stands in RFC: "Extension character sets (mime-charset) are reserved for future use." - if someone introduce a new feature in the future we should align the code again.

Introducing a new settings would be confused (in this case).

And I think first we should investigate is it possible to check those languages with rules.

Handle "filename*" field in MP header

cd4905b

airween requested a review from marcstern August 25, 2024 20:27

airween self-assigned this Aug 25, 2024

fzipi reviewed Aug 25, 2024

View reviewed changes

apache2/msc_multipart.c Outdated Show resolved Hide resolved

apache2/msc_multipart.c Show resolved Hide resolved

theseion requested changes Aug 26, 2024

View reviewed changes

marcstern reviewed Aug 26, 2024

View reviewed changes

apache2/msc_multipart.c Show resolved Hide resolved

marcstern reviewed Aug 26, 2024

View reviewed changes

apache2/msc_multipart.c Show resolved Hide resolved

airween added 2 commits August 26, 2024 12:02

Fix string comparison

524a799

Set MULTIPART_INVALID_QUOTING variable if it is necessary

2b22261

marcstern added the 2.x Related to ModSecurity version 2.x label Aug 27, 2024

amonachesi mentioned this pull request Aug 29, 2024

Blog post on releases 4.6.0 and 3.3.6 coreruleset/website#144

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Handle "filename*" field in MP header #3239

fix: Handle "filename*" field in MP header #3239

airween commented Aug 25, 2024

theseion Aug 26, 2024

airween Aug 26, 2024

marcstern commented Aug 26, 2024 •

edited

Loading

marcstern Aug 26, 2024

airween Aug 26, 2024

theseion Aug 26, 2024

airween Aug 26, 2024

theseion Aug 27, 2024

sonarqubecloud bot commented Aug 26, 2024

airween commented Aug 26, 2024

fix: Handle "filename*" field in MP header #3239

Are you sure you want to change the base?

fix: Handle "filename*" field in MP header #3239

Conversation

airween commented Aug 25, 2024

what

why

references

additional notes

theseion Aug 26, 2024

Choose a reason for hiding this comment

airween Aug 26, 2024

Choose a reason for hiding this comment

marcstern commented Aug 26, 2024 • edited Loading

marcstern Aug 26, 2024

Choose a reason for hiding this comment

airween Aug 26, 2024

Choose a reason for hiding this comment

theseion Aug 26, 2024

Choose a reason for hiding this comment

airween Aug 26, 2024

Choose a reason for hiding this comment

theseion Aug 27, 2024

Choose a reason for hiding this comment

sonarqubecloud bot commented Aug 26, 2024

Quality Gate failed

airween commented Aug 26, 2024

marcstern commented Aug 26, 2024 •

edited

Loading