Minimize unnecessary casts and check for overflows in witness invariants #1665

karoliineh · 2025-01-27T18:21:15Z

Removing unnecessary casts

Does not add the LL suffix for constant values that fit the int range: 1LL → 1
If a variable does not overflow the int range, it is not cast: (long long )h → h
If an arithmetic operation does not overflow the int range, its subexpressions are not cast: (long long )j + 1LL → j + 1

Dealing with overflows

If an arithmetic operation would overflow the int range but does not overflow in long long, only one of the variables or constants is cast: ((long long )k + (long long )d) + 2147483647LL → ((long long )k + d) + 2147483647
If an expression would overflow the long long range, it is discarded with a warning

TODO

~~n != (unsigned char)0 → n != 0~~
~~(unsigned long )arg == 0UL → arg == 0~~
Make sure that unsigned types are also checked for overflows within invariants
Fix issue revealed by Add abstract interface to Invariant #1668 before merging this.

I also observed that using the relational domain for overflow checks would be beneficial for eliminating some of the unnecessary casts. I.e. in 36-apron/52-queuesize.c, we can successfully verify that assertion __goblint_check(used + free == capacity) succeeds. However, as the overflow query only queries IntDomain values, the addition used+free will overflow the int range and is thus cast in the invariant. Furthermore, making the overflow queries use the information from relationAnalysis could also increase our precision. However, this should be addressed in a separate PR.

…ants not fitting into long in invariants

…bles

michael-schwarz · 2025-01-28T08:36:39Z

Nice, I think it's good to have better readable assertions! However, I'm a bit worried about one of these aspects:

If an arithmetic operation would overflow the int range but does not overflow in long long, only one of the variables or constants is cast.

This breaks invariants about the CIL AST, where all arguments to BinOp (except some Pointer + Integer ones) are supposed to have the same type as the resulting type. I don't think creating such illegal expressions is a good idea, especially given that @DrMichaelPetter's inter-domain refinement and similar features will likely also rely on the invariant feature, which also makes sense architecturally.

Could such logic not be moved to the pretty-printer where we know that the CIL expressions will not be fed to anything in Goblint before going through CIL parsing again? It seems printing one cast in general is enough to force the effective type to what we want it to be, regardless of overflow?

sim642 · 2025-01-28T09:06:47Z

This concern is already handled by constructing two exps in parallel: one without all the casts for the witness and another one with all the necessary implicit casts that's used for Goblint-internal things (here for the overflow queries).

It seems printing one cast in general is enough to force the effective type to what we want it to be, regardless of overflow?

That's not so simple in general. All the variables in an invariant may have different integer types. Sure, when cast to long long, everything else is probably bound to be integer promoted and arithmetic converted also to long long. But when trying to minimize casts, you can get many different intermediate types, again due to integer promotion and arithmetic conversion.

I don't think this could be done while pretty-printing because this relies on overflow queries. The secondary goal is to never return invariant expressions that we cannot guarantee overflow-free when going from mathematical integers from Apron to C.
This is probably also desirable for the inter-domain refinement expressions.

michael-schwarz · 2025-01-28T09:35:30Z

Where is this handling? Currently it looks to be like we have modified the function cil_exp_of_lincons1 to yield invalid expressions that CIL would immediately flag as invalid if one were to call GoblintCil.Check.checkStandaloneExp. We should not abuse CIl types to represent illegal things --- this will just cause problems.

If you want such things I suggest stringifying them immediately so the invariant about Cil expressions is violated only locally in the function they originate from.

I don't think this could be done while pretty-printing because this relies on overflow queries.

Why? Either no cast is necessary and then the expression with all casts should also not contain any casts (because everything already is of the right type), or at least one of the arguments needs to be cast because the usual arithmetic promotions are not enough. In this case, CIL has both casts and your pretty printer could forgo the cast in one of the branches? What am I missing?

karoliineh · 2025-01-28T10:49:25Z

Either no cast is necessary and then the expression with all casts should also not contain any casts (because everything already is of the right type)

Casts are also unnecessary for invariants, where char-type variables are used in an arithmetic expression; however, for CIL, they are cast to int, and thus, they would not be the right shape for both. The problem is not only with casting to long, but also casting char types to int.

sim642 · 2025-01-28T12:46:13Z

Where is this handling? Currently it looks to be like we have modified the function cil_exp_of_lincons1 to yield invalid expressions that CIL would immediately flag as invalid if one were to call GoblintCil.Check.checkStandaloneExp. We should not abuse CIl types to represent illegal things --- this will just cause problems.

CIL expressions without implicit casts turned explicit are not invalid! CIL has the option insertImplicitCasts which we choose to enable, but CIL doesn't care and works fine without them as well. So from CIL's perspective, they're perfectly valid and would pass the check.

If you want such things I suggest stringifying them immediately so the invariant about Cil expressions is violated only locally in the function they originate from.

Of course they could cause problems if used in unintended contexts, but that's not the case. These CIL expressions (with or without implicit casts) are passed to Invariant.of_exp, where Invariant is a module for

analyzer/src/cdomain/value/domains/invariant.ml

Line 1 in bce9f92

(** Invariants for witnesses. *)

Witness invariants are intentionally wrapped into the Invariant.t type and cannot be accidentally used as normal CIL expressions.
So the cast-less CIL expressions do only exist in that local scope of the conversion and the rest of Goblint just sees an Invariant.t.

Right now there isn't invariant.mli that fully abstracts it (so it's possible to pattern match the expression out), but that could/should be added to prevent such misuse outright.

For non-witness use of such domains converted to expressions, there should be a separate query anyway (Queries.Invariant returns Invariant.t). Since the implementation currently constructs both copies of the expression, it's trivial to use one for the existing query for witnesses and the other for a separate query for refining expressions. It's likely that the two use cases might benefit from different expressions anyway (based on which domain is refining which). Meanwhile witness invariants are desirable to optimize differently, e.g. restrict to widened variables (#1219).

…thout overflows

sim642 · 2025-01-31T08:58:25Z

Right now there isn't invariant.mli that fully abstracts it (so it's possible to pattern match the expression out), but that could/should be added to prevent such misuse outright.

I now tried that in #1668 and that revealed a problem with precondition loop invariants instead.
The refinement stuff in #1635 already uses separate expression generation from invariant anyway and would not even be affected by this change.

karoliineh added 7 commits January 22, 2025 17:53

Move no_overflow from relationAnalysis to sharedFunctions

019227b

Add cram test for not suffixing int constants with LL in loop invariants

f3fd468

Use IInt ikind for constants fitting into int range and discard const…

2ee30c3

…ants not fitting into long in invariants

Check for overflows in invariants and discard overflowing expressions

6ac5ad2

Rename test

6a39867

Add CRAM test for minimizing casts in invariants with char-type varia…

a823c1d

…bles

Minimize unnecessary casts in invariants

70309dd

karoliineh self-assigned this Jan 27, 2025

karoliineh added sv-comp SV-COMP (analyses, results), witnesses relational Relational analyses (Apron, affeq, lin2var) explainability labels Jan 27, 2025

karoliineh added this to the SV-COMP 2026 milestone Jan 27, 2025

karoliineh added 4 commits January 29, 2025 16:31

Create a query to ask for overflows for unsigned types

7310d98

Update CRAM tests: avoid unnecessary casts for unsigned arithmetic wi…

3062125

…thout overflows

Avoid unnecessary casts for unsigned arithmetic without overflows

ad76af7

Match both exp and exp_plain when constructing invariants

78a7383

karoliineh marked this pull request as ready for review January 30, 2025 13:40

sim642 mentioned this pull request Jan 31, 2025

Add abstract interface to Invariant #1668

Draft

2 tasks

sim642 added the pr-dependency Depends or builds on another PR, which should be merged before label Jan 31, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Minimize unnecessary casts and check for overflows in witness invariants #1665

Minimize unnecessary casts and check for overflows in witness invariants #1665

karoliineh commented Jan 27, 2025 •

edited by sim642

Loading

michael-schwarz commented Jan 28, 2025 •

edited

Loading

sim642 commented Jan 28, 2025

michael-schwarz commented Jan 28, 2025

karoliineh commented Jan 28, 2025 •

edited

Loading

sim642 commented Jan 28, 2025

sim642 commented Jan 31, 2025

Minimize unnecessary casts and check for overflows in witness invariants #1665

Are you sure you want to change the base?

Minimize unnecessary casts and check for overflows in witness invariants #1665

Conversation

karoliineh commented Jan 27, 2025 • edited by sim642 Loading

Removing unnecessary casts

Dealing with overflows

TODO

michael-schwarz commented Jan 28, 2025 • edited Loading

sim642 commented Jan 28, 2025

michael-schwarz commented Jan 28, 2025

karoliineh commented Jan 28, 2025 • edited Loading

sim642 commented Jan 28, 2025

sim642 commented Jan 31, 2025

karoliineh commented Jan 27, 2025 •

edited by sim642

Loading

michael-schwarz commented Jan 28, 2025 •

edited

Loading

karoliineh commented Jan 28, 2025 •

edited

Loading