C++: Generate IR for destruction of unconditionally constructed temporaries #16125

MathiasVP · 2024-04-04T14:11:37Z

(what a mouthful 😂)

This PR adds destructor calls for temporaries that are "unconditionally constructed". For example, we now get a destructor call for S() in:

struct S {
  int a;
  S();
  ~S();
};

void test() {
  int x = S().a;
}

The "unconditionally constructed" part refers to the fact that we still don't get destructor calls in examples such as:

struct S {
  int a;
  S();
  ~S();
};

void test(bool b) {
  int x = b ? S().a : 0;
}

since the destruction has to happen "at the semicolon", but only if we actually evaluated S().a. Generating destructor calls in such cases is for a subsequent PR in the Glorious Future.

Once this PR is merged we should be able to pull the query added in #15939 out of experimental (as it will then actually have results).

Commit-by-commit review recommended. Each commit is either a "fix things" or a "accept test changes" commit which represents the test changes caused by the the previous "fix things" commit.

I don't think we should add a change note for this just yet. We can do so once we've pulled the cpp/iterator-to-expired-container query out of experimental.

…n if destructors are called.

…rations since these are also conditional.

have multiple parents (the 'new' expression, the call to 'operator new', and the size expression). This happens because the latter two are 'TranslatedExpr's that return the 'new' expression as their expression even though they don't technically represent the translation of this expression. To prevent this bug we tell the IR construction that the latter two handle their destructors explicitly which means that IR construction doesn't try to synthesize them.

…PhiOperand' consistency errors.

…onversion to the temporary object conversion.

cpp/ql/lib/semmle/code/cpp/exprs/Expr.qll

…ary object expression.

…t expression at the top-level.

cpp/ql/lib/semmle/code/cpp/ir/implementation/raw/internal/TranslatedElement.qll

…slatedElement.qll Co-authored-by: Jeroen Ketema <[email protected]>

cpp/ql/lib/semmle/code/cpp/ir/implementation/raw/internal/TranslatedExpr.qll

…e reused expression is a temporary.

andersfugmann · 2024-04-09T09:28:50Z

cpp/ql/test/experimental/query-tests/Security/CWE/CWE-416/test.cpp

@@ -724,15 +724,15 @@ void iterate(const std::vector<T>& v) {
 }

 std::vector<int>& ref_to_first_in_returnValue_1() {
-  return returnValue()[0]; // BAD [NOT DETECTED] (see *)
+  return returnValue()[0]; // BAD
 }

 std::vector<int>& ref_to_first_in_returnValue_2() {
  return returnValue()[0]; // BAD [NOT DETECTED]


Why is this case not detected? The function signature and implementation is identical to ref_to_first_in_returnValue_1

I haven't investigated why yet. Note that, while function signature and implementation is identical to ref_to_first_in_returnValue_1, they differ in how they're used. See here for this. Specifically:

In the one we detect (i.e., ref_to_first_in_returnValue_1) the range-based for loop looks like:

for (auto x : ref_to_first_in_returnValue_1()) {}

and in the one we fail to detect the range-based for loop looks like:

{ auto value = ref_to_first_in_returnValue_2(); for (auto x : value) {} }

We could improve the UX on this by moving the alert location to the for-loop (instead of at the place where the temporary is destructed). The problem with this is that the elements in the DB for range-based for loops don't always have a location (which makes for an even worse UX 😂)

jketema

LGTM, but @rdmarsh2 should probably also approve this before we merge.

dbartol

I really appreciate the curation into small commits. It made this much easier to review than it otherwise would have been.

I have two main questions about the approach, but no objections to merging as-is:

Determining if an object is conditionally destructed

The extractor must already know whether a given implicit destructor call is conditional or not. Do we just not have this info in the DB?

Approaching it from another direction, don't we already know when we're in a "conditional context" in IR construction, since we need to know whether to turn short-circuit expressions into Boolean values or vice-versa? Would it be easier and more reliable to share that logic to detect conditional destructors? (The answer to the latter may very well be "no").

Destruction due to exception

First, I thought we were already ignoring destruction on exception edges anyway, even for named locals, so I'm not sure it's important to get temporaries destructed on a throw right now either.

That said, if I understand the new code correctly, a throw will now have an Exception edge to any needed destructor calls, after which there will be another Exception edge to wherever the exception is first handled (whether a Catch or an Unwind). Does that mean that the last destructor call will have an exception successor but no fall-through successor? If so, that looks like the destructor call is throwing an exception, rather than completing successfully and "falling through" to the handler.

My original plan for the IR was to handle destructors in Finally blocks. Control could enter a Finally block via an Exception edge or a Goto edge, and then would exit the finally block via either an Unwind edge or a Goto edge. The actual outgoing edge taken would be determined by the kind of the incoming edge that was taken. This would introduce an annoying diamond in the CFG, though.

If you wanted to do control-flow splitting to get rid of the diamond, you could duplicate the Finally block into a regular block (for the non-exceptional case) and a Fault block (basically, a Finally that is only ever reached via an exception). I think that splitting winds up being very similar to what you've done in this PR, except with the addition of Fault and EndFault instructions to make the control flow more explicit.

In any case, I think what you've got in this PR is fine, so maybe don't worry about the above until you run into something your current approach can't handle. I suspect that if you ever do attempt to handle conditionally-destructed temporaries, you'll have to revisit this.

MathiasVP · 2024-04-12T15:25:51Z

That said, if I understand the new code correctly, a throw will now have an Exception edge to any needed destructor calls, after which there will be another Exception edge to wherever the exception is first handled (whether a Catch or an Unwind). Does that mean that the last destructor call will have an exception successor but no fall-through successor? If so, that looks like the destructor call is throwing an exception, rather than completing successfully and "falling through" to the handler.

It's not quite correct to say that the throw will have an Exception edge to any needed destructor calls. Rather, there will be a Goto edge from the throw to the destructor call(s), and an Exception edge from the final destructor call to the exceptional successor (see for example here). In some ways, I guess that's even worse than what you describe since it looks like the throw actually doesn't throw an exception 😂

So yes, this is probably not the best possible modeling since it looks like the destructor is throwing the exception.

My original plan for the IR was to handle destructors in Finally blocks. Control could enter a Finally block via an Exception edge or a Goto edge, and then would exit the finally block via either an Unwind edge or a Goto edge. The actual outgoing edge taken would be determined by the kind of the incoming edge that was taken. This would introduce an annoying diamond in the CFG, though.

Good suggestion. That would indeed be a better way to model this. There are many things about exceptional control-flow that we could improve, and I'll your comment to that internal issue.

If you wanted to do control-flow splitting to get rid of the diamond, you could duplicate the Finally block into a regular block (for the non-exceptional case) and a Fault block (basically, a Finally that is only ever reached via an exception). I think that splitting winds up being very similar to what you've done in this PR, except with the addition of Fault and EndFault instructions to make the control flow more explicit.

That's a good point. We've already agreed that we want to introduce control-flow splitting to handle conditional destruction at some point in the future as well, and introducing a split here would align perfectly well with this.

rdmarsh2 and others added 12 commits April 4, 2024 10:29

C++: Unsuppress temporary destructors in IR

75c453f

C++: Accept test changes.

894d934

C++: suppress destructors on conditional temporaries

17e8c95

C++: Accept test changes.

cf996f8

C++: Add a simple test that fails.

d4e2d37

C++: Only report implicit destructors if we need to translate them.

a756f14

C++: Accept test changes.

56a132f

C++: Handle conversions in 'isInConditionalEvaluation'.

796fcfe

C++: Accept test changes.

a6a0e20

C++: Also suppress destructor calls on throwing ternary expressions.

73602dc

C++: Accept test changes.

0b7070f

Merge branch 'main' into destructors-for-unconditional-unnamed

774efb5

github-actions bot added the C++ label Apr 4, 2024

MathiasVP changed the title ~~C++: Generate IR for destruction of unconditionally constructed unnamed temporaries~~ C++: Generate IR for destruction of unconditionally constructed temporaries Apr 4, 2024

MathiasVP added 15 commits April 4, 2024 16:01

C++: Add a failing testcase.

805b4d6

C++: Properly handle the case where a TranslatedElement has no children.

1808886

C++: Accept test changes.

8f11cb6

C++: Accept query test changes.

587ae07

C++: Make sure the edge kind out of a throw is an 'ExceptionEdge' eve…

f098b8e

…n if destructors are called.

C++: Accept test changes.

b6ddb97

C++: Add another test with conditional construction.

e63a607

C++: Suppress destructor calls for the right-hand side of logical ope…

d279e3f

…rations since these are also conditional.

C++: Accept test changes.

bb2c690

C++: Add a failing testcase.

b042366

C++: Accept test changes.

4c01c06

C++: Add a failing testcase.

955f9c7

C++: Fix another multiple parents problem.

54e4103

C++: Accept test changes.

45e7154

MathiasVP marked this pull request as ready for review April 7, 2024 00:19

MathiasVP requested a review from a team as a code owner April 7, 2024 00:19

MathiasVP added the no-change-note-required This PR does not need a change note label Apr 7, 2024

MathiasVP added 5 commits April 7, 2024 14:26

C++: Add testcases that produces an 'missingOperandType' and 'missing…

a0de95d

…PhiOperand' consistency errors.

C++: Move destructor calls from expressions with a temporary object c…

89eaadd

…onversion to the temporary object conversion.

C++: Accept test changes.

fcd0e99

C++: Also handle destructor calls on converted expressions in PrintAST.

8a6a60e

C++: Accept test changes.

d40fa4c

MathiasVP force-pushed the destructors-for-unconditional-unnamed branch from 50c6b3b to d40fa4c Compare April 7, 2024 14:50

MathiasVP commented Apr 8, 2024

View reviewed changes

cpp/ql/lib/semmle/code/cpp/exprs/Expr.qll Show resolved Hide resolved

C++: Add testcase where two destructor calls are remapped to a tempor…

febd060

…ary object expression.

MathiasVP mentioned this pull request Apr 8, 2024

C++: Add example with missing destructor call on parameter #16146

Merged

C++: Add testcase with two destructor calls without a temporary objec…

9c25ce4

…t expression at the top-level.

MathiasVP force-pushed the destructors-for-unconditional-unnamed branch from 670ca5a to 9c25ce4 Compare April 8, 2024 14:36

Merge branch 'main' into destructors-for-unconditional-unnamed

4fa53b6

MathiasVP requested a review from rdmarsh2 April 8, 2024 14:48

jketema reviewed Apr 9, 2024

View reviewed changes

cpp/ql/lib/semmle/code/cpp/ir/implementation/raw/internal/TranslatedElement.qll Outdated Show resolved Hide resolved

jketema reviewed Apr 9, 2024

View reviewed changes

cpp/ql/lib/semmle/code/cpp/ir/implementation/raw/internal/TranslatedElement.qll Outdated Show resolved Hide resolved

Update cpp/ql/lib/semmle/code/cpp/ir/implementation/raw/internal/Tran…

17c8fa3

…slatedElement.qll Co-authored-by: Jeroen Ketema <[email protected]>

jketema reviewed Apr 9, 2024

View reviewed changes

cpp/ql/lib/semmle/code/cpp/ir/implementation/raw/internal/TranslatedExpr.qll Show resolved Hide resolved

C++: Ensure 'isConditionalTemporaryDestructorCall' only holds when th…

c325a79

…e reused expression is a temporary.

andersfugmann reviewed Apr 9, 2024

View reviewed changes

jketema previously approved these changes Apr 9, 2024

View reviewed changes

MathiasVP requested review from dbartol and removed request for rdmarsh2 April 10, 2024 16:00

Merge branch 'main' into destructors-for-unconditional-unnamed

7172e2f

MathiasVP dismissed jketema’s stale review via 7172e2f April 10, 2024 16:34

Merge branch 'main' into destructors-for-unconditional-unnamed

736d59c

dbartol approved these changes Apr 12, 2024

View reviewed changes

MathiasVP merged commit 1166645 into github:main Apr 12, 2024
14 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

C++: Generate IR for destruction of unconditionally constructed temporaries #16125

C++: Generate IR for destruction of unconditionally constructed temporaries #16125

MathiasVP commented Apr 4, 2024 •

edited

Loading

andersfugmann Apr 9, 2024

MathiasVP Apr 9, 2024 •

edited

Loading

MathiasVP Apr 9, 2024

jketema left a comment

dbartol left a comment •

edited

Loading

MathiasVP commented Apr 12, 2024

C++: Generate IR for destruction of unconditionally constructed temporaries #16125

C++: Generate IR for destruction of unconditionally constructed temporaries #16125

Conversation

MathiasVP commented Apr 4, 2024 • edited Loading

andersfugmann Apr 9, 2024

Choose a reason for hiding this comment

MathiasVP Apr 9, 2024 • edited Loading

Choose a reason for hiding this comment

MathiasVP Apr 9, 2024

Choose a reason for hiding this comment

jketema left a comment

Choose a reason for hiding this comment

dbartol left a comment • edited Loading

Choose a reason for hiding this comment

Determining if an object is conditionally destructed

Destruction due to exception

MathiasVP commented Apr 12, 2024

MathiasVP commented Apr 4, 2024 •

edited

Loading

MathiasVP Apr 9, 2024 •

edited

Loading

dbartol left a comment •

edited

Loading