Fix memleaks in work with struct types #1885

viktormalik · 2021-06-18T10:07:24Z

Using std::shared_ptr to store the inner struct in SizedType causes memleaks when working with recursive types (due to cycles in shared pointers). This PR replaces the std::shared_ptr by a raw pointer.

Fixes #1879.

Checklist

Language changes are updated in docs/reference_guide.md
User-visible and non-trivial changes updated in CHANGELOG.md
The new behaviour is covered by tests

src/struct.h

danobi

So I see in SemanticAnalyser::visit(Tuple &tuple) I see an unconditional CreateTuple(). Doesn't this cause us to store 10x the values in StructManager::tuples_? If so, anything we can do to avoid that?

src/struct.cpp

src/types.h

viktormalik · 2021-06-22T11:16:41Z

So I see in SemanticAnalyser::visit(Tuple &tuple) I see an unconditional CreateTuple(). Doesn't this cause us to store 10x the values in StructManager::tuples_? If so, anything we can do to avoid that?

You're right, this is unnecessary. Two solutions come to mind:

Only store the type in the first/last iteration of the semantic analyser, which is easy, though I'm not sure if it's correct.
Change StructManager::tuples_ into a hash map, then we'd have each tuple type stored only once. The hash could be the string representation of the SizedType (obtained by operator<<), which should be fine for this purpose, I hope.

danobi · 2021-06-22T23:10:20Z

Option 1 sounds incorrect although I haven't thought about it too hard.

Option 2 sounds reasonable to me. Couple thoughts on option 2:

Using a string representation as key would work. But maybe also check if we can implement std::hash for SizedType so it could just be a a std::unordered_set. std::unique_ptr already implements std::hash: https://en.cppreference.com/w/cpp/memory/unique_ptr/hash . SizedType being hashable sounds generally useful to have (though I wouldn't go crazy over this to
Tuples types may change at each pass. We'll still be leaving behind the old, unused tuple types. Out of scope for this PR but I wonder if we can get rid of is_final_pass() altogether. It not only is not guaranteed to work (what if something takes 11 passes to resolve?) but causes subtle issues like this.

viktormalik · 2021-06-28T10:37:48Z

Using a string representation as key would work. But maybe also check if we can implement std::hash for SizedType so it could just be a a std::unordered_set. std::unique_ptr already implements std::hash: https://en.cppreference.com/w/cpp/memory/unique_ptr/hash . SizedType being hashable sounds generally useful to have (though I wouldn't go crazy over this to

I implemented std::hash for SizedType so now StructManager::tuples_ is an unordered_set. It seems to work without problems but it definitely needs a detailed review.

Tuples types may change at each pass. We'll still be leaving behind the old, unused tuple types. Out of scope for this PR but I wonder if we can get rid of is_final_pass() altogether. It not only is not guaranteed to work (what if something takes 11 passes to resolve?) but causes subtle issues like this.

I'm not sure if getting rid of is_final_pass would help here as we leave unused types behind in cases when the type changes between passes, which would still happen.

Still, the approach of running a fixed number of 10 iterations is not very good, but that's for another discussion. We could be able to implement a more "clever" AST pass that would not re-visit nodes for which the state didn't change in 2 successive iterations. Then we could get rid of is_final_pass as we'd just run the semantic analysis until the fixpoint is found.

danobi · 2021-06-29T17:56:52Z

I'm not sure if getting rid of is_final_pass would help here as we leave unused types behind in cases when the type changes between passes, which would still happen.

So I think is_final_pass() exists b/c maps are not always declared before they are used (needs citation). So in theory, if we could figure out the exact types of all the maps before we figure out the rest of the types, we only need a single pass, as non-maps are declared + defined at the same time.

But I agree, we should discuss this somewhere else.

fbs · 2021-06-29T20:11:34Z

I originally tagged this for the 0.13 release but I'm hoping to do that release on the first. Should we try and get this in or just leave it for the next release? The leak isn't too bad

src/types.cpp

danobi · 2021-06-29T21:11:45Z

I originally tagged this for the 0.13 release but I'm hoping to do that release on the first. Should we try and get this in or just leave it for the next release? The leak isn't too bad

If we can get it in by the 1st that'd be nice. If not, not a huge deal IMO

viktormalik · 2021-06-30T19:51:44Z

I originally tagged this for the 0.13 release but I'm hoping to do that release on the first. Should we try and get this in or just leave it for the next release? The leak isn't too bad

If we can get it in by the 1st that'd be nice. If not, not a huge deal IMO

Updated based on @danobi's review. Should be ready for merge if we want to get this in.

danobi

Discussed on IRC: this PR has potential to cause subtle bugs. So better if we let this get tested through another development cycle to iron out issues.

CHANGELOG.md

Using std::shared_ptr to store the inner struct in SizedType causes memleaks when working with recursive types (due to cycles in shared pointers). This commit replaces the std::shared_ptr by a raw pointer. Since we use the same pointer to store the inner structure of tuples, we move the tuple definitions into StructMap.

Until now, array and tuple types were compred by size only. As this may be imprecise, compare them by the inner structure.

This allows to use SizedType and related types (such as Struct) in std::unordered_set/map. Also will be handy once we implement type uniqueness.

Store tuple types inside an unordered_set instead of a vector. Thanks to this, we don't store a new tuple type in each iteration of the semantic analyser. This required to re-implement std::hash and std::equal_to for std::unique_ptr<Struct>.

viktormalik force-pushed the struct-memleak-fix branch from bb999ab to b45623c Compare June 18, 2021 10:09

viktormalik commented Jun 18, 2021

View reviewed changes

src/struct.h Outdated Show resolved Hide resolved

viktormalik force-pushed the struct-memleak-fix branch from b45623c to 34a1494 Compare June 18, 2021 10:14

fbs added this to the 0.13.0 milestone Jun 19, 2021

danobi reviewed Jun 21, 2021

View reviewed changes

src/struct.cpp Show resolved Hide resolved

src/types.h Outdated Show resolved Hide resolved

viktormalik force-pushed the struct-memleak-fix branch from 34a1494 to 5553ee0 Compare June 28, 2021 09:58

danobi reviewed Jun 29, 2021

View reviewed changes

src/types.cpp Outdated Show resolved Hide resolved

src/types.cpp Outdated Show resolved Hide resolved

fbs added the version-freeze label Jun 30, 2021

viktormalik force-pushed the struct-memleak-fix branch from 5553ee0 to db75183 Compare June 30, 2021 19:48

danobi approved these changes Jun 30, 2021

View reviewed changes

danobi modified the milestones: 0.13.0, 0.14.0 Jun 30, 2021

fbs requested changes Jul 1, 2021

View reviewed changes

CHANGELOG.md Outdated Show resolved Hide resolved

viktormalik added 4 commits July 2, 2021 07:47

Better SizedType equality for arrays and tuples

1b4c87a

Until now, array and tuple types were compred by size only. As this may be imprecise, compare them by the inner structure.

Implement std::hash for SizedType

0e371fa

This allows to use SizedType and related types (such as Struct) in std::unordered_set/map. Also will be handy once we implement type uniqueness.

Make tuple types unique

cd21f1e

Store tuple types inside an unordered_set instead of a vector. Thanks to this, we don't store a new tuple type in each iteration of the semantic analyser. This required to re-implement std::hash and std::equal_to for std::unique_ptr<Struct>.

viktormalik force-pushed the struct-memleak-fix branch from db75183 to cd21f1e Compare July 2, 2021 05:48

danobi merged commit 8b32ed5 into bpftrace:master Jul 8, 2021

viktormalik deleted the struct-memleak-fix branch November 24, 2021 12:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix memleaks in work with struct types #1885

Fix memleaks in work with struct types #1885

viktormalik commented Jun 18, 2021 •

edited

Loading

danobi left a comment

viktormalik commented Jun 22, 2021

danobi commented Jun 22, 2021

viktormalik commented Jun 28, 2021

danobi commented Jun 29, 2021

fbs commented Jun 29, 2021

danobi commented Jun 29, 2021

viktormalik commented Jun 30, 2021

danobi left a comment

Fix memleaks in work with struct types #1885

Fix memleaks in work with struct types #1885

Conversation

viktormalik commented Jun 18, 2021 • edited Loading

Checklist

danobi left a comment

Choose a reason for hiding this comment

viktormalik commented Jun 22, 2021

danobi commented Jun 22, 2021

viktormalik commented Jun 28, 2021

danobi commented Jun 29, 2021

fbs commented Jun 29, 2021

danobi commented Jun 29, 2021

viktormalik commented Jun 30, 2021

danobi left a comment

Choose a reason for hiding this comment

viktormalik commented Jun 18, 2021 •

edited

Loading