Add minimal support for sampling #22

Phantomical · 2022-11-09T06:48:13Z

This PR introduces an absolutely minimal skeleton of an API for reading sampled perf events.

Highlights

Adds a new Sampler type which is a counter that can also read Records from the kernel ring buffer.
Adds support for PERF_EVENT_MMAP records and nothing else.
A bunch of enum bindings and parsing code needed to get the whole thing off of the ground.

This is just enough to write a simple mmap end-to-end test to check that the whole thing is working. The rest of the features can be added backwards-compatibly in follow-up PRs. See the non-goals section for stuff that I have excluded from this PR.

API Summary

This is just a subset of the API prototypes but since this PR is so large I felt that it would be helpful to have it summarized.

API Prototypes

struct Sampler { .. };
impl Sampler {
  fn next(&mut self) -> Option<samples::Record>;
  fn next_blocking(&mut self, timeout: Option<Duration>) -> Option<samples::Record>;
}

impl Builder {
  // Note that this is additive
  fn sample(self, sample: Sample) -> Self;
  fn mmap2(self, bool) -> Self;

  fn build_sampler(self, buffer_len: usize) -> Sampler;
}

struct Sample {
   ... bitflags go here
}

mod samples {
  struct Record {
    pub ty: RecordType,
    pub misc: RecordMiscFlags,
    pub event: RecordEvent,
    pub sample_id: SampleId,
  }

  enum RecordEvent {
    Mmap(Mmap),
    .. new variants go here
    Unknown(Vec<u8>),
  }

  .. more binding types
}

How the API Works

Construct a Builder like normal, configure it, and call Builder::build_sampler to create a Sampler.
Call Sampler::next or Sampler::next_blocking to read the next sample::Record out of the ring buffer.
Use the Record for whatever profiling thing you wanted to do.

Future Compatibility

This is important part of any binding to perf_event_open. Here's how this API can be extended in a semver compatible manner.

Adding new record types: The RecordEvent enum is #[non_exhaustive]. When a new record type is added to the perf API then we can add support without having to break backwards compatibility. In the meantime, users can use RecordEvent::Unknown to handle records that perf-event does not support.
Adding new common fields to Record: I don't think this one will happen but nevertheless Record is also #[non_exhaustive]
Adding new fields to a record type: Not relevant to this PR since I haven't actually added support for those records either. However, this can be addressed by #[non_exhaustive] as well.
Supporting new values for kernel flags: In order to remain forward compatible, none of the enums used in the bindings are rust enums. Instead, they are newtypes around an integer with associated constants. That way if the kernel adds a new enum variant in the future it is still representable.

Non-goals for this specific PR

I'm trying to keep this PR as small as possible because it is already too big. Except for the last one, all of these can be added in follow-up PRs.

Support for any record types besides PERF_RECORD_MMAP.
Support for all the configuration options available within perf_event_attr.
Support for working with the aux buffer.
Accessors for reading some of the extra information available within the memory map.
Support for using the record bindings and parsing code in any context outside of reading events from a kernel ringbuffer. I think the requirements to do this well are too different from what perf-event needs.

Supersedes #1

Phantomical · 2022-11-10T07:00:38Z

@jimblandy I think this is ready for review. It shouldn't be merged until a new version of bytes is available with my upstream PR but that doesn't affect the code.

jimblandy · 2022-11-10T17:04:31Z

@Phantomical This is very exciting. I will have time to review this on Friday.

jimblandy · 2022-11-11T21:47:57Z

Support for using the record bindings and parsing code in any context outside of reading events from a kernel ringbuffer. I think the requirements to do this well are too different from what perf-event needs.

I don't agree - but I think it's fine to proceed as you suggest anyway.

I do think we're going to need to be able to process data from files eventually. If we don't, then we basically force our users who also want to consume perf files to either use two separate crates, or switch entirely to some other crate that does support both use cases.

But I don't see the work here as likely to cut off future possibilities, in practice. I'm pretty sure we're going to end up with a lot of code that doesn't care where the data came from, which can serve both live and recorded sources. I think we can just proceed with the work here and think about files later.

jimblandy · 2022-11-11T23:32:34Z

All new features are behind an unstable feature gate - I can remove this if you think it's not worth it.

I think we can just leave out the feature. We'll be bumping the major version anyway.

jimblandy

I wasn't able to get into the substance of this today, but here are a few minor comments I have so far:

perf-event/src/lib.rs

jimblandy · 2022-11-12T01:02:53Z

perf-event/src/lib.rs

+impl std::ops::Deref for Sampler {
+    type Target = Counter;
+
+    fn deref(&self) -> &Self::Target {
+        &self.counter
+    }
+}
+
+impl std::ops::DerefMut for Sampler {
+    fn deref_mut(&mut self) -> &mut Self::Target {
+        &mut self.counter
+    }
+}


This way of making Sampler support the operations of Counter is considered an anti-pattern in Rust:
https://rust-unofficial.github.io/patterns/anti_patterns/deref.html

Possible alternatives:

Add a counter method to Sampler, and let people call that to get the underlying Counter.

Create a trait with the methods, and have both Sampler and Counter implement that.

Unfortunately, both of those alternatives aren't great :(

As a third alternative:

Copy the methods from Counter to Sampler and have them delegate to the real implementations in Counter

I think that either keeping the Deref impls (despite it being an antipattern) or copying the methods is the best choice. Ultimately, though, the choice is yours and I'll implement whichever one you prefer.

Just thinking this over: Some of the methods only make sense for counted events. Not all sampled events are counted events. But there are counters that record their overflows to the mmap area, so certainly Sampler needs the counter-related methods. So I guess there's nothing that can be omitted.

Let's just copy the methods for now.

I've removed the deref impls and used a macro to add the methods to both Counter and Sampler.

perf-event/src/lib.rs

This means we only need one #[cfg(feature = "unstable")] for the block instead of one per method.

perf-event/Cargo.toml

perf-event/src/lib.rs

janaknat · 2022-11-14T16:53:35Z

A small println type example would help in understanding the API usage.

Phantomical · 2022-11-16T23:06:57Z

@janaknat I agree. However, what's actually been added in this PR is almost too limited to do a proper example (beyond what is in the tests here). I will add that as one of the (many) follow-up PRs that I'll be making once this one gets merged.

This adds nix as a dev-dependency so that it can be used to replace the direct libc calls within the test cases.

Phantomical · 2022-11-21T07:36:24Z

Bytes v1.3.0 has now been released with the get_x_ne methods we depend on 🎉. This means that all the external blockers for this PR have now been resolved.

@jimblandy this is now ready for final review and/or merging, whichever you think is appropriate.

Clippy doesn't like having a method named `next` on a struct when it's not an implementation of Iterator. This silences the clippy warning by renaming the method.

The issue was a missing ! in the if guard on config.sample_id_all. This would cause SampleId to parse fields when _not_ expected to and ignore everything when it was expected to parse fields. This commit also includes some test cases to exercise both options.

These are present within perf_event.h but aren't documented in the manpage yet (not even the one for Linux 6.0).

This is also present within the source headers but undocumented in the manpage.

Phantomical · 2022-12-02T21:33:45Z

@jimblandy Do you think you'll have time to look at this anytime soon?

janaknat · 2023-02-09T20:24:05Z

@Phantomical Is there a branch in your repo which has all the changes for full sampling support that I can fork off of? I'd love to give it a run.

Phantomical · 2023-02-10T18:20:47Z

@janaknat I do! I have everything up in a branch here: https://github.com/Phantomical/perf-event/tree/sampling-work. I haven't touched it in a few months but it is more or less feature complete.

janaknat · 2023-02-14T22:46:40Z

@Phantomical Thanks. I'll give it a go.

janaknat · 2023-03-17T19:50:27Z

@Phantomical Is it possible to create a perf report compatible perf.data with the changes in your branch. Ideally, I'd like to use the perf.data that is generated to form flamegraphs through Rust.

Also, is there a source you used to fully understand the perf record functionality of perf_event_open()? I am looking at https://github.com/torvalds/linux/blob/master/tools/perf/Documentation/perf.data-file-format.txt . Curious to know what you are using as reference.

Phantomical · 2023-03-17T20:31:26Z

@janaknat It is technically possible but it might not be the best approach. As far as I know, perf.data is basically a dump of the samples that were read from the perf_event_open ringbuffer. What you probably want is some sort of RawSampler that just gives the byte records without parsing them. That actually sounds pretty useful so I'll try and write it up this weekend and see what I get.

The best documentation on perf_event_open is the manpage. The arch linux one is more recent but is harder to read. The later sections of it cover how reading from the ringbuffer works. For anything not documented in the manpage I looked at the kernel source: everything relevant is defined in the perf-event.h header.

Phantomical · 2023-03-21T07:13:42Z

Closing this in favour of #30 since I believe that to be a better approach.

Phantomical force-pushed the sampling branch from 8b6cc1f to 2c4eb0d Compare November 9, 2022 06:49

Phantomical marked this pull request as ready for review November 10, 2022 06:58

jimblandy requested changes Nov 12, 2022

View reviewed changes

Phantomical added 23 commits November 13, 2022 12:04

First pass sampling bindings

4266ca9

Hide sampling behind an "unstable" feature

f816e07

Only put #[cfg(feature = "unstable")] on public API items

cff0c36

Remove useless pub(crate)

bb97c9d

Bring drop_guard into the crate

4499097

Replace parse method parameters with ParseConfig

4b3d9a1

Add support for MMAP records

289143d

Add support for generating MMAP records to Builder

7fdede3

Mark new methods as unstable

558e394

Parse records using native-endian integers, not big-endian

886cd06

Add integration test for testing the mmap record

e28fc3c

Move new methods to their own impl Builder block

46dd732

This means we only need one #[cfg(feature = "unstable")] for the block instead of one per method.

Allow creating Sample directly from the underlying bits

1e21a70

Add new methods for Samples and RecordMiscFlags

0c2b3d8

Add some more docs

a390bb1

Formatting

5b536c0

Replace MmapMut with MmapRaw

3a42222

Rewrite Sampler::next to only read page through pointers

8a7c922

Add Sampler::next_blocking

6b1053c

Add unit test for Mmap parsing

6ecc31a

Formatting

3210d81

Doc tweaks

ca6a489

Reorganize some impls

92807d5

Phantomical commented Nov 13, 2022

View reviewed changes

perf-event/Cargo.toml Outdated Show resolved Hide resolved

perf-event/src/lib.rs Show resolved Hide resolved

Fix doctest on Builder::sample

fd744e4

Phantomical added 3 commits November 16, 2022 22:04

Update bytes to use upstream master

717fa29

Remove unsafe methods from test cases by using nix instead

fac5864

This adds nix as a dev-dependency so that it can be used to replace the direct libc calls within the test cases.

Make sure that next_blocking returns events present after POLLHUP

32eee0c

Phantomical mentioned this pull request Nov 19, 2022

Release a new version of bytes tokio-rs/bytes#578

Closed

Phantomical added 3 commits November 19, 2022 18:46

Use imported path to bitflags macro

9d542bb

Update bytes dependency constraint to v1.3.0

e5e5a06

Remove Hex formatting from asserts which don't need it

886b395

Phantomical added 9 commits November 21, 2022 15:55

Avoid ever using std::os::unix::prelude module

ce954ef

Rename Sampler::next -> Sampler::next_record

8491697

Clippy doesn't like having a method named `next` on a struct when it's not an implementation of Iterator. This silences the clippy warning by renaming the method.

Fix cargo doc broken link warning

ad04123

Fix another broken doc link

9029052

Rewrite Debug impl for SampleId

4d82ee5

Add some not-yet-documented variants to SampleType

acf7f69

These are present within perf_event.h but aren't documented in the manpage yet (not even the one for Linux 6.0).

Add SampleType::AUX binding

9c060b2

This is also present within the source headers but undocumented in the manpage.

Update nix to version 0.26

a617a9c

Phantomical mentioned this pull request Mar 21, 2023

Even more minimal support for perf-event sampling #30

Closed

Phantomical closed this Mar 21, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add minimal support for sampling #22

Add minimal support for sampling #22

Phantomical commented Nov 9, 2022 •

edited

Loading

Phantomical commented Nov 10, 2022

jimblandy commented Nov 10, 2022

jimblandy commented Nov 11, 2022

jimblandy commented Nov 11, 2022

jimblandy left a comment

jimblandy Nov 12, 2022

Phantomical Nov 12, 2022

jimblandy Nov 12, 2022

Phantomical Nov 13, 2022

janaknat commented Nov 14, 2022

Phantomical commented Nov 16, 2022 •

edited

Loading

Phantomical commented Nov 21, 2022 •

edited

Loading

Phantomical commented Dec 2, 2022

janaknat commented Feb 9, 2023

Phantomical commented Feb 10, 2023

janaknat commented Feb 14, 2023

janaknat commented Mar 17, 2023

Phantomical commented Mar 17, 2023

Phantomical commented Mar 21, 2023

Add minimal support for sampling #22

Add minimal support for sampling #22

Conversation

Phantomical commented Nov 9, 2022 • edited Loading

Highlights

API Summary

How the API Works

Future Compatibility

Non-goals for this specific PR

Phantomical commented Nov 10, 2022

jimblandy commented Nov 10, 2022

jimblandy commented Nov 11, 2022

jimblandy commented Nov 11, 2022

jimblandy left a comment

Choose a reason for hiding this comment

jimblandy Nov 12, 2022

Choose a reason for hiding this comment

Phantomical Nov 12, 2022

Choose a reason for hiding this comment

jimblandy Nov 12, 2022

Choose a reason for hiding this comment

Phantomical Nov 13, 2022

Choose a reason for hiding this comment

janaknat commented Nov 14, 2022

Phantomical commented Nov 16, 2022 • edited Loading

Phantomical commented Nov 21, 2022 • edited Loading

Phantomical commented Dec 2, 2022

janaknat commented Feb 9, 2023

Phantomical commented Feb 10, 2023

janaknat commented Feb 14, 2023

janaknat commented Mar 17, 2023

Phantomical commented Mar 17, 2023

Phantomical commented Mar 21, 2023

Phantomical commented Nov 9, 2022 •

edited

Loading

Phantomical commented Nov 16, 2022 •

edited

Loading

Phantomical commented Nov 21, 2022 •

edited

Loading