tracepoint: Support __data_loc fields in tracepoints #1542

danobi · 2020-09-29T05:08:29Z

This PR teaches bpftrace to read __data_loc arguments from tracepoints.

Note that this PR causes a minor compatibility break where fields previously
named args->data_loc_name are now named args->name.

This closes #385 . This supercedes #770 .

Checklist

Language changes are updated in docs/reference_guide.md
User-visible and non-trivial changes updated in CHANGELOG.md
The new behaviour is covered by tests

danobi · 2020-09-29T05:14:42Z

There's still one thing I'm really confused about. So the typical case works ok:

$ sudo ./build/src/bpftrace -e 't:irq:irq_handler_entry { print(str(args->name)) }' | head -10
Attaching 1 probe...
nvidia
nvidia
nvidia
nvidia
nvidia
nvidia
nvidia
nvidia

But printing the address fails:

$ sudo ./build/src/bpftrace -e 't:irq:irq_handler_entry { print(args->name) }' -v
BTF: using data from /sys/kernel/btf/vmlinux
Attaching 1 probe...

Error log:
0: (bf) r6 = r1
1: (61) r1 = *(u32 *)(r6 +12)
2: (b7) r2 = 30007
3: (7b) *(u64 *)(r10 -24) = r2
4: (57) r1 &= 65535
5: (0f) r1 += r6
last_idx 5 first_idx 0
regs=2 stack=0 before 4: (57) r1 &= 65535
regs=2 stack=0 before 3: (7b) *(u64 *)(r10 -24) = r2
regs=2 stack=0 before 2: (b7) r2 = 30007
regs=2 stack=0 before 1: (61) r1 = *(u32 *)(r6 +12)
6: (7b) *(u64 *)(r10 -8) = r1
7: (b7) r1 = 0
8: (7b) *(u64 *)(r10 -16) = r1
last_idx 8 first_idx 0
regs=2 stack=0 before 7: (b7) r1 = 0
9: (18) r7 = 0xffff89de0a4b1600
11: (85) call bpf_get_smp_processor_id#8
12: (bf) r4 = r10
13: (07) r4 += -24
14: (bf) r1 = r6
15: (bf) r2 = r7
16: (bf) r3 = r0
17: (b7) r5 = 24
18: (85) call bpf_perf_event_output#25
invalid indirect read from stack off -24+16 size 24
processed 18 insns (limit 1000000) max_states_per_insn 0 total_states 0 peak_states 0 mark_read 0

ERROR: Error loading program: tracepoint:irq:irq_handler_entry

As far as I can tell, the "-24+16 size 24" error is bogus. Instructions 3, 6 and 8 completely initialize the 24 byte region starting at -24 + 16 = -8.

If anyone has ideas I would be ecstatic to hear them.

fbs · 2020-09-29T09:36:57Z

Maybe its unhappy about adding ctx into the mix? The ir and output look valid in both cases, but this seems to fix it:

diff --git a/src/ast/codegen_llvm.cpp b/src/ast/codegen_llvm.cpp
index 7c72ce3e..a32e9af1 100644
--- a/src/ast/codegen_llvm.cpp
+++ b/src/ast/codegen_llvm.cpp
@@ -1552,7 +1552,7 @@ void CodegenLLVM::visit(FieldAccess &acc)
                             b_.CreateGEP(ctx_, b_.getInt64(field.offset)));
       expr_ = b_.CreateIntCast(expr_, b_.getInt64Ty(), false);
       expr_ = b_.CreateAnd(expr_, b_.getInt64(0xFFFF));
-      expr_ = b_.CreateAdd(expr_, b_.CreatePtrToInt(ctx_, b_.getInt64Ty()));
+      // expr_ = b_.CreateAdd(expr_, b_.CreatePtrToInt(ctx_, b_.getInt64Ty()));
     }

And this works too

diff --git a/src/ast/codegen_llvm.cpp b/src/ast/codegen_llvm.cpp
index 7c72ce3e..b2d3d6b3 100644
--- a/src/ast/codegen_llvm.cpp
+++ b/src/ast/codegen_llvm.cpp
@@ -1552,7 +1552,8 @@ void CodegenLLVM::visit(FieldAccess &acc)
                             b_.CreateGEP(ctx_, b_.getInt64(field.offset)));
       expr_ = b_.CreateIntCast(expr_, b_.getInt64Ty(), false);
       expr_ = b_.CreateAnd(expr_, b_.getInt64(0xFFFF));
-      expr_ = b_.CreateAdd(expr_, b_.CreatePtrToInt(ctx_, b_.getInt64Ty()));
+      // expr_ = b_.CreateAdd(expr_, b_.CreatePtrToInt(ctx_, b_.getInt64Ty()));
+      expr_ = b_.CreateAdd(expr_, b_.getInt64(1234));
     }

But here it does an i16 load to avoid the masking:

Bytecode:
0: (bf) r6 = r1
1: (69) r1 = *(u16 *)(r6 +12)
2: (b7) r2 = 30007
3: (7b) *(u64 *)(r10 -24) = r2
4: (07) r1 += 1234
5: (7b) *(u64 *)(r10 -8) = r1
6: (b7) r1 = 0
7: (7b) *(u64 *)(r10 -16) = r1
last_idx 7 first_idx 0

danobi · 2020-09-29T16:58:00Z

Maybe its unhappy about adding ctx into the mix? The ir and output look valid in both cases, but this seems to fix it:

Commenting out the ctx + offset fixed it for me. Not sure there's anything we can do from our side. It seems like a kernel issue to me.

src/ast/codegen_llvm.cpp

danobi · 2020-09-29T18:35:03Z

Chatted w/ Alexei. When we add an offset to ctx and store the result to the stack, the verifier thinks it's a register spill and doesn't let anyone read from the location on stack. I'm not sure that's what the correct behavior should be but it's the current limitation.

We'll probably have to live with the unprintable data_loc integers. Shouldn't be too bad functionally b/c it's not like userspace could do anything with that pointer.

Edit: there may be a kernel fix coming sometime.

fbs · 2020-09-30T07:45:29Z

Just landed the new tuple code, so this might need a rebase

It's unclear why the renaming exists. It's not like the renamed field name made __data_loc fields work properly. This makes things more predictable. Note that this will break the workaround described in bpftrace#999 .

danobi · 2020-09-30T17:23:50Z

Rebased.

Also moved the type rewrite from semantic analyser into clang parser. Probably better to reduce the spread of the logic. Also made it easier to write a better clang parser test.

This commit coordinates __data_loc annotation between the TracepointFormatParser and ClangParser. The next commit will use `Field::is_data_loc` to generate the proper code for field accesses.

This commit teaches codegen how to decode the __data_loc fields. Accessing `args->name`, where `name` is a __data_loc field now returns the address the data portion begins. We also have to do some hacky type rewriting in the semantic analyzer to have a wide enough integer to store pointers in. Not sure if there's a better way.

danobi mentioned this pull request Sep 29, 2020

[WIP] tracepoint __data_loc support #770

Closed

danobi force-pushed the data_loc branch from c7bc18f to 25a04ac Compare September 29, 2020 05:15

danobi force-pushed the data_loc branch from 25a04ac to a2420c9 Compare September 29, 2020 16:56

fbs reviewed Sep 29, 2020

View reviewed changes

src/ast/codegen_llvm.cpp Outdated Show resolved Hide resolved

danobi force-pushed the data_loc branch from a2420c9 to 106c555 Compare September 29, 2020 18:59

tracepoint: Do not rename __data_loc fields to data_loc_<name>

cac43d5

It's unclear why the renaming exists. It's not like the renamed field name made __data_loc fields work properly. This makes things more predictable. Note that this will break the workaround described in bpftrace#999 .

danobi force-pushed the data_loc branch from 106c555 to 123afcc Compare September 30, 2020 17:23

danobi added 3 commits September 30, 2020 10:26

tracepoint: Annotate __data_loc fields during parse time

56138f4

This commit coordinates __data_loc annotation between the TracepointFormatParser and ClangParser. The next commit will use `Field::is_data_loc` to generate the proper code for field accesses.

tracepoint: Add runtime test for data_loc

b65fb14

danobi force-pushed the data_loc branch from 123afcc to b65fb14 Compare September 30, 2020 17:28

Update changelog

2cdaf61

fbs approved these changes Oct 1, 2020

View reviewed changes

danobi merged commit cc3daa2 into bpftrace:master Oct 1, 2020

viktormalik mentioned this pull request Aug 4, 2023

Can't read filename field from tracepoint:sched:sched_process_exec #999

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tracepoint: Support __data_loc fields in tracepoints #1542

tracepoint: Support __data_loc fields in tracepoints #1542

danobi commented Sep 29, 2020 •

edited

Loading

danobi commented Sep 29, 2020 •

edited

Loading

fbs commented Sep 29, 2020

danobi commented Sep 29, 2020

danobi commented Sep 29, 2020 •

edited

Loading

fbs commented Sep 30, 2020

danobi commented Sep 30, 2020

tracepoint: Support __data_loc fields in tracepoints #1542

tracepoint: Support __data_loc fields in tracepoints #1542

Conversation

danobi commented Sep 29, 2020 • edited Loading

Checklist

danobi commented Sep 29, 2020 • edited Loading

fbs commented Sep 29, 2020

danobi commented Sep 29, 2020

danobi commented Sep 29, 2020 • edited Loading

fbs commented Sep 30, 2020

danobi commented Sep 30, 2020

danobi commented Sep 29, 2020 •

edited

Loading

danobi commented Sep 29, 2020 •

edited

Loading

danobi commented Sep 29, 2020 •

edited

Loading