Tetragon based file integrity monitoring (FIM) #2409

anfedotoff · 2024-05-03T12:02:17Z

Is there an existing issue for this?

I have searched the existing issues

Is your feature request related to a problem?

No response

Describe the feature you would like

We could use Tetragon for file integrity monitoring: collect hashes of executed binaries and opened files and put this information in events. Hashes are calculated using IMA-measurement Linux integrity subsystem.

Describe your proposed solution

We already talked about FIM. I found some technical issues during my research, so I decided to provide a CFP before PR.

Code of Conduct

I agree to follow this project's Code of Conduct

anfedotoff · 2024-05-03T12:08:02Z

Hi 👋 , @kkourt!
If you have time, please, have a look. I'll be happy to have some discussion on implementation details.

xmulligan · 2024-05-03T13:14:38Z

Once it is ready, please also add the CfP to the repo https://github.com/cilium/design-cfps

kkourt · 2024-05-17T15:16:43Z

Thanks @anfedotoff!

Here are some first thoughts:

Considering your proposal:

spec:
  lsm:
  - call: "bprm_check_security"
    args:
    - index: 0
      type: "linux_binprm" # file type also is allowed
    selectors:
      - matchArgs:
          - index: 0
            operator: "Prefix"
            values:
              - "/usr/bin"
      - matchActions:
          - action: FileHash
            argHash 0

In the BPF code, what we do is:

For the linux_binprm type, we first copy the path:

tetragon/bpf/process/types/basic.h

Lines 2591 to 2598 in 82c4b13

    
           case linux_binprm_type: { 
        
           	struct linux_binprm *bprm = (struct linux_binprm *)arg; 
        
           	struct file *file; 
        
           	arg = (unsigned long)_(&bprm->file); 
        
           	probe_read(&file, sizeof(file), (const void *)arg); 
        
           	path_arg = _(&file->f_path); 
        
           	goto do_copy_path;

We then filter:

tetragon/bpf/process/types/basic.h

Lines 1815 to 1820 in 82c4b13

    
           case string_type: 
        
           case net_dev_ty: 
        
           case data_loc_type: 
        
           	/* for strings, we just encode the length */ 
        
           	pass &= filter_char_buf(filter, args, 4); 
        
           	break;

And finally do the action:

tetragon/bpf/process/types/basic.h

Line 2358 in 82c4b13

    
           do_actions(void *ctx, struct msg_generic_kprobe *e, struct selector_action *actions,

So by the time we reach the action, we only have the string and we cannot get the hash. Hence, I believe we need to get the hash at the first step.

So I was thinking something like:

spec:
  lsm:
  - call: "bprm_check_security"
    args:
    - index: 0
      type: "linux_binprm" # file type also is allowed
    - index: 1 # argument 1 will be the result of applying operation ima_file_hash() to argument index 0
      type: "hash"
      sourceIndex: 0
      operator: "ima_file_hash"
    selectors:
      - matchArgs:
          - index: 0
            operator: "Prefix"
            values:
              - "/usr/bin"

I'm still not sure about the syntax, but the basic idea would be to push the computation of the hash early, when we extract the arguments.

anfedotoff · 2024-05-17T15:52:24Z

LGTM! We still able to filter by file path, before collecting a hash in your approach, right? In other words I mean not to call ima bpf-helpers if filtering is not passed.

As far as I concerned, IMA bpf-helpers just retrieve the hash from IMA-measurement list. Difference between bpf_ima_inode_hash (5.15) and bpf_ima_file_hash (5.18): if there is no hash in IMA-measurement list bpf_ima_file_hash will calculate the hash, update IMA-measurement list and return it to the caller.

operator: "ima_file_hash"

Here you mean to call appropriate bpf-helper according to kernel version? Or user specifies the helper it prefers? I think, the first way is better.

kkourt · 2024-05-17T16:56:50Z

LGTM! We still able to filter by file path, before collecting a hash in your approach, right? In other words I mean not to call ima bpf-helpers if filtering is not passed.

I think it should be possible to collect the hash after the filtering, but it's more tricky. In that case, collecting the hash in the action makes more sense to me, but we will need to maintain the necessary arguments to call the helpers.

As far as I concerned, IMA bpf-helpers just retrieve the hash from IMA-measurement list. Difference between bpf_ima_inode_hash (5.15) and bpf_ima_file_hash (5.18): if there is no hash in IMA-measurement list bpf_ima_file_hash will calculate the hash, update IMA-measurement list and return it to the caller.

operator: "ima_file_hash"

Here you mean to call appropriate bpf-helper according to kernel version? Or user specifies the helper it prefers? I think, the first way is better.

I would do the simple thing first, allowing users to specify exactly what they want. We can add a detection function to reject the policy if the helper does not exist.

anfedotoff · 2024-05-18T07:10:47Z

I think it should be possible to collect the hash after the filtering, but it's more tricky. In that case, collecting the hash in the action makes more sense to me, but we will need to maintain the necessary arguments to call the helpers.

Ah, I understood. Before args filtering, we need to retrieve all arguments. I think we can try to implement your approach. To get hash using an action, we need to store arguments for bpf-helpers somewhere (suppose in separate bpf-map). So, for now, using actions looks more complicated for me:)).

I would do the simple thing first, allowing users to specify exactly what they want. We can add a detection function to reject the policy if the helper does not exist.

It makes sense. I'll take time to learn more about how to validate tracing policy for correctness.

kkourt · 2024-05-22T08:18:59Z

It makes sense. I'll take time to learn more about how to validate tracing policy for correctness.

Here's an example of checking whether the "multi kprobe" feature is supported:

tetragon/pkg/bpf/detect.go

Line 45 in e7c9ec3

func detectKprobeMulti() bool {

What we can do then is check to see whether a specific feature is supported iff it's used by a tracing policy. See for example:

tetragon/pkg/sensors/tracing/enforcer.go

Line 283 in e7c9ec3

    
           useMulti := !specOpts.DisableKprobeMulti && !option.Config.DisableKprobeMulti && bpf.HasKprobeMulti()

.

anfedotoff · 2024-07-29T12:11:16Z

We already have #2566 merged, so I can start implementing IMA FIM 🚀!

I came to the conclusion that Action for IMA Hash is better at the end and it is not so hard to implement as I think before LSM sensor PR. Maybe I became more familiar with Tetragon, who knows). So, let's consider the following tracingPolicy and imagine that we want to get IMA hash for file being opened:

apiVersion: cilium.io/v1alpha1
kind: TracingPolicy
metadata:
  name: "lsm-file-open"
spec:
  lsmhooks:
  - hook: "file_open"
    args:
      - index: 0
        type: "file"
    selectors:
    - matchArgs:
      - index: 0
        operator: "Equal"
        values:
        - "/etc/passwd"

File_open hook is triggered very often. So if we get hash at the time the args being resolved we will retrieve/calculate hash for every file! If we use action, we will have args filter and we will calculate hash only fo /etc/passwd. The problem is how to get parameters that are needed to call ima_helpers. IIUC, arguments are saved to msg_generic_kprobe here:

tetragon/bpf/process/generic_calls.h

Lines 225 to 234 in 2b07de6

    
           #ifdef GENERIC_LSM 
        
           	struct bpf_raw_tracepoint_args *raw_args = (struct bpf_raw_tracepoint_args *)ctx; 
        
           	e->a0 = BPF_CORE_READ(raw_args, args[0]); 
        
           	e->a1 = BPF_CORE_READ(raw_args, args[1]); 
        
           	e->a2 = BPF_CORE_READ(raw_args, args[2]); 
        
           	e->a3 = BPF_CORE_READ(raw_args, args[3]); 
        
           	e->a4 = BPF_CORE_READ(raw_args, args[4]); 
        
           	generic_process_init(e, MSG_OP_GENERIC_LSM, config); 
        
           #endif

These arguments are pointers that we need to pass to the ima_helpers. All other information that we need is also available at the Action phase.

Another question is where to store an ima_hash? I think msg_generic_kprobe is good and easy choice for that. @kkourt what's your opinion on this?

anfedotoff · 2024-08-02T10:03:42Z

Another question is where to store an ima_hash?

I think we can use a separate map BPF_MAP_TYPE_HASH for passing hashes to user space. The key can be u64 value (pid+4bytes of hash). So, this implementation Action + map will be look like stacktrace Action implementation. I think this will be a good design decision, because this new action will have week coupling with other tetragon code base.

kkourt · 2024-08-12T08:02:33Z

The usual way of passing arguments to userspace is to store them in ->args of msg_generic_kprobe

I think we can use a separate map BPF_MAP_TYPE_HASH for passing hashes to user space. The key can be u64 value (pid+4bytes of hash). So, this implementation Action + map will be look like stacktrace Action implementation. I think this will be a good design decision, because this new action will have week coupling with other tetragon code base.

Adding a map seems like a premature optimization to me. Do we really need it? What is the use-case we are trying to optimize?

anfedotoff · 2024-08-12T09:24:32Z

The usual way of passing arguments to userspace is to store them in ->args of msg_generic_kprobe

I think we can use a separate map BPF_MAP_TYPE_HASH for passing hashes to user space. The key can be u64 value (pid+4bytes of hash). So, this implementation Action + map will be look like stacktrace Action implementation. I think this will be a good design decision, because this new action will have week coupling with other tetragon code base.

Adding a map seems like a premature optimization to me. Do we really need it? What is the use-case we are trying to optimize?

For me it's OK to use ->args to pass hashes to user space. I suppose it is possible to put hashes in ->args at Action phase? Maybe it is better to use ->args, as you suggest.

kkourt · 2024-08-12T09:27:40Z

I suppose it is possible to put hashes in ->args at Action phase? Maybe it is better to use ->args, as you suggest.

That's a good question! I don't see why not, but it's not something we have done before I believe.

anfedotoff · 2024-08-15T15:28:11Z

I suppose it is possible to put hashes in ->args at Action phase? Maybe it is better to use ->args, as you suggest.

That's a good question! I don't see why not, but it's not something we have done before I believe.

Yes, we can use ->args! I found a solution about how to reserve space for a hash and pass hashes to user space!

anfedotoff · 2024-08-20T13:51:22Z

I started to develop IMA hashes collection for LSM events: (#2818) and met some problems:

bpf_ima_inode_hash/bpf_ima_file_hash can be called only from BPF_F_SLEEPABLE lsm programs: lsm.s.
lsm.s programs are strictly limited for maps usage. Only arrays, hashes and ringbuffer are allowed. We can't use BPF_MAP_TYPE_PROG_ARRAY and BPF_MAP_TYPE_PERF_EVENT_ARRAY. It means for us that we can't use bpf-to-bpf calls from lsm.s programs and send events from lsm.s programs with perfbuffer.
verfier doesn't allow us to call ima helpers from generic programs even if we make them sleepable and use allowed maps: 38: (85) call bpf_ima_file_hash#193 R1 type=scalar expected=ptr_, trusted_ptr. R1 is always turns out as scalar value.

Good news is that we can use bpf-to-bpf to lsm.s from lsm programs. In #2818 I managed to get IMA hash and print it to tracing_pipe. At least this is possible. I used such tracingpolicy:

apiVersion: cilium.io/v1alpha1
kind: TracingPolicy
metadata:
  name: "lsm"
spec:
  lsmhooks:
  - hook: "file_open"
    args:
      - index: 0
        type: "file"
    selectors:
    - matchBinaries:
      - operator: "Postfix"
        values:
        - "cat"
      matchArgs:
      - index: 0
        operator: "Equal"
        values:
        - "/etc/passwd"
      matchActions:
      - action: NoPost

Filtration worked I managed to get only hash for /etc/passwd according to policy. This success and Kevin's suggestions in the Tetragon Slack channel gave me an idea for solving the IMA problem. I'll describe my solution a little bit later. Solution is might not be so pretty as we have for generic bpf programs due to limitations of lsm.s programs. But we can discus it!

cc: @kkourt, @kevsecurity

anfedotoff · 2024-08-21T11:01:22Z

As I promised, I put my thoughts about overcoming our problems. First of all, I think we need to make as less changes in bpf code part as possible. Adding new code is better than changing generic concepts. Below I put a picture of generic_lsm sensor bpf part.

We can support ima_hash collection for this hooks (I think with them we can handle most of the cases):

security_bprm_check(struct linux_binprm *bprm)
security_file_open(struct file *file)
security_mmap_file(struct file *file, unsigned long prot, unsigned long flags)

Depending on tracing policy, we will load an appropriate bpf program for hash calculation. Basically, proposed approach is look like stacktrace collection we already have in tetragon.

@kkourt , @kevsecurity looking forward for your comments:)

anfedotoff · 2024-08-21T11:53:35Z

Btw, we can use struct msg_execve_key current, as key for ima_hashes map. So we only need to set
e->common.flags at action phase, that we need to tail call to lsm.s after generic_output event. Also I think we can use BPF_MAP_LRU_HASH to store ima hashes. (I hope we able to use this type of maps).

anfedotoff · 2024-10-07T13:00:40Z

Implemented in #2818

anfedotoff added the kind/enhancement This improves or streamlines existing functionality label May 3, 2024

anfedotoff mentioned this issue Jul 1, 2024

LSM sensor #2566

Merged

anfedotoff mentioned this issue Aug 26, 2024

IMA hashes in LSM events #2818

Merged

anfedotoff closed this as completed Oct 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tetragon based file integrity monitoring (FIM) #2409

Tetragon based file integrity monitoring (FIM) #2409

anfedotoff commented May 3, 2024

anfedotoff commented May 3, 2024

xmulligan commented May 3, 2024

kkourt commented May 17, 2024 •

edited

Loading

anfedotoff commented May 17, 2024

kkourt commented May 17, 2024

anfedotoff commented May 18, 2024

kkourt commented May 22, 2024

anfedotoff commented Jul 29, 2024

anfedotoff commented Aug 2, 2024

kkourt commented Aug 12, 2024

anfedotoff commented Aug 12, 2024 •

edited

Loading

kkourt commented Aug 12, 2024

anfedotoff commented Aug 15, 2024

anfedotoff commented Aug 20, 2024 •

edited

Loading

anfedotoff commented Aug 21, 2024 •

edited

Loading

anfedotoff commented Aug 21, 2024 •

edited

Loading

anfedotoff commented Oct 7, 2024

Tetragon based file integrity monitoring (FIM) #2409

Tetragon based file integrity monitoring (FIM) #2409

Comments

anfedotoff commented May 3, 2024

Is there an existing issue for this?

Is your feature request related to a problem?

Describe the feature you would like

Describe your proposed solution

Code of Conduct

anfedotoff commented May 3, 2024

xmulligan commented May 3, 2024

kkourt commented May 17, 2024 • edited Loading

anfedotoff commented May 17, 2024

kkourt commented May 17, 2024

anfedotoff commented May 18, 2024

kkourt commented May 22, 2024

anfedotoff commented Jul 29, 2024

anfedotoff commented Aug 2, 2024

kkourt commented Aug 12, 2024

anfedotoff commented Aug 12, 2024 • edited Loading

kkourt commented Aug 12, 2024

anfedotoff commented Aug 15, 2024

anfedotoff commented Aug 20, 2024 • edited Loading

anfedotoff commented Aug 21, 2024 • edited Loading

anfedotoff commented Aug 21, 2024 • edited Loading

anfedotoff commented Oct 7, 2024

kkourt commented May 17, 2024 •

edited

Loading

anfedotoff commented Aug 12, 2024 •

edited

Loading

anfedotoff commented Aug 20, 2024 •

edited

Loading

anfedotoff commented Aug 21, 2024 •

edited

Loading

anfedotoff commented Aug 21, 2024 •

edited

Loading