Return traffic can be denied for a short duration once the policies are reconciled on a new pod #345

Pavani-Panakanti · 2024-12-06T19:29:28Z

What happened:
In standard mode, we do a default allow at pod startup and all traffic is allowed before policies are reconciled. It takes 1-2secs for the policies to be reconciled on the new pod. Once the network policy reconciliation happens, we start tracking the flows in conntrack table. For return traffic we check if entry is present in conntrack table and allow it accordingly. For traffic which exited the pod before network policies were applied and return traffic came after policies were applied, the return traffic will be denied as entry is not tracked in conntrack table

As a mitigation, 2-5secs delay can be added at the pod startup using init container. As a result, traffic will start going out of the pod only after network policies were applied and there will be no denies in the return traffic

We are actively working on fixing this issue, so that cx can use standard mode without the need to add sleep at pod startup. Fix for this issue can be tracked here

Please note that this issue happens only with standard mode and not in strict mode

What you expected to happen:

How to reproduce it (as minimally and precisely as possible):

Anything else we need to know?:

Environment:

Kubernetes version (use kubectl version):
CNI Version
Network Policy Agent Version
OS (e.g: cat /etc/os-release):
Kernel (e.g. uname -a):

The text was updated successfully, but these errors were encountered:

youwalther65 · 2024-12-09T07:57:23Z

NetPol are always asynchronously reconciled. What makes standard mode so special here compared to strict mode? Can you please elaborate on the technical side a bit.

m00lecule · 2024-12-09T12:02:23Z

Secondly I believe the strict mode should be also reviewed in context of postponed startup - https://docs.aws.amazon.com/eks/latest/userguide/cni-network-policy-configure.html#cni-network-policy-configure-policy.

In strict mode the pods are starting in default deny mode (for 1-2s workloads cannot access anything), indicating the startup should be also postponed by 5s to ensure the networkpolicies are reconciled. Before the initial reconciliation the pods are isolated from networking perspective which is not a useful state.

Eventually all of strict mode users will consider postponing startup by few seconds to ensure smooth operations. I believe we could do a favor to strict mode users and delay the workloads startup for everybody. Te goal is to ensure they won't be obligated to introduce some home crafted startup commands after trying our EKS + vpc-cni + networkpolicy enabled, which would lead to much smoother experience for upcoming EKS users.

Pavani-Panakanti · 2024-12-12T02:27:45Z

@youwalther65 In strict mode, we do default deny before policies are applied on the first pod, so no egress traffic goes out of the pod before policies were applied. So above issue will not happen where response packet will be denied as entry is missing in conntrack table for traffic that egressed out of pod before applying policies

Pavani-Panakanti · 2024-12-18T01:22:24Z

@m00lecule We are looking into improving the user experience for strict mode. This is something we are prioritizing. We will provide more details soon

janavenkat · 2025-03-06T09:25:45Z

Syncing here as well aws/amazon-vpc-cni-k8s#3206 (comment)

m00lecule · 2025-03-06T22:05:51Z

@Pavani-Panakanti The issue is still present after upgrading vpc-cni to v1.19.3-eksbuild.1.

Pavani-Panakanti · 2025-03-07T20:01:30Z

Looking into this. Will add an update soon

Pavani-Panakanti added the bug Something isn't working label Dec 6, 2024

This was referenced Dec 6, 2024

Network policy blocks established connections to STS. #73

Closed

Race condition causes quickly opened connections to fail #186

Closed

Pavani-Panakanti changed the title ~~[Standard mode] Return traffic can be denied for a short duration once the policies are reconciled on a new pod~~ Return traffic can be denied for a short duration once the policies are reconciled on a new pod Dec 7, 2024

This was referenced Dec 7, 2024

Response traffic from allowed egress denied on short lived pods #189

Closed

Network policy blocks established connections to RDS #236

Closed

m00lecule mentioned this issue Dec 8, 2024

Network Policy Rule Evaluation Blocks Traffic to DNS Server aws/amazon-network-policy-controller-k8s#146

Open

This was referenced Feb 3, 2025

Fix standard mode return packet drop at pod startup #361

Merged

Changes to attach probes at pod start aws/amazon-vpc-cni-k8s#3188

Open

orsenthil mentioned this issue Feb 13, 2025

Connection Issues with VPC CNI Network Policy Enforcement aws/amazon-vpc-cni-k8s#3203

Open

haouc mentioned this issue Feb 18, 2025

Changes to attach probes at pod start aws/amazon-vpc-cni-k8s#3206

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Return traffic can be denied for a short duration once the policies are reconciled on a new pod #345

Return traffic can be denied for a short duration once the policies are reconciled on a new pod #345

Pavani-Panakanti commented Dec 6, 2024 •

edited

Loading

youwalther65 commented Dec 9, 2024

m00lecule commented Dec 9, 2024 •

edited

Loading

Pavani-Panakanti commented Dec 12, 2024

Pavani-Panakanti commented Dec 18, 2024

janavenkat commented Mar 6, 2025

m00lecule commented Mar 6, 2025

Pavani-Panakanti commented Mar 7, 2025

Return traffic can be denied for a short duration once the policies are reconciled on a new pod #345

Return traffic can be denied for a short duration once the policies are reconciled on a new pod #345

Comments

Pavani-Panakanti commented Dec 6, 2024 • edited Loading

youwalther65 commented Dec 9, 2024

m00lecule commented Dec 9, 2024 • edited Loading

Pavani-Panakanti commented Dec 12, 2024

Pavani-Panakanti commented Dec 18, 2024

janavenkat commented Mar 6, 2025

m00lecule commented Mar 6, 2025

Pavani-Panakanti commented Mar 7, 2025

Pavani-Panakanti commented Dec 6, 2024 •

edited

Loading

m00lecule commented Dec 9, 2024 •

edited

Loading