Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[sub-interface] | Err msg detected in syslog when add portchannel sub-interface: "Can't connect to teamd LAG='Po58.58', error='No such file or directory" #11218

Open
nhe-NV opened this issue Jun 22, 2022 · 3 comments
Assignees
Labels
BRCM Issue for 202205 Triaged this issue has been triaged

Comments

@nhe-NV
Copy link

nhe-NV commented Jun 22, 2022

Description

Err msg detected in sysog: "Can't connect to teamd LAG='Po58.58', error='No such file or directory"

Steps to reproduce the issue:

1./home/admin# config portchannel add PortChannel58
2./home/admin# config portchannel member add PortChannel58 Ethernet0
3./home/admin# config subinterface add Po58.58 58
4.check the syslog, there is Err msg in the syslog: "Can't connect to teamd LAG='Po58.58', error='No such file or directory"

Describe the results you received:

WARNING teamd#tlm_teamd: :- try_add_lag: Can't connect to teamd LAG='Po58.58', error='No such file or directory'. attempt=1
WARNING teamd#tlm_teamd: :- try_add_lag: Can't connect to teamd LAG='Po58.58', error='No such file or directory'. attempt=2
WARNING teamd#tlm_teamd: :- try_add_lag: Can't connect to teamd LAG='Po58.58', error='No such file or directory'. attempt=3
WARNING teamd#tlm_teamd: :- try_add_lag: Can't connect to teamd LAG='Po58.58', error='No such file or directory'. attempt=4
WARNING teamd#tlm_teamd: :- try_add_lag: Can't connect to teamd LAG='Po58.58', error='No such file or directory'. attempt=5
WARNING teamd#tlm_teamd: :- try_add_lag: Can't connect to teamd LAG='Po58.58', error='No such file or directory'. attempt=6
WARNING teamd#tlm_teamd: :- try_add_lag: Can't connect to teamd LAG='Po58.58', error='No such file or directory'. attempt=7
WARNING teamd#tlm_teamd: :- try_add_lag: Can't connect to teamd LAG='Po58.58', error='No such file or directory'. attempt=8
WARNING teamd#tlm_teamd: :- try_add_lag: Can't connect to teamd LAG='Po58.58', error='No such file or directory'. attempt=9

Describe the results you expected:

There should be no such err msg

Output of show version:

 202111.74-ca284a1bf_Internal

Output of show techsupport:

(paste your output here or download and attach the file here )

Additional information you deem important (e.g. issue happens only occasionally):

@yxieca
Copy link
Contributor

yxieca commented Jun 22, 2022

Encountered on Mellanox platform. However, issue is generic. Believe that BRCM made some changes lately and caused the regression.

@yxieca yxieca added BRCM Triaged this issue has been triaged labels Jun 22, 2022
neethajohn pushed a commit to sonic-net/sonic-mgmt that referenced this issue Jul 13, 2022
Summary:
Use loganalyzer ignore marker to ignore 'config load' error logs.
Ignore LAG and FDB related error logs found in test case sub_port_interfaces/test_sub_port_interfaces.py::test_admin_status_down_disables_forwarding

Jul 8 07:00:39.302949 foo-DUT ERR teamd#tlm_teamd: :- process_add_queue: Can't connect to teamd after 10 attempts. LAG 'PortChannel1.20'
Jul 8 07:28:02.105624 foo-DUT ERR swss#orchagent: :- update: Failed to get port by bridge port ID 0x3a00000000083f.

How did you do it?
For error log 1, Jul 8 07:00:39.302949 foo-DUT ERR teamd#tlm_teamd: :- process_add_queue: Can't connect to teamd after 10 attempts. LAG 'PortChannel1.20'
It's raised after the test case creates some portchannel-sub-interfaces.
But the portchannel-sub-interfaces have been created successfully:

admin@foo:~/tmp/sub_port_interfaces$ show ip int
Interface        Master    IPv4 address/mask    Admin/Oper    BGP Neighbor    Neighbor IP
---------------  --------  -------------------  ------------  --------------  -------------
Ethernet80.10              10.0.0.56/31         up/up         ARISTA01BT1     10.0.0.57
Ethernet84.10              10.0.0.58/31         up/up         ARISTA02BT1     10.0.0.59
Ethernet88.10              10.0.0.60/31         up/up         ARISTA03BT1     10.0.0.61
Ethernet92.10              10.0.0.62/31         up/up         ARISTA04BT1     10.0.0.63
Ethernet96.10              10.0.0.64/31         up/up         ARISTA05BT1     10.0.0.65
Ethernet100.10             10.0.0.66/31         up/up         ARISTA06BT1     10.0.0.67
Ethernet104.10             10.0.0.68/31         up/up         ARISTA07BT1     10.0.0.69
Ethernet108.10             10.0.0.70/31         up/up         ARISTA08BT1     10.0.0.71
Ethernet112.10             10.0.0.72/31         up/up         ARISTA09BT1     10.0.0.73
Ethernet116.10             10.0.0.74/31         up/up         ARISTA10BT1     10.0.0.75
Ethernet120.10             10.0.0.76/31         up/up         ARISTA11BT1     10.0.0.77
Ethernet124.10             10.0.0.78/31         up/up         ARISTA12BT1     10.0.0.79
Loopback0                  10.1.0.32/32         up/up         N/A             N/A
PortChannel1.20            172.16.0.1/30        up/up         N/A             N/A
PortChannel1.30            172.16.0.5/30        up/up         N/A             N/A
PortChannel1.40            172.16.0.9/30        up/up         N/A             N/A
PortChannel1.50            172.16.0.13/30       up/up         N/A             N/A
PortChannel2.20            172.16.4.1/30        up/up         N/A             N/A
PortChannel2.30            172.16.4.5/30        up/up         N/A             N/A
PortChannel2.40            172.16.4.9/30        up/up         N/A             N/A
PortChannel2.50            172.16.4.13/30       up/up         N/A             N/A
Vlan1000                   192.168.0.1/21       up/up         N/A             N/A
docker0                    240.127.1.1/24       up/down       N/A             N/A
eth0                       10.64.247.28/23      up/up         N/A             N/A
lo                         127.0.0.1/16         up/up         N/A             N/A

admin@foo:~/tmp/sub_port_interfaces$ ping 172.16.0.1
PING 172.16.0.1 (172.16.0.1) 56(84) bytes of data.
64 bytes from 172.16.0.1: icmp_seq=1 ttl=64 time=0.095 ms
64 bytes from 172.16.0.1: icmp_seq=2 ttl=64 time=0.078 ms
64 bytes from 172.16.0.1: icmp_seq=3 ttl=64 time=0.124 ms
According to: sonic-net/sonic-buildimage#11218
I think we can safely ignore it in the sub port test case.

For error log 2, Jul 8 07:28:02.105624 foo-DUT ERR swss#orchagent: :- update: Failed to get port by bridge port ID 0x3a00000000083f.
It's raised after removing a member interface from the vlan, because the deleted interface loses its fdb entry.

As for the log analyzer ignore marker in the 'config load', I've to do that slight refactor, otherwise we can't only ignore 2 patterns in ignore_regex_list, but also include all error logs caused by 'config load'

How did you verify/test it?
Run it on physical testbed:

=== Running tests in groups ===
/usr/local/lib/python2.7/dist-packages/ansible/parsing/vault/__init__.py:44: CryptographyDeprecationWarning: Python 2 is no longer supported by the Python core team. Support for it is now deprecated in cryptography, and will be removed in the next release.
  from cryptography.exceptions import InvalidSignature
================================================================================================================== test session starts ===================================================================================================================
platform linux2 -- Python 2.7.17, pytest-4.6.5, py-1.10.0, pluggy-0.13.1
ansible: 2.8.12
rootdir: /var/src/sonic-mgmt-int/tests, inifile: pytest.ini
plugins: celery-4.4.7, forked-1.3.0, metadata-1.11.0, xdist-1.28.0, html-1.22.1, repeat-0.9.1, ansible-2.2.2
collecting ... /usr/local/lib/python2.7/dist-packages/ansible/parsing/vault/__init__.py:44: CryptographyDeprecationWarning: Python 2 is no longer supported by the Python core team. Support for it is now deprecated in cryptography, and will be removed in the next release.
  from cryptography.exceptions import InvalidSignature
collected 1 item                                                                                                                                                                                                                                         

sub_port_interfaces/test_sub_port_interfaces.py::TestSubPorts::test_admin_status_down_disables_forwarding[port_in_lag] PASSED                                                                                                                      [100%]

--------------------------------------------------------------------------------------------- generated xml file: /var/src/sonic-mgmt-int/tests/logs/tr.xml ----------------------------------------------------------------------------------------------
=============================================================================================================== 1 passed in
@dgsudharsan
Copy link
Collaborator

@adyeung @preetham-singh Can you please share status on this issue?

yejianquan added a commit to sonic-net/sonic-mgmt that referenced this issue Jul 18, 2022
Summary:
Use loganalyzer ignore marker to ignore 'config load' error logs.
Ignore LAG and FDB related error logs found in test case sub_port_interfaces/test_sub_port_interfaces.py::test_admin_status_down_disables_forwarding

Jul 8 07:00:39.302949 foo-DUT ERR teamd#tlm_teamd: :- process_add_queue: Can't connect to teamd after 10 attempts. LAG 'PortChannel1.20'
Jul 8 07:28:02.105624 foo-DUT ERR swss#orchagent: :- update: Failed to get port by bridge port ID 0x3a00000000083f.

How did you do it?
For error log 1, Jul 8 07:00:39.302949 foo-DUT ERR teamd#tlm_teamd: :- process_add_queue: Can't connect to teamd after 10 attempts. LAG 'PortChannel1.20'
It's raised after the test case creates some portchannel-sub-interfaces.
But the portchannel-sub-interfaces have been created successfully:

admin@foo:~/tmp/sub_port_interfaces$ show ip int
Interface        Master    IPv4 address/mask    Admin/Oper    BGP Neighbor    Neighbor IP
---------------  --------  -------------------  ------------  --------------  -------------
Ethernet80.10              10.0.0.56/31         up/up         ARISTA01BT1     10.0.0.57
Ethernet84.10              10.0.0.58/31         up/up         ARISTA02BT1     10.0.0.59
Ethernet88.10              10.0.0.60/31         up/up         ARISTA03BT1     10.0.0.61
Ethernet92.10              10.0.0.62/31         up/up         ARISTA04BT1     10.0.0.63
Ethernet96.10              10.0.0.64/31         up/up         ARISTA05BT1     10.0.0.65
Ethernet100.10             10.0.0.66/31         up/up         ARISTA06BT1     10.0.0.67
Ethernet104.10             10.0.0.68/31         up/up         ARISTA07BT1     10.0.0.69
Ethernet108.10             10.0.0.70/31         up/up         ARISTA08BT1     10.0.0.71
Ethernet112.10             10.0.0.72/31         up/up         ARISTA09BT1     10.0.0.73
Ethernet116.10             10.0.0.74/31         up/up         ARISTA10BT1     10.0.0.75
Ethernet120.10             10.0.0.76/31         up/up         ARISTA11BT1     10.0.0.77
Ethernet124.10             10.0.0.78/31         up/up         ARISTA12BT1     10.0.0.79
Loopback0                  10.1.0.32/32         up/up         N/A             N/A
PortChannel1.20            172.16.0.1/30        up/up         N/A             N/A
PortChannel1.30            172.16.0.5/30        up/up         N/A             N/A
PortChannel1.40            172.16.0.9/30        up/up         N/A             N/A
PortChannel1.50            172.16.0.13/30       up/up         N/A             N/A
PortChannel2.20            172.16.4.1/30        up/up         N/A             N/A
PortChannel2.30            172.16.4.5/30        up/up         N/A             N/A
PortChannel2.40            172.16.4.9/30        up/up         N/A             N/A
PortChannel2.50            172.16.4.13/30       up/up         N/A             N/A
Vlan1000                   192.168.0.1/21       up/up         N/A             N/A
docker0                    240.127.1.1/24       up/down       N/A             N/A
eth0                       10.64.247.28/23      up/up         N/A             N/A
lo                         127.0.0.1/16         up/up         N/A             N/A

admin@foo:~/tmp/sub_port_interfaces$ ping 172.16.0.1
PING 172.16.0.1 (172.16.0.1) 56(84) bytes of data.
64 bytes from 172.16.0.1: icmp_seq=1 ttl=64 time=0.095 ms
64 bytes from 172.16.0.1: icmp_seq=2 ttl=64 time=0.078 ms
64 bytes from 172.16.0.1: icmp_seq=3 ttl=64 time=0.124 ms
According to: sonic-net/sonic-buildimage#11218
I think we can safely ignore it in the sub port test case.

For error log 2, Jul 8 07:28:02.105624 foo-DUT ERR swss#orchagent: :- update: Failed to get port by bridge port ID 0x3a00000000083f.
It's raised after removing a member interface from the vlan, because the deleted interface loses its fdb entry.

As for the log analyzer ignore marker in the 'config load', I've to do that slight refactor, otherwise we can't only ignore 2 patterns in ignore_regex_list, but also include all error logs caused by 'config load'

How did you verify/test it?
Run it on physical testbed:

=== Running tests in groups ===
/usr/local/lib/python2.7/dist-packages/ansible/parsing/vault/__init__.py:44: CryptographyDeprecationWarning: Python 2 is no longer supported by the Python core team. Support for it is now deprecated in cryptography, and will be removed in the next release.
  from cryptography.exceptions import InvalidSignature
================================================================================================================== test session starts ===================================================================================================================
platform linux2 -- Python 2.7.17, pytest-4.6.5, py-1.10.0, pluggy-0.13.1
ansible: 2.8.12
rootdir: /var/src/sonic-mgmt-int/tests, inifile: pytest.ini
plugins: celery-4.4.7, forked-1.3.0, metadata-1.11.0, xdist-1.28.0, html-1.22.1, repeat-0.9.1, ansible-2.2.2
collecting ... /usr/local/lib/python2.7/dist-packages/ansible/parsing/vault/__init__.py:44: CryptographyDeprecationWarning: Python 2 is no longer supported by the Python core team. Support for it is now deprecated in cryptography, and will be removed in the next release.
  from cryptography.exceptions import InvalidSignature
collected 1 item                                                                                                                                                                                                                                         

sub_port_interfaces/test_sub_port_interfaces.py::TestSubPorts::test_admin_status_down_disables_forwarding[port_in_lag] PASSED                                                                                                                      [100%]

--------------------------------------------------------------------------------------------- generated xml file: /var/src/sonic-mgmt-int/tests/logs/tr.xml ----------------------------------------------------------------------------------------------
=============================================================================================================== 1 passed in
@zhangyanzhao
Copy link
Collaborator

BRCM will take a look

@judyjoseph judyjoseph self-assigned this Aug 17, 2022
tshalvi pushed a commit to tshalvi/sonic-buildimage that referenced this issue Dec 20, 2022
…ABLE (sonic-net#2408)

* Fix for issue sonic-net#11218
Avoid processing portchannel subinterfaces in teamd
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
BRCM Issue for 202205 Triaged this issue has been triaged
Projects
None yet
Development

No branches or pull requests

6 participants