Skip to content

Fix SR-IOV test failures on HPC/GPU SKUs with InfiniBand interfaces#4313

Draft
umfranci wants to merge 2 commits intomainfrom
umfranci/vf-count-ib-28022026
Draft

Fix SR-IOV test failures on HPC/GPU SKUs with InfiniBand interfaces#4313
umfranci wants to merge 2 commits intomainfrom
umfranci/vf-count-ib-28022026

Conversation

@umfranci
Copy link
Collaborator

@umfranci umfranci commented Mar 2, 2026

On SKUs with InfiniBand, RDMA interfaces (ib*) are discovered as PCI NICs, inflating VF counts beyond what ARM reports and crashing tests that assume all PCI NICs have IP addresses. Following are the Key changes made:

  • Added 2 new functions - get_pci_nics_except_ib() and get_device_slots_except_ib() to filter out IB interfaces from VF/slot counts.
  • Used get_pci_nics_except_ib() in check_sriov() and get_pci_nics_except_ib() in check_sriov_count() VF count validation.
  • Fixed get_pci_slot_from_device_path() to return the leaf PCI device slot instead of a parent bridge.
  • Used IB-excluded counts in initialize_nic_info() and sriov_basic_test() and added IB skip logic in VF connection tests.
  • Skipped IB interfaces in verify_sriov_interrupts_change() to prevent crashes on NICs without IPs.
  • Skipped IB interfaces in server NIC matching; fixed .is_not_none() → .is_not_empty() bug.

@umfranci umfranci requested a review from SRIKKANTH March 2, 2026 07:45
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant