Skip to content

[release-4.21] Handle missing sriov_numvfs during Mellanox firmware reset#1240

Open
SchSeba wants to merge 2 commits into
openshift:release-4.21from
SchSeba:cp-release-4.21-sriov-numvfs-20260702
Open

[release-4.21] Handle missing sriov_numvfs during Mellanox firmware reset#1240
SchSeba wants to merge 2 commits into
openshift:release-4.21from
SchSeba:cp-release-4.21-sriov-numvfs-20260702

Conversation

@SchSeba

@SchSeba SchSeba commented Jul 2, 2026

Copy link
Copy Markdown
Contributor

Summary

  • skip resetting sriov_numvfs to 0 when the sysfs file is not present after staging Mellanox firmware changes
  • avoid failing the Mellanox firmware reset flow while SRIOV_EN is still disabled until reboot
  • mount writable /tmp for mstconfig lockfiles when the config-daemon runs with readOnlyRootFilesystem

Test plan

  • Not run (backport only)

Made with Cursor

rollandf and others added 2 commits July 2, 2026 15:28
When running with readOnlyRootFilesystem enabled, mstconfig fails to
create its lockfiles because the path /tmp/mstflint_lockfiles is
hardcoded in the mstflint source code.

This patch adds an emptyDir volume mounted at /tmp to provide a
writable location for these lockfiles while maintaining the security
benefits of a read-only root filesystem for the rest of the container.

Signed-off-by: Fred Rolland <frolland@nvidia.com>
(cherry picked from commit 77a9221)
Signed-off-by: Sebastian Sch <sebassch@gmail.com>
When SRIOV_EN is false the sriov_numvfs sysfs file does not exist.
SetSriovNumVfs now checks for the file before writing and returns
success when numVfs is 0 and the file is absent, so callers like
MlxResetFW do not need special error handling for this case.

Signed-off-by: Sebastian Sch <sebassch@gmail.com>
(cherry picked from commit b93b9be)
Signed-off-by: Sebastian Sch <sebassch@gmail.com>
@openshift-ci openshift-ci Bot requested review from MrSanketkumar and dougbtv July 2, 2026 15:30
@openshift-ci

openshift-ci Bot commented Jul 2, 2026

Copy link
Copy Markdown
Contributor

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: SchSeba

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@openshift-ci openshift-ci Bot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Jul 2, 2026
@openshift-ci

openshift-ci Bot commented Jul 2, 2026

Copy link
Copy Markdown
Contributor

@SchSeba: all tests passed!

Full PR test history. Your PR dashboard.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

approved Indicates a PR has been approved by an approver from all required OWNERS files.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants