Skip to content

Enable RMS_NORM and VIEW in OpenVINO backend, remove op_case 2 for rope#153

Open
zhaixuejun1993 wants to merge 4 commits into
ravi9:dev_backend_openvinofrom
zhaixuejun1993:xuejun/op_case_rope
Open

Enable RMS_NORM and VIEW in OpenVINO backend, remove op_case 2 for rope#153
zhaixuejun1993 wants to merge 4 commits into
ravi9:dev_backend_openvinofrom
zhaixuejun1993:xuejun/op_case_rope

Conversation

@zhaixuejun1993
Copy link
Copy Markdown
Collaborator

This pull request makes targeted adjustments to the OpenVINO backend integration in the ggml project, focusing on refining operator handling and improving the logic for tensor output detection. The changes enhance the accuracy and reliability of operator identification and backend support checks.

Operator handling improvements:

  • Updated the logic in GgmlOvDecoder::compute_op_case to no longer set a specific bit for nodes whose first source is a VIEW operation, simplifying the operator case determination process.
  • Refined the is_output_idx static method in GgmlOvDecoder to additionally check that the second source tensor's operation is GGML_OP_NONE, making output index detection more precise.

Backend support adjustments:

  • Removed GGML_OP_RMS_NORM from the set of operations that are not supported with a VIEW input in the OpenVINO backend, potentially enabling new operation combinations.

@cavusmustafa
Copy link
Copy Markdown
Collaborator

Cannot verify currently since dev_backend_openvino is broken for gemma4 due to a previous change: 335cf5c

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants