Skip to content

Revert fattn q static shape#155

Closed
cavusmustafa wants to merge 33 commits into
ravi9:masterfrom
cavusmustafa:revert_fattn_q_static_shape
Closed

Revert fattn q static shape#155
cavusmustafa wants to merge 33 commits into
ravi9:masterfrom
cavusmustafa:revert_fattn_q_static_shape

Conversation

@cavusmustafa
Copy link
Copy Markdown
Collaborator

reverting static Q shape for attention layers due to issues with llama-perplexity.
will raise another PR for final solution.

zhaixuejun1993 and others added 30 commits April 22, 2026 14:33
* added translate_1to1_match_1_input function and updated gelu and tanh translations

* Remove unused translation function calls

---------

Co-authored-by: Mustafa Cavus <mustafacavus@intel.com>
* OpenVINO backend: refactor VIEW related operation

* Enable VIEW handling in following ops

* OpenVINO backend does not support GGML_OP_NORM & GGML_OP_L2_NORM with VIEW input accuracy issue from OpenVINO
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants