Skip to content

Enable minicpm3 for OpenVINO backend architecture tests#175

Merged
zhaixuejun1993 merged 1 commit into
ravi9:dev_backend_openvinofrom
zhaixuejun1993:xuejun/arch-test-minicpm3
May 20, 2026
Merged

Enable minicpm3 for OpenVINO backend architecture tests#175
zhaixuejun1993 merged 1 commit into
ravi9:dev_backend_openvinofrom
zhaixuejun1993:xuejun/arch-test-minicpm3

Conversation

@zhaixuejun1993
Copy link
Copy Markdown
Collaborator

This pull request introduces improvements to how sequence and attention parameters are handled in the OpenVINO decoder implementation. The main focus is on ensuring that parameters such as token_len_per_seq and attention_size are correctly set and passed as inputs when available.

Parameter handling improvements:

  • In compute_llm_params, the token_len_per_seq parameter is now set based on the size of node->src[1] if it hasn't been set yet and the source exists. This ensures accurate tracking of token lengths for each sequence.
  • In add_extra_inputs, both attention_size and token_len_per_seq are now only added as extra inputs if their values are set (not equal to -1), preventing the creation of unnecessary or invalid inputs.## Overview

Additional information

Requirements

@zhaixuejun1993 zhaixuejun1993 merged commit fba54ea into ravi9:dev_backend_openvino May 20, 2026
2 of 12 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant