Skip to content

Conversation

@anmyachev
Copy link
Contributor

@anmyachev anmyachev commented Oct 27, 2025

Inspired by intel/torch-xpu-ops@779f899

CI:

Test all models: https://github.com/intel/intel-xpu-backend-for-triton/actions/runs/18856801306 (see summary)

Summary:

=========================================
Summary of only failed models:
Real failed models: 3 [['meta-llama/Llama-3.2-1B', 'eager_fail_to_run'], ['google/gemma-2-2b', 'eager_fail_to_run'], ['CamemBert', 'eager_fail_to_run']]
Real failed models: 3 [['google/gemma-2-2b', 'eager_fail_to_run'], ['meta-llama/Llama-3.2-1B', 'eager_fail_to_run'], ['CamemBert', 'eager_fail_to_run']]
Real failed models: 4 [['google/gemma-2-2b', 'eager_fail_to_run'], ['meta-llama/Llama-3.2-1B', 'eager_fail_to_run'], ['openai/whisper-tiny', 'fail_accuracy'], ['CamemBert', 'eager_fail_to_run']]
Real failed models: 3 [['meta-llama/Llama-3.2-1B', 'eager_fail_to_run'], ['CamemBert', 'eager_fail_to_run'], ['google/gemma-2-2b', 'eager_fail_to_run']]
Real failed models: 3 [['CamemBert', 'eager_fail_to_run'], ['meta-llama/Llama-3.2-1B', 'eager_fail_to_run'], ['google/gemma-2-2b', 'eager_fail_to_run']]
Real failed models: 1 [['CamemBert', 'eager_fail_to_run']]
Real failed models: 1 [['CamemBert', 'eager_fail_to_run']]
Real failed models: 1 [['CamemBert', 'eager_fail_to_run']]
Real failed models: 1 [['CamemBert', 'eager_fail_to_run']]
Real failed models: 1 [['CamemBert', 'eager_fail_to_run']]
Real failed models: 1 [['convit_base', 'eager_fail_to_run']]
Real failed models: 1 [['convit_base', 'eager_fail_to_run']]
Real failed models: 2 [['convit_base', 'eager_fail_to_run'], ['sebotnet33ts_256', 'fail_accuracy']]
Real failed models: 1 [['convit_base', 'eager_fail_to_run']]
Real failed models: 2 [['maml_omniglot', 'eager_fail_to_run'], ['functorch_maml_omniglot', 'eager_fail_to_run']]
Real failed models: 3 [['detectron2_fasterrcnn_r_50_fpn', 'eager_1st_run_OOM'], ['functorch_maml_omniglot', 'eager_fail_to_run'], ['maml_omniglot', 'eager_fail_to_run']]
Real failed models: 2 [['maml_omniglot', 'eager_fail_to_run'], ['functorch_maml_omniglot', 'eager_fail_to_run']]
Real failed models: 2 [['functorch_maml_omniglot', 'eager_fail_to_run'], ['maml_omniglot', 'eager_fail_to_run']]
Real failed models: 6 [['detectron2_fasterrcnn_r_50_dc5', 'eager_1st_run_OOM'], ['functorch_maml_omniglot', 'eager_fail_to_run'], ['detectron2_fasterrcnn_r_101_c4', 'eager_1st_run_OOM'], ['detectron2_fasterrcnn_r_50_c4', 'eager_1st_run_OOM'], ['maml_omniglot', 'eager_fail_to_run'], ['detectron2_fasterrcnn_r_101_dc5', 'eager_1st_run_OOM']]
Real failed models: 2 [['functorch_maml_omniglot', 'eager_fail_to_run'], ['maml_omniglot', 'eager_fail_to_run']]
Real failed models: 2 [['functorch_maml_omniglot', 'eager_fail_to_run'], ['maml_omniglot', 'eager_fail_to_run']]
Real failed models: 2 [['functorch_maml_omniglot', 'eager_fail_to_run'], ['maml_omniglot', 'eager_fail_to_run']]
Real failed models: 2 [['functorch_maml_omniglot', 'eager_fail_to_run'], ['maml_omniglot', 'eager_fail_to_run']]
Real failed models: 2 [['functorch_maml_omniglot', 'eager_fail_to_run'], ['maml_omniglot', 'eager_fail_to_run']]
ERROR: Found failed models!

Error checking:

    return F.linear(input, self.weight, self.bias)
RuntimeError: expected mat1 and mat2 to have the same dtype, but got: float != c10::BFloat16

Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>
Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>
Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>
@anmyachev
Copy link
Contributor Author

It looks like HUGGING_FACE_HUB_TOKEN needs to be updated:

HUGGING_FACE_HUB_TOKEN: ${{ secrets.HUGGING_FACE_HUB_TOKEN }}

@kwasd could you take a look? This has P0 priority.

Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>
Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>
Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>
@anmyachev anmyachev changed the title [E2E] Align pytorch/benchmarks/dynamo configs to match what torch-xpu-ops uses [E2E] Align pytorch/benchmarks/dynamo configs and torchbench dependencies to match what torch-xpu-ops uses Oct 27, 2025
@anmyachev anmyachev marked this pull request as ready for review October 27, 2025 21:44
Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>
Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>
Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants