[Test] pt-nightly-hf-mlm-roberta-b-pre-conv-v2-8-1vm without functionalization by ManfeiBai · Pull Request #868 · GoogleCloudPlatform/ml-testing-accelerators

ManfeiBai · 2023-04-14T18:28:14Z

Description

Please include a summary of relevant context/issue and your changes.

Tests

Please describe the tests that you ran on TPUs to verify changes.

Instruction and/or command lines to reproduce your tests: ...

List links for your tests (use go/shortn-gen for any internal link): ...

Checklist

Before submitting this PR, please make sure (put X in square brackets):

I have performed a self-review of my code.
I have necessary comments in my code, particularly in hard-to-understand areas.
I have run one-shot tests and provided workload links above if applicable.
I have made or will make corresponding changes to the doc if needed.

…IZATION

alanwaketan · 2023-04-18T18:04:32Z

@ManfeiBai You can try one shot tests here:

ml-testing-accelerators/tests/README.md

Line 36 in 583e71a

## Running a One Shot Test

just to validate the test run. No need to wait until it finishes to see the result.

alanwaketan · 2023-04-18T18:26:50Z

tests/pytorch/nightly/hf-lm.libsonnet

        pip install tensorboardX google-cloud-storage
        echo 'export PATH=~/.local/bin:$PATH' >> ~/.bash_profile
        echo 'export XLA_USE_BF16=1' >> ~/.bash_profile
+        echo 'export XLA_DISABLE_FUNCTIONALIZATION=1' >> ~/.bash_profile


Can we have a new tpuVm (tpuVmNoFunc) setup that inherits tpuVm and only use that for "hf_lm + v2_8 + roberta_base_pre"?

You can see how pjrt doest it: https://github.com/GoogleCloudPlatform/ml-testing-accelerators/blob/master/tests/pytorch/nightly/resnet50-mp.libsonnet#L188

@will-cromar Feel free to chime in.

Will you be using it in other tests? If it's just for this file, we can add a mixin here that sets this variable, e.g.

noFunctionalization:: { tpuSettings+: { tpuvmExports+: ||| XLA_DISABLE_FUNCTIONALIZATION=1 ||| } }

The correct place to set environment variables for these tests is tpuVmExports+. The use of .bash_profile is probably just historical baggage.

If it's shared by other tests, feel free to add this field to common instead.

test pt-nightly-hf-mlm-roberta-b-pre-conv-v2-8-1vm without FUNCTIONAL…

93e7dfc

…IZATION

ManfeiBai changed the title ~~[Test] test pt-nightly-hf-mlm-roberta-b-pre-conv-v2-8-1vm without FUNCTIONAL…~~ [Test] pt-nightly-hf-mlm-roberta-b-pre-conv-v2-8-1vm without functionalization Apr 14, 2023

ManfeiBai added 2 commits April 14, 2023 18:33

comment

fe22218

delet duplicate

e5a02f9

alanwaketan requested review from alanwaketan and will-cromar April 18, 2023 18:02

alanwaketan reviewed Apr 18, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Test] pt-nightly-hf-mlm-roberta-b-pre-conv-v2-8-1vm without functionalization#868

[Test] pt-nightly-hf-mlm-roberta-b-pre-conv-v2-8-1vm without functionalization#868
ManfeiBai wants to merge 3 commits intoGoogleCloudPlatform:masterfrom
ManfeiBai:newforhfmlmprerebortptnightly

ManfeiBai commented Apr 14, 2023 •

edited

Loading

Uh oh!

alanwaketan commented Apr 18, 2023

Uh oh!

alanwaketan Apr 18, 2023

Uh oh!

will-cromar Apr 18, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

ManfeiBai commented Apr 14, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Tests

Checklist

Uh oh!

alanwaketan commented Apr 18, 2023

Uh oh!

alanwaketan Apr 18, 2023

Choose a reason for hiding this comment

Uh oh!

will-cromar Apr 18, 2023

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ManfeiBai commented Apr 14, 2023 •

edited

Loading