Fix Dawn nightly: select SDPA attn output by shape, not numel (#20283) by JulianCloudNTH · Pull Request #20283 · pytorch/executorch

JulianCloudNTH · 2026-06-15T16:04:20Z

Summary:

The WebGPU Dawn native nightly (webgpu_native_test) fails deterministically on the llama1b_prefill SDPA config with FAIL: ambiguous attention output: 3 tensors match numel 262144, which fails the binary and turns the job red.

sdpa_with_kv_cache returns three tensors [k_cache, v_cache, attn_output]. test_sdpa_config identified the attention output purely by element count (numel == S*Hq*D). For llama1b_prefill (Hq=32, Hkv=8, D=64, S=128, Cmax=512) the attention count S*Hq*D = 128*32*64 = 262144 coincides exactly with each cache count Cmax*Hkv*D = 512*8*64 = 262144, so all three outputs match numel and the existing ambiguity guard correctly bails before any numeric comparison. The kernel output itself is fine -- the sibling llama1b_decode config (same Hq/Hkv/D) passes at ~1e-9; only the test's output-selection heuristic was wrong. The colliding config and the numel selector were introduced together in D107595144.

Fix: disambiguate by shape instead of flat count. The attention output is [1, S, Hq, D] while each cache is [1, Cmax, Hkv, D]; these differ in dims 1-2 even when the flat count collides. Match dim()==4 && size(1)==S && size(2)==Hq && size(3)==D, keeping the attn_matches > 1 ambiguity guard as a backstop.

Scope: test-only, one function (test_sdpa_config); no kernel, runtime, or export change.

Authored with Claude Code.

Reviewed By: Gasoonjia

Differential Revision: D108625761

pytorch-bot · 2026-06-15T16:04:25Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/20283

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

meta-codesync · 2026-06-15T16:04:30Z

@JulianCloudNTH has exported this pull request. If you are a Meta employee, you can view the originating Diff in D108625761.

github-actions · 2026-06-15T16:05:22Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

…h#20283) Summary: The WebGPU Dawn native nightly (`webgpu_native_test`) fails deterministically on the `llama1b_prefill` SDPA config with `FAIL: ambiguous attention output: 3 tensors match numel 262144`, which fails the binary and turns the job red. `sdpa_with_kv_cache` returns three tensors `[k_cache, v_cache, attn_output]`. `test_sdpa_config` identified the attention output purely by element count (`numel == S*Hq*D`). For `llama1b_prefill` (`Hq=32, Hkv=8, D=64, S=128, Cmax=512`) the attention count `S*Hq*D = 128*32*64 = 262144` coincides exactly with each cache count `Cmax*Hkv*D = 512*8*64 = 262144`, so all three outputs match `numel` and the existing ambiguity guard correctly bails before any numeric comparison. The kernel output itself is fine -- the sibling `llama1b_decode` config (same `Hq/Hkv/D`) passes at `~1e-9`; only the test's output-selection heuristic was wrong. The colliding config and the numel selector were introduced together in D107595144. Fix: disambiguate by shape instead of flat count. The attention output is `[1, S, Hq, D]` while each cache is `[1, Cmax, Hkv, D]`; these differ in dims 1-2 even when the flat count collides. Match `dim()==4 && size(1)==S && size(2)==Hq && size(3)==D`, keeping the `attn_matches > 1` ambiguity guard as a backstop. Scope: test-only, one function (`test_sdpa_config`); no kernel, runtime, or export change. Authored with Claude Code. Reviewed By: Gasoonjia Differential Revision: D108625761

meta-cla Bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jun 15, 2026

JulianCloudNTH had a problem deploying to cadence June 15, 2026 16:04 — with GitHub Actions Failure

meta-codesync Bot added the meta-exported label Jun 15, 2026

Gasoonjia approved these changes Jun 15, 2026

View reviewed changes

meta-codesync Bot changed the title ~~Fix Dawn nightly: select SDPA attn output by shape, not numel~~ Fix Dawn nightly: select SDPA attn output by shape, not numel (#20283) Jun 15, 2026

JulianCloudNTH force-pushed the export-D108625761 branch from 3186b53 to 63b3389 Compare June 15, 2026 16:44

JulianCloudNTH had a problem deploying to cadence June 15, 2026 16:44 — with GitHub Actions Error

JulianCloudNTH force-pushed the export-D108625761 branch from 63b3389 to 717a011 Compare June 15, 2026 16:45

JulianCloudNTH had a problem deploying to cadence June 15, 2026 16:47 — with GitHub Actions Failure

meta-codesync Bot merged commit a9dd615 into pytorch:main Jun 15, 2026
175 of 180 checks passed

JulianCloudNTH deleted the export-D108625761 branch June 15, 2026 17:22

shoumikhin mentioned this pull request Jun 15, 2026

Fix webgpu native SDPA test: select attention output by position, not numel #20282

Closed

JulianCloudNTH mentioned this pull request Jun 15, 2026

SDPA tests: select attn output by shape #20285

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Dawn nightly: select SDPA attn output by shape, not numel (#20283)#20283

Fix Dawn nightly: select SDPA attn output by shape, not numel (#20283)#20283
meta-codesync[bot] merged 1 commit into
pytorch:mainfrom
JulianCloudNTH:export-D108625761

JulianCloudNTH commented Jun 15, 2026 •

edited by meta-codesync Bot

Loading

Uh oh!

pytorch-bot Bot commented Jun 15, 2026 •

edited

Loading

Uh oh!

meta-codesync Bot commented Jun 15, 2026

Uh oh!

github-actions Bot commented Jun 15, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

JulianCloudNTH commented Jun 15, 2026 • edited by meta-codesync Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot Bot commented Jun 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/20283

Uh oh!

meta-codesync Bot commented Jun 15, 2026

Uh oh!

github-actions Bot commented Jun 15, 2026

This PR needs a release notes: label

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

JulianCloudNTH commented Jun 15, 2026 •

edited by meta-codesync Bot

Loading

pytorch-bot Bot commented Jun 15, 2026 •

edited

Loading

This PR needs a `release notes:` label