Initial support gfx950 #64

xinyazhang · 2025-01-09T06:10:59Z

Supported under the name "Unidentified"
Bump the Triton compiler
Add pyaotriton.HipMemory to manage GPU memory when Torch is CPU-only
Add env var AOTRITON_TORCH_ONLY_USE_CPU to run unit tests with CPU-only Torch

Note: as initial support, there is no tuning database for gfx950

This name is intentionally long and unaligned with others to indicate its experimental status.

compiler complains when using num_stages = 4

This allows us using CPU-only Torch for testing.

xinyazhang · 2025-01-09T06:18:25Z

All FOR_RELEASE=1 test/test_backward.py passed on real hardware.

* Supported under the name "Unidentified" * Bump the Triton compiler * Add `pyaotriton.HipMemory` to manage GPU memory when Torch is CPU-only * Add env var `AOTRITON_TORCH_ONLY_USE_CPU` to run unit tests with CPU-only Torch Note: as initial support, there is no tuning database for gfx950

xinyazhang added 12 commits January 3, 2025 15:59

Switch to new Triton with gfx950 support.

b646bef

Add gfx950 target under the name "Unidentified"

e382110

This name is intentionally long and unaligned with others to indicate its experimental status.

Add gfx950 in C++ piece

2a0e4fb

Change default compiler options

4c9ae87

compiler complains when using num_stages = 4

Sync the compile.py with current Triton

e6d05fb

Skip checking of LUT entries for Unidentified GPU

62e1945

Remove Unidentified from default target

7519c35

Add HipMemory python binding and env var AOTRITON_TORCH_ONLY_USE_CPU

e74d974

This allows us using CPU-only Torch for testing.

Adjust fudge factor for CPU only Torch

90dce81

Fix the remaining UTs

04fc32f

Fix test_gqa and fudge factor for q

35a97dd

Remove legacy bindings/Makefile

f52a1df

xinyazhang requested review from pruthvistony, jeffdaily and alugorey January 9, 2025 06:16

xinyazhang marked this pull request as ready for review January 9, 2025 06:17

jeffdaily approved these changes Jan 9, 2025

View reviewed changes

xinyazhang merged commit e8515a8 into main Jan 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Initial support gfx950 #64

Initial support gfx950 #64

xinyazhang commented Jan 9, 2025 •

edited

Loading

xinyazhang commented Jan 9, 2025 •

edited

Loading

Initial support gfx950 #64

Initial support gfx950 #64

Conversation

xinyazhang commented Jan 9, 2025 • edited Loading

xinyazhang commented Jan 9, 2025 • edited Loading

xinyazhang commented Jan 9, 2025 •

edited

Loading

xinyazhang commented Jan 9, 2025 •

edited

Loading