Releases: tenstorrent/tt-metal
v0.56.0-rc45
Note
If you are installing from a release, please refer to the README, INSTALLATION instructions, and any other documentation packaged with the release, not on the main branch. There may be differences between the latest main and the previous release.
The changelog will now follow, showing the changes from last release.
This release was generated by the CI workflow https://github.com/tenstorrent/tt-metal/actions/runs/13621564810
📦 Uncategorized
- #0: Add programming examples for TT-Metalium multi-device native APIs
- PR: #17331
v0.56.0-rc44
Note
If you are installing from a release, please refer to the README, INSTALLATION instructions, and any other documentation packaged with the release, not on the main branch. There may be differences between the latest main and the previous release.
The changelog will now follow, showing the changes from last release.
This release was generated by the CI workflow https://github.com/tenstorrent/tt-metal/actions/runs/13599594043
📦 Uncategorized
- #0: Add programming examples for TT-Metalium multi-device native APIs
- PR: #17331
v0.56.0-rc43
Note
If you are installing from a release, please refer to the README, INSTALLATION instructions, and any other documentation packaged with the release, not on the main branch. There may be differences between the latest main and the previous release.
The changelog will now follow, showing the changes from last release.
This release was generated by the CI workflow https://github.com/tenstorrent/tt-metal/actions/runs/13579442965
📦 Uncategorized
- #0: Add programming examples for TT-Metalium multi-device native APIs
- PR: #17331
v0.56.0-rc42
Note
If you are installing from a release, please refer to the README, INSTALLATION instructions, and any other documentation packaged with the release, not on the main branch. There may be differences between the latest main and the previous release.
The changelog will now follow, showing the changes from last release.
This release was generated by the CI workflow https://github.com/tenstorrent/tt-metal/actions/runs/13556960790
📦 Uncategorized
- #0: Add programming examples for TT-Metalium multi-device native APIs
- PR: #17331
v0.56.0-rc41
Note
If you are installing from a release, please refer to the README, INSTALLATION instructions, and any other documentation packaged with the release, not on the main branch. There may be differences between the latest main and the previous release.
The changelog will now follow, showing the changes from last release.
This release was generated by the CI workflow https://github.com/tenstorrent/tt-metal/actions/runs/13534601244
📦 Uncategorized
- #0: Add programming examples for TT-Metalium multi-device native APIs
- PR: #17331
v0.56.0-rc38
Note
If you are installing from a release, please refer to the README, INSTALLATION instructions, and any other documentation packaged with the release, not on the main branch. There may be differences between the latest main and the previous release.
The changelog will now follow, showing the changes from last release.
This release was generated by the CI workflow https://github.com/tenstorrent/tt-metal/actions/runs/13468231055
📦 Uncategorized
v0.56.0-rc37
Note
If you are installing from a release, please refer to the README, INSTALLATION instructions, and any other documentation packaged with the release, not on the main branch. There may be differences between the latest main and the previous release.
The changelog will now follow, showing the changes from last release.
This release was generated by the CI workflow https://github.com/tenstorrent/tt-metal/actions/runs/13448174485
📦 Uncategorized
v0.56.0-rc36
Note
If you are installing from a release, please refer to the README, INSTALLATION instructions, and any other documentation packaged with the release, not on the main branch. There may be differences between the latest main and the previous release.
The changelog will now follow, showing the changes from last release.
This release was generated by the CI workflow https://github.com/tenstorrent/tt-metal/actions/runs/13425954330
📦 Uncategorized
- Implement JointAttention
- PR: #17079
- #17374: Add concurrency group for _produce-data.yaml
- PR: #17402
- Extra cleanup. With emphasis
- PR: #17466
- Add
.test_durations
for ttnn pytest splits- PR: #17533
- #13856: update gelu_bw op documentation on bfloat8_b limitations
- PR: #17407
- #17134: Enable test_transformer_2d_model UT in SD
- PR: #17534
- Add test for create_qkv_heads_decode on subcoregrids
- PR: #17507
- Always flatten convolution input tensors
- PR: #17463
- #0: Use kernel time instead of fw time for calculating device time
- PR: #17424
- Fix golden function for
ttnn.linear
so it can be used with comparison mode- PR: #17288
- [skip ci] update codeowners with matmul+ team changes
- PR: #17540
- #0: Fix NCRISC delay
- PR: #17526
- Change create_tt_tensor_from_py_data to use from_vector - fixed branch
- PR: #17389
- Revert "Change create_tt_tensor_from_py_data to use from_vector - fixed branch"
- PR: #17547
- #0: Add missing flush in send_chunk_from_address
- PR: #17545
- Pick up latest actions setup venv
- PR: #17553
- #16364: split prefetch dependent config
- PR: #17548
- #16339: let
DispatchMemMap
get created with customDispatchSettings
- PR: #17449
- #0: update CODEOWNERS based on eltwise team changes
- PR: #17552
- [skip ci] add links and standardize pull request checklist text
- PR: #17539
- [skip ci] Revert "disable-pr-gate"
- PR: #17561
- #0: Simplify MeshDevice construction by eradicating mesh_type
- PR: #17416
- #17541: Guard against passing 0 to counting zero intrinsics in program.cpp
- PR: #17549
- Revamp install_dependencies.sh
- PR: #17565
- Jjiang/fixed tensor creation and conversion
- PR: #17564
- Create Tutorial - Adding a Model
- PR: #16462
- [skip ci] Update README.md
- PR: #17571
- #17201: Implement IDevice tracing APIs for MeshDevice
- PR: #17465
- [skip ci] Update README.md
- PR: #17572
- Optimize EDM-fabric flow-control protocols
- PR: #17495
v0.56.0-rc35
Note
If you are installing from a release, please refer to the README, INSTALLATION instructions, and any other documentation packaged with the release, not on the main branch. There may be differences between the latest main and the previous release.
The changelog will now follow, showing the changes from last release.
This release was generated by the CI workflow https://github.com/tenstorrent/tt-metal/actions/runs/13403584877
📦 Uncategorized
- Implement JointAttention
- PR: #17079
- #17374: Add concurrency group for _produce-data.yaml
- PR: #17402
- Extra cleanup. With emphasis
- PR: #17466
- #17324: port Padding CPP Unit OP Tests to TTNN
- PR: #17354
- Add dprint/watcher support for BH second erisc
- PR: #17387
- Fix outdated call to
ttnn.open_device
in ttnn tutorial notebooks- PR: #17501
- Use three arguments for distro, version, architecture
- PR: #17502
- update deprecated API use
- PR: #17509
- Fix pgm dispatch sweep to use new args
- PR: #17511
- Fix nanogpt failure in tt-train after shape changes
- PR: #17515
- Fix init calls for binary sfpu pow and quant
- PR: #17267
- Aliu/quanta bringup
- PR: #17245
- Split ttnn pytests into more groups
- PR: #17512
- Remove MeshCQ's dependence on HardwareCommandQueue for loading SubDevices
- PR: #17478
- Fix two broken workflows by updating arguments
- PR: #17518
- Aliu/quanta bringup
- PR: #17517
- #17203: Remove set_worker_queue_mode method from
IDevice
- PR: #17108
- [skip ci] Update metal-api-surface.yaml
- PR: #17471
- Fix workflow call
- PR: #17521
- [skip-ci] Missed one
- PR: #17522
- #0: Fix T3000 unit tests and suppress chatty logs
- PR: #17520
- Use transaction id for eth receiver write to worker
- PR: #17369
- [skip ci] Fix name bug
- PR: #17527
- Update data_formats.md with mantissa rounding description
- PR: #17529
- #0: Use stream autoinc register to send word credits over ethernet.
- PR: #17504
- #17134: Fix SD resnet block ut parametrizations
- PR: #17532
- Add
.test_durations
for ttnn pytest splits- PR: #17533
- #13856: update gelu_bw op documentation on bfloat8_b limitations
- PR: #17407
- #17134: Enable test_transformer_2d_model UT in SD
- PR: #17534
- Add test for create_qkv_heads_decode on subcoregrids
- PR: #17507
- Always flatten convolution input tensors
- PR: #17463
- #0: Use kernel time instead of fw time for calculating device time
- PR: #17424
- Fix golden function for
ttnn.linear
so it can be used with comparison mode- PR: #17288
- [skip ci] update codeowners with matmul+ team changes
- PR: #17540
- #0: Fix NCRISC delay
- PR: #17526
- Change create_tt_tensor_from_py_data to use from_vector - fixed branch
- PR: #17389
- Revert "Change create_tt_tensor_from_py_data to use from_vector - fixed branch"
- PR: #17547
- #0: Add missing flush in send_chunk_from_address
- PR: #17545
- Pick up latest actions setup venv
- PR: #17553
- #16364: split prefetch dependent config
- PR: #17548
- #16339: let
DispatchMemMap
get created with customDispatchSettings
- PR: #17449
- #0: update CODEOWNERS based on eltwise team changes
- PR: #17552
- [skip ci] add links and standardize pull request checklist text
- PR: #17539
- [skip ci] Revert "disable-pr-gate"
- PR: #17561
- #0: Simplify MeshDevice construction by eradicating mesh_type
- PR: #17416
- #17541: Guard against passing 0 to counting zero intrinsics in program.cpp
- PR: #17549
- Revamp install_dependencies.sh
- PR: #17565
- Jjiang/fixed tensor creation and conversion
- PR: #17564
- Create Tutorial - Adding a Model
- PR: #16462
- [skip ci] Update README.md
- PR: #17571
- #17201: Implement IDevice tracing APIs for MeshDevice
- PR: #17465
- [skip ci] Update README.md
- PR: #17572
- Optimize EDM-fabric flow-control protocols
- PR: #17495
v0.56.0-rc34
Note
If you are installing from a release, please refer to the README, INSTALLATION instructions, and any other documentation packaged with the release, not on the main branch. There may be differences between the latest main and the previous release.
The changelog will now follow, showing the changes from last release.
This release was generated by the CI workflow https://github.com/tenstorrent/tt-metal/actions/runs/13401445732
📦 Uncategorized
- Implement JointAttention
- PR: #17079
- #17374: Add concurrency group for _produce-data.yaml
- PR: #17402
- Extra cleanup. With emphasis
- PR: #17466
- #17324: port Padding CPP Unit OP Tests to TTNN
- PR: #17354
- #17013: Update add_bw doc
- PR: #17363
- #0: Fix produce data crash when testcase is missing properties block
- PR: #17499
- #0: Update Llama3 PERF.md and llama3 vision PCC after cache regeneration
- PR: #17505
- #17483: Fix untilize with unpad for 1d tensors
- PR: #17484
- Add dprint/watcher support for BH second erisc
- PR: #17387
- Fix outdated call to
ttnn.open_device
in ttnn tutorial notebooks- PR: #17501
- Use three arguments for distro, version, architecture
- PR: #17502
- update deprecated API use
- PR: #17509
- Fix pgm dispatch sweep to use new args
- PR: #17511
- Fix nanogpt failure in tt-train after shape changes
- PR: #17515
- Fix init calls for binary sfpu pow and quant
- PR: #17267
- Aliu/quanta bringup
- PR: #17245
- Split ttnn pytests into more groups
- PR: #17512
- Remove MeshCQ's dependence on HardwareCommandQueue for loading SubDevices
- PR: #17478
- Fix two broken workflows by updating arguments
- PR: #17518
- Aliu/quanta bringup
- PR: #17517
- #17203: Remove set_worker_queue_mode method from
IDevice
- PR: #17108
- [skip ci] Update metal-api-surface.yaml
- PR: #17471
- Fix workflow call
- PR: #17521
- [skip-ci] Missed one
- PR: #17522
- #0: Fix T3000 unit tests and suppress chatty logs
- PR: #17520
- Use transaction id for eth receiver write to worker
- PR: #17369
- [skip ci] Fix name bug
- PR: #17527
- Update data_formats.md with mantissa rounding description
- PR: #17529
- #0: Use stream autoinc register to send word credits over ethernet.
- PR: #17504
- #17134: Fix SD resnet block ut parametrizations
- PR: #17532
- Add
.test_durations
for ttnn pytest splits- PR: #17533
- #13856: update gelu_bw op documentation on bfloat8_b limitations
- PR: #17407
- #17134: Enable test_transformer_2d_model UT in SD
- PR: #17534
- Add test for create_qkv_heads_decode on subcoregrids
- PR: #17507
- Always flatten convolution input tensors
- PR: #17463
- #0: Use kernel time instead of fw time for calculating device time
- PR: #17424
- Fix golden function for
ttnn.linear
so it can be used with comparison mode- PR: #17288
- [skip ci] update codeowners with matmul+ team changes
- PR: #17540
- #0: Fix NCRISC delay
- PR: #17526
- Change create_tt_tensor_from_py_data to use from_vector - fixed branch
- PR: #17389
- Revert "Change create_tt_tensor_from_py_data to use from_vector - fixed branch"
- PR: #17547
- #0: Add missing flush in send_chunk_from_address
- PR: #17545
- Pick up latest actions setup venv
- PR: #17553
- #16364: split prefetch dependent config
- PR: #17548
- #16339: let
DispatchMemMap
get created with customDispatchSettings
- PR: #17449
- #0: update CODEOWNERS based on eltwise team changes
- PR: #17552
- [skip ci] add links and standardize pull request checklist text
- PR: #17539
- [skip ci] Revert "disable-pr-gate"
- PR: #17561
- #0: Simplify MeshDevice construction by eradicating mesh_type
- PR: #17416
- #17541: Guard against passing 0 to counting zero intrinsics in program.cpp
- PR: #17549
- Revamp install_dependencies.sh
- PR: #17565
- Jjiang/fixed tensor creation and conversion
- PR: #17564
- Create Tutorial - Adding a Model
- PR: #16462
- [skip ci] Update README.md
- PR: #17571
- #17201: Implement IDevice tracing APIs for MeshDevice
- PR: #17465
- [skip ci] Update README.md
- PR: #17572
- Optimize EDM-fabric flow-control protocols
- PR: #17495