-
Notifications
You must be signed in to change notification settings - Fork 42
Pull requests: nod-ai/shark-ai
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add --use-attention-mask for tp8 as workaround for sharded compilation
#1188
opened Mar 28, 2025 by
aviator19941
Loading…
[sharktank][kernels] restores unmasked attn functionality
#1187
opened Mar 28, 2025 by
dan-garvey
Loading…
Make move_shards_to_new_devices help function more useful
#1185
opened Mar 27, 2025 by
Alex-Vasile
Loading…
[tuner] expose the function merge_td_specs as utility executable
#1184
opened Mar 27, 2025 by
bangtianliu
Loading…
[sharktank] Perplexity refactor + add features
#1172
opened Mar 27, 2025 by
archana-ramalingam
Loading…
[tuner] add the option of providing starter td spec
#1171
opened Mar 27, 2025 by
bangtianliu
Loading…
[shortfin] Fix shortfin broken build due to iree changes
#1170
opened Mar 27, 2025 by
zphoenixrises
•
Draft
Bump IREE requirement pins to 3.4.0rc20250325 and remove dependencies on
--iree-hal-target-backends
#1145
opened Mar 25, 2025 by
shark-pr-automator
bot
Loading…
(shortfin-apps)(flux) Align flux response handling with shark-ui compatibility requirements
#1128
opened Mar 20, 2025 by
monorimet
Loading…
Bump IREE requirement pins to 3.3.0rc20250320
#1126
opened Mar 20, 2025 by
shark-pr-automator
bot
Loading…
Correcting usage of Something isn't working
--iree-hal-target-device=llvm-cpu
.
bug
#1122
opened Mar 19, 2025 by
benvanik
Loading…
Add attn dtype to config json and sfnp float8 support
#1118
opened Mar 19, 2025 by
aviator19941
Loading…
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.