Skip to content

Support int8 KVCacheQuant and W8A8 inference in vllm#1112

Closed
AniZpZ wants to merge 52 commits intovllm-project:mainfrom AniZpZ:kv-quant-merge

Commits

Commits on Sep 20, 2023

Commits on Sep 21, 2023

Commits on Sep 22, 2023

Commits on Sep 26, 2023

Commits on Sep 27, 2023

Commits on Sep 28, 2023

Commits on Oct 9, 2023

Commits on Oct 16, 2023

Commits on Oct 17, 2023

Commits on Oct 18, 2023

Commits on Oct 19, 2023

Commits on Oct 24, 2023

Commits on Oct 26, 2023