Add support for Baichuan 13b model #512

ericzhou571 · 2023-07-19T06:21:13Z

This pull request introduces a server that can be initiated using the following command:

python -m vllm.entrypoints.openai.api_server --host 0.0.0.0 --port 8000 --model /root/github_repos/baichuan-13b-chat/ --gpu-memory-utilization 0.8 --dtype half --trust-remote-code

We've conducted tests with a single GPU for baichuan-13b-chat inference. By setting the temperature to 0 and using identical prompts, we were able to achieve consistent outputs from the baichuan-13b-chat model, deployed using a standard FastChat worker.

However, please be aware of the following limitations:

Our code is currently only compatible with non-distributed deployments, i.e., setups involving a single GPU and single model.
While our code is operational with distributed deployment using tensor parallelism, the results it produces are not yet accurate. We are actively looking for community help to rectify this issue.

Any contributions to improving this implementation would be greatly appreciated.

luohao123 · 2023-07-22T14:24:24Z

A single GPU? for 24GB like 4090, is not possible to load in a single GPU with fp16.

ericzhou571 · 2023-07-24T02:15:09Z

A single GPU? for 24GB like 4090, is not possible to load in a single GPU with fp16.

A100 do have 40GB

Laych7 · 2023-07-27T08:17:19Z

/usr/bin/python3: Error while finding module specification for 'vllm.entrypoints.openai.api_server' (ImportError: cannot import name 'activation_ops' from partially initialized module 'vllm' (most likely due to a circular import) (.local/lib/python3.8/site-packages/vllm/init.py))
是我的问题吗

Storm0921 · 2023-07-31T02:58:37Z

/usr/bin/python3：查找“vllm.entrypoints.openai.api_server”的模块规范时出错（导入错误：无法从部分初始化的模块“vllm”导入名称“activation_ops”（很可能是由于循环导入）（.local/lib/python3.8/site-packages/vllm/init.py））是我的问题吗

I solved this problem，only need pip install vllm again

a1164714 · 2023-08-02T01:59:00Z

python3 benchmarks/benchmark_throughput.py --dataset benchmarks/ShareGPT_V3_unfiltered_cleaned_split.json --backend hf --hf-max-batch-size 4 --model .//baichuan-inc--Baichuan-13B-Chat --trust-remote-code

ValueError: Asking to pad but the tokenizer does not have a padding token. Please select a token to use as `pad_token` 
`(tokenizer.pad_token = tokenizer.eos_token e.g.)` or add a new pad token via `tokenizer.add_special_tokens({'pad_token': 
'[PAD]'})`.

zhuohan123 · 2023-08-02T05:24:35Z

Close this PR since #643 is a simpler solution. Please check out the latest main branch to test baichuan-13b! Again, thanks @ericzhou571 for the great work!

luhairong11 · 2024-03-19T11:22:12Z

python3 benchmarks/benchmark_throughput.py --dataset benchmarks/ShareGPT_V3_unfiltered_cleaned_split.json --backend hf --hf-max-batch-size 4 --model .//baichuan-inc--Baichuan-13B-Chat --trust-remote-codepython3 benchmarks/benchmark_throughput.py --dataset benchmarks/ShareGPT_V3_unfiltered_cleaned_split.json --backend hf --hf-max-batch-size 4 --model .//baichuan-inc--Baichuan-13B-Chat --trust-remote-代码
ValueError: Asking to pad but the tokenizer does not have a padding token. Please select a token to use as `pad_token` 
`(tokenizer.pad_token = tokenizer.eos_token e.g.)` or add a new pad token via `tokenizer.add_special_tokens({'pad_token': 
'[PAD]'})`.

Have you solved it? I also encountered this problem.

ericzhou571 added 5 commits July 19, 2023 13:54

add support for baichuan_13b

cbe1d2e

add support for baichuan_13b

aedf32d

Update model_loader.py

7c1282d

add support for baichuan_13b

eb77280

add support for baichuan_13b

2f9d455

ericzhou571 mentioned this pull request Jul 19, 2023

Assistance Needed: Issues with Distributed Deployment in Baichuan-13b-Chat Server Implementation #513

Closed

This was referenced Jul 19, 2023

Baichuan model can not be run #490

Closed

support baichuan 13b #530

Closed

mklf mentioned this pull request Jul 27, 2023

ModuleNotFoundError: No module named 'transformers_modules' with API serving using baichuan-7b #572

Closed

zhuohan123 closed this Aug 2, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for Baichuan 13b model #512

Add support for Baichuan 13b model #512

ericzhou571 commented Jul 19, 2023 •

edited

Loading

luohao123 commented Jul 22, 2023

ericzhou571 commented Jul 24, 2023

Laych7 commented Jul 27, 2023 •

edited

Loading

Storm0921 commented Jul 31, 2023 •

edited

Loading

a1164714 commented Aug 2, 2023

zhuohan123 commented Aug 2, 2023

luhairong11 commented Mar 19, 2024

Add support for Baichuan 13b model #512

Add support for Baichuan 13b model #512

Conversation

ericzhou571 commented Jul 19, 2023 • edited Loading

luohao123 commented Jul 22, 2023

ericzhou571 commented Jul 24, 2023

Laych7 commented Jul 27, 2023 • edited Loading

Storm0921 commented Jul 31, 2023 • edited Loading

a1164714 commented Aug 2, 2023

zhuohan123 commented Aug 2, 2023

luhairong11 commented Mar 19, 2024

ericzhou571 commented Jul 19, 2023 •

edited

Loading

Laych7 commented Jul 27, 2023 •

edited

Loading

Storm0921 commented Jul 31, 2023 •

edited

Loading