llm

[LLM] enable Phi3-small in ipex.optimize_transformers (#4393)

Nov 20, 2024

c59f394 · Nov 20, 2024

This branch is 1 commit ahead of, 156 commits behind intel/intel-extension-for-pytorch:xpu-main.

Name	Name	Last commit message	Last commit date
parent directory ..
fine-tuning	fine-tuning	Update fine-tuning README.md (#4968)	Oct 31, 2024
inference	inference	[LLM] enable Phi3-small in ipex.optimize_transformers (#4393)	Nov 20, 2024
tools	tools	fix patching failure issues when the env_activate.sh is invoked multi…	Oct 8, 2024
training	training	add mixtral-8x7b training example (#4916)	Nov 14, 2024
Dockerfile	Dockerfile	2.3.110 PRs cherry pick (#4795)	Sep 13, 2024
README.md	README.md	2.3.110 PRs cherry pick (#4795)	Sep 13, 2024
requirements.txt	requirements.txt	update the transformer to 4.44.2 (#4875)	Oct 11, 2024

README.md

LLM Optimization Overview

Here you can find benchmarking scripts for large language models (LLM) text generation. These scripts:

Support Llama, GPT-J, Qwen, OPT, Bloom model families and some other models such as ChatGLMv3-6B, Baichuan2-13B and Phi3-mini.
Include both single instance and distributed (DeepSpeed) use cases for FP16 optimization.
Cover model generation inference with low precision cases for different models with best performance and accuracy (fp16 AMP and weight only quantization)

Environment Setup

[Recommended] Docker-based environment setup with compilation from source

# Get the Intel® Extension for PyTorch* source code
git clone https://github.com/intel/intel-extension-for-pytorch.git
cd intel-extension-for-pytorch
git checkout xpu-main
git submodule sync
git submodule update --init --recursive

# Build an image with the provided Dockerfile by compiling Intel® Extension for PyTorch* from source
docker build -f examples/gpu/llm/Dockerfile --build-arg COMPILE=ON -t ipex-llm:xpu-main .

# Run the container with command below
docker run -it --rm --privileged -v /dev/dri/by-path:/dev/dri/by-path ipex-llm:xpu-main bash

# When the command prompt shows inside the docker container, enter llm examples directory
cd llm

# Activate environment variables
source ./tools/env_activate.sh [inference|fine-tuning]

Conda-based environment setup with compilation from source

Make sure the driver and Base Toolkit are installed. Refer to Installation Guide.

# Get the Intel® Extension for PyTorch* source code
git clone https://github.com/intel/intel-extension-for-pytorch.git
cd intel-extension-for-pytorch
git checkout xpu-main
git submodule sync
git submodule update --init --recursive

# Make sure you have GCC >= 11 is installed on your system.
# Create a conda environment
conda create -n llm python=3.10 -y
conda activate llm
# Setup the environment with the provided script
cd examples/gpu/llm
# If you want to install Intel® Extension for PyTorch\* from source, use the commands below:
# e.g. bash ./tools/env_setup.sh 3 /opt/intel/oneapi/compiler/latest /opt/intel/oneapi/mkl/latest /opt/intel/oneapi/ccl/latest /opt/intel/oneapi/mpi/latest /opt/intel/oneapi/pti/latest pvc
bash ./tools/env_setup.sh 3 <DPCPP_ROOT> <ONEMKL_ROOT> <ONECCL_ROOT> <MPI_ROOT> <PTI_ROOT> <AOT>

conda deactivate
conda activate llm
export OCL_ICD_VENDORS=/etc/OpenCL/vendors
source ./tools/env_activate.sh [inference|fine-tuning]

where

AOT is a text string to enable Ahead-Of-Time compilation for specific GPU models. For example 'pvc,ats-m150' for the Platform Intel® Data Center GPU Max Series, Intel® Data Center GPU Flex Series and Intel® Arc™ A-Series Graphics (A770). Check tutorial for details.

How To Run LLM with ipex.llm

Inference and fine-tuning are supported in individual directories.

For inference example scripts, visit the inference directory.

For fine-tuning example scripts, visit the fine-tuning directory.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Files

llm

llm

README.md

LLM Optimization Overview

Environment Setup

[Recommended] Docker-based environment setup with compilation from source

Conda-based environment setup with compilation from source

How To Run LLM with ipex.llm

Files

llm

Directory actions

More options

Directory actions

More options

Latest commit

History

llm

Folders and files

parent directory

README.md

LLM Optimization Overview

Environment Setup

[Recommended] Docker-based environment setup with compilation from source

Conda-based environment setup with compilation from source

How To Run LLM with ipex.llm