Strip for LoRA modules #3331

nikita-malininn · 2025-03-05T15:35:34Z

Changes

Added strip method for LoRA modules

Reason for changes

To able IR model conversion

Related tickets

159708

Tests

Updated

On top of #3322

tests/torch/quantization/test_strip.py

ljaljushkin · 2025-03-10T14:46:51Z

tests/torch/ptq/test_fq_lora.py

+    ),
+    ids=["asym", "sym"],
+)
+def test_fq_lora_tuning(mode, backup_mode, compression_kwargs, _seed):


Suggest extending this test by calling strip and exporting to OV and checking similarity for OV model (you can pre-compute similarity for "stripped to float" model or do it in the test)

ljaljushkin · 2025-03-10T14:51:24Z

nncf/torch/quantization/strip.py

+                    result_dtype=original_dtype,
+                )
+
+        elif isinstance(quantizer, SymmetricLoraQuantizer):


Ordinary FQ should be also supported here, since some layers can be selected to INT8 (first/last or by mixed precision), and they will be represented by ordinary FQ w/o LoRA.
You can check number of u8/u4 constants after export to OV.

ljaljushkin · 2025-03-10T14:52:37Z

nncf/torch/quantization/strip.py

+        original_shape = original_weight.shape
+        original_eps = torch.finfo(original_dtype).eps
+
+        # Quantize-dequantize using universal quantization formula


I'd reference to the markdown with this “universal”, otherwise it can be not clear what you mean here.

It would be helpful to include a note explaining why the weights are not directly quantized. Please mention that this approach is necessary to prevent floating-point errors that can occur due to the different order of operations during quantization when using Torch for tuning and OpenVINO (OV) for inference.

github-actions bot added NNCF PT Pull requests that updates NNCF PyTorch NNCF Common Pull request that updates NNCF Common experimental NNCF OpenVINO Pull requests that updates NNCF OpenVINO NNCF PTQ Pull requests that updates NNCF PTQ API Public API-impacting changes labels Mar 5, 2025

Initial commit

6eb42d3

ljaljushkin requested changes Mar 10, 2025

View reviewed changes

nikita-malininn force-pushed the nm/strip_lora branch from 00ba62e to ba00566 Compare March 10, 2025 15:06

github-actions bot removed NNCF Common Pull request that updates NNCF Common experimental NNCF OpenVINO Pull requests that updates NNCF OpenVINO NNCF PTQ Pull requests that updates NNCF PTQ labels Mar 10, 2025

nikita-malininn added 2 commits March 10, 2025 16:16

Update after merge

ba00566

Apply comments

906b5d0

nikita-malininn requested a review from ljaljushkin March 10, 2025 18:05

Update test

9393fe4

MaximProshin added the Code Freeze label Mar 11, 2025

Fix

f511709

nikita-malininn marked this pull request as ready for review March 11, 2025 16:49

nikita-malininn requested a review from a team as a code owner March 11, 2025 16:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Strip for LoRA modules #3331

Strip for LoRA modules #3331

nikita-malininn commented Mar 5, 2025 •

edited

Loading

ljaljushkin Mar 10, 2025

ljaljushkin Mar 10, 2025

ljaljushkin Mar 10, 2025

ljaljushkin Mar 10, 2025

Strip for LoRA modules #3331

Are you sure you want to change the base?

Strip for LoRA modules #3331

Conversation

nikita-malininn commented Mar 5, 2025 • edited Loading

Changes

Reason for changes

Related tickets

Tests

ljaljushkin Mar 10, 2025

Choose a reason for hiding this comment

ljaljushkin Mar 10, 2025

Choose a reason for hiding this comment

ljaljushkin Mar 10, 2025

Choose a reason for hiding this comment

ljaljushkin Mar 10, 2025

Choose a reason for hiding this comment

nikita-malininn commented Mar 5, 2025 •

edited

Loading