[FX] Support weight quantization for operations where `weight_port_id` != 1 #3334

siddhant-0707 · 2025-03-08T02:18:54Z

Changes

Updated the FX backend’s _get_input_scale_shape to use the FX insertion point shape and, when available, the actual weight tensor’s shape to compute the per‑channel scale shape.
Adjusted statistics collector in _get_stat_collector so that the reduction and aggregation axes are derived using the same channel axes as used for scale shape computation.

Related tickets

Issue #3206

Tests

All tests run successfully

nncf/quantization/algorithms/min_max/algorithm.py

nncf/quantization/algorithms/min_max/torch_fx_backend.py

anzr299 · 2025-03-11T05:52:15Z

nncf/quantization/algorithms/min_max/algorithm.py

+            if is_weight:
+                channel_axes = self._backend_entity.get_weight_quantization_axes(node, target_point, len(shape))
+            else:
+                channel_axes = (1,)


This is not required, it can be reverted back to the old code.

anzr299 · 2025-03-11T05:52:32Z

nncf/quantization/algorithms/min_max/algorithm.py


-        # Weight statistics is constant, so only one collection is enough.
+        range_estimator_params = self._get_range_estimator_parameters(target_point, qconfig)


This is also unnecessary

anzr299 · 2025-03-11T07:24:14Z

nncf/quantization/algorithms/min_max/torch_fx_backend.py

-        )
+        channel_idx = channel_axes[0] if channel_axes else 0
+
+        if is_weights and not channel_axes:


Suggested change

if is_weights and not channel_axes:

if not len(channel_axes):

to cover the case of vector weights which are being quantized per channel

anzr299 · 2025-03-11T07:26:01Z

nncf/quantization/algorithms/min_max/torch_fx_backend.py

-        scale_shape = tuple(
-            get_scale_shape(input_shape, is_weights=is_weights, per_channel=per_channel, channel_idx=channel_idx)
-        )
+        channel_idx = channel_axes[0] if channel_axes else 0


Suggested change

channel_idx = channel_axes[0] if channel_axes else 0

Since channel axes is already being checked and handled in the if-else block below. channel_axes[0] can directly be passed to channel_idx parameter of get_scale_shape

siddhant-0707 added 2 commits March 7, 2025 20:59

Support weight channel axes

b8203a5

Change minmax algo to support channel axes for ConvTranspose

474e6b7

siddhant-0707 requested a review from a team as a code owner March 8, 2025 02:18

github-actions bot added the NNCF PTQ Pull requests that updates NNCF PTQ label Mar 8, 2025

siddhant-0707 added 3 commits March 7, 2025 21:36

add comment back algorithm.py

7319f9f

add comment algorithm.py

3308b40

refactor parameter names in

0ee4f8b

alexsu52 requested a review from anzr299 March 10, 2025 07:41

alexsu52 self-assigned this Mar 10, 2025

anzr299 requested changes Mar 10, 2025

View reviewed changes

nncf/quantization/algorithms/min_max/algorithm.py Outdated Show resolved Hide resolved

nncf/quantization/algorithms/min_max/algorithm.py Outdated Show resolved Hide resolved

nncf/quantization/algorithms/min_max/torch_fx_backend.py Outdated Show resolved Hide resolved

siddhant-0707 added 2 commits March 10, 2025 19:01

use torch weight_channel_axes in torchfx

74f677a

streamline channel axes handling

3565ba3

siddhant-0707 requested a review from anzr299 March 10, 2025 23:06

anzr299 requested changes Mar 11, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FX] Support weight quantization for operations where `weight_port_id` != 1 #3334

[FX] Support weight quantization for operations where `weight_port_id` != 1 #3334

siddhant-0707 commented Mar 8, 2025 •

edited

Loading

anzr299 Mar 11, 2025

anzr299 Mar 11, 2025

anzr299 Mar 11, 2025

anzr299 Mar 11, 2025


		# Weight statistics is constant, so only one collection is enough.
		range_estimator_params = self._get_range_estimator_parameters(target_point, qconfig)

	if is_weights and not channel_axes:
	if not len(channel_axes):

[FX] Support weight quantization for operations where weight_port_id != 1 #3334

Are you sure you want to change the base?

[FX] Support weight quantization for operations where weight_port_id != 1 #3334

Conversation

siddhant-0707 commented Mar 8, 2025 • edited Loading

Changes

Related tickets

Tests

anzr299 Mar 11, 2025

Choose a reason for hiding this comment

anzr299 Mar 11, 2025

Choose a reason for hiding this comment

anzr299 Mar 11, 2025

Choose a reason for hiding this comment

anzr299 Mar 11, 2025

Choose a reason for hiding this comment

[FX] Support weight quantization for operations where `weight_port_id` != 1 #3334

[FX] Support weight quantization for operations where `weight_port_id` != 1 #3334

siddhant-0707 commented Mar 8, 2025 •

edited

Loading