Skip to content

Commit cfd4804

Browse files
mengdilinfacebook-github-bot
authored andcommitted
write distributed_kmeans centroids and assignments to hive tables (facebookresearch#4017)
Summary: Pull Request resolved: facebookresearch#4017 Exposing an option to run kmeans centroids and assignments to hive table which should bring us close in parity with Digraph's Kmeans API. This is needed for cluster balance data quality checks for large scale centroids Reviewed By: kuarora Differential Revision: D64835789 fbshipit-source-id: 95cbea00bb6b4733c03836049bc379be813bf9e5
1 parent a11c1db commit cfd4804

File tree

1 file changed

+2
-0
lines changed

1 file changed

+2
-0
lines changed

benchs/bench_fw/descriptors.py

+2
Original file line numberDiff line numberDiff line change
@@ -83,6 +83,8 @@ class DatasetDescriptor:
8383

8484
embedding_column: Optional[str] = None
8585

86+
embedding_id_column: Optional[str] = None
87+
8688
sampling_rate: Optional[float] = None
8789

8890
# sampling column for xdb

0 commit comments

Comments
 (0)