Tuning for multiple columns part 2: Find candidate parameters for multiple aggregation #524

dvadym · 2024-08-30T16:52:48Z

This PR introduces finding candidates during tuning for computing utility analysis for case when utility analysis for multiple metrics is computed and SUM can be computed for multiple columns . This covers cases when DP aggregations can be presented in the pseudo-SQL terms as

SELECT partition_key, DP_COUNT(), DP_SUM(column1), DP_SUM(columns2)
GROUP BY partition_key

miracvbasaran · 2024-09-10T16:13:29Z

analysis/dp_strategy_selector.py

            # This is Select partitions case.
            return self._get_strategy_for_select_partition(sensitivities.l0)
+
+        n_metrics = len(self._metrics)
+        # Having n metrics is equivalent of multiplying of contributing for


nit: the equivalent of... or equivalent to

miracvbasaran · 2024-09-10T16:16:21Z

analysis/parameter_tuning.py

-                               num=max_candidates)).astype(int)
+    # In order to ensure that max_sum_per_partition > 0, let us skip 0-th
+    # bin if max = 0.
+    # TODO(dvadym): better algorithm for finding candidates.


Do you have some ideas here already?

Yes, I have. There is work in this direction.

dvadym

Thanks!

dvadym · 2024-09-10T17:17:11Z

analysis/parameter_tuning.py

-                               num=max_candidates)).astype(int)
+    # In order to ensure that max_sum_per_partition > 0, let us skip 0-th
+    # bin if max = 0.
+    # TODO(dvadym): better algorithm for finding candidates.


Yes, I have. There is work in this direction.

dvadym · 2024-09-10T17:19:35Z

analysis/dp_strategy_selector.py

            # This is Select partitions case.
            return self._get_strategy_for_select_partition(sensitivities.l0)
+
+        n_metrics = len(self._metrics)
+        # Having n metrics is equivalent of multiplying of contributing for


dvadym added 5 commits August 30, 2024 18:52

Find candidate parameters for multiple aggregation

364aaca

test

536810c

fixes

c736de7

Fix tests

1af1c22

test added

39d0f3a

dvadym changed the title ~~(WIP) Find candidate parameters for multiple aggregation~~ Tuning for multiple columns part 2: Find candidate parameters for multiple aggregation Sep 4, 2024

dvadym requested a review from miracvbasaran September 4, 2024 14:31

dvadym added 2 commits September 5, 2024 13:41

Comments

809d597

merge

1581b8e

miracvbasaran approved these changes Sep 10, 2024

View reviewed changes

Addressed comments

387b65a

dvadym commented Sep 10, 2024

View reviewed changes

dvadym merged commit 916bd8e into OpenMined:main Sep 10, 2024
6 of 14 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tuning for multiple columns part 2: Find candidate parameters for multiple aggregation #524

Tuning for multiple columns part 2: Find candidate parameters for multiple aggregation #524

dvadym commented Aug 30, 2024 •

edited

Loading

miracvbasaran Sep 10, 2024

dvadym Sep 10, 2024

miracvbasaran Sep 10, 2024

dvadym Sep 10, 2024

dvadym left a comment

dvadym Sep 10, 2024

dvadym Sep 10, 2024

Tuning for multiple columns part 2: Find candidate parameters for multiple aggregation #524

Tuning for multiple columns part 2: Find candidate parameters for multiple aggregation #524

Conversation

dvadym commented Aug 30, 2024 • edited Loading

miracvbasaran Sep 10, 2024

Choose a reason for hiding this comment

dvadym Sep 10, 2024

Choose a reason for hiding this comment

miracvbasaran Sep 10, 2024

Choose a reason for hiding this comment

dvadym Sep 10, 2024

Choose a reason for hiding this comment

dvadym left a comment

Choose a reason for hiding this comment

dvadym Sep 10, 2024

Choose a reason for hiding this comment

dvadym Sep 10, 2024

Choose a reason for hiding this comment

dvadym commented Aug 30, 2024 •

edited

Loading