Update algorithm for budget deduction #28

bmcase · 2024-10-02T19:36:46Z

Update the algorithm for budget deduction to more closely follow the paper. Especially as it relates to using the sensitivity to compute the privacy loss.

bmcase · 2024-10-04T01:56:52Z

@martinthomson updated this PR to also only specify the L1-norm instead of p-norm.

bmcase · 2024-10-08T16:19:55Z

@martinthomson can we go ahead and merge this PR? I think this is probably good for the budget section for now -- keeping it aligned with the L1 case of the paper.

martinthomson

I think that we'll need something better integrated into the attribution logic for this to be workable.

martinthomson · 2024-10-09T02:56:04Z

api.bs

@@ -6,7 +6,8 @@ URL: https://private-attribution.github.io/api/
 Editor: Martin Thomson, w3cid 68503, Mozilla https://mozilla.org/, [email protected]
 Editor: Andy Leiserson, w3cid 147715, Mozilla https://mozilla.org/, [email protected]
 Editor: Benjamin Savage, w3cid 114877, Meta https://www.meta.com/, [email protected]
-Abstract: This specifies a browser API for the measurement of advertising performance.  The goal is to produce aggregate statistics about how advertising leads to conversions, without creating a risk to the privacy of individual web users.  This API collates information about people from multiple web origins, which could be a significant risk to their privacy.  To manage this risk, the information that is gathered is aggregated using an aggregation service that is trusted by the user-agent to perform aggregation within strict limits.  Noise is added to the aggregates produced by this service to provide differential privacy. Websites may select an aggregation service from the list of approved aggregation services provided by the user-agent.
+Editor: Benjamin Case, w3cid 128082, Meta https://www.meta.com/, [email protected]
+Abstract: This specifies a browser API for the measurement of advertising performance. The goal is to produce aggregate statistics about how advertising leads to conversions, without creating a risk to the privacy of individual web users. This API collates information about people from multiple web origins, which could be a significant risk to their privacy. To manage this risk, the information that is gathered is aggregated using an aggregation service that is trusted by the user-agent to perform aggregation within strict limits. Noise is added to the aggregates produced by this service to provide differential privacy. Websites may select an aggregation service from the list of approved aggregation services provided by the user-agent.


What diff tool are you using? You didn't change this line, but GitHub seems to think that you did...

martinthomson · 2024-10-09T02:56:59Z

api.bs

+When a conversion requests attribution the call includes several querier-provided
+parameters:


Suggested change

When a conversion requests attribution the call includes several querier-provided

parameters:

When a site requests attribution, they provide several parameters:

martinthomson · 2024-10-09T02:58:38Z

api.bs

-the impressions from that week are not used.
+When a conversion requests attribution the call includes several querier-provided
+parameters:
+1. the window of epochs to search for relevant events (`epochs` parameter);


Suggested change

1. the window of epochs to search for relevant events (`epochs` parameter);

1. the length of time over which to select impressions ({{PrivateAttributionConversionOptions/lookbackDays}});

martinthomson · 2024-10-09T02:59:06Z

api.bs

+When a conversion requests attribution the call includes several querier-provided
+parameters:
+1. the window of epochs to search for relevant events (`epochs` parameter);
+2. the requested privacy budget (`requested_epsilon`);


Suggested change

2. the requested privacy budget (`requested_epsilon`);

2. the requested [=privacy budget=] ({{PrivateAttributionConversionOptions/epsilon}});

martinthomson · 2024-10-09T02:59:31Z

api.bs

+parameters:
+1. the window of epochs to search for relevant events (`epochs` parameter);
+2. the requested privacy budget (`requested_epsilon`);
+3. the `filterData` value used for selecting relevant events;


Suggested change

3. the `filterData` value used for selecting relevant events;

3. the {{PrivateAttributionConversionOptions/filterData}} value used for selecting relevant events;

martinthomson · 2024-10-09T03:00:20Z

api.bs

+1. the window of epochs to search for relevant events (`epochs` parameter);
+2. the requested privacy budget (`requested_epsilon`);
+3. the `filterData` value used for selecting relevant events;
+4. the `PrivateAttributionLogic` such as last-touch or equal-credit;


Suggested change

4. the `PrivateAttributionLogic` such as last-touch or equal-credit;

4. the attribution {{PrivateAttributionConversionOptions/logic}} to use in selecting and attributing credit;

martinthomson · 2024-10-09T03:01:23Z

api.bs

+5. two sensitivity parameters: `report_global_sensitivity` which is a cap on how much attributed
+    value can come from this one conversion (e.g. the conversion value) and `query_global_sensitivity`
+    which is a maximum sensitivity across all reports to be processed the aggregation query.


This doesn't match the terminology that we've described. What we have defined has a value and a maxValue. It would help if you used the same words.

I don't regard the value as being sensitivity measure in that way, either. I view the sensitivity as a measure that applies to the entire query. That's a measure that is supplied when a batch of reports is sent to the aggregation service. Individual reports will have a maxValue that the browser will guarantee is at least as much as the actual contained value and no greater than the batch sensitivity. The concrete budget deduction will be no higher than both these measures.

martinthomson · 2024-10-09T03:04:40Z

api.bs

+    which is a maximum sensitivity across all reports to be processed the aggregation query.
+
+The algorithm to <dfn>deduct privacy budget</dfn> and compute the attributed histogram will first look across
+epochs for eligible impressions. It will deduct budget from any epoch with eligible


I'd rather not use the word "epoch" here if we're going to use "week" elsewhere.

martinthomson · 2024-10-09T03:12:11Z

api.bs

+Step 2: For each epoch compute the individual privacy loss of the query following Thm 4 of [[PPA-DP]]. There are three cases
+* Case 1: If the epoch has no relevant impressions the privacy loss is 0.
+* Case 2: If the window of epochs contains only a single epoch, the `individual_sensitivity` is the L1-norm of attribution function
+    applied to only the impressions in this epoch. The privacy loss deducted from the epoch's budget is
+    then `requested_epsilon * individual_sensitivity / query_global_sensitivity`.
+* Case 3: If multiple epochs are considered, the privacy loss deducted from the epoch's budget is
+    `requested_epsilon * report_global_sensitivity / query_global_sensitivity`


I think that it would be easier for this to be integrated into the attribution logic than have it be standalone.

Ideally, that means that you would execute the attribution logic and return a value that this function uses. Right now, I think that ends up being a list of tuples, each containing (impression, value, week). The part that I find a little difficult to parse here is that you need to return (impression, value=0, week) for every week that contains an impression in order to make this theorem work. That's non-intuitive to me and requires a better explanation than I was able to find.

Benjamin Case added 4 commits October 2, 2024 15:32

update algorithm for budget deduction

39bda30

checkpoint

2b96cfa

citation and markdown fixes

2a4fd23

line indents

978b16c

bmcase mentioned this pull request Oct 3, 2024

Flush out aggregation section #29

Open

bmcase and others added 2 commits October 3, 2024 20:08

Merge branch 'main' into idp_updates

c52fecd

only specify L1-norm, not p-norm

dc7d57d

fmt

212e7a9

martinthomson reviewed Oct 9, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update algorithm for budget deduction #28

Update algorithm for budget deduction #28

bmcase commented Oct 2, 2024

bmcase commented Oct 4, 2024

bmcase commented Oct 8, 2024

martinthomson left a comment

martinthomson Oct 9, 2024

martinthomson Oct 9, 2024

martinthomson Oct 9, 2024

martinthomson Oct 9, 2024

martinthomson Oct 9, 2024

martinthomson Oct 9, 2024

martinthomson Oct 9, 2024

martinthomson Oct 9, 2024

martinthomson Oct 9, 2024

		When a conversion requests attribution the call includes several querier-provided
		parameters:

	When a conversion requests attribution the call includes several querier-provided
	parameters:
	When a site requests attribution, they provide several parameters:

	1. the window of epochs to search for relevant events (`epochs` parameter);
	1. the length of time over which to select impressions ({{PrivateAttributionConversionOptions/lookbackDays}});

	2. the requested privacy budget (`requested_epsilon`);
	2. the requested [=privacy budget=] ({{PrivateAttributionConversionOptions/epsilon}});

	3. the `filterData` value used for selecting relevant events;
	3. the {{PrivateAttributionConversionOptions/filterData}} value used for selecting relevant events;

	4. the `PrivateAttributionLogic` such as last-touch or equal-credit;
	4. the attribution {{PrivateAttributionConversionOptions/logic}} to use in selecting and attributing credit;

Update algorithm for budget deduction #28

Are you sure you want to change the base?

Update algorithm for budget deduction #28

Conversation

bmcase commented Oct 2, 2024

bmcase commented Oct 4, 2024

bmcase commented Oct 8, 2024

martinthomson left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment