From 6d807c94ee4905edf82766abf49a569f735e5828 Mon Sep 17 00:00:00 2001
From: Nurbek Tastan <47779789+tnurbek@users.noreply.github.com>
Date: Tue, 14 May 2024 17:20:48 +0400
Subject: [PATCH] Add files via upload

---
 index.md | 245 +++++++++++++++++++++++++++++++++++++++++++++++++++++++
 1 file changed, 245 insertions(+)
 create mode 100644 index.md
diff --git a/index.md b/index.md
new file mode 100644
index 0000000..4054998
--- /dev/null
+++ b/index.md
@@ -0,0 +1,245 @@
+
+<br>
+
+<p align="center">
+<iframe width="560" height="315" src="https://www.youtube.com/embed/<videoid>" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" allowfullscreen></iframe>
+</p>
+
+<br>
+
+
+![main figure](docs/ShapFed.png)
+<p align="justify"> **Overview of our proposed ShapFed algorithm:** Each participant $i$ transmits their locally computed iterates $w_i$ to the server. The server then, <span style="color: blue">(i)</span> computes class-specific Shapley values (CSSVs) using the last layer parameters (gradients) $\hat{w}$  <span style="color: blue">(ii)</span> aggregates the weights by employing normalized contribution assessment values $\tilde{\gamma}_i$ for each participant $i$, and <span style="color: blue">(iii)</span> broadcasts the personalized weights $\bar{w}_i$ to each participant, using their individual, not-normalized contribution values $\gamma_i$. 
+ </p>
+
+ 
+## Abstract
+<p align="justify">
+Federated learning (FL) has emerged as a pivotal approach in machine learning, enabling multiple participants to collaboratively train a global model without sharing raw data. While FL finds applications in various domains such as healthcare and finance, it is challenging to ensure global model convergence when participants do not contribute equally and/or honestly. To overcome this challenge, principled mechanisms are required to evaluate the contributions made by individual participants in the FL setting. Existing solutions for contribution assessment rely on general accuracy evaluation, often failing to capture nuanced dynamics and class-specific influences. This paper proposes a novel contribution assessment method called ShapFed for fine-grained evaluation of participant contributions in FL. Our approach uses Shapley values from cooperative game theory to provide a granular understanding of class-specific influences. Based on ShapFed, we introduce a weighted aggregation method called ShapFed-WA, which outperforms conventional federated averaging, especially in class-imbalanced scenarios. Personalizing participant updates based on their contributions further enhances collaborative fairness by delivering differentiated models commensurate with the participant contributions. Experiments on CIFAR-10, Chest X-Ray, and Fed-ISIC2019 datasets demonstrate the effectiveness of our approach in improving utility, efficiency, and fairness in FL systems. The code can be found at https://github.com/tnurbek/shapfed.</p>
+
+
+## ShapFed Main Algorithm
+![Pseudocode](docs/pseudo.png)
+
+<p align="justify">**Weighted Aggregation:** The optimal weights $w_s^{\star}$ are derived using Equation 7 from the paper, while $w_s$ represents the result of applying equal weights (FedAvg). **Personalization:** Rather than distributing a uniform global model to all users, we provide personalized weights $\bar{w}_i$, which are $\gamma_i$ combinations of individual user weights $w_i$ and the optimally aggregated weight $w_s^{\star}$.</p>
+
+![Weighted ShapFed and Personalized ShapFed](docs/alignment.png) 
+ 
+## Results: Contribution Assessment of Our Approach
+<p align="justify"> 1. Comparison of our proposed contribution assessment algorithm (CSSV) with CGSV and true Shapley value computations using ResNet-34 architecture on Chest X-Ray dataset. </p>
+
+![CSSV vs CGSV](docs/contribution_comparison2.png) 
+
+<p align="justify"> 2. Heatmap visualization of class-specific Shapley values for heterogeneous setting (explained in Section 5.2 in the paper) evaluated on CIFAR-10 dataset. </p>
+
+![Heatmap](docs/cifar10-skew.png)
+
+## Results: Weighted Aggregation
+<p align="justify">1. Comparing FedAvg and ShapFed-WA on CIFAR10 under an imbalanced split scenario: insights into the balanced accuracy of four individual participants.</p>
+
+![ShapFed vs FedAvg - CIFAR10](docs/imbalanced_cifar.png)
+
+<p align="justify">2. (**Left**) The balanced accuracy of our methods (ShapFed-WA & ShapFed) vs FedAvg. (**Right**) Per-participant accuracy using all methods evaluated on Fed-ISIC2019 dataset.</p>
+
+![ShapFed vs FedAvg - FEDISIC](docs/fedisic-plot.png)
+
+ 
+## Results: Personalization
+<table border="1" style="width:100%; border-collapse: collapse;">
+    <caption>1. Performance and fairness comparison with our method and FedAvg. We use Pearson's correlation (↑) as a fairness metric on CIFAR-10. The red highlight indicates a negative gain from collaboration.</caption>
+    <thead>
+        <tr style="background-color: lightblue;">
+            <th colspan="2">Dataset / Partition</th>
+            <th>Setting</th>
+            <th>P1</th>
+            <th>P2</th>
+            <th>P3</th>
+            <th>P4</th>
+            <th>P5</th>
+            <th>Corr.</th>
+        </tr>
+    </thead>
+    <tbody>
+        <tr>
+            <td rowspan="3">ChestXRay</td>
+            <td rowspan="3">Hetero.</td>
+            <td>Individual</td>
+            <td>50.0</td>
+            <td>64.7</td>
+            <td>62.0</td>
+            <td>53.7</td>
+            <td>50.0</td>
+            <td>---</td>
+        </tr>
+        <tr>
+            <td>FedAvg</td>
+            <td>50.0</td>
+            <td>55.8</td>
+            <td>61.9</td>
+            <td>54.2</td>
+            <td>50.0</td>
+            <td>0.82</td>
+        </tr>
+        <tr style="background-color: lightgreen;">
+            <td>ShapFed</td>
+            <td style="background-color: lightgreen;">50.0</td>
+            <td style="background-color: lightgreen;">65.2</td>
+            <td style="background-color: lightgreen;">69.5</td>
+            <td style="background-color: lightgreen;">58.5</td>
+            <td style="background-color: lightgreen;">50.0</td>
+            <td style="background-color: lightgreen; font-weight: bold;">0.93</td>
+        </tr>
+        <tr>
+            <td rowspan="8">CIFAR-10</td>
+            <td rowspan="4">Imb.</td>
+            <td>Individual</td>
+            <td>75.8</td>
+            <td>45.4</td>
+            <td>48.6</td>
+            <td>31.6</td>
+            <td>---</td>
+            <td>---</td>
+        </tr>
+        <tr>
+            <td>FedAvg</td>
+            <td style="background-color: lightred;">56.6</td>
+            <td>56.8</td>
+            <td>63.8</td>
+            <td>64.2</td>
+            <td>---</td>
+            <td>-0.60</td>
+        </tr>
+        <tr>
+            <td>CGSV</td>
+            <td style="background-color: lightred;">57.2</td>
+            <td>59.0</td>
+            <td>58.8</td>
+            <td>60.4</td>
+            <td>---</td>
+            <td>-0.98</td>
+        </tr>
+        <tr style="background-color: lightgreen;">
+            <td>ShapFed</td>
+            <td style="background-color: lightgreen;">81.4</td>
+            <td style="background-color: lightgreen;">78.2</td>
+            <td style="background-color: lightgreen;">71.8</td>
+            <td style="background-color: lightgreen;">73.6</td>
+            <td style="background-color: lightgreen;">---</td>
+            <td style="background-color: lightgreen; font-weight: bold;">0.74</td>
+        </tr>
+        <tr>
+            <td rowspan="4">Hetero.</td>
+            <td>Individual</td>
+            <td>75.2</td>
+            <td>68.8</td>
+            <td>66.8</td>
+            <td>69.0</td>
+            <td>---</td>
+            <td>---</td>
+        </tr>
+        <tr>
+            <td>FedAvg</td>
+            <td>74.6</td>
+            <td>70.2</td>
+            <td>70.2</td>
+            <td>76.0</td>
+            <td>---</td>
+            <td>0.53</td>
+        </tr>
+        <tr>
+            <td>CGSV</td>
+            <td style="background-color: lightred;">55.0</td>
+            <td style="background-color: lightred;">55.8</td>
+            <td style="background-color: lightred;">57.2</td>
+            <td style="background-color: lightred;">52.6</td>
+            <td>---</td>
+            <td>-0.26</td>
+        </tr>
+        <tr style="background-color: lightgreen;">
+            <td>ShapFed</td>
+            <td style="background-color: lightgreen;">79.8</td>
+            <td style="background-color: lightgreen;">75.4</td>
+            <td style="background-color: lightgreen;">69.0</td>
+            <td style="background-color: lightgreen;">75.0</td>
+            <td style="background-color: lightgreen;">---</td>
+            <td style="background-color: lightgreen; font-weight: bold;">0.90</td>
+        </tr>
+    </tbody>
+</table>
+
+
+<table border="1" style="width:87.5%; border-collapse: collapse; margin: auto;">
+    <caption>2. Performance and fairness comparison using Pearson's correlation (↑) as a fairness metric on Fed-ISIC2019. The red highlight indicates a negative gain from collaboration.</caption>
+    <thead>
+        <tr style="background-color: lightblue;">
+            <th>Setting</th>
+            <th>P1</th>
+            <th>P2</th>
+            <th>P3</th>
+            <th>P4</th>
+            <th>P5</th>
+            <th>P6</th>
+            <th>Corr.</th>
+        </tr>
+    </thead>
+    <tbody>
+        <tr>
+            <td>Individual</td>
+            <td>67.2</td>
+            <td>25.7</td>
+            <td>42.3</td>
+            <td>31.0</td>
+            <td>18.5</td>
+            <td>15.6</td>
+            <td>---</td>
+        </tr>
+        <tr>
+            <td>FedAvg</td>
+            <td style="background-color: lightred;">65.4</td>
+            <td>40.9</td>
+            <td>57.2</td>
+            <td>59.3</td>
+            <td>51.5</td>
+            <td>56.2</td>
+            <td>0.63</td>
+        </tr>
+        <tr style="background-color: lightgreen;">
+            <td>ShapFed-WA</td>
+            <td>69.3</td>
+            <td>44.3</td>
+            <td>65.0</td>
+            <td>63.1</td>
+            <td>54.8</td>
+            <td>61.2</td>
+            <td>0.62</td>
+        </tr>
+        <tr style="background-color: lightgreen;">
+            <td>ShapFed</td>
+            <td>68.5</td>
+            <td>44.4</td>
+            <td>61.9</td>
+            <td>60.4</td>
+            <td>40.6</td>
+            <td>53.2</td>
+            <td style="font-weight: bold;">0.84</td>
+        </tr>
+    </tbody>
+</table>
+
+
+## Summary
+
+<p align="justify">This work proposes Class-Specific Shapley Values (CSSVs) to quantify participant contributions at a granular level. The contributions of this work include a novel method to deepen the understanding of participant impact and improve fairness analysis. Evaluation against FedAvg shows superior performance and additional experiments reveal enhanced fairness by personalizing client updates based on contributions. Overall, the approach aims to achieve a more equitable distribution of benefits in FL. In future, we plan to conduct an in-depth theoretical analysis aimed at identifying the specific characteristics that contribute to an effective estimation of Shapley values. This analysis will enhance our understanding of the factors that influence the accuracy and reliability of Shapley value approximations. Furthermore, an investigation into what makes our approximation of cosine similarity from the last layer a robust indicator of contributions will be explored. </p>
+
+
+
+## BibTeX
+If you like our work, please consider citing us.
+```
+@article{Tastan2024ShapFed,
+    title={Redefining Contributions: Shapley-Driven Federated Learning},
+    author={Tastan, Nurbek and Fares, Samar and Aremu, Toluwani and Horvarth, Samuel and Nandakumar, Karthik},
+    journal={https://arxiv.org/abs/<id>},
+    year={2024}
+}
+```

Dataset / Partition		Setting	P1	P2	P3	P4	P5	Corr.
ChestXRay	Hetero.	Individual	50.0	64.7	62.0	53.7	50.0	---
		FedAvg	50.0	55.8	61.9	54.2	50.0	0.82
		ShapFed	50.0	65.2	69.5	58.5	50.0	0.93
CIFAR-10	Imb.	Individual	75.8	45.4	48.6	31.6	---	---
		FedAvg	56.6	56.8	63.8	64.2	---	-0.60
		CGSV	57.2	59.0	58.8	60.4	---	-0.98
		ShapFed	81.4	78.2	71.8	73.6	---	0.74
	Hetero.	Individual	75.2	68.8	66.8	69.0	---	---
		FedAvg	74.6	70.2	70.2	76.0	---	0.53
		CGSV	55.0	55.8	57.2	52.6	---	-0.26
		ShapFed	79.8	75.4	69.0	75.0	---	0.90
Setting	P1	P2	P3	P4	P5	P6	Corr.
Individual	67.2	25.7	42.3	31.0	18.5	15.6	---
FedAvg	65.4	40.9	57.2	59.3	51.5	56.2	0.63
ShapFed-WA	69.3	44.3	65.0	63.1	54.8	61.2	0.62
ShapFed	68.5	44.4	61.9	60.4	40.6	53.2	0.84