Neuromap

Companion repository for the paper The Map Equation Goes Neural: Mapping Network Flows with Graph Neural Networks by Christopher Blöcker, Chester Tan, and Ingo Scholtes, to appear at NeurIPS 2024. The paper adapts the map equation, an information-theoretic objective function for community detection, as a loss function for detecting communities with (graph) neural networks.

How the map equation works

The map equation is an information-theoretic objective function for community detection in complext networks. The map equation adopts a compression perspective and quantifies how efficiently a random walk on a network can be described based on the given communities. Those communities that lead to the best compression are considered to describe the organisational structure of the network well. The map equation's basic principles are described in the paper; more detailed explanations can be found in Community Detection with the Map Equation and Infomap: Theory and Applications.

The basic princple goes like this: (1) split the network into communities; (2) assign codewords to the nodes that are unique within each community, for example using Huffman codes; (3) compute how many bits are required per step that a random walker takes on the network -- this is the so-called codelength. The lower the codelength, the better the partition from step (1). In practice no codewords or random walks are used, instead, the map equation computes the codelength analytically.

How Neuromap works

"Neuromap" refers to using the map equation as a loss function in combination with (graph) neural networks to learn (soft) cluster assignmants (communities). The figure below shows the general setup, where the input consists of a graph, and the output is a soft cluster assignment matrix, optimised in an unsupervised fashion by minimising the map equation through gradient descent.

The core component is the (two-level) map equation, which can be expressed as follows in python, using torch:

class MapEquationPooling(torch.nn.Module):
    def __init__(self, adj: Tensor, device: str = "cpu", *args, **kwargs) -> None:
        super().__init__(*args, **kwargs)

        self.adj       = adj
        self.F, self.p = mkSmartTeleportationFlow(self.adj, device = device)

        # this term is constant, so only calculate it once
        self.p_log_p = torch.sum(self.p * torch.nan_to_num(torch.log2(self.p), nan = 0.0))

    def forward(self, x, s):
        C      = s.T @ self.F @ s
        diag_C = torch.diag(C)

        q      = 1.0 - torch.trace(C)
        q_m    = torch.sum(C, dim = 1) - diag_C
        m_exit = torch.sum(C, dim = 0) - diag_C
        p_m    = q_m + torch.sum(C, dim = 0)


        codelength = torch.sum(q      * torch.nan_to_num(torch.log2(q),      nan = 0.0)) \
                   - torch.sum(q_m    * torch.nan_to_num(torch.log2(q_m),    nan = 0.0)) \
                   - torch.sum(m_exit * torch.nan_to_num(torch.log2(m_exit), nan = 0.0)) \
                   - self.p_log_p \
                   + torch.sum(p_m    * torch.nan_to_num(torch.log2(p_m),    nan = 0.0))

        x_pooled   = torch.matmul(s.T, x)
        adj_pooled = s.T @ self.adj @ s

        return x_pooled, adj_pooled, codelength

The self-contained example-usage.py notebook is intended to provide a minimal example as a starting point.

Requirements

To run the code in this repository, you will need a couple of libraries. Since the exact setup varies somewhat depending on the available hardware (GPU available or not?), please refer to the documentation of the respective library.

PyTorch
PyG
PyTorch Geometric Signed Directed
Open Graph Benchmark (OGB)
The implementation of NOCD, a baseline that is used in the experiments
Infomap as a baseline, which is available as the python package infomap
Some "standard" python libraries: matplotlib, seaborn, scikit-learn, networkx
Some synthetic LFR networks that are available here (generated with code available here)
The results of our experiments can be found here

Or perhaps you are just interested in looking at the plots of the results, which are included in the notebooks.

Citation

If you're using Neuromap, please cite
(Citation entry for the NeurIPS version is coming as soon as it becomes available...)

@misc{blöcker2024mapequationgoesneural,
  title         = {The Map Equation Goes Neural: Mapping Network Flows with Graph Neural Networks},
  author        = {Christopher Bl\"cker and Chester Tan and Ingo Scholtes},
  year          = {2024},
  publisher     = {arXiv},
  doi           = {10.48550/arXiv.2310.01144},
  url           = {https://doi.org/10.48550/arXiv.2310.01144},
  eprint        = {2310.01144},
  archivePrefix = {arXiv},
  primaryClass  = {cs.LG},
  howpublished  = {\href{https://doi.org/10.48550/arXiv.2310.01144}{arXiv:2310.01144}}
}

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
img		img
src		src
.gitignore		.gitignore
LICENSE		LICENSE
Plot Real-World Results.ipynb		Plot Real-World Results.ipynb
README.md		README.md
Run Real-World.ipynb		Run Real-World.ipynb
Run Synthetic.ipynb		Run Synthetic.ipynb
example-usage.ipynb		example-usage.ipynb
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Neuromap

How the map equation works

How Neuromap works

Requirements

Citation

About

Uh oh!

Releases

Packages

Languages

License

chrisbloecker/neuromap

Folders and files

Latest commit

History

Repository files navigation

Neuromap

How the map equation works

How Neuromap works

Requirements

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages