Skip to content

Latest commit

 

History

History
117 lines (87 loc) · 4.71 KB

README.md

File metadata and controls

117 lines (87 loc) · 4.71 KB

Sanctions/Offshores Graph Demo

This repository contains scripts that will merge the OpenSanctions Due Diligence dataset with the ICIJ OffshoreLeaks database in order create a combined graph for analysis.

The result is a Cypher script to load the full graph into the Neo4J database and then browse it using the Linkurious investigation platform.

Based on name-based entity matching between the datasets, an analyst can use this graph to find offshore holdings linked to politically exposed and sanctioned individuals.

This demo will be used in a joint Linkurious/OpenSanctions webinar on Feb. 24, 2022 (5pm CET). Sign up here to participate.

Import the data

Before loading the data into a fresh install of Neo4J (4.4), please make sure the database engine is configured to use enough heap memory for fast bulk imports. Add the following lines to your Neo4J server configuration file:

dbms.transaction.concurrent.maximum=0
dbms.memory.heap.max_size=8g

Once you have configured Neo4J, you can use the load script to import data from published CSV files:

The simplest way to do this is to open the script in a text editor and copy and paste the contained commands into the Neo4J browser web UI.

After the data has loaded into Neo4J, connect your Linkurious instance to the instance. In the "Advanced" section of the "Global configuration", make sure to set indexationChunkSize to 500 (instead of the 5000 default value). You can also copy the contents of linkurious-captions.json and linkurious-styles.json into the relevant sections of the data source configuration dialog.

Fixme: cleanup

//Clean orphan nodes
:auto MATCH (n:identifier) WHERE apoc.node.degree((n)) <= 1 call { with n     DETACH DELETE (n) } in transactions of 50000 rows;
:auto MATCH (n:email) WHERE apoc.node.degree((n)) <= 1 call { with n     DETACH DELETE (n) } in transactions of 50000 rows;
:auto MATCH (n:phone) WHERE apoc.node.degree((n)) <= 1 call { with n     DETACH DELETE (n) } in transactions of 50000 rows;
:auto MATCH (n:name) WHERE apoc.node.degree((n)) <= 1 call { with n     DETACH DELETE (n) } in transactions of 50000 rows;
  
// clean empty properties
CALL apoc.periodic.iterate(
  "MATCH (n) UNWIND keys(n) as k WITH n, k WHERE n[k] = '' RETURN n, k",
  "WITH n, collect(k) as propertyKeys
   CALL apoc.create.removeProperties(n, propertyKeys) YIELD node
   RETURN node",
  {batchSize:50000, parallel:true});

Playing with the data

A query template that works for surprisingly many politicians:

MATCH p = (n)-[*..5]-(b:Offshore)
WHERE id(n) = {{"Node":node}}
    AND NONE(x IN nodes(p)[1..-1] WHERE (x:Offshore))
WITH  p, length(p) as score, n
ORDER BY score ASC
RETURN p, score, n

And an alert template:

MATCH p = (s:SanctionedEntity {nationality: "ru"})-[*..5]-(t:Offshore)
    WHERE NONE(x IN nodes(p)[1..-1] WHERE (x:PoliticalParty OR x:PublicBody OR x:Offshore))
WITH  p, length(p) as score, s, t
ORDER BY score ASC
RETURN p, score, s, t

Links between sanctioned entities and the laundromat core:

MATCH p = (s:SanctionedEntity)-[*..5]-(t:FinancialCrime)
    WHERE NONE(x IN nodes(p)[1..-1] WHERE (x:FinancialCrime OR x:Address))
WITH p LIMIT 10
RETURN p;
MATCH p = (s:SanctionedEntity)-[*..5]-(t:Offshore)
    WHERE NONE(x IN nodes(p)[1..-1] WHERE (x:Offshore OR x:Address))
WITH p LIMIT 10
RETURN p;
MATCH p = (s:Politican)-[*..5]-(t:Offshore)
    WHERE NONE(x IN nodes(p)[1..-1] WHERE (x:Offshore OR x:Address))
WITH p LIMIT 10
RETURN p;
MATCH p = (s:Sanctioned)-[:OWNERSHIP]-(c)
    WHERE NOT c:Sanctioned
WITH p LIMIT 10
RETURN p;
MATCH p = (s:Politician)-[*..4]-(t)-[:PAYMENT]-(c)
    WHERE NONE(x IN nodes(p)[1..-1] WHERE (x:FinancialCrime OR x:Address))
    AND NONE(y IN relationships(p)[1..-1] WHERE (y:PAYMENT))
WITH p, length(p) as score, s, t, c LIMIT 100
RETURN p, score, s, t, c;

Custom actions:

https://offshoreleaks.icij.org/search?cat=2&e=&q={{node.caption}}&utf8=%E2%9C%93
https://www.opensanctions.org/entities/{{node.id}}/

License

The code in this repository is licensed under MIT terms, see LICENSE. The OpenSanctions dataset is licensed under CC-BY-NonCommercial and free for media and NGO use. See the project site for information about commercial licensing.