fix: Implement graph acquisition #164

eddiebergman · 2024-12-16T14:21:51Z

@vladislavalerievich here's a working but highly un-optimized acquisition over mixed spaces with categoricals, numericals and graphs.

The main thing done was to not use the overwritten SingleTaskGP and pass in graphs as a column in the X tensor. This allows us to use almost all of the undelying bo-torch functionality. I made this as a PR so it can be annotated, feel free to merge as you need (it's into your branch).

This tensor represents indices. As far as the acquisition optimization is concerned, these are fixed_features and they are not optimized over. Instead, we optimize them in an outer-loop, as was done with the categorials.
These indices map into a graph_lookup that we attach to the TorchWLKernel when we need it. See the associated @contextmanager. This contextmanager might be overkill but it worked for me.

It's slow, horribly slow ... I changed the parameters to reduce the amount of iterations it will do but ultimately it needs to be sped up. I left a big TODO w.r.t. the optimizations. Most of it is just re-dundant calculations which need to be fixed up.

If you need some guidance on where it's slow, highly recommend py-spy,

py-spy record -f speedscope -o profile.speedscope -- python <pythonfile>.

You can then upload the profile.speedscope to here to see a flamegraph of where all the time is being spent. I recommend using left heavy view (top left).

If you need to see the underlying non-python code in the output, you can add in a -n argument, although it gets noisy until you're used to looking at it.

eddiebergman · 2024-12-16T14:22:17Z

grakel_replace/mixed_single_task_gp.py

-        test_dataset = GraphDataset.from_networkx(x2)  # x2 should be test graphs
-        return self._wl_kernel(self._train_graph_dataset, test_dataset)
-
-
 class MixedSingleTaskGP(SingleTaskGP):


This isn't actually used anymore

eddiebergman · 2024-12-16T14:22:54Z

grakel_replace/mixed_single_task_gp_usage_example.py

+N_GRAPH = 1
+assert N_GRAPH == 1, "This example only supports a single graph feature"


We can assume we'll only ever have 1 graph parameter for now. In the future, if we need more, we could have a kernel per graph hyperparameter.

eddiebergman · 2024-12-16T14:23:19Z

grakel_replace/mixed_single_task_gp_usage_example.py

+X = torch.empty(
+    size=(TOTAL_CONFIGS, N_NUMERICAL + N_CATEGORICAL + N_GRAPH),
+    dtype=torch.float64,
+)


The + N_GRAPH is now where the indices for the graph lookup will go

eddiebergman · 2024-12-16T14:24:35Z

grakel_replace/mixed_single_task_gp_usage_example.py

+if N_GRAPH > 0:
+    wl_kernel = ScaleKernel(
+        TorchWLKernel(
+            graph_lookup=train_graphs,
+            n_iter=5,
+            normalize=True,
+            active_dims=(X.shape[1] - 1,),  # Last column
+        )
+    )
+    kernels.append(wl_kernel)


Now that TorchWLKernel inherits from Kernel, we can use it just like any other kernel, including wrapping it in a ScaleKernel.

Importantly, the graph_lookup we pass in is what the integer column in column X.shape[1] - 1 will refer to.

eddiebergman · 2024-12-16T14:24:55Z

grakel_replace/mixed_single_task_gp_usage_example.py

-train_graphs = graphs[:TRAIN_CONFIGS]
-train_y = y[:TRAIN_CONFIGS].unsqueeze(-1)  # Add dimension for botorch
+# Combine numerical and categorical kernels
+kernel = AdditiveKernel(*kernels)


Can now dump the TorchWLKernel in here with the rest.

eddiebergman · 2024-12-16T14:31:53Z