Possible PyTorch implementation of WL kernel #153

vladislavalerievich · 2024-10-28T16:16:14Z

This pull request introduces a custom PyTorch implementation of the Weisfeiler-Lehman (WL) kernel to replace the existing Grakel-based implementation. The most important changes include adding new example scripts, creating the custom WL kernel class, and adding tests to ensure correctness and compatibility.

New Implementation of Weisfeiler-Lehman Kernel:

grakel_replace/torch_wl_kernel.py: Implemented a custom PyTorch class TorchWLKernel for the WL kernel, including methods to convert NetworkX graphs to sparse adjacency tensors, initialize node labels, perform WL iterations, and compute the kernel matrix. Also added a GraphDataset utility class for preprocessing NetworkX graphs.

Example Scripts:

grakel_replace/grakel_wl_usage_example.py: Added an example script demonstrating the usage of the Grakel library for computing the WL kernel matrix.
grakel_replace/torch_wl_usage_example.py: Added an example script demonstrating the usage of the new PyTorch-based WL kernel implementation.

Testing:

tests/test_torch_wl_kernel.py: Added tests to compare the custom WL kernel against Grakel's implementation, check kernel matrix symmetry, handle empty graphs, validate input types, check the same output for reordered graphs, and test single-node graphs.

eddiebergman

Seems really good as mentioned previously in MM!

There's some more changes we'll need to discuss regarding the representation we pass in and how we'll get it to work with BoTorch but we can discuss that later. For now, there's a bug, which from the looks of things, extends to either a bug in GraKel or at least in how its used, based on your comparison test with it.

Reproduction with description:

import networkx as nx
from torch_wl_kernel import GraphDataset, TorchWLKernel

# Create the same graphs as for the Grakel example

G1 = nx.Graph()
G1.add_edges_from([(0, 1), (1, 2), (1, 3), (1, 4), (2, 3)])

G2 = nx.Graph()
G2.add_edges_from([(0, 1), (1, 2), (2, 3), (3, 4)])
G3 = nx.Graph()
G3.add_edges_from([(0, 1), (1, 3), (3, 2)])

# Process graphs
graphs = GraphDataset.from_networkx([G1, G2, G3])

# Initialize and run WL kernel
wl_kernel = TorchWLKernel(n_iter=2, normalize=False)

K = wl_kernel(graphs)

print("Kernel matrix (pairwise similarities):")
print(K)

# Issue: GraphDataset.from_networkx() relabels nodes independantly which is incorrect,
# we assume the edges all refer to the same, i.e. the 0, 1, 2, 3, 4 in the graphs
# above are not independant of each-other
# ------------------------------------------------------------------
# Below, we re-ordered the edges, placing the first edge at the end.
# This is the same graph, yet the kernel returns something different.
#
#
#     v-------------------------------------------v
#  [(0, 1), (1, 2), (1, 3), (1, 4), (2, 3)]
#          [(1, 2), (1, 3), (1, 4), (2, 3), (0, 1)]
#
# Take a look at the implementation of `nx.convert_node_labels_to_integers()`
# that is used in `GraphDataset.from_networkx()`. We likely need to create
# our own mapping and relabel the nodes as they do.
G1 = nx.Graph()
edges_g1 = [(1, 2), (1, 3), (1, 4), (2, 3), (0, 1)]
G1.add_edges_from(edges_g1)
print(list(G1.nodes()))

G2 = nx.Graph()
G2.add_edges_from([(0, 1), (1, 2), (2, 3), (3, 4)])
print(list(G2.nodes()))
G3 = nx.Graph()
G3.add_edges_from([(0, 1), (1, 3), (3, 2)])
print(list(G3.nodes()))

# Process graphs
graphs = GraphDataset.from_networkx([G1, G2, G3])
for g in graphs:
    print(g.edges())

# Initialize and run WL kernel
wl_kernel = TorchWLKernel(n_iter=2, normalize=False)

K = wl_kernel(graphs)

print("Kernel matrix (pairwise similarities):")
print(K)

grakel_replace/grakel_wl_usage_example.py

grakel_replace/torch_wl_kernel.py

tests/test_torch_wl_kernel.py

eddiebergman

Looks mostly good but I have some questions regarding the definition of the class within the actual Kernel.

Otherwise, looks really solid and the test are really good!

grakel_replace/mixed_single_task_gp.py

eddiebergman · 2024-11-14T12:19:17Z

grakel_replace/mixed_single_task_gp_usage_example.py

+combined_num_cat_kernel = AdditiveKernel(*kernels) if kernels else None
+
+# Create WL kernel for graphs
+wl_kernel = TorchWLKernel(n_iter=5, normalize=True)


I'm guessing there's no way to really make it such that we could pass the TorchWLKernel to the AdditiveKernel, i.e. you would use it just like any other kernel type?

We could define a WLKernel class that extends gpytorch.kernels.Kernel and use that class instead of TorchWLKernel.

Add a PyTorch implementation of WL kernel

36fc3bd

eddiebergman reviewed Oct 29, 2024

View reviewed changes

vladislavalerievich added 8 commits October 29, 2024 15:02

Fix imports

b0d3842

Remove redundant copy

f87abd6

Increase precision for allclose

358fbb7

Fix calculation for graphs with reordered edges

de140b6

Increase test coverage

08c7aea

Improve readability of TorchWLKernel

6f07858

Add additional comments to TorchWLKernel

896f461

Add MixedSingleTaskGP to process graphs

383e924

eddiebergman reviewed Nov 14, 2024

View reviewed changes

vladislavalerievich added 7 commits November 20, 2024 22:38

Refactor WLKernelWrapper into a standalone WLKernel class.

65666a3

Update tests

7fa9432

Add a check for empty inputs

4227f22

Improve and combine tests

f194bd2

Update WLKernel

a104840

Add acquisition function with graph sampling

246f9f6

Add a custom __call__ method to pass graphs during optimization

770c626

vladislavalerievich added the enhancement New feature or request label Nov 22, 2024

vladislavalerievich added 7 commits December 7, 2024 16:14

Update MixedSingleTaskGP

8bf7ea7

Remove not used argument

84d0104

Update sample_graphs

d63239a

Handle different batch dimensions

3db3f89

Set num_restarts=10

f69ddbe

Add acquisition function

1c4cc83

Update WLKernel

dab9a8c

vladislavalerievich removed the enhancement New feature or request label Dec 7, 2024

vladislavalerievich self-assigned this Dec 7, 2024

vladislavalerievich added 2 commits December 7, 2024 23:06

Make train_inputs private

2999582

Update tests

ad55030

eddiebergman and others added 8 commits December 16, 2024 15:13

fix: Implement graph acquisition

8093d31

fix: Implement graph acquisition (#164)

9f978d6

Delete unused MixedSingleTaskGP

a1a29a8

Add seed_all and min_max_scale

046ad66

Refactor optimize.py

0a609f7

Speed up WL kernel computations

5486dcc

Process wl iterations in batches

f140c56

Use CSR

371b530

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Possible PyTorch implementation of WL kernel #153

Possible PyTorch implementation of WL kernel #153

vladislavalerievich commented Oct 28, 2024 •

edited

Loading

eddiebergman left a comment

eddiebergman left a comment

eddiebergman Nov 14, 2024

vladislavalerievich Nov 14, 2024

Possible PyTorch implementation of WL kernel #153

Are you sure you want to change the base?

Possible PyTorch implementation of WL kernel #153

Conversation

vladislavalerievich commented Oct 28, 2024 • edited Loading

New Implementation of Weisfeiler-Lehman Kernel:

Example Scripts:

Testing:

eddiebergman left a comment

Choose a reason for hiding this comment

eddiebergman left a comment

Choose a reason for hiding this comment

eddiebergman Nov 14, 2024

Choose a reason for hiding this comment

vladislavalerievich Nov 14, 2024

Choose a reason for hiding this comment

vladislavalerievich commented Oct 28, 2024 •

edited

Loading