Generative model #42

dnguyen1196 · 2020-05-25T17:15:47Z

Major modifications compared to #34

The convolution layer now uses DGL implementation
Streamlined training/testing scripts and uses existing report functionalities. Added metrics related to graph generative model (still mostly geared towards edge prediction)

yuanqing-wang · 2020-05-25T18:40:49Z

thanks a lot! this looks neat! I'll do line-by-line review in just a bit

yuanqing-wang · 2020-05-25T18:44:03Z

pinot/generative/torch_gvae/model.py

+        self.gc3 = GraphConvolution(hidden_dim1, hidden_dim2, dropout, act=lambda x: x)
+        self.dc = InnerProductDecoder(dropout, act=lambda x: x)
+
+    def parameterization(self, x, adj):


I made this typo before. This should spell 'parametrization' without the extra e.

This is really odd, this file is outdated. Now sure why it pops up in here. The GCNModelVAE should not even have this parametrization function

class GCNModelVAE(nn.Module): """Graph convolutional neural networks for VAE """ def __init__(self, input_feat_dim, hidden_dim1=32, \ hidden_dim2=32, hidden_dim3=16, dropout=0.1, \ log_lik_scale=1): """ Construct a VAE with GCN """ super(GCNModelVAE, self).__init__() self.linear = nn.Linear(input_feat_dim, hidden_dim1) self.gc1 = GN(hidden_dim1, hidden_dim2) # Mapping from "latent graph" to predictive distribution parameter self.output_regression = nn.ModuleList([ GN(hidden_dim2, hidden_dim3), GN(hidden_dim2, hidden_dim3), ]) # Decoder self.dc = InnerProductDecoder(dropout) # Relative weight between the KL divergence and the log likelihood term self.log_lik_scale = log_lik_scale def forward(self, g): """ Compute the parameters of the approximate Gaussian posterior distribution Args: x (FloatTensor): node features Shape (N, D) where N is the number of nodes in the graph adj (FloatTensor): adjacency matrix Shape (N, N) Returns: z: (FloatTensor): the latent encodings of the nodes Shape (N, hidden_dim2) """ z1 = self.linear(g.ndata["h"]) z = self.gc1(g, z1) return z def condition(self, g): """ Compute the approximate Normal posterior distribution q(z|x, adj) and associated parameters """ z = self.forward(g) theta = [parameter.forward(g, z) for parameter in self.output_regression] mu = theta[0] logvar = theta[1] distribution = torch.distributions.normal.Normal( loc=theta[0], scale=torch.exp(theta[1])) return distribution, theta[0], theta[1]

pinot/generative/torch_gvae/utils.py

yuanqing-wang · 2020-05-25T18:57:27Z

pinot/generative/torch_gvae/layers.py

+gcn_reduce = fn.sum(msg='m', out='h')
+
+"""
+Standalone convolution layers, separate from representation.dgl_legacy


Could you comment on how is this different from the vanilla Kipf and Welling Graph Conv? is it the explicit dependency on adjacent matrix that's different?

This has been removed from updated code

yuanqing-wang · 2020-05-25T18:58:50Z

pinot/generative/torch_gvae/utils.py

+    return train_data, test_data, val_data
+
+
+def prepare_train_test_val(data):


could you comment a bit on the necessity of this as opposed to partition / sample small graphs and then batch them together?

I don't know the answer to this question yet. But updated code allows us to choose how many molecules get "batched" into a macromolecule per iteration

yuanqing-wang

Thanks a lot! This looks great.

Minor comments:

I'm a bit confused by the necessity of doing all the operations based on adjacency matrix. it might not be the most efficient way and could be error prone.
I'd suggest that we stick to torch-generic methods when possible

Generative model

dnguyen1196 requested review from karalets and yuanqing-wang May 25, 2020 17:15

yuanqing-wang reviewed May 25, 2020

View reviewed changes

pinot/generative/torch_gvae/utils.py Outdated Show resolved Hide resolved

yuanqing-wang reviewed May 25, 2020

View reviewed changes

pinot/generative/torch_gvae/utils.py Outdated Show resolved Hide resolved

yuanqing-wang reviewed May 25, 2020

View reviewed changes

yuanqing-wang approved these changes May 25, 2020

View reviewed changes

yuanqing-wang merged this pull request into master May 26, 2020

dnguyen1196 deleted the gvae branch May 26, 2020 16:09

yuanqing-wang added a commit that referenced this pull request Jun 27, 2020

Merge pull request #42 from choderalab/gvae

b83c9e4

Generative model

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generative model #42

Generative model #42

dnguyen1196 commented May 25, 2020

yuanqing-wang commented May 25, 2020

yuanqing-wang May 25, 2020

dnguyen1196 May 25, 2020

yuanqing-wang May 25, 2020

dnguyen1196 May 25, 2020

yuanqing-wang May 25, 2020

dnguyen1196 May 25, 2020

yuanqing-wang left a comment

		return train_data, test_data, val_data


		def prepare_train_test_val(data):

Generative model #42

Generative model #42

Conversation

dnguyen1196 commented May 25, 2020

yuanqing-wang commented May 25, 2020

yuanqing-wang May 25, 2020

Choose a reason for hiding this comment

dnguyen1196 May 25, 2020

Choose a reason for hiding this comment

yuanqing-wang May 25, 2020

Choose a reason for hiding this comment

dnguyen1196 May 25, 2020

Choose a reason for hiding this comment

yuanqing-wang May 25, 2020

Choose a reason for hiding this comment

dnguyen1196 May 25, 2020

Choose a reason for hiding this comment

yuanqing-wang left a comment

Choose a reason for hiding this comment