Question about subtask-decomposition training #1

Leafaeolian · 2024-10-11T06:29:23Z

Thanks for your excellent work! This work somewhat like a milestome in perturbation prediction.

The question confused me is the code concering subtask-decomposition training. It seems like subtask1 is seperated with others while subtask2 and subtask3 are trained jointly like multi-task strategy. Its my curiosity that why subtask2 and 3 aren't similarly separated, or why they aren't all linked together like a complete multi-task strategy. Is there a evidence to show the setting is better?

GaoYiChengTJ · 2024-10-12T04:02:45Z

Thanks for your interest. We have performed ablation studies to illustrate this point in our supplementary information. (https://www.nature.com/articles/s43588-024-00698-1)

Leafaeolian · 2024-10-12T05:23:52Z

Sorry, I only find Supplementary Figure. 5 and Supplementary Figure. 6 as ablatrion study which focus on ablating the specific subtask. Could you tell me the detail location in supplementary information?

GaoYiChengTJ · 2024-10-12T06:32:59Z

Subtask-2 can be considered as the coarse-grained version of subtask-3, as it only focus on the direction of DEG genes. Subtask-1 is the most important part of STAMP, however, joint training will introduce large noises to the following subtasks at the initial training stage. From our primary testing, this will lead to a lower performance of STAMP.

Leafaeolian · 2024-10-12T07:35:48Z

waa great! Many thanks, this really help me understand the paper!

Leafaeolian · 2024-10-14T01:38:14Z

Subtask-2 can be considered as the coarse-grained version of subtask-3, as it only focus on the direction of DEG genes. Subtask-1 is the most important part of STAMP, however, joint training will introduce large noises to the following subtasks at the initial training stage. From our primary testing, this will lead to a lower performance of STAMP.

hi author, im find a new confused issue. In my first impression, seperation of "subtask1" and "subtask2+subtask3" is only occur in training stage. But in prediction stage, it seems also take true label of "subtask1" instead of using predicted label for "subtask2+subtask3". In real world setting, its no true label of which genes are diffirential expressed. Did I misundertand? (Code is attached below)

‘’‘
class STAMP:
……
def prediction(self, test_file_path, combo_test = False):
……
output_1 = self.best_model_firstlevel.eval()(batch_x_test[0][1].to(self.device))
output1.append(output_1.cpu())
labels1.append(batch_x_test[1][0].squeeze(1).float())
output_2, mask, hids = self.best_model_secondthirdlevel.second_level_layer.eval()(batch_x_test[1][0].squeeze(1).float().to(self.device), batch_x_test[0][1].to(self.device))
output2.append(output_2.squeeze(-1).cpu())
labels2.append(batch_x_test[1][1].squeeze(1).float())
output_3, mask = self.best_model_secondthirdlevel.third_level_layer.eval()(hids, mask)
output3.append(output_3.squeeze(-1).cpu())
labels3.append(batch_x_test[1][2].squeeze(1).float())
’‘’

GaoYiChengTJ · 2024-10-14T03:35:44Z

Yes, we need to use the true label of subtask-1 to evaluate the performance on the "subtask2+subtask3", as we are focused on the performance of DEGs. For evaluating the accuracy of identifying DEGs, we use the predicted score of subtask1 given the true label of subtask-1. In real application cases, you can directly use the output of model, as it's not involved in the benchmarking.

Leafaeolian · 2024-10-14T05:52:22Z

Thanks for your reply. But im still confused if subtask-1 is all seperated with subtask-2 &-3 in training and testing, how subtask-1 benefit subtask-2&-3?

GaoYiChengTJ · 2024-10-14T06:03:52Z

We used the DEGs identified by statistical methods to constrain the model's learning for subtask-2&3, which can improve the signal-noise ratio to a certain extent. Intuitively, we hope the model should not be focused on non-DEGs , as it can be considered as fitting noise signal.

Leafaeolian · 2024-10-14T06:17:47Z

Many thanks!

Susan920715 · 2024-10-31T13:12:59Z

Yes, we need to use the true label of subtask-1 to evaluate the performance on the "subtask2+subtask3", as we are focused on the performance of DEGs. For evaluating the accuracy of identifying DEGs, we use the predicted score of subtask1 given the true label of subtask-1. In real application cases, you can directly use the output of model, as it's not involved in the benchmarking.

Hi. I'm a little confused here. When making predictions, Do the results of output_1 help in any way for the next two tasks?Why not use the result of output_1 as the DEG label for the input of the next two tasks to get output_2 and out_put3, and then evaluate them against the real result

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about subtask-decomposition training #1

Question about subtask-decomposition training #1

Leafaeolian commented Oct 11, 2024

GaoYiChengTJ commented Oct 12, 2024

Leafaeolian commented Oct 12, 2024

GaoYiChengTJ commented Oct 12, 2024

Leafaeolian commented Oct 12, 2024

Leafaeolian commented Oct 14, 2024

GaoYiChengTJ commented Oct 14, 2024

Leafaeolian commented Oct 14, 2024

GaoYiChengTJ commented Oct 14, 2024

Leafaeolian commented Oct 14, 2024 •

edited

Loading

Susan920715 commented Oct 31, 2024

Question about subtask-decomposition training #1

Question about subtask-decomposition training #1

Comments

Leafaeolian commented Oct 11, 2024

GaoYiChengTJ commented Oct 12, 2024

Leafaeolian commented Oct 12, 2024

GaoYiChengTJ commented Oct 12, 2024

Leafaeolian commented Oct 12, 2024

Leafaeolian commented Oct 14, 2024

GaoYiChengTJ commented Oct 14, 2024

Leafaeolian commented Oct 14, 2024

GaoYiChengTJ commented Oct 14, 2024

Leafaeolian commented Oct 14, 2024 • edited Loading

Susan920715 commented Oct 31, 2024

Leafaeolian commented Oct 14, 2024 •

edited

Loading