Store multiple clustering results in the malware analysis JSON #13

So-Cool · 2016-08-01T15:11:22Z

At the moment at most one parameter settings and clustering results can be stored per clustering algorithm. It should be extended to allow storing results of clustering for multiple parameter settings. See TODO tags in this commit 1727bb0.

The text was updated successfully, but these errors were encountered:

greninja · 2017-01-23T21:31:20Z

Hey @So-Cool ,

Would like to work on this enhancement feature.

So basically we would want to have a good hash function ,without collisions, that should use parameters ('eps' and 'min_samples' in the case of dbscan and "min_samples" and "min_cluster_size" in the case of hdbscan) to generate the hash? Am I correct?

So-Cool · 2017-01-26T09:34:29Z

Hi @greninja ,
that's great that you're willing to work on this. To avoid any kind of mess with your PRs could you please first finalise the other two issues that you are working on?

Hash function is not really necessary, especially that it would need to be bidirectional. One problem is to store it but the other is to retrieve it: we want users to be able to understand what parameters were used to get particular results.

So-Cool added the enhancement label Aug 1, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Store multiple clustering results in the malware analysis JSON #13

Store multiple clustering results in the malware analysis JSON #13

So-Cool commented Aug 1, 2016

greninja commented Jan 23, 2017

So-Cool commented Jan 26, 2017

Store multiple clustering results in the malware analysis JSON #13

Store multiple clustering results in the malware analysis JSON #13

Comments

So-Cool commented Aug 1, 2016

greninja commented Jan 23, 2017

So-Cool commented Jan 26, 2017