refactor: adapt to cut.prob's new handling of NULL in the C core (sim… #1574

maelle · 2024-11-07T09:07:31Z

…pler default for the R interface)

Work needed in the tests.

aviator-app · 2024-11-07T09:07:35Z

Current Aviator status

Aviator will automatically update this comment as the status of the PR changes.
Comment /aviator refresh to force Aviator to re-examine your PR (or learn about other /aviator commands).

This pull request is currently open (not queued).

How to merge

To merge this PR, comment /aviator merge or add the mergequeue label.

See the real-time status of this PR on the Aviator webapp.

Use the Aviator Chrome Extension to see the status of your PR within GitHub.

…pler default for the R interface)

maelle · 2024-11-07T11:50:25Z

mmh this does not work at all currently.

maelle · 2024-11-07T12:04:40Z

@szhorvat actually, I think things are fine. What do you think of the tests

rigraph/tests/testthat/test-motifs.R

Line 1 in 30518c2

test_that("motif finding works", {

?

They're failing for small differences. Furthermore they do not make any sense to me, why are we testing for the value of the divisions?

── Failure ('test-motifs.R:11:3'): motif finding works ─────────────────────────
c(mno0/mno, mno1/mno, mno2/mno) (`actual`) not equal to c(0.654821903845065, 0.666289144345659, 0.668393831285275) (`expected`).

  `actual`: 0.67454 0.66614 0.66597
`expected`: 0.65482 0.66629 0.66839

szhorvat · 2024-11-07T13:01:46Z

I'm really tired today ... could you please help me by showing me a specific before/after example that changes output? Passing c(0,0,0) vs NULL should NOT change anything.

But as I'm writing this, I think I'm starting to remember what's going on:

I think passing NULL instead of c(0,0,0) vs will cause some (unnecessary) RNG calls to be omitted. This means that later calls that use different values than c(0,0,0), and therefore return stochastic results, should indeed be affected. And yes, this is not a bug, don't worry.

It'll be cleanest for each test to use its own random seed.

szhorvat · 2024-11-07T13:09:43Z

Yes, this is certainly what's going on. If you pass NULL or c(0,0,0,...), the result will be the exactly same. But the RNG state will be mutated differently, which means that any subsequent uses of the RNG are affected.

The results are approximately the same and everything is fine.

Adding a tolerance won't work very well here because the noise in the results is still quite high and will continue to be high unless we use large enough graphs and small enough cut probabilities that the computation time becomes too long for a test.

szhorvat · 2024-11-07T13:14:55Z

Furthermore they do not make any sense to me, why are we testing for the value of the divisions?

The interface is not very nice, unfortunately, but improvements are for a later version and for the C core.

If we give cut probabilities $p_1, p_2, \dots$, then only a fraction of the motifs will be sampled. This fraction is $\prod_i (1-p_i)$. This is the value you should see in the ratio of the counts obtained with a non-zero cut probability and the full counts (with no cuts). Since some motifs are rare, some entries in the result vector will fluctuate wildly.

So, if you give c(1/3, 0, 0), then the ratios should all be about $1-1/3 \approx 0.66$.

szhorvat · 2024-11-10T12:14:48Z

R/motifs.R

+  if (is.null(cut.prob)) {
+    .Call(
+      R_igraph_motifs_randesu_estimate, graph, as.numeric(size),
+      cut.prob, as.numeric(sample.size), as.numeric(sample)
+    )
+  } else {
+    .Call(
+      R_igraph_motifs_randesu_estimate, graph, as.numeric(size),
+      as.numeric(cut.prob), as.numeric(sample.size), as.numeric(sample)
+    )
+
+  }


Instead of putting the call in a conditional, update the value of cut.prob in a conditional, and keep a single call to the C function.

szhorvat · 2024-11-10T12:15:21Z

When you resolve conflicts, be sure that you don't accidentally re-add as.numeric to sample.

maelle force-pushed the cut.prob branch 2 times, most recently from 5b76237 to b0d4610 Compare November 7, 2024 09:15

refactor: adapt to cut.prob's new handling of NULL in the C core (sim…

2a5137a

…pler default for the R interface)

maelle force-pushed the cut.prob branch from b0d4610 to 2a5137a Compare November 7, 2024 09:23

szhorvat approved these changes Nov 7, 2024

View reviewed changes

szhorvat reviewed Nov 10, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor: adapt to cut.prob's new handling of NULL in the C core (sim… #1574

refactor: adapt to cut.prob's new handling of NULL in the C core (sim… #1574

maelle commented Nov 7, 2024

aviator-app bot commented Nov 7, 2024

maelle commented Nov 7, 2024

maelle commented Nov 7, 2024

szhorvat commented Nov 7, 2024 •

edited

Loading

szhorvat commented Nov 7, 2024

szhorvat commented Nov 7, 2024

szhorvat Nov 10, 2024

szhorvat commented Nov 10, 2024

refactor: adapt to cut.prob's new handling of NULL in the C core (sim… #1574

Are you sure you want to change the base?

refactor: adapt to cut.prob's new handling of NULL in the C core (sim… #1574

Conversation

maelle commented Nov 7, 2024

aviator-app bot commented Nov 7, 2024

Current Aviator status

How to merge

maelle commented Nov 7, 2024

maelle commented Nov 7, 2024

szhorvat commented Nov 7, 2024 • edited Loading

szhorvat commented Nov 7, 2024

szhorvat commented Nov 7, 2024

szhorvat Nov 10, 2024

Choose a reason for hiding this comment

szhorvat commented Nov 10, 2024

szhorvat commented Nov 7, 2024 •

edited

Loading