Recover plugin to automate recovery in CLN #6853

adi2011 · 2023-11-08T17:20:24Z

This would help users detect that they've lost some data and would automatically try to recover from SCB or peer storage.
Read #6544 for more!

plugins/recover.c

rustyrussell

I like the general approach, but we also need to have an idea when gossipd thinks it is synced (probably in getinfo). If we are on regtest, I would simply consider us always up-to-date, but for mainnet you could use some heuristic like "more than 1000 channels and the seeker is in state NORMAL" maybe?

lightningd/notification.c

gossipd/gossipd.c

rustyrussell

Looking good so far! Minor nitpicks...

doc/schemas/listpeerchannels.schema.json

lightningd/channel.h

plugins/libplugin.c

plugins/libplugin.h

plugins/recover.c

rustyrussell

I like it, but test fails for some reason?

tests/test_misc.py

rustyrussell · 2024-01-24T05:03:37Z

tests/test_misc.py

+
+    # l2.daemon.wait_for_log(r'All outputs resolved.*')
+    wait_for(lambda: l2.rpc.listfunds()["channels"][0]["state"] == "ONCHAIN")
+    wait_for(lambda: l2.rpc.listfunds()["channels"][1]["state"] == "ONCHAIN")


CI failing here: seems like l2 only has one channel?

It passes locally, but fails here for some reason. I am trying to resolve it...

cdecker · 2024-02-02T08:18:34Z

Hi @adi2011, I noticed that one of the failing test suites is the fuzz tester. That one has picked up some memory leak in a test binary that is unrelated to your PR. If you'd like to remove the fuzz testing from the PR CI runs, you can rebase on master.

See #7029 for more information :-)

adi2011 · 2024-02-02T08:45:42Z

Hi @cdecker, I've rebased on master. But it'll throw error which is rectified in #7031.

cdecker · 2024-02-13T13:05:10Z

Remvoed milestone as this has a crash bug, and we need to move on with the release process. Happy to re-add it if fixed in the next RC.

adi2011 · 2024-02-16T08:51:37Z

The CI error is fixed here #7080

cdecker · 2024-02-16T14:11:00Z

Rebased on top of master and hopefully we can line it up for inclusion into v24.02 anyway.

…'d help us identify if we've fall behind or lost some state.

… and try to recover the node by entering mode.

…peerchannels.

…ected nodes on the network and call emergency_recover immediately.

…orage and then call restorefrompeer repeatedly.

… being recovered when we lose some state and enter recover mode.

cdecker · 2024-02-16T18:18:01Z

@adi2011 notice that if you force-push we have to run all the tests again, whereas if we just restart we can just rerun the failing cases. Let me take care of CI and this will be merged soon.

adi2011 · 2024-02-16T22:04:55Z

Finally!! 🎉🎉❤️

adi2011 commented Nov 16, 2023

View reviewed changes

plugins/recover.c Outdated Show resolved Hide resolved

rustyrussell reviewed Nov 17, 2023

View reviewed changes

lightningd/notification.c Outdated Show resolved Hide resolved

gossipd/gossipd.c Outdated Show resolved Hide resolved

adi2011 force-pushed the RecoverPlugin branch from 39504f8 to e55230a Compare November 26, 2023 10:08

adi2011 force-pushed the RecoverPlugin branch from e55230a to 76c0394 Compare December 11, 2023 10:47

adi2011 marked this pull request as ready for review December 11, 2023 10:49

adi2011 requested a review from cdecker as a code owner December 11, 2023 10:49

adi2011 force-pushed the RecoverPlugin branch from 76c0394 to 2fb8e6b Compare December 11, 2023 20:06

adi2011 requested a review from rustyrussell December 12, 2023 04:07

rustyrussell reviewed Dec 13, 2023

View reviewed changes

adi2011 force-pushed the RecoverPlugin branch 3 times, most recently from 7e3a4b4 to b42e775 Compare January 16, 2024 14:12

cdecker self-assigned this Jan 16, 2024

cdecker added this to the v24.02 milestone Jan 16, 2024

adi2011 force-pushed the RecoverPlugin branch 7 times, most recently from da38b09 to 6cbf8f5 Compare January 22, 2024 18:16

rustyrussell approved these changes Jan 24, 2024

View reviewed changes

adi2011 force-pushed the RecoverPlugin branch 5 times, most recently from 52b73bf to ad35815 Compare February 1, 2024 07:53

adi2011 force-pushed the RecoverPlugin branch from ad35815 to e910f4c Compare February 2, 2024 08:41

cdecker removed this from the v24.02 milestone Feb 13, 2024

adi2011 force-pushed the RecoverPlugin branch 13 times, most recently from da547bc to df36ca9 Compare February 16, 2024 04:28

cdecker force-pushed the RecoverPlugin branch from df36ca9 to 7ba8a2d Compare February 16, 2024 14:10

adi2011 added 5 commits February 16, 2024 20:10

peer_control: Add lost_state param inside listpeerchannels rpc, which…

65a058d

…'d help us identify if we've fall behind or lost some state.

Plugins: Add recover plugin which'd identify if we've lost some state…

a746579

… and try to recover the node by entering mode.

plugins/recover: Initialise process to detect lost state through list…

06d123d

…peerchannels.

plugins/recover: entering_recovering_mode should connect to most conn…

53f45c1

…ected nodes on the network and call emergency_recover immediately.

plugins/recover: Load gossmap, connect to old peers and fetch peer_st…

2f8ab6d

…orage and then call restorefrompeer repeatedly.

adi2011 force-pushed the RecoverPlugin branch from 7ba8a2d to 89eadcd Compare February 16, 2024 15:58

tests/test_misc.py: Add test_recover_plugin to check if the funds are…

b296e3f

… being recovered when we lose some state and enter recover mode.

adi2011 force-pushed the RecoverPlugin branch from 89eadcd to b296e3f Compare February 16, 2024 16:43

cdecker merged commit cdb0001 into ElementsProject:master Feb 16, 2024
36 checks passed

cdecker mentioned this pull request Jun 13, 2024

Implement a recovery plugin #6544

Closed

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Recover plugin to automate recovery in CLN #6853

Recover plugin to automate recovery in CLN #6853

adi2011 commented Nov 8, 2023 •

edited

Loading

rustyrussell left a comment

rustyrussell left a comment

rustyrussell left a comment

rustyrussell Jan 24, 2024

adi2011 Jan 27, 2024

cdecker commented Feb 2, 2024

adi2011 commented Feb 2, 2024 •

edited

Loading

cdecker commented Feb 13, 2024

adi2011 commented Feb 16, 2024

cdecker commented Feb 16, 2024

cdecker commented Feb 16, 2024

adi2011 commented Feb 16, 2024

Recover plugin to automate recovery in CLN #6853

Recover plugin to automate recovery in CLN #6853

Conversation

adi2011 commented Nov 8, 2023 • edited Loading

rustyrussell left a comment

Choose a reason for hiding this comment

rustyrussell left a comment

Choose a reason for hiding this comment

rustyrussell left a comment

Choose a reason for hiding this comment

rustyrussell Jan 24, 2024

Choose a reason for hiding this comment

adi2011 Jan 27, 2024

Choose a reason for hiding this comment

cdecker commented Feb 2, 2024

adi2011 commented Feb 2, 2024 • edited Loading

cdecker commented Feb 13, 2024

adi2011 commented Feb 16, 2024

cdecker commented Feb 16, 2024

cdecker commented Feb 16, 2024

adi2011 commented Feb 16, 2024

adi2011 commented Nov 8, 2023 •

edited

Loading

adi2011 commented Feb 2, 2024 •

edited

Loading