-
Notifications
You must be signed in to change notification settings - Fork 94
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
combined signature is not valid for cosmos-sdk v0.50 #255
Comments
+1 |
@chillyvee Most likely something else. Like its not syncing or communication is not done properly. Whats your set-up in terms of sentries/nodes? And the connection between the nodes and the signer-nodes. (is it one to many or one to one). |
v3.3.0-rc1 is just 2 commits behind main head. You are right, it shouldn't have any difference. Setup is: sedad single node only + all 3 horcrux cosigners on the same hardware port 1000,1001,1002 - no external network dependency (one chain binary to many horcrux cosigners) |
Resetting to v3.3.0-rc1 and will report back. There are slight differences but none that look significant
|
Can you give any suggestions regarding any expected signing values to check? |
Missed many blocks with 3.3.0-rc1 then suddenly recovered without any restarts. Missed many blocks
Stopped missing blocks
|
Different but possibly related issue cosigner 1 On sentry switch switch blocks miss
cosigner 2 detached
cosigner 3
Recovered after 15 blocks However typically expect horcrux to keep signing even in cases of temporary disconnect |
@chillyvee Not 100% sure from the logs but I believe its something with the nonce pre-sharing. What its warning about its that the unique ID for the preshare doesnt exist. I.e either its not generated or its cleared for other reasons. As you say, the connection (theoretically) shouldn't impact. Thats one of the ideas with Horcrux. |
Reconfigured so that 2 cosigners do not have a chain node to connect to. All cosigners restart All on single machine
cosigner1 log
cosigner2 log
cosigner3 log
message sets repeat combined signature is not valid continues and block misses continuously restarted all 3 cosigners, blocks are getting signed again |
later cosigner1
cosigner2
cosigner3
|
Adjusted nonces CosignerNonceCache target to always return 1. Worked for a while then started to error. Raised defaultNonceExpiration to 20. Continuing to monitor. Failed after some time. |
With 2/3 cosigner and only cosigner 1 connected to node, it does not matter whether cosigner 1 is leader or not. Without restart Logs look the same. Forcing restart of cosigner sometimes fixes the problem. Restarting many times will fix the problem unless it is too many times then it will unfix the problem until it is restarted even more times. |
ignore this issue until further comment |
Hello @chillyvee, were you able to resolve the issue? I attempted to reproduce the errors you were running into above, but was unsuccessful. I used interchaintest and the latest version of seda-chain (which is using SDK v0.50) to spin up several different horcrux clusters under different scenarios. Scenario 1:
Results are normal, blocks are being signed: Scenario 2:
Results are normal, blocks are being signed: Scenario 3:
Cosigners not connected to a node: (timeout is expected) Leader & connected to a node: ConclusionI wasn't able to reproduce your error. Is it possible one of your cluster's cosigners is misconfigured with a signing shard? If there is a different scenario I missed, please let me know! |
Scenario resolved. PR will be provided. |
On seda testnet 2/3 horcrux signers sometimes has this error for hundreds of blocks in a row:
combined signature is not valid
Single signer horcrux does not seem to have an issue
2/3 signer horcrux has an issue sometimes. Restarting 2/3 of the cosigners appears to resolve the issue.
Sometimes the issue occurs after restarting the chain binary and reconnecting to a working horcrux 2/3 cluster.
Is there a known resolution that is waiting to be implemented? We can attempt to assist.
cosigner1 - message set repeats
cosigner2 - message set repeats
cosigner 3 - similar messages repeat
messages are similar when signing works and when signing fails (no major difference)
The text was updated successfully, but these errors were encountered: