core: don't fail match on misses and redeem errors #772

buck54321 · 2020-10-22T16:11:00Z

dexc

Set default log level to debug.

core

Allow retries of failed swaps until the broadcast timeout has expired
Allow retries of redemptions frequently until the broadcast timeout, and less frequently afterwards until resolution.
Do not group swaps or redemptions for retries.
Retries on failed swaps and redeems are metered so that they don't run every tick, but reconfiguring a wallet clears the tick meterer.

client/core/core.go

jholdstock · 2020-10-22T16:33:12Z

@buck54321 made this change because of an issue I encountered - my redeem contract did not broadcast successfully because my wallet was locked. I unlocked the wallet, but dexc never retried the broadcast. Running with the changes in this PR resolved the issue - dexc retried the broadcast until it succeeded.

Thanks for the support!

chappjc · 2020-10-22T16:50:43Z

Thanks jholdstock. You had a peculiar issue with bitcoind reporting unlocked via getwalletinfo, which seems like a bitcoind issue, but we'll consider ways to be more robust. Encrypting an unencrypted wallet while dexc is running seems to be trouble that we'll work around too.

client/core/trade.go

chappjc · 2020-10-26T18:08:08Z

Just changed base branch to master. We'll cherry-pick from master into release-0.1 instead of the other way around. The conflict will go away when rebasing on master

chappjc · 2020-10-25T18:42:45Z

client/core/trade.go

@@ -162,6 +179,16 @@ func (t *trackedTrade) broadcastTimeout() time.Duration {
 	return time.Millisecond * time.Duration(t.dc.cfg.BroadcastTimeout)
 }

+// delayTicks sets the tickGovernor to prevent retrying to quickly after an


too quickly

chappjc · 2020-10-26T18:19:48Z

client/core/trade.go

+		c.swapMatchGroup(t, []*matchTracker{m}, errs)
+	}
+	if len(groupables) > 0 {
+		c.swapMatchGroup(t, groupables, errs)


I'm thinking about if we should have the groupables go first since these are all in series now, and if one of the suspect ones takes too long (something is funny with them after all and the server request or something else could be slower than usual), the ok ones in groupables might take too long to get to.

chappjc · 2020-10-26T18:27:49Z

client/core/trade.go

+			lastActionTime := match.matchTime()
+			if match.Match.Side == order.Taker {
+				// It is possible that AuditStamp could be zero if we're
+				// recovering during startup. The implications of this is that
+				// if we are 1) already in start up recovery, and 2) we fail
+				// again, we will not try again even if we have time before the
+				// broadcast timeout.
+				lastActionTime = encode.UnixTimeMilli(int64(match.MetaData.Proof.Auth.AuditStamp))


~~during startup~~ -> after reconnect? or just "if we missed the audit request"

The audit request may be missed while disconnected, and we'd get here after reconnect->match_status resolution, not just restart of dexc. I think we just observed this, although in the maker redeem situation where the audit was of taker's contract.

chappjc · 2020-10-26T18:32:36Z

client/core/trade.go

+			if time.Since(lastActionTime) < t.broadcastTimeout() {
+				t.delayTicks(match, t.dc.tickInterval*3/4)


If they missed the audit request, this would jump to setting swapErr even though there's possibly still time to try again.

If we don't know the audit time stamp, we'd ideally figure out the time that the counter party's contract reached swapConf. That's trivial or free of backend RPCs, so we can at least set lastActionTime to match.matchTime() + something or even time.Now - something if we don't have an AuditStamp.

Another idea is to always allow at least 1 retry if AuditStamp is zero. To do that might require putting the bogus value mentioned above in AuditStamp though.

chappjc · 2020-10-26T18:51:39Z

client/core/trade.go

+			// we will not try again for an hour even if we have time before the
+			// broadcast timeout.


for ~~an hour~~ 5 minutes

chappjc · 2020-10-26T19:36:30Z

client/core/core.go

+					(m.suspectSwap && t.wallets.fromAsset.ID == assetID ||
+						(m.suspectRedeem && t.wallets.toAsset.ID == assetID)) {


Missing parenthesis around the m.suspectSwap && t.wallets.fromAsset.ID == assetID.

Perhaps this would read better like:

isFromAsset := t.wallets.fromAsset.ID == assetID for _, m := range t.matches { if m.tickGovernor != nil && ((m.suspectSwap && isFromAsset) || (m.suspectRedeem && !isFromAsset)) {

chappjc reviewed Oct 22, 2020

View reviewed changes

client/core/core.go Show resolved Hide resolved

buck54321 force-pushed the failerr branch 2 times, most recently from 4b92346 to 9c92b98 Compare October 25, 2020 00:50

chappjc reviewed Oct 25, 2020

View reviewed changes

client/core/trade.go Outdated Show resolved Hide resolved

client/core/trade.go Outdated Show resolved Hide resolved

client/core/trade.go Outdated Show resolved Hide resolved

buck54321 marked this pull request as ready for review October 25, 2020 18:12

chappjc added this to the 0.1.1 milestone Oct 26, 2020

chappjc changed the base branch from release-0.1 to master October 26, 2020 18:07

chappjc reviewed Oct 26, 2020

View reviewed changes

buck54321 added 5 commits October 26, 2020 17:33

failErr tweaks and default debug logging

c45846a

de-group suspect matches. retry until bcast timeout

4b62585

review followup 1

23f5ab7

set tickInterval and check for correct errors

be23ea7

review followup 2

2706a5b

buck54321 force-pushed the failerr branch from 9e32c65 to 2706a5b Compare October 26, 2020 22:43

chappjc approved these changes Oct 27, 2020

View reviewed changes

chappjc merged commit bd3d828 into decred:master Oct 27, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

core: don't fail match on misses and redeem errors #772

core: don't fail match on misses and redeem errors #772

buck54321 commented Oct 22, 2020 •

edited

Loading

jholdstock commented Oct 22, 2020

chappjc commented Oct 22, 2020 •

edited

Loading

chappjc commented Oct 26, 2020 •

edited

Loading

chappjc Oct 25, 2020

chappjc Oct 26, 2020

chappjc Oct 26, 2020

chappjc Oct 26, 2020

chappjc Oct 26, 2020

chappjc Oct 26, 2020

		if time.Since(lastActionTime) < t.broadcastTimeout() {
		t.delayTicks(match, t.dc.tickInterval*3/4)

		// we will not try again for an hour even if we have time before the
		// broadcast timeout.

		(m.suspectSwap && t.wallets.fromAsset.ID == assetID \|\|
		(m.suspectRedeem && t.wallets.toAsset.ID == assetID)) {

core: don't fail match on misses and redeem errors #772

core: don't fail match on misses and redeem errors #772

Conversation

buck54321 commented Oct 22, 2020 • edited Loading

jholdstock commented Oct 22, 2020

chappjc commented Oct 22, 2020 • edited Loading

chappjc commented Oct 26, 2020 • edited Loading

chappjc Oct 25, 2020

Choose a reason for hiding this comment

chappjc Oct 26, 2020

Choose a reason for hiding this comment

chappjc Oct 26, 2020

Choose a reason for hiding this comment

chappjc Oct 26, 2020

Choose a reason for hiding this comment

chappjc Oct 26, 2020

Choose a reason for hiding this comment

chappjc Oct 26, 2020

Choose a reason for hiding this comment

buck54321 commented Oct 22, 2020 •

edited

Loading

chappjc commented Oct 22, 2020 •

edited

Loading

chappjc commented Oct 26, 2020 •

edited

Loading