eth/filters: subscribe history logs #27439

jsvisa · 2023-06-08T09:01:17Z

This is the second part of #15063

eth/filters/api.go

s1na · 2023-06-15T14:02:15Z

I have some misgivings about how we handle block ranges right now. IMO block ranges specifically for the live mode are not necessary. I.e. when you subscribe you get the logs for all new blocks. To make my case web3.js only lets you specify FromBlock (no ToBlock) and ethers.js doesn't support any of those parameters. These are 2 of the biggest web3 libraries.

My suggestion is we can simplify this greatly by: only allowing FromBlock and only for specifying historical blocks. I.e. FromBlock should be either a number below head block or safe or finalized. Later on we can allow ToBlock (for historical range queries).

jsvisa · 2023-06-19T02:52:13Z

@s1na lgtm, and as we discussed offline, seems we have more work to do, I'll post them here(in case I forget).

support block range subscription;
support server active close connection;

fjl · 2023-06-22T14:17:34Z

I like @s1na's idea to not allow toBlock for subscription queries. The problem with using our subscription model is that there is no way for the server to signal the end of a subscription. We can fix that by changing how subscriptions work, but I think that will be a lot of work for everyone to support. Let's try the simple solution for now.

s1na · 2023-06-29T13:53:42Z

eth/filters/api.go

+		logChan, errChan := f.rangeLogsAsync(cctx)
+
+		// subscribe rmLogs
+		query := ethereum.FilterQuery{FromBlock: big.NewInt(n), ToBlock: big.NewInt(head), Addresses: crit.Addresses, Topics: crit.Topics}


Took me a minute to get this, but smart trick I love it!

Reality check: It's more complicated.

Reorg happens in a future block that rangeLogsAsync hasn't processed yet. In that case we don't really care about the event since when rangeLogsAsync reaches that point it will go on the right chain by itself.

We can keep track of how far we are in delivery by checking the block number of incoming logs.

However it's harder to say where is the point of reorg, because removed logs are sent in the reverse order (from more recent block backwards).

What we can do is compare as they come in to see if they pass the blocks for which we've already delivered logs.

At this point we have to stop rangeAsyncLogs immediately. Or hmm there's a question what should be done if it is mid-block (with some logs remaining in that block).

Then the new subscription which sends removed logs will also send the replacement logs of the new chain. So we can just tune into that.

s1na · 2023-07-04T09:18:18Z

eth/filters/api.go

+				// We transmit all data that meets the following conditions:
+				// 1. reorg not happened
+				// 2. reorg happened but the the log is in the remainder of the currently delivered block
+				if !reorged || log.BlockNumber < reorgBlock || log.BlockNumber <= delivered {


I don't understand this condition: log.BlockNumber <= delivered. Why did you add this?

Ensure that all logs of the same block are sent out, avoid sending only a subset of logs.

eth/filters/api.go

jsvisa · 2023-07-10T07:24:06Z

eth/filters/api.go

+				if len(logs) == 0 {
+					continue
+				}
+				if reorgBlock == 0 {


Reorg maybe happened more than once between from and to, so I think we need to check for each <-reorgLogsCh message.

I was wondering if we should distinguish between two different Reorgs and one Reorg with sequence data(but the received logs maybe not continuous, due to query filter) 🤔

I came up with a scenario:

from, to is 1 -> 10;

delivered is 6;

A Reorg occurred between 5-9, assume the logs are huge, split into the following logs:

Removed 5-7

Removed 8-9

Replaced 5-7

Replaced 8-9

We are receiving a log from <-reorgLogsCh, here we set the reorgBlock to 5, and send all removed logs between 5-6 to the subscriber;

Then we recv a log from <-logChan, because reorg is detected, so no logs were sent to the subscriber;

Next we may recv <-reorgLogsCh or errChan, if the first channel comes first(the following ii, iii, iv), everything works fine; else if the latter comes first, we miss the following new logs(Replaced 5-7 and 8-9).

Yes you're right that considering 2 different reorgs adds another layer of complexity. I have not considered that in the changes I made.

Regarding the scenario. Yes I think we should disable the errChan breaking out of the loop if we're in the middle of a reorg processing. But I also don't know how to detect the end of a reorg yet.

eth/filters/api.go

holiman · 2023-09-11T11:08:01Z

eth/filters/api.go

+// histLogs retrieves logs older than current header.
+func (api *FilterAPI) histLogs(notifier notifier, rpcSub *rpc.Subscription, from int64, crit FilterCriteria) error {


I think histLogs is a bit wonky, is the benefit of reducing historicalLogs really worth it?

holiman · 2023-09-11T11:09:00Z

eth/filters/api.go

+	if crit.FromBlock == nil {
+		return api.liveLogs(notifier, rpcSub, crit)


on input, you take a ctx. But the ctx is lost, afaict you don't check for ctx timeouts or cancel. Shouldn't you? What is the lifecycle here?

As the ctx will be canceled when the subscription goroutine returns, so we instantiated a new background context to control the workflow.

The lifecycle is as below:

do historical logs fetch and push;

do live logs push;

terminate if the subscription was canceled or the push channel was broken;

holiman · 2023-09-14T11:40:03Z

eth/filters/api.go

+				reorgBlock := logs[0].BlockNumber
+				if !reorged && reorgBlock <= delivered {
+					logger.Info("Reorg detected", "reorgBlock", reorgBlock, "delivered", delivered)
+					reorged = true


This seems a bit strange. Once reorged i sset, it never becomes unset again. It will discard all histLogs when it's in that mode. Seems to me that this big switch has thee different modes:

reorged mode, where it discards historic logs

liveOnly mode

"normal" non-reorged mode

It makes it a bit complicated to follow, IMO

Once reorged i sset, it never becomes unset again

Here we only need to handle the case when our historical delivery process is catching up with the live blocks.

liveOnly mode

Yeah, We did have another switch liveOnly, which was set after all historical logs were delivered: https://github.com/ethereum/go-ethereum/blob/633c7ac67a6de35e000bdc72d51f1054f4e743b5/eth/filters/api.go#L373-L375

Seems to me that this big switch has thee different modes:

Actually I managed to bring it down to 2 modes (674a888). Now only setting liveOnly mode. This will be set:

either when a reorg happens to discard historical logs

or when historical logs have finished

But my patch has a problem. In the "reorg" mode, we only send removed log notifications up to the delivered point. So if we finish historical processing (say head = 10 so delivered = 10). Then after emitting block 11, we get a reorg from block 8. We will only delivered the removed logs up to 10.

Ok I pushed another commit which should fix the problem above and IMO the code is more readable now.

That said I noticed the tests fail every once in a while. I believe the edge case they're catching is the following:

Blockchain will first write the new (reorged) blocks to db and then send removed/new logs. This can cause the history processor to go accidentally on the new chain before we realize there's a reorg happening.

So we return oldBlock1, oldBlock2, newBlock3, removeOldBlock3, newBlock3, newBlock4. I.e. the removed logs don't match the ones we emitted for that height.

If my hunch is right about the issue, it will be a nasty one to fix.

I've pushed a fix for this. The subscription now tracks a number of block hashes that it has delivered logs for. This map is used to avoid sending removed logs for a block we haven't delivered as well as sending duplicate logs.

Signed-off-by: jsvisa <delweng@gmail.com>

MariusVanDerWijden · 2024-01-11T09:53:45Z

eth/filters/api.go

+			hashes.Add(batch.hash, struct{}{})
+		}
+		for _, log := range logs[batch.start:batch.end] {
+			log := log


Do we need to deref the log here to copy it?
e.g. log := *log
Otherwise we just copy the pointer if I understand correctly

fjl · 2024-01-11T10:08:57Z

eth/filters/filter_system_test.go

+}
+
+func (n *mockNotifier) Notify(id rpc.ID, data interface{}) error {
+	n.c <- data


log := data.(*types.Log) n.callback(log) n.logs = append(n.logs, log)

fjl · 2024-01-11T10:10:38Z

eth/filters/filter_system_test.go

+func (n *mockNotifier) Closed() <-chan interface{} {
+	return nil
+}
+


func (n *mockNotifier) Done() { close(n.done) }

and call Done from FilterAPI.

fjl · 2024-01-11T10:11:18Z

eth/filters/api.go

+		defer func() {
+			liveLogsSub.Unsubscribe()
+			cancel()
+		}()


defer notifier.Done()

cong08 · 2024-02-01T08:21:02Z

why not merge it ?

cong08 · 2024-02-20T08:17:55Z

why not merge it ?

@karalabe @rjl493456442

jsvisa requested review from karalabe, holiman and rjl493456442 as code owners June 8, 2023 09:01

jsvisa marked this pull request as draft June 8, 2023 09:01

jsvisa force-pushed the eth-filter-subscribe-history branch from 9ec20ce to ea6c486 Compare June 9, 2023 07:39

s1na reviewed Jun 9, 2023

View reviewed changes

eth/filters/api.go Outdated Show resolved Hide resolved

eth/filters/api.go Outdated Show resolved Hide resolved

jsvisa marked this pull request as ready for review June 13, 2023 10:17

s1na reviewed Jun 29, 2023

View reviewed changes

s1na reviewed Jul 4, 2023

View reviewed changes

eth/filters/api.go Outdated Show resolved Hide resolved

s1na reviewed Jul 4, 2023

View reviewed changes

eth/filters/api.go Outdated Show resolved Hide resolved

jsvisa commented Jul 10, 2023

View reviewed changes

s1na mentioned this pull request Jul 14, 2023

ethclient: SubscribeFilterLogs() not respecting FilterQuery.FromBlock #15063

Open

jsvisa force-pushed the eth-filter-subscribe-history branch 2 times, most recently from 275897a to ca0c080 Compare July 25, 2023 13:56

s1na reviewed Aug 8, 2023

View reviewed changes

eth/filters/api.go Show resolved Hide resolved

holiman reviewed Sep 11, 2023

View reviewed changes

holiman reviewed Sep 14, 2023

View reviewed changes

jsvisa added 9 commits September 27, 2023 03:59

eth/filters: wip

ea0d0b1

Signed-off-by: jsvisa <delweng@gmail.com>

eth/filters: do hist sync

013f13a

Signed-off-by: jsvisa <delweng@gmail.com>

eth/filters: use a sole context

7258162

Signed-off-by: jsvisa <delweng@gmail.com>

eth/filters: restruct

fe9287d

Signed-off-by: jsvisa <delweng@gmail.com>

eth/filters: more accurate check of live or hist

0b470f1

Signed-off-by: jsvisa <delweng@gmail.com>

eth/filters: handle toBlock

b83d6ff

Signed-off-by: jsvisa <delweng@gmail.com>

eth/filters: notify with error check

054436f

Signed-off-by: jsvisa <delweng@gmail.com>

eth/filters: local variable

baf6f7a

Signed-off-by: jsvisa <delweng@gmail.com>

eth/filters: wrap Notify, for unit test

d4be045

Signed-off-by: jsvisa <delweng@gmail.com>

jsvisa and others added 22 commits September 27, 2023 03:59

eth/filters: almost working

5092b51

Signed-off-by: jsvisa <delweng@gmail.com>

eth/filters: deliver the same block logs if reorg occured

c33c3f0

Signed-off-by: jsvisa <delweng@gmail.com>

eth/filters: dry

582df42

Signed-off-by: jsvisa <delweng@gmail.com>

eth/filters: dry 2

5d90ff5

Signed-off-by: jsvisa <delweng@gmail.com>

eth/filters: history logs should be requested > head

6897bd0

Signed-off-by: jsvisa <delweng@gmail.com>

eth/filters: test with N logs in one block

ec81063

Signed-off-by: jsvisa <delweng@gmail.com>

eth/filters: check reorgBlock in idempotent

c82cf9d

Signed-off-by: jsvisa <delweng@gmail.com>

eth/filters: dry testcases

2afe950

Signed-off-by: jsvisa <delweng@gmail.com>

eth/filters: more testcase

eaaf9f3

Signed-off-by: jsvisa <delweng@gmail.com>

eth/filters: balanceDiffer as private function

ee71ca6

Signed-off-by: jsvisa <delweng@gmail.com>

eth/filters: reuse genesisAlloc

8ddb10d

Signed-off-by: jsvisa <delweng@gmail.com>

eth/filters: ethclient set toBlock=nil to latest

4f576d0

Signed-off-by: jsvisa <delweng@gmail.com>

refactors

2f8becb

Signed-off-by: jsvisa <delweng@gmail.com>

refactor notifying logs

2df25bc

Signed-off-by: jsvisa <delweng@gmail.com>

send logs of each block together

83c0c7c

Signed-off-by: jsvisa <delweng@gmail.com>

fix fetch range err handling

205b696

Signed-off-by: jsvisa <delweng@gmail.com>

fix future reorged log edge case

ae5b814

Signed-off-by: jsvisa <delweng@gmail.com>

minor

fc3e830

Signed-off-by: jsvisa <delweng@gmail.com>

eth/filters: add some newlines

e195ee4

Signed-off-by: jsvisa <delweng@gmail.com>

rm reorg param, re-use liveOnly

f823768

Signed-off-by: jsvisa <delweng@gmail.com>

refactor

6a7706e

Signed-off-by: jsvisa <delweng@gmail.com>

track delivered blocks

97bbe05

Signed-off-by: jsvisa <delweng@gmail.com>

jsvisa force-pushed the eth-filter-subscribe-history branch from d83f528 to 97bbe05 Compare September 27, 2023 04:03

jsvisa mentioned this pull request Oct 13, 2023

rpc: improve the realtime notify performance by 30% #28328

Merged

s1na mentioned this pull request Jan 5, 2024

Timed out on eth_getLogs request through large range of blocks #28765

Open

MariusVanDerWijden reviewed Jan 11, 2024

View reviewed changes

fjl reviewed Jan 11, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

eth/filters: subscribe history logs #27439

eth/filters: subscribe history logs #27439

jsvisa commented Jun 8, 2023

s1na commented Jun 15, 2023

jsvisa commented Jun 19, 2023

fjl commented Jun 22, 2023

s1na Jun 29, 2023

s1na Jun 29, 2023

s1na Jul 4, 2023

jsvisa Jul 4, 2023

jsvisa Jul 10, 2023

jsvisa Jul 10, 2023

jsvisa Jul 10, 2023

s1na Jul 10, 2023

holiman Sep 11, 2023

holiman Sep 11, 2023

jsvisa Sep 11, 2023

holiman Sep 14, 2023

jsvisa Sep 16, 2023

s1na Sep 18, 2023

s1na Sep 18, 2023 •

edited

Loading

s1na Sep 19, 2023

MariusVanDerWijden Jan 11, 2024

fjl Jan 11, 2024

fjl Jan 11, 2024

fjl Jan 11, 2024

cong08 commented Feb 1, 2024

cong08 commented Feb 20, 2024

		// histLogs retrieves logs older than current header.
		func (api FilterAPI) histLogs(notifier notifier, rpcSub rpc.Subscription, from int64, crit FilterCriteria) error {

		if crit.FromBlock == nil {
		return api.liveLogs(notifier, rpcSub, crit)

eth/filters: subscribe history logs #27439

Are you sure you want to change the base?

eth/filters: subscribe history logs #27439

Conversation

jsvisa commented Jun 8, 2023

s1na commented Jun 15, 2023

jsvisa commented Jun 19, 2023

fjl commented Jun 22, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

s1na Sep 18, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

cong08 commented Feb 1, 2024

cong08 commented Feb 20, 2024

s1na Sep 18, 2023 •

edited

Loading