Replies: 2 comments 21 replies
-
is there an ID to join by? if yes, try session-window with a timeout? |
Beta Was this translation helpful? Give feedback.
-
There is an additional requirement we need to discuss for #2440: Correct handling of "late streams". By "late stream" I mean the scenario where we have two input sources that are individually ordered (or almost ordered, say, max 1s out of order). But the latencies of the two source streams are significantly different. Say stream A has latency of 10s and stream B can sometimes have allowed latency of up to 60s. So stream B events are allowed to be late of stream A events by 50s. My concerns is, that when joined in a vertex, stream A will progress the watermark to an extent that all B events become late and are dropped. With above example values, if we set This should of course generalize to joins with more than 2 inputs. |
Beta Was this translation helpful? Give feedback.
-
In continuation of the "ordered streams" workaround discussed in #2395 (by setting
readBatchSize:1
andscale.max: 1
).I have two ordered source streams that I want to combine into a single ordered stream using a join vertex. Which vertex type could support me writing a UDF that sorts the output events?
Beta Was this translation helpful? Give feedback.
All reactions