Extend fast inequality join #630

anusudarsan · 2017-07-18T14:51:22Z

Follow up PR to extend functionality added in prestodb#7097. The tests result below. @losipiuk Please review.

PR branch

Benchmark                                                     (buckets)  (fastInequalityJoins)  (filterOutCoefficient)  Mode  Cnt     Score     Error  Units
BenchmarkInequalityJoin.benchmarkJoin                               100                   true                      10  avgt   30   234.191 ±  33.999  ms/op
BenchmarkInequalityJoin.benchmarkJoin                               100                  false                      10  avgt   30  2360.016 ± 189.837  ms/op
BenchmarkInequalityJoin.benchmarkJoin                              1000                   true                      10  avgt   30   187.426 ±  24.792  ms/op
BenchmarkInequalityJoin.benchmarkJoin                              1000                  false                      10  avgt   30   414.487 ±  27.297  ms/op
BenchmarkInequalityJoin.benchmarkJoin                             10000                   true                      10  avgt   30   198.977 ±  35.756  ms/op
BenchmarkInequalityJoin.benchmarkJoin                             10000                  false                      10  avgt   30   239.980 ±  18.026  ms/op
BenchmarkInequalityJoin.benchmarkJoin                             60000                   true                      10  avgt   30   173.009 ±   6.956  ms/op
BenchmarkInequalityJoin.benchmarkJoin                             60000                  false                      10  avgt   30   181.165 ±   8.649  ms/op
BenchmarkInequalityJoin.benchmarkJoinWithExpressionPredicate        100                   true                      10  avgt   30   280.396 ±  17.991  ms/op
BenchmarkInequalityJoin.benchmarkJoinWithExpressionPredicate        100                  false                      10  avgt   30  2376.180 ± 125.109  ms/op
BenchmarkInequalityJoin.benchmarkJoinWithExpressionPredicate       1000                   true                      10  avgt   30   210.539 ±  11.744  ms/op
BenchmarkInequalityJoin.benchmarkJoinWithExpressionPredicate       1000                  false                      10  avgt   30   471.721 ±  45.251  ms/op
BenchmarkInequalityJoin.benchmarkJoinWithExpressionPredicate      10000                   true                      10  avgt   30   203.669 ±   9.373  ms/op
BenchmarkInequalityJoin.benchmarkJoinWithExpressionPredicate      10000                  false                      10  avgt   30   259.281 ±  13.036  ms/op
BenchmarkInequalityJoin.benchmarkJoinWithExpressionPredicate      60000                   true                      10  avgt   30   203.048 ±  10.953  ms/op
BenchmarkInequalityJoin.benchmarkJoinWithExpressionPredicate      60000                  false                      10  avgt   30   199.349 ±   9.362  ms/op
BenchmarkInequalityJoin.benchmarkJoinWithFunctionPredicate          100                   true                      10  avgt   30   216.786 ±  12.600  ms/op
BenchmarkInequalityJoin.benchmarkJoinWithFunctionPredicate          100                  false                      10  avgt   30  2408.483 ±  97.140  ms/op
BenchmarkInequalityJoin.benchmarkJoinWithFunctionPredicate         1000                   true                      10  avgt   30   195.763 ±  13.215  ms/op
BenchmarkInequalityJoin.benchmarkJoinWithFunctionPredicate         1000                  false                      10  avgt   30   580.025 ± 120.265  ms/op
BenchmarkInequalityJoin.benchmarkJoinWithFunctionPredicate        10000                   true                      10  avgt   30   226.885 ±  24.685  ms/op
BenchmarkInequalityJoin.benchmarkJoinWithFunctionPredicate        10000                  false                      10  avgt   30   274.404 ±  18.313  ms/op
BenchmarkInequalityJoin.benchmarkJoinWithFunctionPredicate        60000                   true                      10  avgt   30   197.643 ±  11.469  ms/op
BenchmarkInequalityJoin.benchmarkJoinWithFunctionPredicate        60000                  false                      10  avgt   30   210.390 ±  15.268  ms/op
BenchmarkInequalityJoin.benchmarkRangePredicateJoin                 100                   true                      10  avgt   30   247.428 ±  11.549  ms/op
BenchmarkInequalityJoin.benchmarkRangePredicateJoin                 100                  false                      10  avgt   30  2487.442 ±  60.085  ms/op
BenchmarkInequalityJoin.benchmarkRangePredicateJoin                1000                   true                      10  avgt   30   240.810 ±  14.584  ms/op
BenchmarkInequalityJoin.benchmarkRangePredicateJoin                1000                  false                      10  avgt   30   527.124 ±  47.483  ms/op
BenchmarkInequalityJoin.benchmarkRangePredicateJoin               10000                   true                      10  avgt   30   226.683 ±  11.559  ms/op
BenchmarkInequalityJoin.benchmarkRangePredicateJoin               10000                  false                      10  avgt   30   270.130 ±  13.167  ms/op
BenchmarkInequalityJoin.benchmarkRangePredicateJoin               60000                   true                      10  avgt   30   226.237 ±   8.305  ms/op
BenchmarkInequalityJoin.benchmarkRangePredicateJoin               60000                  false                      10  avgt   30   218.149 ±   9.184  ms/op

sprint-59 branch

Benchmark                                                     (buckets)  (fastInequalityJoins)  (filterOutCoefficient)  Mode  Cnt     Score     Error  Units
BenchmarkInequalityJoin.benchmarkJoin                               100                   true                      10  avgt   30   222.267 ±  36.490  ms/op
BenchmarkInequalityJoin.benchmarkJoin                               100                  false                      10  avgt   30  2409.789 ± 193.371  ms/op
BenchmarkInequalityJoin.benchmarkJoin                              1000                   true                      10  avgt   30   192.559 ±  25.289  ms/op
BenchmarkInequalityJoin.benchmarkJoin                              1000                  false                      10  avgt   30   458.951 ±  56.590  ms/op
BenchmarkInequalityJoin.benchmarkJoin                             10000                   true                      10  avgt   30   186.968 ±  31.170  ms/op
BenchmarkInequalityJoin.benchmarkJoin                             10000                  false                      10  avgt   30   191.897 ±  13.314  ms/op
BenchmarkInequalityJoin.benchmarkJoin                             60000                   true                      10  avgt   30   153.550 ±  12.891  ms/op
BenchmarkInequalityJoin.benchmarkJoin                             60000                  false                      10  avgt   30   168.607 ±  10.826  ms/op
BenchmarkInequalityJoin.benchmarkJoinWithExpressionPredicate        100                   true                      10  avgt   30   279.125 ±  23.798  ms/op
BenchmarkInequalityJoin.benchmarkJoinWithExpressionPredicate        100                  false                      10  avgt   30  2375.963 ±  78.662  ms/op
BenchmarkInequalityJoin.benchmarkJoinWithExpressionPredicate       1000                   true                      10  avgt   30   208.635 ±  12.900  ms/op
BenchmarkInequalityJoin.benchmarkJoinWithExpressionPredicate       1000                  false                      10  avgt   30   479.930 ±  54.966  ms/op
BenchmarkInequalityJoin.benchmarkJoinWithExpressionPredicate      10000                   true                      10  avgt   30   191.762 ±  12.959  ms/op
BenchmarkInequalityJoin.benchmarkJoinWithExpressionPredicate      10000                  false                      10  avgt   30   240.564 ±  18.699  ms/op
BenchmarkInequalityJoin.benchmarkJoinWithExpressionPredicate      60000                   true                      10  avgt   30   182.319 ±  13.622  ms/op
BenchmarkInequalityJoin.benchmarkJoinWithExpressionPredicate      60000                  false                      10  avgt   30   190.407 ±   8.862  ms/op
BenchmarkInequalityJoin.benchmarkJoinWithFunctionPredicate          100                   true                      10  avgt   30   193.858 ±  12.845  ms/op
BenchmarkInequalityJoin.benchmarkJoinWithFunctionPredicate          100                  false                      10  avgt   30  2288.445 ±  55.931  ms/op
BenchmarkInequalityJoin.benchmarkJoinWithFunctionPredicate         1000                   true                      10  avgt   30   206.152 ±  10.544  ms/op
BenchmarkInequalityJoin.benchmarkJoinWithFunctionPredicate         1000                  false                      10  avgt   30   479.045 ±  54.701  ms/op
BenchmarkInequalityJoin.benchmarkJoinWithFunctionPredicate        10000                   true                      10  avgt   30   207.194 ±  12.368  ms/op
BenchmarkInequalityJoin.benchmarkJoinWithFunctionPredicate        10000                  false                      10  avgt   30   252.875 ±  15.385  ms/op
BenchmarkInequalityJoin.benchmarkJoinWithFunctionPredicate        60000                   true                      10  avgt   30   178.251 ±   6.402  ms/op
BenchmarkInequalityJoin.benchmarkJoinWithFunctionPredicate        60000                  false                      10  avgt   30   201.308 ±  12.452  ms/op
BenchmarkInequalityJoin.benchmarkRangePredicateJoin                 100                   true                      10  avgt   30  2435.688 ± 143.372  ms/op
BenchmarkInequalityJoin.benchmarkRangePredicateJoin                 100                  false                      10  avgt   30  2433.708 ±  64.086  ms/op
BenchmarkInequalityJoin.benchmarkRangePredicateJoin                1000                   true                      10  avgt   30   488.757 ±  33.252  ms/op
BenchmarkInequalityJoin.benchmarkRangePredicateJoin                1000                  false                      10  avgt   30   507.685 ±  34.516  ms/op
BenchmarkInequalityJoin.benchmarkRangePredicateJoin               10000                   true                      10  avgt   30   263.682 ±  14.438  ms/op
BenchmarkInequalityJoin.benchmarkRangePredicateJoin               10000                  false                      10  avgt   30   261.671 ±  14.073  ms/op
BenchmarkInequalityJoin.benchmarkRangePredicateJoin               60000                   true                      10  avgt   30   216.391 ±  11.229  ms/op
BenchmarkInequalityJoin.benchmarkRangePredicateJoin               60000                  false                      10  avgt   30   217.869 ±  11.015  ms/op

The PR extends the functionality to speed up query with range predicates eg: benchmarkRangePredicateJoin . But I added benchmark tests for other queries which were already addressed by the optimization. So you can see the comparison below with and without this optimization.

								(buckets)  (fastInequalityJoins)     (sprint-59)	    (PR branch)
BenchmarkInequalityJoin.benchmarkJoin 				100		true		222.267 ±  36.490  ms/op   234.191 ±  33.999  ms/op
BenchmarkInequalityJoin.benchmarkJoin 				100		false	       2409.789 ± 193.371  ms/op  2360.016 ± 189.837  ms/op
BenchmarkInequalityJoin.benchmarkJoinWithExpressionPredicate	100		true		279.125 ±  23.798  ms/op   280.396 ±  17.991  ms/op
BenchmarkInequalityJoin.benchmarkJoinWithExpressionPredicate	100		false	       2375.963 ±  78.662  ms/op  2376.180 ± 125.109  ms/op
BenchmarkInequalityJoin.benchmarkJoinWithFunctionPredicate	100		true		193.858 ±  12.845  ms/op   216.786 ±  12.600  ms/op
BenchmarkInequalityJoin.benchmarkJoinWithFunctionPredicate	100		false	       2288.445 ±  55.931  ms/op  2408.483 ±  97.140  ms/op
BenchmarkInequalityJoin.benchmarkRangePredicateJoin		100		true	      2435.688 ± 143.372  ms/op	   247.428 ±  11.549  ms/op
BenchmarkInequalityJoin.benchmarkRangePredicateJoin		100		false	      2433.708 ±  64.086  ms/op   2487.442 ±  60.085  ms/op

kokosing · 2017-07-18T19:54:10Z

@anu can you post what performance results you had before applying your patch?

kokosing

I just skimmed this so far. I need a closer look.

It is awesome that you worked on that!

kokosing · 2017-07-18T19:58:39Z

presto-benchmark/src/test/java/com/facebook/presto/benchmark/BenchmarkInequalityJoin.java

@@ -121,6 +121,13 @@ public void setUp()
                .execute("SELECT count(*) FROM t1 JOIN t2 on (t1.bucket = t2.bucket) AND t1.val1 < sin(t2.val2)");
    }

+    @Benchmark


should not it belong to previous commit?

kokosing · 2017-07-18T20:00:49Z

presto-main/src/main/java/com/facebook/presto/operator/JoinHashSupplier.java

@@ -90,9 +91,13 @@ public JoinHash get()
        // are not thread safe...
        Optional<JoinFilterFunction> filterFunction =
                filterFunctionFactory.map(factory -> factory.create(session.toConnectorSession(), addresses, channels));
+        List<JoinFilterFunction> filterExpressions = filterFunctionFactory.isPresent() ?


Use Optional.map as in the statement above instead of ternary operator

kokosing · 2017-07-18T20:12:10Z

presto-main/src/test/java/com/facebook/presto/sql/planner/TestSortExpressionExtractor.java

@@ -81,6 +82,61 @@ public void testGetSortExpression()
                        ComparisonExpressionType.GREATER_THAN,
                        new FunctionCall(QualifiedName.of("sin"), ImmutableList.of(new SymbolReference("b1"))),
                        new SymbolReference("p1")));
+
+        assertGetSortExpression(
+                new LogicalBinaryExpression(LogicalBinaryExpression.Type.OR,


instead of creating expressions by hand you can use something like:

expression("b1 > p1 OR b2 <= p1")

where expression is:

private Expression expression(String sql) { return rewriteIdentifiersToSymbolReferences(new SqlParser().createExpression(sql)); }

losipiuk · 2017-07-19T08:46:25Z

Why is that targeted for sprint-59 instead master

losipiuk

I can not grasp this PR. Could we meet so you explain to me what is happening here?

losipiuk · 2017-07-19T08:53:37Z

presto-benchmark/src/test/java/com/facebook/presto/benchmark/BenchmarkInequalityJoin.java

@@ -107,6 +107,20 @@ public void setUp()
                .execute("SELECT count(*) FROM t1 JOIN t2 on (t1.bucket = t2.bucket) WHERE t1.val1 < t2.val2");
    }

+    @Benchmark
+    public List<Page> benchmarkJoinWithExpressionPredicate(Context context)


I would explicitly state we are talking about arithmetics here. Maybe call method benchmarkJoinWithArithmeticInPredicate

It seems you renamed the wrong test method. The one with sin(). But left original name for test method with arithmetic.

losipiuk · 2017-07-19T08:55:58Z

presto-tests/src/main/java/com/facebook/presto/tests/AbstractTestQueries.java

@@ -2279,6 +2279,8 @@ public void testJoinWithLessThanInJoinClause()
                "VALUES -1");
        // test with only null value in build side
        assertQuery("SELECT b FROM nation n, (VALUES (0, NULL)) t(a, b) WHERE n.regionkey - 100 < t.b AND n.nationkey = t.a", "SELECT 1 WHERE FALSE");
+        // test with filter expression


Shouldn't comment here be sth like:
// test with function calls in predicate

losipiuk · 2017-07-19T08:56:07Z

presto-tests/src/main/java/com/facebook/presto/tests/AbstractTestQueries.java

@@ -2298,6 +2300,8 @@ public void testJoinWithGreaterThanInJoinClause()
                "VALUES -1");
        // test with only null value in build side
        assertQuery("SELECT b FROM nation n, (VALUES (0, NULL)) t(a, b) WHERE n.regionkey + 100 > t.b AND n.nationkey = t.a", "SELECT 1 WHERE FALSE");
+        // test with filter expression


losipiuk · 2017-07-19T12:19:58Z

presto-main/src/main/java/com/facebook/presto/sql/planner/SortExpressionExtractor.java

+        return new SortExpressionVisitor(buildSymbols).process(filter);
+    }
+
+    private static class SortExpressionVisitor


I pushed the fixup commit 518d7ea to PR branch which simplifies the extractor. See what you think.
I found the visitExpression logic with HashSet hard to follow. You can naturally put logic of supporting just AND with matching sort expressions within visitLogicalBinaryExpression.

Take a look at commit and see what you think.

losipiuk · 2017-07-19T12:21:58Z

presto-main/src/main/java/com/facebook/presto/sql/planner/SortExpressionExtractor.java

+           return new FilterExpressions().process(expression);
+        }
+
+        private static class FilterExpressions


Name it FilterExpressionsVisitor

losipiuk · 2017-07-19T12:28:31Z

presto-main/src/main/java/com/facebook/presto/sql/planner/SortExpressionExtractor.java

+            @Override
+            protected List<Expression> visitExpression(Expression expression, Void context)
+            {
+                List<Expression> filters = new ArrayList<>();


You can do it in more functional way (IMO more a bit more readable) as:

return expression.getChildren().stream() .flatMap(child -> process(child).stream()) .collect(toImmutableList());

losipiuk · 2017-07-19T13:19:49Z

presto-main/src/main/java/com/facebook/presto/operator/SortedPositionLinks.java

@@ -152,13 +153,14 @@ public Factory build()
            return new Factory()
            {
                @Override
-                public PositionLinks create(Optional<JoinFilterFunction> lessThanFunction)
+                public PositionLinks create(Optional<JoinFilterFunction> lessThanFunction, List<JoinFilterFunction> filterFunctions)


I do not understand why we keep old lessThanFunction and add a list of filterFunctions.

Would lessThanFunction not be on of the functions in filterFunctions list?

We use the lessThanFunction (a<b AND b< a+10 in case of range predicates), for the next() method. I can get rid of it and use filterFunctions ({a< b, b< a+10}) in a loop in next() method too (like I do in start()).

I would rename filterFunctions to inequalityFilterConjuncts or inequalityFilterExpressions and then either remove lessThanFunction.
Or rename it to combinedIneqalityFIlterExpressions and ensure that it is clear when this is constructed that it is just a conjunction of function passed in the other parameter.

anusudarsan · 2017-07-19T13:49:34Z

This is not going to sprint-59. I had the PR branch locally rebased on that. I will change it once the review is done

losipiuk · 2017-07-19T14:05:53Z

presto-main/src/main/java/com/facebook/presto/sql/planner/SortExpressionExtractor.java

+            @Override
+            protected List<Expression> visitLogicalBinaryExpression(LogicalBinaryExpression expression, Void context)
+            {
+                return extractConjuncts(expression);


This does not seem good.
It will work correctly for case like this
a < x AND a < x +10

But what if we have other conjuncts in filter function. Which are not in shape of
[sort_symbol] [<=|>=|<|>] [expression using probe side symbols>].

Then those conjuncts will be used as filter functions in SortedPositionLInks.start(). And they will not work fine when passed as lessThanFunction to binary search.

Am I right?
For sure we need tests for that.

as discussed added test in ATQ.

anusudarsan · 2017-07-20T22:10:07Z

addressed comments. @losipiuk @kokosing

kokosing · 2017-07-21T12:15:50Z

Can you please squash fixup commits?

anusudarsan · 2017-07-21T16:21:38Z

@kokosing done

losipiuk

Looks good. Some minor comments. Mostly concerning clarifications in tests.

losipiuk · 2017-07-24T12:22:59Z

presto-benchmark/src/test/java/com/facebook/presto/benchmark/BenchmarkInequalityJoin.java

@@ -107,6 +107,20 @@ public void setUp()
                .execute("SELECT count(*) FROM t1 JOIN t2 on (t1.bucket = t2.bucket) WHERE t1.val1 < t2.val2");
    }

+    @Benchmark
+    public List<Page> benchmarkJoinWithExpressionPredicate(Context context)


It seems you renamed the wrong test method. The one with sin(). But left original name for test method with arithmetic.

losipiuk · 2017-07-24T12:26:27Z

presto-tests/src/main/java/com/facebook/presto/tests/AbstractTestQueries.java

@@ -2279,6 +2279,8 @@ public void testJoinWithLessThanInJoinClause()
                "VALUES -1");
        // test with only null value in build side
        assertQuery("SELECT b FROM nation n, (VALUES (0, NULL)) t(a, b) WHERE n.regionkey - 100 < t.b AND n.nationkey = t.a", "SELECT 1 WHERE FALSE");
+        // test with function calls in predicate


Please explain in the comment what are you testing here. This is not supported case for inequality fast joins.

This case is supported by this optimization since the non-equi join condition occurs in the ON clause. updated the comment.

losipiuk · 2017-07-24T12:34:57Z

presto-tests/src/main/java/com/facebook/presto/tests/AbstractTestQueries.java

@@ -2305,6 +2305,22 @@ public void testJoinWithGreaterThanInJoinClause()
    }

    @Test
+    public void testJoinWithRangePredicatesinJoinClause()


In should be upper case.

losipiuk · 2017-07-24T12:38:12Z

presto-tests/src/main/java/com/facebook/presto/tests/AbstractTestQueries.java

@@ -2305,6 +2305,22 @@ public void testJoinWithGreaterThanInJoinClause()
    }

    @Test
+    public void testJoinWithRangePredicatesinJoinClause()
+    {
+        assertQuery("SELECT COUNT(*)\n" +


Please format the queries so every condition is in separate line. This is hard to read in this shape.

Also add comment what is this test testing.
I understand that it for testing regression in fast inequality join code.
And the code is working for this query because expressions used in equality conditions are rewritten to symbols. So actual expression in join is in shape of build_symbol < probe_expression.

What about seconde query?

losipiuk · 2017-07-24T12:59:26Z

presto-main/src/main/java/com/facebook/presto/operator/SortedPositionLinks.java

@@ -43,7 +42,8 @@
 *
 * filterFunction_1(...) OR filterFunction_2(....) OR ... OR filterFunction_n(...)
 *
- * To use lessThanFunction in this class, it must be an expression in form of:
+ * To use the elements in the list of filters inequalityJoinFilterConjuncts in this class,


Actually this Javadoc is out of sync with the codebase. As we are not supporting g(build_column, ....) but only build_column as sort expression. Maybe add a commit which puts that javadoc in sync with current implementation?

We do support g(build_column) as sort expression, provided the expression is pushed to the Scan node (eg: when the expression appears in the ON clause). Updated the doc accordingly.

Yeah - but in this case it is essentially a symbol when it comes to filter function in join.
I would rather keep in javadoc that g(b_symbol) is not supported. As this javadoc is describing what is supported locally.
It should not need to be updated anytime we add some optimizer in other place which transforms expression which does not have supported shape to one which has.

Then this Javadoc would be unmaintainble (if it is not such already :) )

losipiuk · 2017-07-24T13:05:43Z

presto-main/src/main/java/com/facebook/presto/operator/SortedPositionLinks.java

+        return startingPosition;
+    }
+
+    private int findStartPositionForFunction(int startingPosition, int probePosition, Page allProbeChannelsPage, JoinFilterFunction filterFunction)


Make paremeter ordering consistent with applyLessThanFunction. I.e. make filterFunction first parameter.

losipiuk · 2017-07-24T13:08:08Z

presto-main/src/main/java/com/facebook/presto/operator/SortedPositionLinks.java

    }

    @Override
    public int start(int startingPosition, int probePosition, Page allProbeChannelsPage)
    {
+        int count = inequalityJoinFilterConjuncts.size();
+        while (count > 0) {


Please use for loop here, too.

losipiuk · 2017-07-24T13:13:46Z

presto-main/src/main/java/com/facebook/presto/sql/gen/JoinFilterFunctionCompiler.java

@@ -112,7 +114,14 @@ public JoinFilterFunctionFactory compileJoinFilterFunction(RowExpression filter,
    private JoinFilterFunctionFactory internalCompileFilterFunctionFactory(RowExpression filterExpression, int leftBlocksSize, Optional<SortExpression> sortChannel)
    {
        Class<? extends InternalJoinFilterFunction> internalJoinFilterFunction = compileInternalJoinFilterFunction(filterExpression, leftBlocksSize);
-        return new IsolatedJoinFilterFunctionFactory(internalJoinFilterFunction, sortChannel);
+
+        List<Class<? extends InternalJoinFilterFunction>> internalInequalityJoinFilterConjuncts = new ArrayList<>();


What about using Optional.map()

List<Class<? extends InternalJoinFilterFunction>> internalInequalityJoinFilterConjuncts = sortChannel.map(channel -> channel.getInequalityJoinFilterConjuncts().stream() .map(rowExpression -> compileInternalJoinFilterFunction(rowExpression, leftBlocksSize)) .collect(toImmutableList())) .orElse(ImmutableMap.of());

If you do not like this please at least use ImmutableMap.of() instead new ArrayList(). We use ImmutableMap.of() as empty list by convention.

tried this, but had to change the type of internalInequalityJoinFilterConjuncts to ImmutableList which also needs changing the constructor of IsolatedJoinFilterFunctionFactory. So keeping it as-is.

losipiuk · 2017-07-24T13:24:58Z

presto-main/src/main/java/com/facebook/presto/sql/gen/JoinFilterFunctionCompiler.java

@@ -71,6 +72,7 @@
 import static com.facebook.presto.sql.gen.TryCodeGenerator.defineTryMethod;
 import static com.google.common.base.MoreObjects.toStringHelper;
 import static com.google.common.base.Verify.verify;
+import static com.google.common.collect.ImmutableList.toImmutableList;


JoinFilterCacheKey currently has sortChannel field?
It seems unnecessary to me. Anyway, it should either be removed. Or taken into consideration in equals and hashCode.

It is not related strictly to this PR but maybe you could fix that that along the way as you are working on this class anyway. As a separate commit of course :).

The sortChannel is still being used to call internalCompileFilterFunctionFactory. So added it to equals and hashcode

losipiuk · 2017-07-24T13:28:13Z

presto-main/src/main/java/com/facebook/presto/sql/planner/SortExpressionExtractor.java

 import static com.google.common.collect.ImmutableSet.toImmutableSet;
 import static java.util.Objects.requireNonNull;

 /**
 * Currently this class handles only simple expressions like:
- *
+ * <p>


Not up to date. Probe side expression can be aribtrary one. Not just symbol.

Added a new javadoc commit to reflect what was supported, and later added info about range predicate support.

losipiuk · 2017-07-24T13:31:19Z

And one more question. How did evaluating conjuncts one by one influence performance?

anusudarsan · 2017-07-24T15:32:35Z

no regression after evaluating conjuncts one by one . Here is the result:

Benchmark                                                       (buckets)  (fastInequalityJoins)  (filterOutCoefficient)  Mode  Cnt     Score     Error  Units
BenchmarkInequalityJoin.benchmarkJoin                                 100                   true                      10  avgt   30   187.515 ±   9.338  ms/op
BenchmarkInequalityJoin.benchmarkJoin                                 100                  false                      10  avgt   30  2372.887 ±  75.762  ms/op
BenchmarkInequalityJoin.benchmarkJoin                                1000                   true                      10  avgt   30   174.968 ±  10.317  ms/op
BenchmarkInequalityJoin.benchmarkJoin                                1000                  false                      10  avgt   30   431.399 ±  43.748  ms/op
BenchmarkInequalityJoin.benchmarkJoin                               10000                   true                      10  avgt   30   173.501 ±   7.109  ms/op
BenchmarkInequalityJoin.benchmarkJoin                               10000                  false                      10  avgt   30   209.469 ±  13.789  ms/op
BenchmarkInequalityJoin.benchmarkJoin                               60000                   true                      10  avgt   30   169.522 ±   6.670  ms/op
BenchmarkInequalityJoin.benchmarkJoin                               60000                  false                      10  avgt   30   180.787 ±  18.943  ms/op
BenchmarkInequalityJoin.benchmarkJoinWithArithmeticInPredicate        100                   true                      10  avgt   30   224.066 ±  30.257  ms/op
BenchmarkInequalityJoin.benchmarkJoinWithArithmeticInPredicate        100                  false                      10  avgt   30  2468.211 ± 119.020  ms/op
BenchmarkInequalityJoin.benchmarkJoinWithArithmeticInPredicate       1000                   true                      10  avgt   30   210.219 ±   9.918  ms/op
BenchmarkInequalityJoin.benchmarkJoinWithArithmeticInPredicate       1000                  false                      10  avgt   30   457.773 ±  47.746  ms/op
BenchmarkInequalityJoin.benchmarkJoinWithArithmeticInPredicate      10000                   true                      10  avgt   30   204.778 ±   8.744  ms/op
BenchmarkInequalityJoin.benchmarkJoinWithArithmeticInPredicate      10000                  false                      10  avgt   30   268.531 ±  39.687  ms/op
BenchmarkInequalityJoin.benchmarkJoinWithArithmeticInPredicate      60000                   true                      10  avgt   30   208.116 ±  16.148  ms/op
BenchmarkInequalityJoin.benchmarkJoinWithArithmeticInPredicate      60000                  false                      10  avgt   30   201.309 ±  10.366  ms/op
BenchmarkInequalityJoin.benchmarkJoinWithExpressionPredicate          100                   true                      10  avgt   30   273.970 ±  19.601  ms/op
BenchmarkInequalityJoin.benchmarkJoinWithExpressionPredicate          100                  false                      10  avgt   30  2431.654 ±  86.431  ms/op
BenchmarkInequalityJoin.benchmarkJoinWithExpressionPredicate         1000                   true                      10  avgt   30   216.315 ±  11.381  ms/op
BenchmarkInequalityJoin.benchmarkJoinWithExpressionPredicate         1000                  false                      10  avgt   30   465.993 ±  28.393  ms/op
BenchmarkInequalityJoin.benchmarkJoinWithExpressionPredicate        10000                   true                      10  avgt   30   208.250 ±  12.453  ms/op
BenchmarkInequalityJoin.benchmarkJoinWithExpressionPredicate        10000                  false                      10  avgt   30   248.567 ±  15.304  ms/op
BenchmarkInequalityJoin.benchmarkJoinWithExpressionPredicate        60000                   true                      10  avgt   30   198.138 ±   9.732  ms/op
BenchmarkInequalityJoin.benchmarkJoinWithExpressionPredicate        60000                  false                      10  avgt   30   201.292 ±   8.834  ms/op
BenchmarkInequalityJoin.benchmarkRangePredicateJoin                   100                   true                      10  avgt   30   254.792 ±  14.055  ms/op
BenchmarkInequalityJoin.benchmarkRangePredicateJoin                   100                  false                      10  avgt   30  2528.831 ±  48.746  ms/op
BenchmarkInequalityJoin.benchmarkRangePredicateJoin                  1000                   true                      10  avgt   30   234.692 ±   9.586  ms/op
BenchmarkInequalityJoin.benchmarkRangePredicateJoin                  1000                  false                      10  avgt   30   473.772 ±  33.844  ms/op
BenchmarkInequalityJoin.benchmarkRangePredicateJoin                 10000                   true                      10  avgt   30   226.928 ±   8.116  ms/op
BenchmarkInequalityJoin.benchmarkRangePredicateJoin                 10000                  false                      10  avgt   30   284.204 ±  18.327  ms/op
BenchmarkInequalityJoin.benchmarkRangePredicateJoin                 60000                   true                      10  avgt   30   221.052 ±   6.519  ms/op
BenchmarkInequalityJoin.benchmarkRangePredicateJoin                 60000                  false                      10  avgt   30   215.020 ±   9.656  ms/op

I will address the rest of the comments.

anusudarsan

addressed comments @losipiuk

anusudarsan · 2017-07-25T15:39:38Z

presto-main/src/main/java/com/facebook/presto/operator/SortedPositionLinks.java

@@ -43,7 +42,8 @@
 *
 * filterFunction_1(...) OR filterFunction_2(....) OR ... OR filterFunction_n(...)
 *
- * To use lessThanFunction in this class, it must be an expression in form of:
+ * To use the elements in the list of filters inequalityJoinFilterConjuncts in this class,


We do support g(build_column) as sort expression, provided the expression is pushed to the Scan node (eg: when the expression appears in the ON clause). Updated the doc accordingly.

anusudarsan · 2017-07-25T15:39:58Z

presto-benchmark/src/test/java/com/facebook/presto/benchmark/BenchmarkInequalityJoin.java

@@ -107,6 +107,20 @@ public void setUp()
                .execute("SELECT count(*) FROM t1 JOIN t2 on (t1.bucket = t2.bucket) WHERE t1.val1 < t2.val2");
    }

+    @Benchmark
+    public List<Page> benchmarkJoinWithExpressionPredicate(Context context)


anusudarsan · 2017-07-25T16:03:28Z

presto-main/src/main/java/com/facebook/presto/sql/planner/SortExpressionExtractor.java

 import static com.google.common.collect.ImmutableSet.toImmutableSet;
 import static java.util.Objects.requireNonNull;

 /**
 * Currently this class handles only simple expressions like:
- *
+ * <p>


Added a new javadoc commit to reflect what was supported, and later added info about range predicate support.

anusudarsan · 2017-07-25T21:09:49Z

presto-main/src/main/java/com/facebook/presto/sql/gen/JoinFilterFunctionCompiler.java

@@ -71,6 +72,7 @@
 import static com.facebook.presto.sql.gen.TryCodeGenerator.defineTryMethod;
 import static com.google.common.base.MoreObjects.toStringHelper;
 import static com.google.common.base.Verify.verify;
+import static com.google.common.collect.ImmutableList.toImmutableList;


The sortChannel is still being used to call internalCompileFilterFunctionFactory. So added it to equals and hashcode

anusudarsan · 2017-07-25T21:11:19Z

presto-tests/src/main/java/com/facebook/presto/tests/AbstractTestQueries.java

@@ -2279,6 +2279,8 @@ public void testJoinWithLessThanInJoinClause()
                "VALUES -1");
        // test with only null value in build side
        assertQuery("SELECT b FROM nation n, (VALUES (0, NULL)) t(a, b) WHERE n.regionkey - 100 < t.b AND n.nationkey = t.a", "SELECT 1 WHERE FALSE");
+        // test with function calls in predicate


This case is supported by this optimization since the non-equi join condition occurs in the ON clause. updated the comment.

anusudarsan · 2017-07-25T21:28:49Z

presto-main/src/main/java/com/facebook/presto/sql/gen/JoinFilterFunctionCompiler.java

@@ -112,7 +114,14 @@ public JoinFilterFunctionFactory compileJoinFilterFunction(RowExpression filter,
    private JoinFilterFunctionFactory internalCompileFilterFunctionFactory(RowExpression filterExpression, int leftBlocksSize, Optional<SortExpression> sortChannel)
    {
        Class<? extends InternalJoinFilterFunction> internalJoinFilterFunction = compileInternalJoinFilterFunction(filterExpression, leftBlocksSize);
-        return new IsolatedJoinFilterFunctionFactory(internalJoinFilterFunction, sortChannel);
+
+        List<Class<? extends InternalJoinFilterFunction>> internalInequalityJoinFilterConjuncts = new ArrayList<>();


tried this, but had to change the type of internalInequalityJoinFilterConjuncts to ImmutableList which also needs changing the constructor of IsolatedJoinFilterFunctionFactory. So keeping it as-is.

anusudarsan · 2017-07-25T21:39:28Z

presto-main/src/main/java/com/facebook/presto/operator/SortedPositionLinks.java

@@ -211,17 +210,35 @@ public int next(int position, int probePosition, Page allProbeChannelsPage)
            return -1;
        }
        // break a position links chain if next position should be filtered out
-        if (applyLessThanFunction(nextPosition, probePosition, allProbeChannelsPage)) {
-            return nextPosition;
+        int count = inequalityJoinFilterConjuncts.size();


anusudarsan · 2017-07-26T14:36:44Z

presto-main/src/main/java/com/facebook/presto/sql/gen/JoinFilterFunctionCompiler.java

-                    .map(rowExpression -> compileInternalJoinFilterFunction(rowExpression, leftBlocksSize))
-                    .collect(toImmutableList());
-        }
+        List<? extends Class<? extends InternalJoinFilterFunction>> internalInequalityJoinFilterConjuncts = sortChannel.map(channel -> channel.getInequalityJoinFilterConjuncts().stream()


updated. thanks @findepi

losipiuk · 2017-07-26T16:34:32Z

Travis is red. Please change base of the PR to prestodb/master. I think you can do it without creating new one.

Update docs to reflect what is currently supported

anusudarsan · 2017-07-26T20:55:04Z

travis is green. rebased on master and squashed the commits.

losipiuk · 2017-07-26T22:52:49Z

Thanks, I will do one last pass tomorrow and merge if everything looks good.

losipiuk · 2017-07-27T12:34:33Z

presto-main/src/main/java/com/facebook/presto/operator/SortedPositionLinks.java

@@ -151,13 +152,10 @@ public Factory build()
                }
            }

-            return lessThanFunction -> {
-                checkState(lessThanFunction.isPresent(), "Using SortedPositionLinks without lessThanFunction");


please reintroduce the check in new code checking if provided List is not empty.

Or actually it can be moved to SortedPositionLinks constructor.

done. closing this and opening a PR upstream

The sorted position links is searched for each of the expression in the range predicate. Thus this optimization works only for predicates with AND (conjuncts). The iteration over the position links is stopped as soon as each of the filter expression is false.

anusudarsan · 2017-07-27T13:27:39Z

prestodb#8614 created. closing this.

anusudarsan requested a review from losipiuk July 18, 2017 14:52

kokosing suggested changes Jul 18, 2017

View reviewed changes

losipiuk reviewed Jul 19, 2017

View reviewed changes

anusudarsan force-pushed the extend-fast-inequality-join branch 3 times, most recently from cbc96f1 to ce681df Compare July 20, 2017 22:02

anusudarsan force-pushed the extend-fast-inequality-join branch from ce681df to da032be Compare July 21, 2017 15:05

losipiuk suggested changes Jul 24, 2017

View reviewed changes

anusudarsan force-pushed the extend-fast-inequality-join branch 2 times, most recently from f7e97a4 to 1c62815 Compare July 25, 2017 21:35

anusudarsan commented Jul 25, 2017

View reviewed changes

anusudarsan force-pushed the extend-fast-inequality-join branch from 1c62815 to 66ae46c Compare July 26, 2017 14:33

anusudarsan commented Jul 26, 2017

View reviewed changes

anusudarsan added 4 commits July 26, 2017 14:13

Remove unused field from inequality join optimization

9c76d45

Update javadoc for SortExpression in inequality join optimization

e75bb27

Update docs to reflect what is currently supported

Update JoinFilterCacheKey equals and hashCode

db4e6d9

Add tests for non-equi join condition optimization

f31459c

anusudarsan force-pushed the extend-fast-inequality-join branch from 66ae46c to da718de Compare July 26, 2017 19:30

anusudarsan changed the base branch from sprint-59 to master July 26, 2017 19:30

anusudarsan force-pushed the extend-fast-inequality-join branch from da718de to 1f0d3bf Compare July 26, 2017 19:32

Teradata deleted a comment from kokosing Jul 26, 2017

losipiuk reviewed Jul 27, 2017

View reviewed changes

anusudarsan added 2 commits July 27, 2017 09:18

Refactor unit tests to use expression utility

7bf10af

anusudarsan force-pushed the extend-fast-inequality-join branch from 1f0d3bf to 7bf10af Compare July 27, 2017 13:19

anusudarsan closed this Jul 27, 2017

anusudarsan mentioned this pull request Jul 27, 2017

Extend fast inequality join prestodb/presto#8614

Merged

Extend fast inequality join #630

Extend fast inequality join #630

Conversation

anusudarsan commented Jul 18, 2017 • edited Loading

kokosing commented Jul 18, 2017

kokosing left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

losipiuk commented Jul 19, 2017

losipiuk left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

anusudarsan Jul 19, 2017 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

anusudarsan commented Jul 19, 2017

losipiuk Jul 19, 2017 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

anusudarsan commented Jul 20, 2017

kokosing commented Jul 21, 2017

anusudarsan commented Jul 21, 2017

losipiuk left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

losipiuk commented Jul 24, 2017

anusudarsan commented Jul 24, 2017

anusudarsan left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

losipiuk commented Jul 26, 2017 • edited Loading

anusudarsan commented Jul 26, 2017

losipiuk commented Jul 26, 2017 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

anusudarsan commented Jul 27, 2017

anusudarsan commented Jul 18, 2017 •

edited

Loading

anusudarsan Jul 19, 2017 •

edited

Loading

losipiuk Jul 19, 2017 •

edited

Loading

losipiuk commented Jul 26, 2017 •

edited

Loading

losipiuk commented Jul 26, 2017 •

edited

Loading