SHS-NG M4.4: Port JobsTab and StageTab to the new backend. #10

vanzin · 2017-04-17T20:12:20Z

This change is a little larger because there's a whole lot of logic
behind these pages, all really tied to internal types and listeners.
There's also a lot of code that was moved to the new module.

Added missing StageData and ExecutorStageSummary fields which are
used by the UI. Some json golden files needed to be updated to account
for new fields.
Save RDD graph data in the store. This tries to re-use existing types as
much as possible, so that the code doesn't need to be re-written. So it's
probably not very optimal.
Some old classes (e.g. JobProgressListener) still remain, since they're used
in other parts of the code; they're not used by the UI anymore, though, and
will be cleaned up in a separate change.
Save information about active pools in the disk store; this could potentially
be avoided, since it's most probably not much data, but it makes it easier
later to add this kind of information to the API and to history if wanted.
Because the new store sorts things slightly differently from the previous
code, some json golden files had some elements within them shuffled around.
The retention unit test in UISeleniumSuite was disabled because the code
to throw away old stages / tasks hasn't been added yet. It's less of a
problem with the new store since it doesn't use memory, but it will be
added later to avoid a similar issue with unbound disk space usage.
The job description field in the API tries to follow the old behavior, which
makes it be empty most of the time, even though there's information to fill it
in. For stages, a new field was added to hold the description (which is basically
the job description), so that the UI can be rendered in the old way.
A new stage status ("SKIPPED") was added to account for the fact that the API
couldn't represent that state before. Because of the way the new code tracks
stages, they would end up showing up as "PENDING" in the UI.

TODO: add UIListener unit tests for the new fields.

This change is a little larger because there's a whole lot of logic behind these pages, all really tied to internal types and listeners. There's also a lot of code that was moved to the new module. - Added missing StageData and ExecutorStageSummary fields which are used by the UI. Some json golden files needed to be updated to account for new fields. - Save RDD graph data in the store. This tries to re-use existing types as much as possible, so that the code doesn't need to be re-written. So it's probably not very optimal. - Some old classes (e.g. JobProgressListener) still remain, since they're used in other parts of the code; they're not used by the UI anymore, though, and will be cleaned up in a separate change. - Save information about active pools in the disk store; this could potentially be avoided, since it's most probably not much data, but it makes it easier later to add this kind of information to the API and to history if wanted. - Because the new store sorts things slightly differently from the previous code, some json golden files had some elements within them shuffled around. - The retention unit test in UISeleniumSuite was disabled because the code to throw away old stages / tasks hasn't been added yet. - The job description field in the API tries to follow the old behavior, which makes it be empty most of the time, even though there's information to fill it in. For stages, a new field was added to hold the description (which is basically the job description), so that the UI can be rendered in the old way. - A new stage status ("SKIPPED") was added to account for the fact that the API couldn't represent that state before. Because of the way the new code tracks stages, they would end up showing up as "PENDING" in the UI.

…nput of UDF as double in the failed test in udf-aggregate_part1.sql ## What changes were proposed in this pull request? It still can be flaky on certain environments due to float limitation described at apache#25110 . See apache#25110 (comment) - https://amplab.cs.berkeley.edu/jenkins/view/Spark%20QA%20Test%20(Dashboard)/job/spark-master-test-maven-hadoop-2.7/6584/testReport/org.apache.spark.sql/SQLQueryTestSuite/udf_pgSQL_udf_aggregates_part1_sql___Regular_Python_UDF/ ``` Expected "700000000000[6] 1", but got "700000000000[5] 1" Result did not match for query #33
SELECT CAST(avg(udf(CAST(x AS DOUBLE))) AS long), CAST(udf(var_pop(CAST(x AS DOUBLE))) AS decimal(10,3))
FROM (VALUES (7000000000005), (7000000000007)) v(x) ``` Here;s what's going on: apache#25110 (comment) ``` scala> Seq("7000000000004.999", "7000000000006.999").toDF().selectExpr("CAST(avg(value) AS long)").show() +--------------------------+ |CAST(avg(value) AS BIGINT)| +--------------------------+ | 7000000000005| +--------------------------+ ``` Therefore, this PR just avoid to cast in the specific test. This is a temp fix. We need more robust way to avoid such cases. ## How was this patch tested? It passes with Maven in my local before/after this PR. I believe the problem seems similarly the Python or OS installed in the machine. I should test this against PR builder with `test-maven` for sure.. Closes apache#25128 from HyukjinKwon/SPARK-28270-2. Authored-by: HyukjinKwon <gurwls223@apache.org> Signed-off-by: HyukjinKwon <gurwls223@apache.org>

… Arrow on JDK9+ ### What changes were proposed in this pull request? This PR aims to add `io.netty.tryReflectionSetAccessible=true` to the testing configuration for JDK11 because this is an officially documented requirement of Apache Arrow. Apache Arrow community documented this requirement at `0.15.0` ([ARROW-6206](apache/arrow#5078)). > #### For java 9 or later, should set "-Dio.netty.tryReflectionSetAccessible=true". > This fixes `java.lang.UnsupportedOperationException: sun.misc.Unsafe or java.nio.DirectByteBuffer.(long, int) not available`. thrown by netty. ### Why are the changes needed? After ARROW-3191, Arrow Java library requires the property `io.netty.tryReflectionSetAccessible` to be set to true for JDK >= 9. After apache#26133, JDK11 Jenkins job seem to fail. - https://amplab.cs.berkeley.edu/jenkins/view/Spark%20QA%20Test%20(Dashboard)/job/spark-master-test-maven-hadoop-3.2-jdk-11/676/ - https://amplab.cs.berkeley.edu/jenkins/view/Spark%20QA%20Test%20(Dashboard)/job/spark-master-test-maven-hadoop-3.2-jdk-11/677/ - https://amplab.cs.berkeley.edu/jenkins/view/Spark%20QA%20Test%20(Dashboard)/job/spark-master-test-maven-hadoop-3.2-jdk-11/678/ ```scala Previous exception in task: sun.misc.Unsafe or java.nio.DirectByteBuffer.<init>(long, int) not available
 io.netty.util.internal.PlatformDependent.directBuffer(PlatformDependent.java:473)
 io.netty.buffer.NettyArrowBuf.getDirectBuffer(NettyArrowBuf.java:243)
 io.netty.buffer.NettyArrowBuf.nioBuffer(NettyArrowBuf.java:233)
 io.netty.buffer.ArrowBuf.nioBuffer(ArrowBuf.java:245)
 org.apache.arrow.vector.ipc.message.ArrowRecordBatch.computeBodyLength(ArrowRecordBatch.java:222)
 ``` ### Does this PR introduce any user-facing change? No. ### How was this patch tested? Pass the Jenkins with JDK11. Closes apache#26552 from dongjoon-hyun/SPARK-ARROW-JDK11. Authored-by: Dongjoon Hyun <dhyun@apple.com> Signed-off-by: Dongjoon Hyun <dhyun@apple.com>

vanzin force-pushed the shs-ng/M4.4 branch from c5b98d4 to d53ceae Compare April 17, 2017 21:40

vanzin force-pushed the shs-ng/M4.3 branch from 933d094 to d41f35e Compare April 17, 2017 21:40

vanzin force-pushed the shs-ng/M4.4 branch from d53ceae to 5be07a0 Compare April 25, 2017 17:43

vanzin force-pushed the shs-ng/M4.3 branch from d41f35e to 3a23a38 Compare April 25, 2017 17:43

vanzin force-pushed the shs-ng/M4.4 branch from 5be07a0 to 75f0af5 Compare April 26, 2017 18:11

vanzin force-pushed the shs-ng/M4.3 branch from 3a23a38 to 17d6887 Compare April 26, 2017 18:11

vanzin force-pushed the shs-ng/M4.4 branch from 75f0af5 to ab8e655 Compare April 26, 2017 23:57

vanzin force-pushed the shs-ng/M4.3 branch from 17d6887 to 001c71a Compare April 26, 2017 23:57

vanzin force-pushed the shs-ng/M4.4 branch from ab8e655 to 192b0b9 Compare April 27, 2017 18:14

vanzin force-pushed the shs-ng/M4.3 branch from 001c71a to 409a201 Compare April 27, 2017 18:14

vanzin force-pushed the shs-ng/M4.4 branch from 192b0b9 to 05ad02a Compare April 27, 2017 21:31

vanzin force-pushed the shs-ng/M4.3 branch from 409a201 to 207df90 Compare April 27, 2017 21:31

vanzin force-pushed the shs-ng/M4.4 branch from 05ad02a to 8100137 Compare April 28, 2017 15:08

vanzin force-pushed the shs-ng/M4.3 branch from 207df90 to 69749b9 Compare April 28, 2017 15:08

vanzin force-pushed the shs-ng/M4.4 branch from 8100137 to b0dae26 Compare April 28, 2017 21:35

vanzin force-pushed the shs-ng/M4.3 branch from 69749b9 to 2dd88f6 Compare April 28, 2017 21:35

vanzin force-pushed the shs-ng/M4.4 branch from b0dae26 to f95d3cc Compare May 1, 2017 22:58

vanzin force-pushed the shs-ng/M4.3 branch from 2dd88f6 to 6d8e1b2 Compare May 1, 2017 22:58

vanzin force-pushed the shs-ng/M4.4 branch from f95d3cc to b22a8b7 Compare May 5, 2017 21:19

vanzin force-pushed the shs-ng/M4.3 branch from 6d8e1b2 to dcacfc7 Compare May 5, 2017 21:19

vanzin force-pushed the shs-ng/M4.4 branch from b22a8b7 to e72e7b7 Compare May 5, 2017 22:57

vanzin force-pushed the shs-ng/M4.3 branch from dcacfc7 to ceb9c6b Compare May 5, 2017 22:57

vanzin force-pushed the shs-ng/M4.4 branch from e72e7b7 to f607d5d Compare May 8, 2017 17:25

vanzin force-pushed the shs-ng/M4.3 branch from ceb9c6b to 57627d0 Compare May 8, 2017 17:25

vanzin force-pushed the shs-ng/M4.4 branch from f607d5d to 89def67 Compare May 9, 2017 01:08

vanzin force-pushed the shs-ng/M4.3 branch from 57627d0 to 405294b Compare May 9, 2017 01:08

vanzin force-pushed the shs-ng/M4.4 branch from 89def67 to 185b407 Compare May 15, 2017 20:44

vanzin force-pushed the shs-ng/M4.3 branch from 405294b to ed48cd6 Compare May 15, 2017 20:44

vanzin force-pushed the shs-ng/M4.4 branch from 185b407 to cc90ca8 Compare May 26, 2017 18:53

vanzin force-pushed the shs-ng/M4.3 branch from ed48cd6 to 4df8af1 Compare May 26, 2017 18:53

vanzin force-pushed the shs-ng/M4.4 branch from cc90ca8 to 64f7d76 Compare May 30, 2017 23:03

vanzin force-pushed the shs-ng/M4.3 branch from 4df8af1 to c5a17fd Compare May 30, 2017 23:03

vanzin closed this May 30, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SHS-NG M4.4: Port JobsTab and StageTab to the new backend. #10

SHS-NG M4.4: Port JobsTab and StageTab to the new backend. #10

vanzin commented Apr 17, 2017 •

edited

Loading

SHS-NG M4.4: Port JobsTab and StageTab to the new backend. #10

SHS-NG M4.4: Port JobsTab and StageTab to the new backend. #10

Conversation

vanzin commented Apr 17, 2017 • edited Loading

vanzin commented Apr 17, 2017 •

edited

Loading