Add Reflection to support custom Spark Implementation at Runtime #1362

amahussein · 2024-09-27T19:22:49Z

Signed-off-by: Ahmed Hussein ahussein@nvidia.com

Adds a workaorund to run against open source Spark and custom Spark implementation that overrides the constructors of Graph objects.

This PR also should improve performance on Databricks environment.
Before the change, we used to try to call normal Spark API, then catch the java exception, finally load the constructor method and call it with correct arguments.
After the change, the tools detects the runtime, and uses reflection everytime to call the constructor. This way, it reduces the overhead on each object allocation.

Signed-off-by: Ahmed Hussein <ahussein@nvidia.com> Fixes NVIDIA#1360 Adds a workaorund to run against open source Spark and custom Spark implementation that overrides the constructors of Graph objects.

tgravescs · 2024-09-30T14:18:25Z

core/src/main/scala/org/apache/spark/sql/rapids/tool/annotation/ToolsReflection.scala

+import scala.annotation.StaticAnnotation
+import scala.annotation.meta.{beanGetter, beanSetter, field, getter, param, setter}
+
+


nit extra newline

tgravescs · 2024-09-30T14:22:52Z

core/src/main/scala/org/apache/spark/sql/rapids/tool/util/stubs/ToolsSparkListenerEvent.scala

@@ -0,0 +1,32 @@
+/*


this feels like it shouldn't be in the stubs directory, its critical class and there is no platform specific version, correct?

moved it to the same package.

tgravescs · 2024-09-30T14:32:16Z

core/src/main/scala/org/apache/spark/sql/rapids/tool/util/ToolsPlanGraph.scala

-      case _: java.lang.NoSuchMethodError =>
-        dbRuntimeReflection.constructCluster(id, name, desc, nodes, metrics)
-    }
+    GraphReflectionAPIHelper.api.get.constructCluster(id, name, desc, nodes, metrics)


these (GraphReflectionAPIHelper.api) are all Options and we just call get here on it without doing else or checking so is there any reason for that to be an option?

I refactored the code to define the "method graphBuilder" as a field depending on the initialization of the APIs

parthosa

When using the python tools CLI, we download the open source Spark and provide that to the JAR tool. This will happen even if are running the tools inside Databricks env.

In that case, do we need a follow up on python side to use custom runtime jars instead of open source Spark?

Signed-off-by: Ahmed Hussein <ahussein@nvidia.com>

amahussein

Thanks @tgravescs !
I updated the code and put a code comment pointing to the issue in Scala 2.12

amahussein · 2024-09-30T18:37:06Z

core/src/main/scala/org/apache/spark/sql/rapids/tool/util/ToolsPlanGraph.scala

-      case _: java.lang.NoSuchMethodError =>
-        dbRuntimeReflection.constructCluster(id, name, desc, nodes, metrics)
-    }
+    GraphReflectionAPIHelper.api.get.constructCluster(id, name, desc, nodes, metrics)


I refactored the code to define the "method graphBuilder" as a field depending on the initialization of the APIs

amahussein · 2024-09-30T18:37:49Z

core/src/main/scala/org/apache/spark/sql/rapids/tool/util/stubs/ToolsSparkListenerEvent.scala

@@ -0,0 +1,32 @@
+/*


moved it to the same package.

Signed-off-by: Ahmed Hussein <ahussein@nvidia.com>

amahussein · 2024-09-30T18:46:57Z

When using the python tools CLI, we download the open source Spark and provide that to the JAR tool. This will happen even if are running the tools inside Databricks env.

In that case, do we need a follow up on python side to use custom runtime jars instead of open source Spark?

We have the issue #1359 to allow customers to specify their own dependency.
The open source works fine in DB. So, we can leave it as it now.
The trick with setting the dependencies for each individual CSP is that we have to redefine the yaml configuration assuming the user is running on a cluster node and for each one, set the path of the SPARK jars on the local disk.
Currently, we do not need to do all that unless we learn that it affects the QualX significantly (which does not at the moment)

core/src/main/scala/org/apache/spark/scheduler/ToolsListenerEventExtraAPIs.scala

Signed-off-by: Ahmed Hussein <ahussein@nvidia.com>

nartal1 · 2024-10-04T18:18:25Z

core/src/main/scala/org/apache/spark/sql/rapids/tool/util/stubs/db/DBGraphSQLMetricStub.scala

+
+  // for 10.4 it is only one constructor with 3 arguments.
+  // final java.lang.String name, final long accumulatorId, final java.lang.String metricType
+  private val isDB104OrOlder: Boolean = constr.paramLists.flatten.size < 4


Question - If there is any addition or deletion of arguments in the future DB versions, should we add a flag or shim it ?

Yes, when DB changes its API, we will revisit the implementation and do an extension instead of relying on a flag.
I did not want to change that part since it was working fine and we had no need to change .

Add Reflection to support custom Spark Implementation at Runtime

aa5451f

Signed-off-by: Ahmed Hussein <ahussein@nvidia.com> Fixes NVIDIA#1360 Adds a workaorund to run against open source Spark and custom Spark implementation that overrides the constructors of Graph objects.

amahussein added core_tools Scope the core module (scala) build dependencies Pull requests that update a dependency file labels Sep 27, 2024

amahussein requested a review from wjxiz1992 September 27, 2024 19:22

amahussein self-assigned this Sep 27, 2024

amahussein requested review from nartal1 and tgravescs September 27, 2024 19:23

tgravescs reviewed Sep 30, 2024

View reviewed changes

parthosa reviewed Sep 30, 2024

View reviewed changes

Address the PR comments

2a7c927

Signed-off-by: Ahmed Hussein <ahussein@nvidia.com>

amahussein commented Sep 30, 2024

View reviewed changes

Remove extra unecessary comment

c3f9cb5

Signed-off-by: Ahmed Hussein <ahussein@nvidia.com>

nartal1 reviewed Sep 30, 2024

View reviewed changes

core/src/main/scala/org/apache/spark/scheduler/ToolsListenerEventExtraAPIs.scala Outdated Show resolved Hide resolved

Fix typo

cc044c0

Signed-off-by: Ahmed Hussein <ahussein@nvidia.com>

parthosa mentioned this pull request Oct 2, 2024

Add support for processing Photon event logs in Scala #1338

Open

nartal1 reviewed Oct 4, 2024

View reviewed changes

tgravescs approved these changes Oct 7, 2024

View reviewed changes

amahussein merged commit b053680 into NVIDIA:dev Oct 7, 2024
14 checks passed

amahussein deleted the rapids-tools-1360 branch October 7, 2024 14:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Reflection to support custom Spark Implementation at Runtime #1362

Add Reflection to support custom Spark Implementation at Runtime #1362

amahussein commented Sep 27, 2024 •

edited

Loading

tgravescs Sep 30, 2024

tgravescs Sep 30, 2024

amahussein Sep 30, 2024

tgravescs Sep 30, 2024

amahussein Sep 30, 2024

parthosa left a comment •

edited

Loading

amahussein left a comment

amahussein Sep 30, 2024

amahussein Sep 30, 2024

amahussein commented Sep 30, 2024

nartal1 Oct 4, 2024

amahussein Oct 7, 2024

		import scala.annotation.StaticAnnotation
		import scala.annotation.meta.{beanGetter, beanSetter, field, getter, param, setter}

Add Reflection to support custom Spark Implementation at Runtime #1362

Add Reflection to support custom Spark Implementation at Runtime #1362

Conversation

amahussein commented Sep 27, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

parthosa left a comment • edited Loading

Choose a reason for hiding this comment

amahussein left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

amahussein commented Sep 30, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

amahussein commented Sep 27, 2024 •

edited

Loading

parthosa left a comment •

edited

Loading