This repository has been archived by the owner on Mar 30, 2021. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 92
Running the ThriftServer with Sparklinedata components
Jordan Moore edited this page Oct 24, 2017
·
7 revisions
-
Download the latest sparklinedata jar from Releases
-
Copy the scripts from our Repo to your spark sbin directory
-
to start the thriftserver issue:
sbin/start-sparklinedatathriftserver.sh <location of the downloaded jar>
for example:
sbin/start-sparklinedatathriftserver.sh ~/Downloads/spark-druid-olap-0.0.3.jar
for yarn run
./start-sparklinedatathriftserver.sh ~/Downloads/spark-druid-olap-assembly-0.0.3.jar --master yarn
In the following we run thriftserver on yarn and with timezone set to UTC, point to custom spark.properties, and configure the FAIR scheduler
JAVA_TOOL_OPTIONS=-Duser.timezone=UTC sbin/start-sparklinedatathriftserver.sh accelerator_2.10-0.2.1-assembly.jar \
--driver-memory 19g --master yarn --deploy-mode client --conf spark.scheduler.mode=FAIR --properties-file sparkline.properties"
- To stop the thriftserver issue:
sbin/stop-sparklinedatathriftserver.sh
- Overview
- Quick Start
-
User Guide
- [Defining a DataSource on a Flattened Dataset](https://github.com/SparklineData/spark-druid-olap/wiki/Defining-a Druid-DataSource-on-a-Flattened-Dataset)
- Defining a Star Schema
- Sample Queries
- Approximate Count and Spatial Queries
- Druid Datasource Options
- Sparkline SQLContext Options
- Using Tableau with Sparkline
- How to debug a Query Plan?
- Running the ThriftServer with Sparklinedata components
- [Setting up multiple Sparkline ThriftServers - Load Balancing & HA] (https://github.com/SparklineData/spark-druid-olap/wiki/Setting-up-multiple-Sparkline-ThriftServers-(Load-Balancing-&-HA))
- Runtime Views
- Sparkline SQL extensions
- Sparkline Pluggable Modules
- Dev. Guide
- Reference Architectures
- Releases
- Cluster Spinup Tool
- TPCH Benchmark