Running the ThriftServer with Sparklinedata components

Download the latest sparklinedata jar from Releases
Copy the scripts from our Repo to your spark sbin directory
to start the thriftserver issue:

sbin/start-sparklinedatathriftserver.sh <location of the downloaded jar>

for example:

sbin/start-sparklinedatathriftserver.sh ~/Downloads/spark-druid-olap-0.0.3.jar

for yarn run

./start-sparklinedatathriftserver.sh ~/Downloads/spark-druid-olap-assembly-0.0.3.jar --master yarn

In the following we run thriftserver on yarn and with timezone set to UTC, point to custom spark.properties, and configure the FAIR scheduler

JAVA_TOOL_OPTIONS=-Duser.timezone=UTC sbin/start-sparklinedatathriftserver.sh accelerator_2.10-0.2.1-assembly.jar \
--driver-memory 19g --master yarn --deploy-mode client --conf spark.scheduler.mode=FAIR --properties-file sparkline.properties"

To stop the thriftserver issue:

sbin/stop-sparklinedatathriftserver.sh

Overview
Quick Start
- Installing and Setup Druid
User Guide
- [Defining a DataSource on a Flattened Dataset](https://github.com/SparklineData/spark-druid-olap/wiki/Defining-a Druid-DataSource-on-a-Flattened-Dataset)
- Defining a Star Schema
- Sample Queries
- Approximate Count and Spatial Queries
- Druid Datasource Options
- Sparkline SQLContext Options
- Using Tableau with Sparkline
- How to debug a Query Plan?
- Running the ThriftServer with Sparklinedata components
- [Setting up multiple Sparkline ThriftServers - Load Balancing & HA] (https://github.com/SparklineData/spark-druid-olap/wiki/Setting-up-multiple-Sparkline-ThriftServers-(Load-Balancing-&-HA))
- Runtime Views
- Sparkline SQL extensions
- Sparkline Pluggable Modules
Dev. Guide
Reference Architectures
- Accelerating existing SQL Datasets
Releases
Cluster Spinup Tool
TPCH Benchmark
- Generating Denormalized TPCH Dataset
- Build TPCH Index for Benchmark

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Running the ThriftServer with Sparklinedata components

Clone this wiki locally