This repository has been archived by the owner on Mar 30, 2021. It is now read-only.

Home old

Jump to bottom

hbutani edited this page Aug 4, 2016 · 1 revision

#Fast BI using Spark and Druid.

This project is aimed at two classes of users

Users of Druid who want SQL access to their indexes and use traditional BI tools such as Tableau with Druid
Spark and Hive users who find performance of their interactive BI painfully slow.

Where to start.

##Indexing

Indexing TPCH data as an example.
Setting up Druid Druid.

Setting up the data

Sample data set for TPCH.
Demo with sample dataset

##Querying data from Spark

Setup thrift server connections so you can use Squirrel, Razor SQL, Zeppelin or Tableau against the datasets.
Sample Queries.