forked from prestodb/prestodb.github.io
-
Notifications
You must be signed in to change notification settings - Fork 0
/
overview.html
130 lines (104 loc) · 3.77 KB
/
overview.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
<!DOCTYPE html>
<html>
<head>
<meta charset="utf-8">
<title>Presto | Overview</title>
<link rel="stylesheet" href="static/presto.css" type="text/css"/>
<meta name="viewport" content="width=device-width">
<meta property="og:type" content="website">
<meta property="og:title" content="Presto | Overview">
<meta property="og:description" content="Distributed SQL Query Engine for Big Data">
<meta property="og:image" content="static/presto-og.png">
</head>
<body>
<header class="topbar clearfix">
<nav class="width">
<a href="."><h1>Presto</h1></a>
<ul class="nav">
<li><a href="overview.html">Overview</a></li>
<li><a href="docs/current/">Docs</a></li>
<li><a href="faq.html">FAQ</a></li>
<li><a href="community.html">Community</a></li>
<li><a href="resources.html">Resources</a></li>
<li><a href="https://github.com/facebook/presto">GitHub</a>
</ul>
</nav>
</header>
<header class="hero clearfix">
<div class="width">
<h1>Overview</h1>
</div>
</header>
<div class="content homecontent width clearfix">
<div class="">
<h1>Introduction</h1>
<p>
Presto is a distributed system that runs on a cluster of machines.
A full installation includes a coordinator and multiple workers.
Queries are submitted from a client such as the Presto CLI to the
coordinator. The coordinator parses, analyzes and plans the query
execution, then distributes the processing to the workers.
</p>
<!--suppress CheckImageSize -- larger file for retina -->
<img src="static/presto-overview.png" width="80%" alt="Presto Installation Overview">
<h1>Requirements</h1>
<p>
Presto has a few basic requirements:
</p>
<ul>
<li>Linux or Mac OS X</li>
<li>Java 8, 64-bit</li>
<li>Python 2.4+</li>
</ul>
<h1>Connectors</h1>
<p>
Presto supports pluggable connectors that provide data for queries.
The requirements vary by connector.
</p>
<h2>Hadoop / Hive</h2>
<p>
Presto supports reading Hive data from the following versions of Hadoop:
</p>
<ul>
<li>Apache Hadoop 1.x</li>
<li>Apache Hadoop 2.x</li>
<li>Cloudera CDH 4</li>
<li>Cloudera CDH 5</li>
</ul>
<p>
The following file formats are supported: Text, SequenceFile,
RCFile, ORC and Parquet.
</p>
<p>
Additionally, a remote Hive metastore is required.
Local or embedded mode is not supported.
Presto does not use MapReduce and thus only requires HDFS.
</p>
<h2>Cassandra</h2>
<p>
Cassandra 2.x is required. This connector is completely
independent of the Hive connector and only requires an
existing Cassandra installation.
</p>
<h2>TPC-H</h2>
<p>
The TPC-H connector dynamically generates data that can be used
for experimenting with and testing Presto. This connector has
no external requirements.
</p>
<h1>Deployment</h1>
<p>
See <a href="docs/current/installation/deployment.html">Deploying Presto</a>
for complete deployment instructions.
</p>
<h1>Running Queries</h1>
<p>
You can run queries using the
<a href="docs/current/installation/cli.html">Command Line Interface</a>
after deploying Presto.
</p>
</div>
</div>
<div class="footer">© Copyright 2013-2016, Facebook</div>
</body>
</html>