Should be able to have clusters with dedicated brokers and data nodes #1934

pauldix · 2015-03-12T21:31:11Z

Currently, all servers in a cluster come up as both a broker and a data node. We should be able to spin up servers only as brokers and have other servers come up only as data nodes.

otoolep · 2015-03-25T15:58:05Z

We should consider completely redoing the current code in this area. The use of directories to indicate what "role" is a node in has caused lots of problems in the implementation. I think we should consider a simple file on disk that states "I'm a broker, I'm a data node, or I'm both". In other words, write the "role" to disk, and switch on the contents of that file on start-up. If the file does not exist, assume "combined".

Right now the code in this area is pretty unclear. An explicit "role" file may be the answer.

jwilder · 2015-04-01T16:26:46Z

Based on #1426, this a proposed design change that we'd like feedback on.

Currently we have a Data, Broker, Snapshot and Admin Port which are all bound on a common bind address. There is also a mix of public APIs endpoints (/query, /write, etc..) commingled w/ cluster communication endpoints (/data_nodes_index, /data_nodes_create, etc..) that are served over the current Data port. This makes it difficult to limit access to cluster communication endpoints while allowing more open access to the public API. In addition, it may be desirable to segregate public API and internal cluster communication on separate network interfaces for performance and additional security. This is currently not possible.

We propose the following changes:

Bind-Address - Default bind address used by listeners. This can be overridden for each port below to support cluster communication only on an internal interface but allow API traffic on a public interface.
Cluster Port [bind-address]:8085 - The port used for all inter-node communication. This would include the:
- Broker: /raft, /messaging
- Data Node: /data_nodes_*, /metastore, /process_continuous_queries
Admin Port [bind-address]:8083
- Current Admin UI
- Snapshot: (currently a separate snapshot port)
API Port [bind-address]:8086
- DataNode: /write, /query, /ping, /dump

With this proposal, we get the following:

All cluster communication occurs over a dedicated port (default 8085).
All public API endpoints are exposed over a dedicated port (default 8086)
Ability to separate cluster and public API traffic on different network interfaces
Ability to run all endpoints on the same interface and port if desired

cc @pauldix @benbjohnson @otoolep

benbjohnson · 2015-04-01T16:36:15Z

I'd rather support the simple common use case first and allow users to separate out ports as needed. Most people will probably stand up an influxd server behind a firewall and just let internal servers hit it. In that case it makes more sense to have one single port that users can open to select servers. Then they can break off individual services on different ports as needed. e.g. start everything on :8086 and let them break off from there.

I do like the separation of cluster port, admin port, & API port though. That seems like a reasonable separation.

otoolep · 2015-04-01T18:42:44Z

I too see the need for separate ports, and like the ideas above. However I don't see why we have the Admin UI on a different port. It should be on the API port, as far as I can see. I don't see any advantages to having it on a different port.

jwilder · 2015-04-02T16:26:25Z

If we remove the admin port, the admin interface will need to be served from a /admin or similar. @toddboom any concerns about that?

pauldix · 2015-04-02T17:59:21Z

@jwilder I think this design makes a lot of sense. I've actually had issue #1426 open about this for a while. The only change I'd make is that I'd put the /dump endpoint in the admin group. That one can end up with a ton of data and it's probably not something they'd want in the publicly available API.

This is a pre-requisite for #1934. When running separate broker and data nodes, you currently need to know what role a host is performing. This complicates cluster setup in that you must configure separate broker URLs and data node URLs. This change allows a broker only node to redirect data nodes endpoints to a valid data node and a data only node to redirect broker endpoints to a valid broker.

toddboom · 2015-04-03T04:28:41Z

@jwilder no issues for me on the admin interface - i think it would make some people happier

jwilder · 2015-04-03T04:44:51Z

Ok. We'll use this as the target design for this issue.

This is a pre-requisite for #1934. When running separate broker and data nodes, you currently need to know what role a host is performing. This complicates cluster setup in that you must configure separate broker URLs and data node URLs. This change allows a broker only node to redirect data nodes endpoints to a valid data node and a data only node to redirect broker endpoints to a valid broker.

…1934)

pauldix added the 1 - Ready label Mar 12, 2015

corylanou self-assigned this Mar 12, 2015

beckettsean modified the milestones: 0.9.0, Next Release Mar 14, 2015

pauldix modified the milestones: 0.9.0, Next Point Release Mar 14, 2015

toddboom modified the milestone: 0.9.0 Mar 14, 2015

pauldix added this to the 0.9.0 milestone Mar 25, 2015

corylanou added 2 - Working and removed 1 - Ready labels Mar 25, 2015

influxdb-denver-pair assigned jwilder and corylanou and unassigned corylanou and jwilder Mar 25, 2015

jwilder mentioned this issue Mar 30, 2015

Initial work to run separate brokers and data nodes #2118

Closed

corylanou mentioned this issue Apr 1, 2015

API handler and broker/data node handlers should be able to run on different ports #1426

Closed

jwilder mentioned this issue Apr 3, 2015

Node redirection #2154

Merged

jwilder mentioned this issue Apr 6, 2015

Separate broker and data nodes #2175

Merged

pauldix closed this as completed in #2175 Apr 7, 2015

pauldix removed the 2 - Working label Apr 7, 2015

jwilder mentioned this issue Apr 9, 2015

Snapshot handler should be moved to admin listener #2231

Closed

mark-rushakoff pushed a commit that referenced this issue Jan 11, 2019

fix(http): op and error keys no longer required in error responses (#…

7d114af

…1934)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Should be able to have clusters with dedicated brokers and data nodes #1934

Should be able to have clusters with dedicated brokers and data nodes #1934

pauldix commented Mar 12, 2015

otoolep commented Mar 25, 2015

jwilder commented Apr 1, 2015

benbjohnson commented Apr 1, 2015

otoolep commented Apr 1, 2015

jwilder commented Apr 2, 2015

pauldix commented Apr 2, 2015

toddboom commented Apr 3, 2015

jwilder commented Apr 3, 2015

Should be able to have clusters with dedicated brokers and data nodes #1934

Should be able to have clusters with dedicated brokers and data nodes #1934

Comments

pauldix commented Mar 12, 2015

otoolep commented Mar 25, 2015

jwilder commented Apr 1, 2015

benbjohnson commented Apr 1, 2015

otoolep commented Apr 1, 2015

jwilder commented Apr 2, 2015

pauldix commented Apr 2, 2015

toddboom commented Apr 3, 2015

jwilder commented Apr 3, 2015