Distributed Queries #2202

otoolep · 2015-04-08T22:47:55Z

No description provided.

Refactored query engine to have different processing pipeline for raw queries. This enables queries that have a large offset to not keep everything in memory. It also makes it so that queries against raw data that have a limit will only p rocess up to that limit and then bail out. Raw data queries will only read up to a certain point in the map phase before yielding to the engine for further processing. Fixes #2029 and fixes #2030

…work.

Fixes issue #1649

No doubt about it, URL routing is definitely brittle and needs work.

With a max remote reponse of 1GB, testing is unable to proceed.

jwilder · 2015-04-11T04:52:02Z

remote_mapper.go

+	}
+
+	// request to start streaming results
+	resp, err := http.Post(m.dataNodes[0].URL.String()+"/run_mapper", "application/json", bytes.NewReader(b))


Should this be m.dataNode[rand.Intn(len(m.dataNodes))]to avoid a single node handling all the traffic? Or is m.dataNodes already randomized beforehand?

I was thinking exactly the same thing myself.

#2242

jwilder · 2015-04-11T05:00:18Z

Two comments but LGTM otherwise.

otoolep · 2015-04-11T18:34:35Z

#2243 opened to track issue around Limit Reader.

@jwilder -- thanks for the 3rd set of eyes. Merging now.

Distributed Queries

otoolep added the 2 - Working label Apr 8, 2015

otoolep force-pushed the distributed-queries-m branch 2 times, most recently from 4a39dd6 to dbd33b7 Compare April 9, 2015 22:54

otoolep changed the title ~~Use 64-bit Series IDs~~ Distributed Queries Apr 9, 2015

otoolep force-pushed the distributed-queries-m branch 4 times, most recently from 40b6866 to 2abe5b3 Compare April 10, 2015 22:09

pauldix and others added 16 commits April 10, 2015 16:11

Update server and handler to work with streamed responses

5e82ca5

uncoment raw ordering test

728f5de

WIP: Initial implementation of remote mapper for distributed queries.

1139950

Fix errors on limits and chunked raw queries.

6e8ea9a

Remove the interval setting from NextInterval to make remote mappers …

d41b85a

…work.

Fix the group by multiple dimensions test to be correct.

4a0c468

Fix wildcard group by query with time test to be correct.

f5dfb14

Finish up distributed queries.

7661546

Add change for distributed query engine

b353119

Fix opentsdb integration tests after rebase

8a25683

Use uint64 for Series IDs

bf1a8aa

Fixes issue #1649

Fixes based on feedback.

37d4f2a

Fix compilation errors after parser merge

9282a8a

Use different base port range for DQ testing

559e1d4

'reflect' is not used

350795d

otoolep force-pushed the distributed-queries-m branch from b1a4df1 to b461a36 Compare April 10, 2015 23:11

otoolep added 4 commits April 10, 2015 16:19

Hook up "run_mapper" in top-level handler

2c554f4

No doubt about it, URL routing is definitely brittle and needs work.

Seems like partial replication reads take longer

925de06

Limix max remote response to 1MB

61d7d0e

With a max remote reponse of 1GB, testing is unable to proceed.

Update CHANGELOG

5882f0b

otoolep force-pushed the distributed-queries-m branch from b461a36 to 5882f0b Compare April 10, 2015 23:26

Make it clearer in tests where numbers come from

5890025

jwilder reviewed Apr 11, 2015
View reviewed changes

otoolep mentioned this pull request Apr 11, 2015

Distributed Query should balance requests #2242

Closed

otoolep added a commit that referenced this pull request Apr 11, 2015

Merge pull request #2202 from influxdb/distributed-queries-m

30fc6df

Distributed Queries

otoolep merged commit 30fc6df into master Apr 11, 2015

otoolep removed the 2 - Working label Apr 11, 2015

otoolep deleted the distributed-queries-m branch April 11, 2015 18:34

damm mentioned this pull request Apr 12, 2015

panic: unsupported value type during encode fields: int64 #2232

Closed

beckettsean mentioned this pull request Apr 15, 2015

Replace mapper.fn with functor #1365

Closed

toddboom mentioned this pull request Apr 17, 2015

Make queries distributed #1467

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Distributed Queries #2202

Distributed Queries #2202

otoolep commented Apr 8, 2015

jwilder Apr 11, 2015

otoolep Apr 11, 2015

jwilder commented Apr 11, 2015

otoolep commented Apr 11, 2015

Distributed Queries #2202

Distributed Queries #2202

Conversation

otoolep commented Apr 8, 2015

jwilder Apr 11, 2015

Choose a reason for hiding this comment

otoolep Apr 11, 2015

Choose a reason for hiding this comment

jwilder commented Apr 11, 2015

otoolep commented Apr 11, 2015