Fix inconsistent data type #2448

cannium · 2015-04-29T06:53:31Z

Otherwise panic

panic: interface conversion: interface is *influxql.firstLastMapOutput, not influxql.firstLastMapOutput

would occur when using first(), last() or spread().

neonstalwart · 2015-04-29T14:10:46Z

@cannium i guess this isn't covered by any tests. might be a good idea to add something to make sure this doesn't regress. i've been adding integration tests for aggregations to https://github.com/influxdb/influxdb/blob/b0865984452938e77148530c4e10b33a6f8c03c5/cmd/influxd/server_integration_test.go#L581. in general it seems the aggregations are not very well tested.

cannium · 2015-04-30T03:44:29Z

Though I managed to make integration tests passed, I noticed that the returned time of first, last are always 1970-01-01T00:00:00Z. I think that should also be fixed.

neonstalwart · 2015-04-30T13:59:10Z

i'm not sure that the current architecture supports having aggregate functions return anything more than just a value but i could be wrong.

neonstalwart · 2015-04-30T15:15:21Z

@cannium actually, i realized that first and last aggregate functions are for a different purpose than when you would use order by with limit 1 to get the whole first or last row (which would include the time related to the point).

aggregates are applied across intervals of time (determined by group by, where, etc) and the time associated with each aggregated value is the time at the beginning of that interval. your tests are implicitly asking to put all data points in 1 big interval and by default that interval starts at the epoch. to verify what i'm saying, try changing your test like so:

{
    reset: true,
    name:  "last value",
    write: `{"database" : "%DB%", "retentionPolicy" : "%RP%", "points": [
        {"name": "cpu", "timestamp": "2000-01-01T00:00:00Z", "fields": {"value": 2}},
        {"name": "cpu", "timestamp": "2000-01-01T00:01:00Z", "fields": {"value": 7}},
        {"name": "cpu", "timestamp": "2000-01-01T00:01:10Z", "fields": {"value": 9}}
    ]}`,
    query:    `SELECT last(value) FROM cpu WHERE time >= '2000-01-01T00:00:00Z' AND time <= '2000-01-01T00:01:10Z' GROUP BY time(2m)`,
    queryDb:  "%DB%",
    expected: `{"results":[{"series":[{"name":"cpu","columns":["time","last"],"values":[["2000-01-01T00:00:00Z",9]]}]}]}`,
},

you can see that now we've specified an interval and the time showed in the results matches the way i've described it.

otoolep · 2015-04-30T22:49:42Z

cmd/influxd/server_integration_test.go

+			expected: `{"results":[{"series":[{"name":"cpu","columns":["time","last"],"values":[["1970-01-01T00:00:00Z",9]]}]}]}`,
+		},
+		{
+			reset: true,


The data is the same with every write, so there is no need to set the reset flag between each test. It will just slow testing down without reason.

It's by purpose. I think it's better to make every test to stand on its own in case of any future modifications.

在 2015年5月1日，06:49，otoolep notifications@github.com 写道：

In cmd/influxd/server_integration_test.go:

},

{

reset: true,

name: "last value",

write: `{"database" : "%DB%", "retentionPolicy" : "%RP%", "points": [

{"name": "cpu", "timestamp": "2000-01-01T00:00:00Z", "fields": {"value": 2}},

{"name": "cpu", "timestamp": "2000-01-01T00:01:00Z", "fields": {"value": 7}},

{"name": "cpu", "timestamp": "2000-01-01T00:01:10Z", "fields": {"value": 9}}

]}`,

query: `SELECT last(value) FROM cpu`,

queryDb: "%DB%",

// FIXME: returned time should be "2000-01-01T00:01:10Z"

expected: `{"results":[{"series":[{"name":"cpu","columns":["time","last"],"values":[["1970-01-01T00:00:00Z",9]]}]}]}`,

},

{

reset: true,
The data is the same with every write, so there is no need to set the reset flag between each test. It will just slow testing down without reason.

—
Reply to this email directly or view it on GitHub.

I realise that, but test run times are also important to us. Happy to make changes in the future, but when data is the same, we don't use reset.

cannium · 2015-04-30T23:04:25Z

Yes, I've read the code. But I still feel the method name confusing. From a user's perspective, one might expect the corresponding time stamp also returned, like a normal select query.

在 2015年4月30日，23:15，Ben Hockey notifications@github.com 写道：

@cannium actually, i realized that first and last aggregate functions are for a different purpose than when you would use order by with limit 1 to get the whole first or last row (which would include the time related to the point).

aggregates are applied across intervals of time (determined by group by, where, etc) and the time associated with each aggregated value is the time at the beginning of that interval. your tests are implicitly asking to put all data points in 1 big interval and by default that interval starts at the epoch. to verify what i'm saying, try changing your test like so:

{
reset: true,
name: "last value",
write: {"database" : "%DB%", "retentionPolicy" : "%RP%", "points": [ {"name": "cpu", "timestamp": "2000-01-01T00:00:00Z", "fields": {"value": 2}}, {"name": "cpu", "timestamp": "2000-01-01T00:01:00Z", "fields": {"value": 7}}, {"name": "cpu", "timestamp": "2000-01-01T00:01:10Z", "fields": {"value": 9}} ]},
query: SELECT last(value) FROM cpu WHERE time >= '2000-01-01T00:00:00Z' AND time <= '2000-01-01T00:01:10Z' GROUP BY time(2m),
queryDb: "%DB%",
expected: {"results":[{"series":[{"name":"cpu","columns":["time","last"],"values":[["2000-01-01T00:00:00Z",9]]}]}]},
},
you can see that now we've specified an interval and the time showed in the results matches the way i've described it.

—
Reply to this email directly or view it on GitHub.

cannium · 2015-05-06T00:03:51Z

ping?

toddboom · 2015-05-08T18:23:09Z

@cannium would you mind rebasing this? if you can, i'll try to get this merged today.

cannium · 2015-05-09T02:11:17Z

Rebased.

toddboom · 2015-05-11T18:47:37Z

+1 from myself and verbal +1 from @otoolep - thanks!

Fix inconsistent data type

cannium force-pushed the fix-data-type branch 2 times, most recently from 24412f5 to 1861f41 Compare April 30, 2015 03:40

cannium force-pushed the fix-data-type branch from 1861f41 to a192c75 Compare April 30, 2015 03:56

otoolep reviewed Apr 30, 2015
View reviewed changes

cannium force-pushed the fix-data-type branch from a192c75 to 8b260be Compare May 4, 2015 02:49

neonstalwart mentioned this pull request May 8, 2015

select last value crashes nodes #2520

Closed

cannium added 2 commits May 9, 2015 08:51

Fix inconsistent data type

0aff0de

Add integration tests for first(), last() and spread() queries.

55106a6

cannium force-pushed the fix-data-type branch from 8b260be to 55106a6 Compare May 9, 2015 01:02

toddboom added a commit that referenced this pull request May 11, 2015

Merge pull request #2448 from cannium/fix-data-type

6b3bd90

Fix inconsistent data type

toddboom merged commit 6b3bd90 into influxdata:master May 11, 2015

cannium deleted the fix-data-type branch May 14, 2015 02:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix inconsistent data type #2448

Fix inconsistent data type #2448

cannium commented Apr 29, 2015

neonstalwart commented Apr 29, 2015

cannium commented Apr 30, 2015

neonstalwart commented Apr 30, 2015

neonstalwart commented Apr 30, 2015

otoolep Apr 30, 2015

cannium Apr 30, 2015

otoolep Apr 30, 2015

cannium commented Apr 30, 2015

cannium commented May 6, 2015

toddboom commented May 8, 2015

cannium commented May 9, 2015

toddboom commented May 11, 2015

Fix inconsistent data type #2448

Fix inconsistent data type #2448

Conversation

cannium commented Apr 29, 2015

neonstalwart commented Apr 29, 2015

cannium commented Apr 30, 2015

neonstalwart commented Apr 30, 2015

neonstalwart commented Apr 30, 2015

otoolep Apr 30, 2015

Choose a reason for hiding this comment

cannium Apr 30, 2015

Choose a reason for hiding this comment

otoolep Apr 30, 2015

Choose a reason for hiding this comment

cannium commented Apr 30, 2015

cannium commented May 6, 2015

toddboom commented May 8, 2015

cannium commented May 9, 2015

toddboom commented May 11, 2015