add a TCP listener #56

teepark · 2015-02-06T01:17:15Z

Because individual sends via TCP can end up split across packet
boundaries, the parsing code had to be reworked to handle that case.
It now takes io.ReadCloser rather than []byte.

It is used this way in the UDP case as well.

mreiferson · 2015-02-07T17:31:47Z

Hi @teepark thanks for this PR.

It looks like it needs a rebase after a recent merge and there's a bit of code to review but otherwise at a high level I'm 👍 with this.

markrechler · 2015-04-27T19:10:16Z

@teepark, wanted to check-in and see if you wanted a stab at rebasing this against master.

Have not gotten to test the TCP functionality yet but it seems with these changes the first set of values don't get sent out.

/statsdaemon --address=:8125 --flush-interval=60 --percent-threshold=50 --percent-threshold=95 --percent-threshold=99 --debug=true
2015/04/27 19:00:03 listening on :8125
2015/04/27 19:00:30 ERROR: unrecognized type code "gapps.v_proxy.dev_dev.v-proxy-1.mem.heap_idle_bytes:2400256|gapps.v_proxy.dev_dev.v-proxy-1.mem.heap_in_use_bytes:2842624|gapps.v_proxy.dev_dev.v-proxy-1.mem.heap_released_bytes:0|gapps.v_proxy.dev_dev.v-proxy-1.mem.gc_pause_usec_100:3644|gapps.v_proxy.dev_dev.v-proxy-1.mem.gc_pause_usec_99:3644|gapps.v_proxy.dev_dev.v-proxy-1.mem.gc_pause_usec_95:1630|gapps.v_proxy.dev_dev.v-proxy-1.mem.next_gc_bytes:3414400|gapps.v_proxy.dev_dev.v-proxy-1.mem.gc_runs:105|capps.v_proxy.dev_dev.proxy.start:27|c"
2015/04/27 19:01:03 DEBUG: apps.v_proxy.dev_dev.proxy.start 1262 1430161263

teepark · 2015-04-27T19:24:22Z

hi @markrechler, I won't have a chance to for little while.

on that failure, shouldn't there be newlines between the values?

markrechler · 2015-04-27T20:02:25Z

There are:

2015/04/27 19:58:15 line from buf: "apps.v_proxy.dev_dev.proxy.start:57|c", "apps.v_proxy.dev_dev.status_code.301:57|c\napps.v_proxy.dev_dev.proxy.success:57|c\n"

Added some debugging:

line, rest := lineFrom(buf)
log.Printf("line from buf: %q, %q", line, rest)

Looks like rest keeps the newlines. Will dig in some more later, probably something with the parsing as it only happens for the first set of values, works fine after that first error.

markrechler · 2015-04-27T21:11:19Z

Only seems to affect gauges:

2015/04/27 20:16:18 buffer: "apps.v_proxy.dev_dev.proxy.start:58|c\napps.v_proxy.dev_dev.status_code.301:58|c\napps.v_proxy.dev_dev.proxy.success:58|c\n"
2015/04/27 20:16:19 buffer: "apps.v_proxy.dev_dev.v-proxy-1.mem.heap_objects:19532|g"

markrechler · 2015-04-27T21:46:43Z

This is making more sense now. We queue up counter metrics in the library we use and send gauges one at a time.

This is related to line parsing. In master we are able to take in one value at a time, or a few values at a time seperated by newlines (https://github.com/etsy/statsd/blob/master/docs/metric_types.md#multi-metric-packets).

teepark · 2015-04-29T17:43:43Z

I do actually have a chance to look at this. This branch is also capable of reading multi-metric packets, and that's actually the paradigm that it uses for long-lived TCP connections: it always pulls in each metric by reading up to a newline. You don't have a test case that illustrates this branch's problem with gauges do you?

markrechler · 2015-04-29T18:38:26Z

Was actually just looking at this, if you submit a metric or two via
echo -ne "deploys.test.myservice:1|c" | nc -w 1 -u localhost 8125

This manifested in the form of gauges for us because we were sending them as one-offs, we send other stuff as multi-metric off the bat.

Adding another condition to lineFrom

if len(input) > 0 {
        return input, []byte{}
    }

Sends all metrics with the caveat that Next will never return a false.

teepark · 2015-04-29T18:42:18Z

I'm merging changes with current master right now, give me a minute to get that pushed.

I mostly mean in statsdaemon_test.go -- I'd love a real failing test :)

markrechler · 2015-04-29T19:01:05Z

Will see if I can come up with anything, the buffer effectively becomes "deploys.test.myservice:2|cdeploys.test.myservice:1|c" so it's a bit funkier to test for.

teepark · 2015-04-29T19:05:22Z

with newlines though right?
"deploys.test.myservice:2|c\ndeploys.test.myservice:1|c"

I can stick that in the tests

markrechler · 2015-04-29T19:12:50Z

It has newlines when sending multi-metric, or single with newline, but when sending one-offs w/out newlines, the buffer turns into essentially an invalid mult-metric.

Breaks:

echo -ne "deploys.test.myservice:2|c" | nc -w 1 -u localhost 8125
echo -ne "deploys.test.myservice:1|c" | nc -w 1 -u localhost 8125

Works:

echo -e "deploys.test.myservice:2|c" | nc -w 1 -u localhost 8125
echo -e "deploys.test.myservice:1|c" | nc -w 1 -u localhost 8125

Works:

echo -e "deploys.test.myservice:2|c\ndeploys.test.myservice:1|c" | nc -w 1 -u localhost 8125

markrechler · 2015-04-29T19:53:23Z

To give some more context:
https://github.com/etsy/statsd/blob/master/stats.js#L211
statsd assumes a newline indicates more than one metric

and with the current/master parsing:
https://github.com/bitly/statsdaemon/blob/master/statsdaemon.go#L374

Even without a newline, there is still potentially a valid line

teepark · 2015-04-29T19:59:03Z

the code in those two links are equivalent. if '\n' doesn't appear in the string then bytes.Split(n []byte{'\n'}) returns a [][]byte of length 1.

markrechler · 2015-04-29T20:32:27Z

Right, was getting caught up in the wrong part of buffer handling, apologies, so one-off metrics work fine with the tcp-handler. UDP being treated as a stream breaks the parsing since one-offs just keep getting bunched together in the buffer. It might be better to have separate UDP parsing that treats each packet as a complete message.

teepark · 2015-04-29T20:38:08Z

I get it now, sorry I've been so dense. Yeah the parser here is set up to read from any io.Reader including what comes out of net.ListenUDP. But in that case I don't want to be joining together all reads into a single buffer.

OK, I'll build a test case around this and then work from there.

teepark · 2015-04-29T21:32:42Z

ok, give that a shot now. flipping the boolean here causes the test to pass or fail, so it seems to be doing what it should.

markrechler · 2015-04-29T22:08:12Z

statsdaemon.go

+	)
+
+	if strings.HasPrefix(typeCode, "c") {
+		split = bytes.SplitN([]byte(typeCode), []byte("|@"), 2)


Timers also support sampling rates, may want to generalize this.

Test wise, could add this to TestParseLineTimer:

d = []byte("glork:320|ms|@0.1") packet = parseLine(d) assert.NotEqual(t, packet, nil) assert.Equal(t, "glork", packet.Bucket) assert.Equal(t, uint64(320), packet.Value.(uint64)) assert.Equal(t, "ms", packet.Modifier) assert.Equal(t, float32(0.1), packet.Sampling)

markrechler · 2015-04-29T22:24:43Z

Thanks for adding the UDP test. Preliminary testing against a real data set is looking good.

markrechler · 2015-05-05T18:14:31Z

statsdaemon.go

+var (
+	serviceAddress    = flag.String("address", ":8125", "UDP service address")
+	tcpServiceAddress = flag.String("tcpaddr", "", "TCP service address, if set")
+	maxUdpPacketSize  = flag.Int64("max-udp-packet-size", 1472, "Maximum UDP packet size")


We can remove the maxUdpPacketSize flag since it's no longer being used.

markrechler · 2015-05-05T18:15:51Z

Sorry for the delayed reply, everything looks good. @teepark could you rebase/squash down the commits to prep for a fast-forward merge whenever you get a chance?

ploxiln · 2015-05-05T18:48:39Z

I think this suffered from a messy rebase. try rebase -i master, and in the rebase commit list, delete the lines for commits that you didn't make

Because individual sends via TCP can end up split across packet boundaries, the parsing code had to be reworked to handle that case. It now takes io.ReadCloser rather than []byte. It is used this way in the UDP case as well, although there it disallows single metrics spanning multiple read()s.

add a TCP listener

teepark force-pushed the master branch from 2c38b33 to 1c87fff Compare February 11, 2015 01:34

markrechler reviewed Apr 29, 2015
View reviewed changes

markrechler reviewed May 5, 2015
View reviewed changes

teepark force-pushed the master branch from f9e7793 to 361dd04 Compare May 5, 2015 21:06

teepark added 2 commits May 5, 2015 14:06

re-introduce the use of maxUdpPacketSize for UDP

6fc8f8b

jehiah added the enhancement label May 6, 2015

markrechler added a commit that referenced this pull request May 6, 2015

Merge pull request #56 from teepark/master

c19b47a

add a TCP listener

markrechler merged commit c19b47a into bitly:master May 6, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add a TCP listener #56

add a TCP listener #56

teepark commented Feb 6, 2015

mreiferson commented Feb 7, 2015

markrechler commented Apr 27, 2015

teepark commented Apr 27, 2015

markrechler commented Apr 27, 2015

markrechler commented Apr 27, 2015

markrechler commented Apr 27, 2015

teepark commented Apr 29, 2015

markrechler commented Apr 29, 2015

teepark commented Apr 29, 2015

markrechler commented Apr 29, 2015

teepark commented Apr 29, 2015

markrechler commented Apr 29, 2015

markrechler commented Apr 29, 2015

teepark commented Apr 29, 2015

markrechler commented Apr 29, 2015

teepark commented Apr 29, 2015

teepark commented Apr 29, 2015

markrechler Apr 29, 2015

markrechler commented Apr 29, 2015

markrechler May 5, 2015

markrechler commented May 5, 2015

ploxiln commented May 5, 2015

add a TCP listener #56

add a TCP listener #56

Conversation

teepark commented Feb 6, 2015

mreiferson commented Feb 7, 2015

markrechler commented Apr 27, 2015

teepark commented Apr 27, 2015

markrechler commented Apr 27, 2015

markrechler commented Apr 27, 2015

markrechler commented Apr 27, 2015

teepark commented Apr 29, 2015

markrechler commented Apr 29, 2015

teepark commented Apr 29, 2015

markrechler commented Apr 29, 2015

teepark commented Apr 29, 2015

markrechler commented Apr 29, 2015

markrechler commented Apr 29, 2015

teepark commented Apr 29, 2015

markrechler commented Apr 29, 2015

teepark commented Apr 29, 2015

teepark commented Apr 29, 2015

markrechler Apr 29, 2015

Choose a reason for hiding this comment

markrechler commented Apr 29, 2015

markrechler May 5, 2015

Choose a reason for hiding this comment

markrechler commented May 5, 2015

ploxiln commented May 5, 2015