Adding the serialization features. #1666

mattdurham · 2024-09-11T14:53:52Z

This adds the serialization side of converting series to the binary format. The binary format is time series where the strings are deduplicated with actual marshalling handled by the msgp library. I tested roughly 6 libraries listed here. Msgp hit the sweetspot of ease of use and size and features. Such as reusing arrays if they were passed in.

mattdurham · 2024-09-11T14:58:15Z

Don't be put off by the lines change count, 80% of that is generated code from msgp

mattdurham · 2024-09-11T14:59:17Z

Tests are failing from a data race in the tests themselves since I am accessing them directly. Lemme see if I can fix that.

wildum

great work, I like the deduplication algo. I only did a first pass, I'm not yet familiar with the full picture

internal/component/prometheus/remote/queue/serialization/seralizer_test.go

internal/component/prometheus/remote/queue/types/serialization.go

mattdurham · 2024-09-11T16:30:55Z

Stats are protocol agnostic, so that if used in prometheus or otel environment they can add their own specific metrics and we dont define protocol specific in the lower level structs. The component in the last PR will expose prometheus compatible ones derived from the callback.

thampiotr

First pass. Looking good :)

thampiotr · 2024-09-13T12:13:58Z

Makefile

@@ -141,7 +141,7 @@ lint: alloylint
 # final command runs tests for all other submodules.
 test:
 	$(GO_ENV) go test $(GO_FLAGS) -race $(shell go list ./... | grep -v /integration-tests/)
-	$(GO_ENV) go test $(GO_FLAGS) ./internal/static/integrations/node_exporter ./internal/static/logs ./internal/component/otelcol/processor/tail_sampling ./internal/component/loki/source/file ./internal/component/loki/source/docker
+	$(GO_ENV) go test $(GO_FLAGS) ./internal/static/integrations/node_exporter ./internal/static/logs ./internal/component/otelcol/processor/tail_sampling ./internal/component/loki/source/file ./internal/component/loki/source/docker ./internal/component/prometheus/remote/queue/serialization


We'd be running these tests twice, second time without -race - I don't see the reason why, is that an accident?

There is one test that will not be ran twice since I am accessing the var directly to test its value. The others will be ran, I could add the //go:build race to the others. Note most of our exclusions above have some tests that run twice.

internal/component/prometheus/remote/queue/serialization/appender.go

thampiotr · 2024-09-13T13:04:44Z

internal/component/prometheus/remote/queue/serialization/appender.go

+	ts.TS = t
+	ts.Value = v
+	ts.Hash = l.Hash()
+	err := a.s.SendSeries(a.ctx, ts)


Is it guaranteed that ts will be returned eventually to the object pool? Would we have a leak if, e.g. the component was removed from Alloy config? I don't see any issues, but would be nice to make this code a bit more clear that this is what's going on, with naming or comments.

It should be a required that all time series are returned. Though not in this PR this is checked in a future test via OutStandingTimeSeriesBinary atomic int. There are end to end tests that ensure at the end of the test this is zero.

internal/component/prometheus/remote/queue/serialization/serializer.go

internal/component/prometheus/remote/queue/types/serialization.go

thampiotr · 2024-09-13T13:38:15Z

internal/component/prometheus/remote/queue/serialization/serializer.go

+	stringsSlice := make([]string, len(strMapToInt))
+	for stringValue, index := range strMapToInt {
+		stringsSlice[index] = stringValue
+	}
+	group.Strings = stringsSlice


Suggested change

stringsSlice := make([]string, len(strMapToInt))

for stringValue, index := range strMapToInt {

stringsSlice[index] = stringValue

}

group.Strings = stringsSlice

dictionary := make([]string, len(strMapToInt))

for stringValue, index := range strMapToInt {

dictionary[index] = stringValue

}

group.dictionary = dictionary

I like to use the concept of dictionary here, or lookup table... it makes it easier to figure out what's going on.

Do you mean to use an actual map? Or a rename like above?

thampiotr · 2024-09-13T13:39:04Z

internal/component/prometheus/remote/queue/serialization/serializer.go

+	}
+	group.Strings = stringsSlice
+
+	buf, err := group.MarshalMsg(s.msgpBuffer)


Sooo... is it worth it to do the dictionary stuff? I guess yes, but on the other hand I know that compression algos would do something similar automatically, snappy can refer to previous part of the data to reduce repetition.

One second had a bug in my test re-evaluating.

Alright back with much more verifiable test.

//go:generate msgp package main import ( "fmt" "math/rand" "reflect" "github.com/golang/snappy" ) // 5 long really random // 5371616 // 4732108 // 5 long half random // 5050060 // 3929185 // 5 long quarter random // 4455979 // 2918973 func main() { metrics := make([]map[string]string, 0) // 100k metrics with 10 labels each for i := 0; i < 100_000; i++ { metrics = append(metrics, getLabels()) } ss := &StringString{Labels: metrics} bb, err := ss.MarshalMsg(nil) if err != nil { panic(err) } out := snappy.Encode(nil, bb) dc, _ := snappy.Decode(nil, out) err = validateStringString(dc, metrics) if err != nil { panic(err) } println(fmt.Printf("dictionary based is %d bytes", len(out))) ib := &IndexBased{ String: make([]string, 0), Names: make([][]uint32, 0), Values: make([][]uint32, 0), } alignIndexBased(ib, metrics) bb, err = ib.MarshalMsg(nil) if err != nil { panic(err) } out = snappy.Encode(nil, bb) dc, _ = snappy.Decode(nil, out) err = validateIndexBased(dc, metrics) println(fmt.Printf("index based is %d bytes", len(out))) } func validateStringString(bb []byte, metrics []map[string]string) error { ss := &StringString{} _, err := ss.UnmarshalMsg(bb) if err != nil { return err } for i, m := range metrics { if !reflect.DeepEqual(ss.Labels[i], m) { return fmt.Errorf("invalid metric at index %d", i) } } return nil } func validateIndexBased(bb []byte, metrics []map[string]string) error { ss := &IndexBased{} _, err := ss.UnmarshalMsg(bb) if err != nil { return err } for i, m := range metrics { if !reflect.DeepEqual(getMetric(ss.Names[i], ss.Values[i], ss.String), m) { return fmt.Errorf("invalid metric at index %d", i) } } return nil } func getMetric(names []uint32, values []uint32, strings []string) map[string]string { metric := make(map[string]string) for i, v := range names { metric[strings[v]] = strings[values[i]] } return metric } func alignIndexBased(ib *IndexBased, strings []map[string]string) { index := 0 stringsList := make(map[string]int) for _, metric := range strings { names := make([]uint32, 0) values := make([]uint32, 0) for k, v := range metric { keyIndex, ok := stringsList[k] if !ok { stringsList[k] = index ib.String = append(ib.String, k) keyIndex = index index++ } valIndex, ok := stringsList[v] if !ok { stringsList[v] = index ib.String = append(ib.String, v) valIndex = index index++ } names = append(names, uint32(keyIndex)) values = append(values, uint32(valIndex)) } ib.Names = append(ib.Names, names) ib.Values = append(ib.Values, values) } ib.String = make([]string, len(stringsList)) for k, v := range stringsList { ib.String[v] = k } } func getLabels() map[string]string { retLbls := make(map[string]string, 0) for i := 0; i < 10; i++ { retLbls[fmt.Sprintf("label_%d", i)] = randString() } return retLbls } var letterRunes = []rune("abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ") var halfRandom = []rune("abcdefghijklmnopqrstuvwxyz") var quarterRandom = []rune("abcdefghijkl") func randString() string { b := make([]rune, rand.Intn(5)) for i := range b { b[i] = letterRunes[rand.Intn(len(letterRunes))] } return string(b) } type IndexBased struct { Names [][]uint32 Values [][]uint32 String []string } type StringString struct { Labels []map[string]string }

In general the index based is never worse, and IMO in many cases is 60% of the size of the pure string based.
Results from the above test, changing out the letterRunes to small sets.

// 5 char long really random
// 5371616 string map
// 4732108 index based

// 5 long half random
// 5050060
// 3929185

// 5 long quarter random
// 4455979
// 2918973

The lower the cardinality the better but even in worse case its not terrible.

internal/component/prometheus/remote/queue/serialization/serializer_bench_test.go

…der.go Co-authored-by: Piotr <17101802+thampiotr@users.noreply.github.com>

…lizer.go Co-authored-by: Piotr <17101802+thampiotr@users.noreply.github.com>

…alization

Co-authored-by: Piotr <17101802+thampiotr@users.noreply.github.com>

…lizer.go Co-authored-by: Piotr <17101802+thampiotr@users.noreply.github.com>

…alization

mattdurham · 2024-09-16T14:07:46Z

Going to merge this and we can revisit any followup in the big merge on specific points.

Adding the serialization features.

0b8e976

mattdurham marked this pull request as ready for review September 11, 2024 14:53

mattdurham requested review from ptodev, thampiotr and wildum September 11, 2024 14:57

mattdurham added 2 commits September 11, 2024 11:13

Dont test this with race condition since we access vars directly.

175aafb

Fix test.

57e6ddd

wildum reviewed Sep 11, 2024

View reviewed changes

internal/component/prometheus/remote/queue/serialization/seralizer_test.go Outdated Show resolved Hide resolved

internal/component/prometheus/remote/queue/types/serialization.go Show resolved Hide resolved

Fix typo in file name and return early in DeserializeToSeriesGroup.

55fe162

thampiotr reviewed Sep 13, 2024

View reviewed changes

mattdurham and others added 9 commits September 13, 2024 10:23

Update internal/component/prometheus/remote/queue/serialization/appen…

843ef50

…der.go Co-authored-by: Piotr <17101802+thampiotr@users.noreply.github.com>

Update internal/component/prometheus/remote/queue/serialization/seria…

c359236

…lizer.go Co-authored-by: Piotr <17101802+thampiotr@users.noreply.github.com>

Rename to indicate that TimeSeries are Put/Get from a pool.

6da1198

Merge remote-tracking branch 'origin/wal_serialization' into wal_seri…

adb047a

…alization

Remove func that was about the same number of lines as inlining.

58f2385

Update internal/component/prometheus/remote/queue/types/serialization.go

2fe259f

Co-authored-by: Piotr <17101802+thampiotr@users.noreply.github.com>

Update internal/component/prometheus/remote/queue/serialization/seria…

dd55897

…lizer.go Co-authored-by: Piotr <17101802+thampiotr@users.noreply.github.com>

Change benchmark to be more specific.

9331c64

Merge remote-tracking branch 'origin/wal_serialization' into wal_seri…

622dbc1

…alization

mattdurham merged commit 626113f into dev.new-wal Sep 16, 2024
17 checks passed

mattdurham deleted the wal_serialization branch September 16, 2024 14:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding the serialization features. #1666

Adding the serialization features. #1666

mattdurham commented Sep 11, 2024

mattdurham commented Sep 11, 2024

mattdurham commented Sep 11, 2024

wildum left a comment

mattdurham commented Sep 11, 2024

thampiotr left a comment

thampiotr Sep 13, 2024

mattdurham Sep 13, 2024

thampiotr Sep 13, 2024

mattdurham Sep 13, 2024

thampiotr Sep 13, 2024

mattdurham Sep 13, 2024

thampiotr Sep 13, 2024

mattdurham Sep 13, 2024 •

edited

Loading

mattdurham Sep 13, 2024

mattdurham Sep 13, 2024 •

edited

Loading

mattdurham Sep 13, 2024

mattdurham commented Sep 16, 2024

Adding the serialization features. #1666

Adding the serialization features. #1666

Conversation

mattdurham commented Sep 11, 2024

mattdurham commented Sep 11, 2024

mattdurham commented Sep 11, 2024

wildum left a comment

Choose a reason for hiding this comment

mattdurham commented Sep 11, 2024

thampiotr left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mattdurham Sep 13, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mattdurham Sep 13, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mattdurham commented Sep 16, 2024

mattdurham Sep 13, 2024 •

edited

Loading

mattdurham Sep 13, 2024 •

edited

Loading