fix: hard limit on field size while parsing line protocol #21843

lesam · 2021-07-13T21:30:46Z

Per https://docs.influxdata.com/enterprise_influxdb/v1.9/write_protocols/line_protocol_reference/
we only support 64KB, but 1MB is a more realistic practical limit. Before this commit there was
no enforcement of field value size.

Closes #21841

Describe your proposed changes here.

CHANGELOG.md updated with a link to the PR (not the Issue)
Well-formatted commit messages
Rebased/mergeable
Tests pass

lesam · 2021-07-13T21:35:38Z

We don't catch everywhere that can build a point, but this covers the write ingress which is the most important one.

gwossum

Implementation and test cases looks good, have test cases for maximum possible field size and 1 greater.

It'd be cool if the test checked the error message in the HTTP body since there's multiple possible errors, but none of the other tests in server_test.go seem to check error messages. I won't hold that against this test.

lesam · 2021-07-14T13:55:42Z

Closing as we want to make this configurable

Per https://docs.influxdata.com/enterprise_influxdb/v1.9/write_protocols/line_protocol_reference/ we only support 64KB, but 1MB is a more realistic practical limit. Before this commit there was no enforcement of field value size. Closes influxdata#21841

gwossum · 2021-07-14T16:06:19Z

tsdb/engine.go

@@ -186,9 +186,8 @@ type EngineOptions struct {
 	// nil will allow all combinations to pass.
 	ShardFilter func(database, rp string, id uint64) bool

-	Config         Config
-	SeriesIDSets   SeriesIDSets
-	FieldValidator FieldValidator


Eliminating Engine.FieldValidator seems reasonable, since I can't find any use other than the default validator. If we get rid of Engine.FieldValidator, should we also get rid of EngineOptions.FieldValidator? There might be confusion in the future if someone thinks they can set that to change the field validation behavior.

This is in EngineOptions - I don't see any FieldValidator remaining in the code?

I'm not sure why I thought there was an Engine.FieldValidator. I thought there was a Shard.FieldValidator, typed Engine.FieldValidator, but it was really setting the value in EngineOptions.FieldValidator. My mistake, carry on!

gwossum · 2021-07-14T16:25:47Z

tsdb/field_validator.go

+	pointSize := point.StringSize()
 	iter := point.FieldIterator()
 	for iter.Next() {
+		if !skipSizeValidation {
+			// Check for size of field too large. Note it is much cheaper to check the whole point size
+			// than checking the StringValue size (StringValue potentially takes an allocation if it must
+			// unescape the string, and must at least parse the string)
+			if pointSize > MaxFieldValueLength && iter.Type() == models.String {


A possible optimization here is to compare pointSize to MaxFieldValueLength before the loop, and set skipSizeValidation = true before starting the loop if pointSize <= MaxFieldValueLength. It's possible the compiler is already performing this loop invariant hoisting, but we can make sure it is. Even if the compiler is already doing the work, this might improve readability. It would look something like:

pointSize := point.StringSize() if pointSize <= MaxFieldValueLength { skipSizeValidation = true } iter := point.FieldIterator() for iter.Next() { if !skipSizeValidation { if iter.Type() == models.String {

I think the code as written is simpler, and I don't want to make it less simple for a single comparison - I doubt that would show up in any profiling.

It is debatable whether I should even be doing the pointSize optimization, but I think it is a clear enough win to not re-allocate a ton of strings during StringValue that it is ok to add the check.

gwossum

Looks good to me. Test cases for both sides of boundary condition.

…#21843) Per https://docs.influxdata.com/enterprise_influxdb/v1.9/write_protocols/line_protocol_reference/ we only support 64KB, but 1MB is a more realistic practical limit. Before this commit there was no enforcement of field value size. Closes influxdata#22094 (cherry picked from commit 6d22e69)

…22095) Per https://docs.influxdata.com/enterprise_influxdb/v1.9/write_protocols/line_protocol_reference/ we only support 64KB, but 1MB is a more realistic practical limit. Before this commit there was no enforcement of field value size. Closes #22094 (cherry picked from commit 6d22e69)

lesam requested a review from gwossum July 13, 2021 21:35

lesam force-pushed the limit-field-size branch from 87a385e to 83b1f0e Compare July 13, 2021 21:37

gwossum previously approved these changes Jul 13, 2021

View reviewed changes

lesam closed this Jul 14, 2021

lesam reopened this Jul 14, 2021

lesam dismissed gwossum’s stale review via 370656a July 14, 2021 14:41

lesam force-pushed the limit-field-size branch from 83b1f0e to 370656a Compare July 14, 2021 14:41

lesam force-pushed the limit-field-size branch from 370656a to efb073a Compare July 14, 2021 15:44

gwossum reviewed Jul 14, 2021

View reviewed changes

gwossum approved these changes Jul 14, 2021

View reviewed changes

lesam merged commit 6d22e69 into influxdata:master-1.x Jul 14, 2021

This was referenced Aug 26, 2021

[forward-port 2.x] Enforce a max field-value size for line protocol #22310

Closed

fix: hard limit on field size while parsing line protocol #22311

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: hard limit on field size while parsing line protocol #21843

fix: hard limit on field size while parsing line protocol #21843

lesam commented Jul 13, 2021 •

edited

Loading

lesam commented Jul 13, 2021

gwossum left a comment

lesam commented Jul 14, 2021

gwossum Jul 14, 2021

lesam Jul 14, 2021

gwossum Jul 14, 2021

gwossum Jul 14, 2021

lesam Jul 14, 2021

gwossum left a comment

fix: hard limit on field size while parsing line protocol #21843

fix: hard limit on field size while parsing line protocol #21843

Conversation

lesam commented Jul 13, 2021 • edited Loading

lesam commented Jul 13, 2021

gwossum left a comment

Choose a reason for hiding this comment

lesam commented Jul 14, 2021

gwossum Jul 14, 2021

Choose a reason for hiding this comment

lesam Jul 14, 2021

Choose a reason for hiding this comment

gwossum Jul 14, 2021

Choose a reason for hiding this comment

gwossum Jul 14, 2021

Choose a reason for hiding this comment

lesam Jul 14, 2021

Choose a reason for hiding this comment

gwossum left a comment

Choose a reason for hiding this comment

lesam commented Jul 13, 2021 •

edited

Loading