Implement resource budgets in dagcbor parsing. #85

warpfork · 2020-09-24T21:51:29Z

This is an alternative fix for #82 .

This diff keeps a single resource budget throughout the entire parse and unmarshal process, decrementing it as resources are consumed. Where possible, it also checks if a length header suggests the resource limit will be exhausted within the next stretch.

This does not yet include a configuration mechanism, but I've set it to "around 10 megs" for now, which is probably a reasonable safe number to roll out today as a default (considering that in many projects that use IPLD, libp2p is also in play, which sets much lower limits for serial message size already). I'd certainly like to make it properly configurable, but perhaps that can be explored later.

rvagg · 2020-09-25T01:31:08Z

codec/dagcbor/unmarshal.go

+)
+
+const (
+	mapEntryGasScore  = 8


how are you deriving these? is it a rough approximation of the overhead?

Yep.

Fairly arbitrarily derived. I figure it takes -- very, very roughly -- at least this many bytes in memory to add another map entry to memory.

Is this number wrong? Absolutely. I have no idea how many bytes it takes to add a map entry in memory. It's a property of the golang native map implementation, and varies based on the size of the map, which is not something I'm particularly interested in predicting or making specific claims about... and then, even that is only in some Nodes. It's actually zero in some nodes (e.g. codegen'd structs). Or who knows what in some other Node implementation: it's an interface users can supply their own of, after all.

It's sufficient to have a limit; I'm not sure it's necessarily important for the limit to be easy to intuit.

rvagg · 2020-09-25T01:36:14Z

sgtm, we'd just better make sure that all instances of allocation, especially large allocation, are accounted for into the future! it's going to be easy to overlook this.

Also, "gas" used like this really is an Americanism. Not a major objection because it's for internal use, just a note that non-americans have to do a double cognitive jump to get to what it's trying to mean when we read "gas" (I have the same problem with Filecoin; it goes like this: "gas? ohhh, they're Americans, there's a fuel tank involved with a fixed capacity, ok").

mvdan

Generally SGTM, though I would like more docs around the gas scores like Rod says.

If we ever expose this as an option, I do think it should be pretty much around byte sizes, because the user shouldn't have to learn what a made up term like "allocation gas" means.

warpfork · 2020-10-20T06:49:22Z

FWIW, I'm mostly pulling "gas" from Ethereum. It's the first digital project I'm aware of that coined a term for this concept and made it popularized. I'm pretty sure my automata course in college introduced the same core concept right around where it discussed solutions to the halting problem, but whatever the heck it was called in that literature, it sure wasn't memorable. "Gas" is.

I agree with the need for more documentation around this. I don't actually know how to reduce it directly to byte sizes though, and I'm not even sure that's possible, due to all the variations that are potentially involved, discussed in #85 (comment) .

warpfork · 2020-10-20T06:52:08Z

I'm going to merge this for now because it's been out long enough and I'm pretty sure it's better than things were before. More PRs in the future to improve docs and configurability will be highly welcome. We'll also probably want to port this to other codecs -- either in shared code if possible, or at least in shared pattern.

Implement resource budgets in dagcbor parsing.

344457a

rvagg reviewed Sep 25, 2020

View reviewed changes

mvdan reviewed Sep 29, 2020

View reviewed changes

warpfork merged commit 6428f6b into master Oct 20, 2020

warpfork deleted the resource-budget-for-dagcbor-parser branch October 20, 2020 06:52

This was referenced Oct 20, 2020

Security fix: patch crashing inputs #82

Closed

[fuzzing] dagcbor Decode/Encode out of memory error LeastAuthority/go-ipld-prime#6

Open

[fuzzing] dagcbor.Unmarshal panic LeastAuthority/go-ipld-prime#4

Open

aschmahmann mentioned this pull request Feb 18, 2021

Release v0.8.0 ipfs/kubo#7707

Closed

73 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement resource budgets in dagcbor parsing. #85

Implement resource budgets in dagcbor parsing. #85

warpfork commented Sep 24, 2020

rvagg Sep 25, 2020

warpfork Oct 20, 2020

rvagg commented Sep 25, 2020

mvdan left a comment

warpfork commented Oct 20, 2020

warpfork commented Oct 20, 2020

Implement resource budgets in dagcbor parsing. #85

Implement resource budgets in dagcbor parsing. #85

Conversation

warpfork commented Sep 24, 2020

rvagg Sep 25, 2020

Choose a reason for hiding this comment

warpfork Oct 20, 2020

Choose a reason for hiding this comment

rvagg commented Sep 25, 2020

mvdan left a comment

Choose a reason for hiding this comment

warpfork commented Oct 20, 2020

warpfork commented Oct 20, 2020