Should validate UTF-8 multi-byte validity for short decode path too #239

cowtowncoder · 2021-01-30T03:19:07Z

(note: follow-up to #236)

Looks like "long / slow" decoding path for UTF-8 Strings checks that multi-byte characters do not invalid encoding patterns, as expected (and what JSON parser does), but the quick/short pass (when String value is guaranteed to fit in buffer without bounds checks) does not necessarily similarly verify that -- the first byte is checked as expected, but 2nd - 4th are not. Check should be performed for these cases as well, and we should have basic tests as well.

I also think that since this may uncover existing invalid usage, change should go in 2.13 and not in 2.12 patch: that way we can get bit more testing.

cowtowncoder · 2021-01-30T03:24:37Z

Methods to check in CBORParser:

_finishShortText(): short String values
_decodeShortName(): short property names

cowtowncoder added the 2.13 label Jan 30, 2021

cowtowncoder added the cbor label Jan 30, 2021

cowtowncoder added this to the 2.13.0 milestone Jan 31, 2021

cowtowncoder closed this as completed in f64a886 Jan 31, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Should validate UTF-8 multi-byte validity for short decode path too #239

Should validate UTF-8 multi-byte validity for short decode path too #239

cowtowncoder commented Jan 30, 2021

cowtowncoder commented Jan 30, 2021

Should validate UTF-8 multi-byte validity for short decode path too #239

Should validate UTF-8 multi-byte validity for short decode path too #239

Comments

cowtowncoder commented Jan 30, 2021

cowtowncoder commented Jan 30, 2021