protobuf

Commit Graph

Author	SHA1	Message	Date
Eric Salo	a77b9665e1	move lua/ up to the top level directory where python/ lives PiperOrigin-RevId: 486786325	2 years ago
Mike Kruskal	17b6451684	Bumping protobuf dependency to newer commit PiperOrigin-RevId: 460811319	2 years ago
Protobuf Team Bot	97993b219d	rename upb::SymbolTable as upb::DefPool PiperOrigin-RevId: 456147709	2 years ago
Joshua Haberman	11b6df0c46	Moved tests into the main source tree.	3 years ago
Joshua Haberman	032400a03e	Fixed data corruption when total hasbits are a power of two.	3 years ago
Joshua Haberman	5c28ab6b2c	Implemented upb_enumvaldef, for storing information about enumvals.	3 years ago
Joshua Haberman	e8ba2a1899	Added a fix for locales that output ',' as decimal separator.	4 years ago
Joshua Haberman	9482957425	Enforce that filenames are unique when loaded into symtab. This brings upb into line with C++. PHP already checks this internally, so this should not be an issue there. Ruby on the other hand does not currently check this, so this change will cause our Ruby implementation to reject some programs that would otherwise have been accepted.	4 years ago
Joshua Haberman	823eb09694	Update all 2011 dates to 2021.	4 years ago
Joshua Haberman	e59d2c8fa7	Added license headers to all files.	4 years ago
Joshua Haberman	add9b12f18	Fixed quadratic memory usage in upb_array_append(). We were erroneously calling realloc() instead of resize(), forcing the entire array to be reallocated for every array append.	4 years ago
Joshua Haberman	f7ed1f27a3	Support non-zero minutes in the timestamp offset for JSON.	4 years ago
Joshua Haberman	6c30b5fe73	Fixed upb encoder for field numbers > 2**28. The encoder was improperly sign-extending the tag to 64 bits.	4 years ago
Joshua Haberman	e9551022c1	Added depth limit checking to upb_encode(). This can catch infinite recursion due to loops, or just excessively deep message trees. The depth limit is configurable, but defaults to 64.	4 years ago
Joshua Haberman	7a17493269	Removed print debugging.	4 years ago
Joshua Haberman	695b7f4617	Added code to test UPB_JSONENC_EMITDEFAULTS.	4 years ago
Joshua Haberman	ee49a8d7df	Added an accessor to get the symtab from a filedef. This matches an API already present in proto2 (const DescriptorPool* FileDescriptor::pool()). However there is a slightly subtle implication here. In proto2, the relationship between Descriptor and MessageFactory is 1:many. You can create as many DynamicMessageFactory instances as you want, and each one will have its own independent DynamicMessage prototype and computed layout for the same underlying Descriptor. In practice the layouts will all be the same, but one thing that could be distinct is that each can have its own extension pool, which is a DescriptorPool that will be searched for extensions when parsing. In contrast, upb does not have a separate "message factory" abstraction. That means that each upb_msgdef has a single distinct layout, in other words a 1:1 correspondence between descriptor and layout. This means that there is no way to create multiple message types for the same descriptor that have distinct extension pools. If you want a different set of extensions, you must create a separate upb_symtab with a distinct set of descriptors. This change further entrenches that upb_filedef:upb_symtab is a 1:1 relationship. A single upb_filedef cannot be a member of multiple symbol tables. In practice this was already true (there is no way to add a single filedef to multiple symbol tables) but this change codifies this 1:1 relationship.	4 years ago
Joshua Haberman	871ff96252	Test SKIPUNKNOWN on regular fields.	4 years ago
Joshua Haberman	0569c22a1e	Removed debug print.	4 years ago
Joshua Haberman	76764643ac	Added option to binary encoder to skip unknown fields.	4 years ago
Joshua Haberman	a04627abc8	Added map sorting to binary and text encoders. For the binary encoder, sorting is off by default. For the text encoder, sorting is on by default. Both defaults can be explicitly overridden. This grows code size a bit. I think we could potentially shave this (and other map-related code size) by having the generated code inject a function pointer to the map-related parsing/serialization code if maps are present. FILE SIZE VM SIZE -------------- -------------- +86% +1.07Ki +71% +768 upb/msg.c [NEW] +391 [NEW] +344 _upb_mapsorter_pushmap [NEW] +158 [NEW] +112 _upb_mapsorter_cmpstr [NEW] +111 [NEW] +64 _upb_mapsorter_cmpbool [NEW] +110 [NEW] +64 _upb_mapsorter_cmpi32 [NEW] +110 [NEW] +64 _upb_mapsorter_cmpi64 [NEW] +110 [NEW] +64 _upb_mapsorter_cmpu32 [NEW] +110 [NEW] +64 _upb_mapsorter_cmpu64 -3.6% -8 -4.3% -8 _upb_map_new +9.5% +464 +9.2% +424 upb/text_encode.c [NEW] +656 [NEW] +616 txtenc_mapentry +15% +32 +20% +32 upb_text_encode -20.1% -224 -20.7% -224 txtenc_msg +5.7% +342 +5.3% +296 upb/encode.c [NEW] +344 [NEW] +304 encode_mapentry [NEW] +246 [NEW] +208 upb_encode_ex [NEW] +41 [NEW] +16 upb_encode_ex.ch +0.7% +8 +0.7% +8 encode_scalar -1.0% -32 -1.0% -32 encode_message [DEL] -38 [DEL] -16 upb_encode.ch [DEL] -227 [DEL] -192 upb_encode +2.0% +152 +2.2% +152 upb/decode.c +44% +128 +44% +128 [section .rodata] +3.4% +24 +3.4% +24 _GLOBAL_OFFSET_TABLE_ +0.6% +107 +0.3% +48 upb/def.c [NEW] +100 [NEW] +48 upb_fielddef_descriptortype +7.1% +7 [ = ] 0 upb_fielddef_defaultint32 +2.9% +24 +2.9% +24 [section .dynsym] +1.2% +24 [ = ] 0 [section .symtab] +3.2% +16 +3.2% +16 [section .plt] [NEW] +16 [NEW] +16 memcmp@plt +0.5% +16 +0.6% +16 tests/conformance_upb.c +1.5% +16 +1.6% +16 DoTestIo +0.1% +16 +0.1% +16 upb/json_decode.c +0.4% +16 +0.4% +16 jsondec_wellknown +3.0% +8 +3.0% +8 [section .got.plt] +3.0% +8 +3.0% +8 _GLOBAL_OFFSET_TABLE_ +1.6% +7 +1.6% +7 [section .dynstr] +1.8% +4 +1.8% +4 [section .hash] +0.5% +3 +0.5% +3 [LOAD #2 [RX]] +2.8% +2 +2.8% +2 [section .gnu.version] -60.0% -1.74Ki [ = ] 0 [Unmapped] +0.3% +496 +1.4% +1.74Ki TOTAL	4 years ago
Joshua Haberman	64abb5eb11	Amalgamation no longer bundles wyhash, but #includes it. Also fixed a few spelling mistakes.	4 years ago
Joshua Haberman	c9f9668234	symtab: use longjmp() for errors and avoid intermediate table. We used to use a separate "add table" during the upb_symtab_addfile() operation to make it easier to back out the file if it contained errors. But this created unnecessary work of re-adding the same symbols to the main symtab once everything was validated. Instead we directly add symbols to the main symbols table. If there is an error in validation, we remove precisely the set of symbols that were already added. This also requires using a separate arena for each file. We can fuse it with the symtab's main arena if the operation is successful. LoadDescriptor_Upb 61.2µs ± 4% 53.5µs ± 1% -12.50% (p=0.000 n=12+12) LoadAdsDescriptor_Upb 4.43ms ± 1% 3.06ms ± 0% -31.00% (p=0.000 n=12+12) LoadDescriptor_Proto2 257µs ± 0% 259µs ± 0% +1.00% (p=0.000 n=12+12) LoadAdsDescriptor_Proto2 13.9ms ± 1% 13.9ms ± 1% ~ (p=0.128 n=12+12)	4 years ago
Joshua Haberman	154f2c25f4	Added UTF-8 validation for proto3 string fields.	4 years ago
Joshua Haberman	86d9908c55	Fastdecode support for packed fields. This is not very optimized yet. There is a lot of room to optimize it further.	4 years ago
Joshua Haberman	e3f41de6c7	Split monolithic BUILD file into many build files.	4 years ago
Joshua Haberman	d5096f9ee8	Fixed bug in addunknown and added ASAN poisoning.	4 years ago
Joshua Haberman	5aa5b77b41	Added simple offset-based accessors for defs, and deprecated old iterators.	4 years ago
Joshua Haberman	8e26a33bcb	Added a test for UTF-8 parse checking and added missing error reporting.	4 years ago
Joshua Haberman	a1c2caeb25	More arena tests. (#279 )	5 years ago
Joshua Haberman	6c4acba610	Implemented upb_arena_fuse() (#278 ) * WIP. * WIP. * Tests are passing. * Recover some perf: LIKELY doesn't propagate through functions. :( * Added some more benchmarks. * Simplify & optimize upb_arena_realloc(). * Only add owned blocks to the freelist. * More optimization/simplification. * Re-fixed the bug. * Revert unintentional changes to parser.rl. * Revert Lua changes for now. * Revert the arena fuse changes for now. * Added last_size to the arena representation. * Re-applied Lua changes. * Implemented upb_arena_fuse(). * Fix the compile by re-ordering statements. * Improve comments.	5 years ago
Joshua Haberman	4c6dcc3c6b	[textformat]: added missing newline when a message opens. (#245 ) * [textformat]: added missing newline when a message opens. * Added tostring() support to Lua that prints to text format. Also fixed a gnarly bug that this exposed.	5 years ago
Joshua Haberman	ca512852f3	Fixed parsing for string->double maps. (#243 ) Map parsing/serializing relies on map entries always having a predictable order. The code that generates layout was not respecting this in the case of string keys and primitive values.	5 years ago
Joshua Haberman	2a85bef825	Generated code interface for maps is complete, though not yet tested.	5 years ago
Joshua Haberman	382f92a87f	Maps encode and decode successfully!	5 years ago
Joshua Haberman	4c57b1fefd	More progress on Lua extension.	5 years ago
Joshua Haberman	d6c3152c0b	Added more Lua tests that are passing. Also ripped out the ctype checking in upb_table, it was not helpful (didn't help catch bugs) but was causing problems.	5 years ago
Joshua Haberman	ae66e571d4	Fixed some bugs and added a few more tests.	5 years ago
Joshua Haberman	bfc86d3577	Fixed many bugs, basic Lua test passes!	5 years ago
Joshua Haberman	b518b06d75	Lua test program is loaded successfully.	5 years ago
Josh Haberman	340bd01338	Removed default instance and oneof array from tables.	6 years ago
Joshua Haberman	1b9d37a00e	Start migrating upb_msglayout to be suitable for generated code. This involves: - remove upb_msglayout -> upb_msgfactory dependency. - remove upb_msglayout -> upb_msgdef dependency (in progress). - make upb_msglayout use a representation that can be statically initialized by generated code. The goal here is that upb_msglayout becomes a kind of "descriptor lite": it contains enough data to parser and serialize protobufs and manipulate a upb_msg in memory, while being far smaller and simpler than a full descriptor. It also does not include field names, which can be a benefit for applications that do not want to leak field names. Generated code can then create a upb_msglayout, and do most things without ever needing to construct full descriptors/defs if they don't want to.	8 years ago
Josh Haberman	693b841ec6	Removed all code for adding extensions to upb_symtab. This means extensions can't be used until we implement the replacement APIs for accessing extensions from a symtab.	8 years ago
Josh Haberman	47da2afd52	Make upb::SymbolTable no longer reference-counted. This transitions it from shared ownership to unique ownership.	8 years ago
Josh Haberman	949aeee3f1	Changes for PR comments.	8 years ago
Josh Haberman	4b0c4ca7fb	New upb_msg code and Lua bindings around it. There are still some things that are unfinished, but we are at parity with what Lua had before.	8 years ago
Joshua Haberman	ac2689cec7	Put oneofs in the same table as fields. (#60 ) * Put oneofs in the same table as fields. Oneofs and fields are not allowed to have names that conflict, so we might as well put them all in the same table. This also allows an efficient operation that looks for both fields and oneofs in a single lookup. Added support for OneofDef to Lua to allow testing of this. * Addressed PR comments.	9 years ago
Josh Haberman	66a74a4fd5	Added lua and core32 Travis builds, and rewrote README.md	10 years ago
Josh Haberman	2d10fa3307	Sync from internal Google development.	11 years ago

1 Commits (b399576ccdb97ed37afc045327852b298e32eff3)