protobuf

Commit Graph

Author	SHA1	Message	Date
Joshua Haberman	f7ed1f27a3	Support non-zero minutes in the timestamp offset for JSON.	4 years ago
Joshua Haberman	6c30b5fe73	Fixed upb encoder for field numbers > 2**28. The encoder was improperly sign-extending the tag to 64 bits.	4 years ago
Joshua Haberman	e9551022c1	Added depth limit checking to upb_encode(). This can catch infinite recursion due to loops, or just excessively deep message trees. The depth limit is configurable, but defaults to 64.	4 years ago
Joshua Haberman	7a17493269	Removed print debugging.	4 years ago
Joshua Haberman	695b7f4617	Added code to test UPB_JSONENC_EMITDEFAULTS.	4 years ago
Joshua Haberman	ee49a8d7df	Added an accessor to get the symtab from a filedef. This matches an API already present in proto2 (const DescriptorPool* FileDescriptor::pool()). However there is a slightly subtle implication here. In proto2, the relationship between Descriptor and MessageFactory is 1:many. You can create as many DynamicMessageFactory instances as you want, and each one will have its own independent DynamicMessage prototype and computed layout for the same underlying Descriptor. In practice the layouts will all be the same, but one thing that could be distinct is that each can have its own extension pool, which is a DescriptorPool that will be searched for extensions when parsing. In contrast, upb does not have a separate "message factory" abstraction. That means that each upb_msgdef has a single distinct layout, in other words a 1:1 correspondence between descriptor and layout. This means that there is no way to create multiple message types for the same descriptor that have distinct extension pools. If you want a different set of extensions, you must create a separate upb_symtab with a distinct set of descriptors. This change further entrenches that upb_filedef:upb_symtab is a 1:1 relationship. A single upb_filedef cannot be a member of multiple symbol tables. In practice this was already true (there is no way to add a single filedef to multiple symbol tables) but this change codifies this 1:1 relationship.	4 years ago
Joshua Haberman	871ff96252	Test SKIPUNKNOWN on regular fields.	4 years ago
Joshua Haberman	0569c22a1e	Removed debug print.	4 years ago
Joshua Haberman	76764643ac	Added option to binary encoder to skip unknown fields.	4 years ago
Joshua Haberman	a04627abc8	Added map sorting to binary and text encoders. For the binary encoder, sorting is off by default. For the text encoder, sorting is on by default. Both defaults can be explicitly overridden. This grows code size a bit. I think we could potentially shave this (and other map-related code size) by having the generated code inject a function pointer to the map-related parsing/serialization code if maps are present. FILE SIZE VM SIZE -------------- -------------- +86% +1.07Ki +71% +768 upb/msg.c [NEW] +391 [NEW] +344 _upb_mapsorter_pushmap [NEW] +158 [NEW] +112 _upb_mapsorter_cmpstr [NEW] +111 [NEW] +64 _upb_mapsorter_cmpbool [NEW] +110 [NEW] +64 _upb_mapsorter_cmpi32 [NEW] +110 [NEW] +64 _upb_mapsorter_cmpi64 [NEW] +110 [NEW] +64 _upb_mapsorter_cmpu32 [NEW] +110 [NEW] +64 _upb_mapsorter_cmpu64 -3.6% -8 -4.3% -8 _upb_map_new +9.5% +464 +9.2% +424 upb/text_encode.c [NEW] +656 [NEW] +616 txtenc_mapentry +15% +32 +20% +32 upb_text_encode -20.1% -224 -20.7% -224 txtenc_msg +5.7% +342 +5.3% +296 upb/encode.c [NEW] +344 [NEW] +304 encode_mapentry [NEW] +246 [NEW] +208 upb_encode_ex [NEW] +41 [NEW] +16 upb_encode_ex.ch +0.7% +8 +0.7% +8 encode_scalar -1.0% -32 -1.0% -32 encode_message [DEL] -38 [DEL] -16 upb_encode.ch [DEL] -227 [DEL] -192 upb_encode +2.0% +152 +2.2% +152 upb/decode.c +44% +128 +44% +128 [section .rodata] +3.4% +24 +3.4% +24 _GLOBAL_OFFSET_TABLE_ +0.6% +107 +0.3% +48 upb/def.c [NEW] +100 [NEW] +48 upb_fielddef_descriptortype +7.1% +7 [ = ] 0 upb_fielddef_defaultint32 +2.9% +24 +2.9% +24 [section .dynsym] +1.2% +24 [ = ] 0 [section .symtab] +3.2% +16 +3.2% +16 [section .plt] [NEW] +16 [NEW] +16 memcmp@plt +0.5% +16 +0.6% +16 tests/conformance_upb.c +1.5% +16 +1.6% +16 DoTestIo +0.1% +16 +0.1% +16 upb/json_decode.c +0.4% +16 +0.4% +16 jsondec_wellknown +3.0% +8 +3.0% +8 [section .got.plt] +3.0% +8 +3.0% +8 _GLOBAL_OFFSET_TABLE_ +1.6% +7 +1.6% +7 [section .dynstr] +1.8% +4 +1.8% +4 [section .hash] +0.5% +3 +0.5% +3 [LOAD #2 [RX]] +2.8% +2 +2.8% +2 [section .gnu.version] -60.0% -1.74Ki [ = ] 0 [Unmapped] +0.3% +496 +1.4% +1.74Ki TOTAL	4 years ago
Joshua Haberman	64abb5eb11	Amalgamation no longer bundles wyhash, but #includes it. Also fixed a few spelling mistakes.	4 years ago
Joshua Haberman	c9f9668234	symtab: use longjmp() for errors and avoid intermediate table. We used to use a separate "add table" during the upb_symtab_addfile() operation to make it easier to back out the file if it contained errors. But this created unnecessary work of re-adding the same symbols to the main symtab once everything was validated. Instead we directly add symbols to the main symbols table. If there is an error in validation, we remove precisely the set of symbols that were already added. This also requires using a separate arena for each file. We can fuse it with the symtab's main arena if the operation is successful. LoadDescriptor_Upb 61.2µs ± 4% 53.5µs ± 1% -12.50% (p=0.000 n=12+12) LoadAdsDescriptor_Upb 4.43ms ± 1% 3.06ms ± 0% -31.00% (p=0.000 n=12+12) LoadDescriptor_Proto2 257µs ± 0% 259µs ± 0% +1.00% (p=0.000 n=12+12) LoadAdsDescriptor_Proto2 13.9ms ± 1% 13.9ms ± 1% ~ (p=0.128 n=12+12)	4 years ago
Joshua Haberman	154f2c25f4	Added UTF-8 validation for proto3 string fields.	4 years ago
Joshua Haberman	e86541ac1d	Fixed the build after the merge.	4 years ago
Joshua Haberman	a0d16e7073	Added a few missing copts, and made some functions proper prototypes.	4 years ago
Joshua Haberman	86d9908c55	Fastdecode support for packed fields. This is not very optimized yet. There is a lot of room to optimize it further.	4 years ago
Joshua Haberman	2c1664906a	Removed license comments and upb_amalgamation for google3.	4 years ago
Joshua Haberman	b7dc77415a	Added licenses() to all BUILD files.	4 years ago
Joshua Haberman	e3f41de6c7	Split monolithic BUILD file into many build files.	4 years ago
Joshua Haberman	d5096f9ee8	Fixed bug in addunknown and added ASAN poisoning.	4 years ago
Joshua Haberman	5aa5b77b41	Added simple offset-based accessors for defs, and deprecated old iterators.	4 years ago
Joshua Haberman	8e26a33bcb	Added a test for UTF-8 parse checking and added missing error reporting.	5 years ago
Joshua Haberman	a1c2caeb25	More arena tests. (#279 )	5 years ago
Joshua Haberman	6c4acba610	Implemented upb_arena_fuse() (#278 ) * WIP. * WIP. * Tests are passing. * Recover some perf: LIKELY doesn't propagate through functions. :( * Added some more benchmarks. * Simplify & optimize upb_arena_realloc(). * Only add owned blocks to the freelist. * More optimization/simplification. * Re-fixed the bug. * Revert unintentional changes to parser.rl. * Revert Lua changes for now. * Revert the arena fuse changes for now. * Added last_size to the arena representation. * Re-applied Lua changes. * Implemented upb_arena_fuse(). * Fix the compile by re-ordering statements. * Improve comments.	5 years ago
Joshua Haberman	4c6dcc3c6b	[textformat]: added missing newline when a message opens. (#245 ) * [textformat]: added missing newline when a message opens. * Added tostring() support to Lua that prints to text format. Also fixed a gnarly bug that this exposed.	5 years ago
Joshua Haberman	3d955e684c	Added "extern C" blocks to textencode. (#244 ) * Added "extern C" blocks to textencode. * Added accidentally-deleted test_upb.lua, deleted unneeded test.proto.	5 years ago
Joshua Haberman	ca512852f3	Fixed parsing for string->double maps. (#243 ) Map parsing/serializing relies on map entries always having a predictable order. The code that generates layout was not respecting this in the case of string keys and primitive values.	5 years ago
Joshua Haberman	806c8c9c6e	Removed obsolete testing files.	5 years ago
Joshua Haberman	2a85bef825	Generated code interface for maps is complete, though not yet tested.	5 years ago
Joshua Haberman	382f92a87f	Maps encode and decode successfully!	5 years ago
Joshua Haberman	4c57b1fefd	More progress on Lua extension.	5 years ago
Joshua Haberman	d6c3152c0b	Added more Lua tests that are passing. Also ripped out the ctype checking in upb_table, it was not helpful (didn't help catch bugs) but was causing problems.	5 years ago
Joshua Haberman	ae66e571d4	Fixed some bugs and added a few more tests.	5 years ago
Joshua Haberman	bfc86d3577	Fixed many bugs, basic Lua test passes!	5 years ago
Joshua Haberman	b518b06d75	Lua test program is loaded successfully.	5 years ago
Joshua Haberman	88d996132e	Added Lua main.c test driver program.	5 years ago
Josh Haberman	b290a5dd65	Disabled another Lua test for the time being.	7 years ago
Josh Haberman	340bd01338	Removed default instance and oneof array from tables.	7 years ago
Joshua Haberman	c8f6a27e6b	Enforced that upb_msg lives in an Arena only, and other simplifying. upb_msg was trying to be general enough that it could either live in an arena or be allocated with malloc()/free(). This was too much complexity for too little benefit. We should commit to just saying that upb_msg is arena-only. I also ripped out the code to glue upb_msg to the existing handlers-based encoder/decoder. upb_msg has its own, small, simple encoder/decoder. I'm trying to whittle down upb_msg to a small and simple core. I updated the Lua extension for these changes. Lua needs some more work to properly create arenas per message. For now I just created a single global arena.	7 years ago
Joshua Haberman	1b9d37a00e	Start migrating upb_msglayout to be suitable for generated code. This involves: - remove upb_msglayout -> upb_msgfactory dependency. - remove upb_msglayout -> upb_msgdef dependency (in progress). - make upb_msglayout use a representation that can be statically initialized by generated code. The goal here is that upb_msglayout becomes a kind of "descriptor lite": it contains enough data to parser and serialize protobufs and manipulate a upb_msg in memory, while being far smaller and simpler than a full descriptor. It also does not include field names, which can be a benefit for applications that do not want to leak field names. Generated code can then create a upb_msglayout, and do most things without ever needing to construct full descriptors/defs if they don't want to.	8 years ago
Josh Haberman	693b841ec6	Removed all code for adding extensions to upb_symtab. This means extensions can't be used until we implement the replacement APIs for accessing extensions from a symtab.	8 years ago
Josh Haberman	47da2afd52	Make upb::SymbolTable no longer reference-counted. This transitions it from shared ownership to unique ownership.	8 years ago
Josh Haberman	15c388b819	Basic serialization for upb_msg and Lua. Doesn't yet include strings, submessages, maps, or repeated fields.	8 years ago
Josh Haberman	949aeee3f1	Changes for PR comments.	8 years ago
Josh Haberman	4b0c4ca7fb	New upb_msg code and Lua bindings around it. There are still some things that are unfinished, but we are at parity with what Lua had before.	8 years ago
Joshua Haberman	ac2689cec7	Put oneofs in the same table as fields. (#60 ) * Put oneofs in the same table as fields. Oneofs and fields are not allowed to have names that conflict, so we might as well put them all in the same table. This also allows an efficient operation that looks for both fields and oneofs in a single lookup. Added support for OneofDef to Lua to allow testing of this. * Addressed PR comments.	9 years ago
Josh Haberman	49dab06e03	Brought into compliance with Google open-source policies. - removed myself from Author headers in source files. - removed copyright notices from source file headers. - added CONTRIBUTING.md	10 years ago
Josh Haberman	016587ea33	Moved lunit to third_party for Google compliance.	10 years ago
Josh Haberman	eace8e3295	Enable Travis for Clang, and enable -Werror for all Travis builds. Also added an extra Clang-only warning flag.	10 years ago
Josh Haberman	51cf616dab	Changes to Lua module loading, and file generation. This change has several parts: 1. Resurrected tools/upbc. The code was all there but the build was broken for open-source. Now you can type "make tools/upbc" and it will build all necessary Lua modules and create a robust shell script for running upbc. 2. Changed Lua module loading to no longer rely on OS-level .so dependencies. The net effect of this is that you now only need to set LUA_PATH and LUA_CPATH; setting LD_LIBRARY_PATH or rpaths is no longer required. Downside: this drops compatibility with Lua 5.1, since it depends on a feature that only exists in Lua >=5.2 (and LuaJIT). 3. Since upbc works again, I fixed the re-generation of the descriptor files (descriptor.upb.h, descriptor.upb.c). "make genfiles" will re-generate these as well as the JIT code generator. 4. Added a Travis test target that ensures that the checked-in generated files are not out of date. I would do this for the Ragel generated file also, but we can't count on all versions of Ragel to necessarily generate identical output. 5. Changed Makefile to no longer automatically run Ragel to regenerate the JSON parser. This is unfortuante, because it's convenient when you're developing the JSON parser. However, "git clone" sometimes skews the timestamps a little bit so that "make" thinks it needs to regenerate these files for a fresh "git clone." This would normally be harmless, but if the user doesn't have Ragel installed, it makes the build fail completely. So now you have to explicitly regenerate the Ragel output. If you want to you can uncomment the auto-generation during development.	10 years ago

1 2

62 Commits (ed5b4108e0a50556e304368d77b3b656fad35fe9)