protobuf

Commit Graph

Author	SHA1	Message	Date
Joshua Haberman	d22d6d71ed	Refactored message accessors to share a common set of functions instead of duplicating logic. Prior to this CL, there were several different code paths for reading/writing message data. Generated code, MiniTable accessors, and reflection all performed direct manipulation of the bits and bytes in a message, but they all had distinct implementations that did not share much of any code. This divergence meant that they could easily have different behavior, bugs could creep into one but not another, and we would need three different sets of tests to get full test coverage. This also made it very difficult to change the internal representation in any way, since it would require updating many places in the code. With this CL, the three different APIs for accessing message data now all share a common set of functions. The common functions all take a `upb_MiniTableField` as the canonical description of a field's type and layout. The lowest-level functions are very branchy, as they must test for every possible variation in the field type (field vs oneof, hasbit vs no-hasbit, different field sizes, whether a nonzero default value exists, extension vs. regular field), however these functions are declared inline and designed to be very optimizable when values are known at compile time. In generated accessors, for example, we can declare constant `upb_MiniTableField` instances so that all values can constant-propagate, and we can get fully specialized code even though we are calling a generic function. On the other hand, when we use the generic functions from reflection, we get runtime branches since values are not known at compile time. But even the function is written to still be as efficient as possible even when used from reflection. For example, we use memcpy() calls with constant length so that the compiler can optimize these into inline loads/stores without having to make an out-of-line call to memcpy(). In this way, this CL should be a benefit to both correctness and performance. It will also make it easier to change the message representation, for example to optimize the encoder by giving hasbits to all fields. Note that we have not completely consolidated all access in this CL: 1. Some functions outside of get/set such as clear and hazzers are not yet unified. 2. The encoder and decoder still touch the message without going through the common functions. The encoder and decoder require a bit more specialized code to get good performance when reading/writing fields en masse. PiperOrigin-RevId: 490016095	2 years ago
Eric Salo	384ffc0af8	implement reserved names and ranges for messages and enums https://github.com/protocolbuffers/protobuf/issues/10158 PiperOrigin-RevId: 489285657	2 years ago
Eric Salo	75907f7af9	rename the upb_MiniTable subtypes to follow the upb style guide: upb_MiniTable_Enum -> upb_MiniTableEnum upb_MiniTable_Extension -> upb_MiniTableExtension upb_MiniTable_Field -> upb_MiniTableField upb_MiniTable_File -> upb_MiniTableFile upb_MiniTable_Sub -> upb_MiniTableSub PiperOrigin-RevId: 486712960	2 years ago
Eric Salo	f6307877d3	move portability stuff into upb/port/ Also delete redundant system #includes that are already pulled in by port/def.inc PiperOrigin-RevId: 486398989	2 years ago
Eric Salo	d9b6f13cde	remove upb_MtDataEncoder from the public surface PiperOrigin-RevId: 485928803	2 years ago
Eric Salo	c033eff26f	split apart mini_table.c into a new subdir PiperOrigin-RevId: 484352293	2 years ago
Eric Salo	40998462d6	add upb_MtDataEncoder_EncodeExtension() We need to sharpen the distinction between messages and extensions in the mini descriptor encoder, so split the code paths for each. PiperOrigin-RevId: 480675339	2 years ago
Joshua Haberman	185d4f09d9	Simplified extension table building slightly by avoiding direct mutation PiperOrigin-RevId: 478953396	2 years ago
Eric Salo	44916d7d27	create reflection_internal library internal declarations are now physically removed from the public headers PiperOrigin-RevId: 478921131	2 years ago
Joshua Haberman	5d0833f48c	Removed obsolete exemption for closed enums The referenced bug was fixed long ago: https://github.com/protocolbuffers/upb/issues/541 PiperOrigin-RevId: 478870590	2 years ago
Eric Salo	0f585c69af	stop requiring extension fields to have a synthetic oneof https://github.com/protocolbuffers/upb/issues/812 PiperOrigin-RevId: 478527073	2 years ago
Joshua Haberman	128ac1c935	Bugfix for when UPB_TREAT_PROTO2_ENUMS_LIKE_PROTO3 is being used This is not easy to test in google3 since we do not use the flag in Google3. It is only used for Ruby. PiperOrigin-RevId: 478071539	2 years ago
Eric Salo	b8bec58e01	pull the mini descriptor encoders into their proper .c files Performance neutral but it simplifies the code, shrinks the public surface, and makes logical sense PiperOrigin-RevId: 478038589	2 years ago
Eric Salo	1e3deb013d	move message/field modifiers functions out of mini_descriptor_encode.c PiperOrigin-RevId: 477486367	2 years ago
Eric Salo	efd06e46a4	use mini descriptors to build message defs and extension defs PiperOrigin-RevId: 477332937	2 years ago
Eric Salo	38d8430923	simplify makejsonname() PiperOrigin-RevId: 474577074	2 years ago
Eric Salo	00765002ff	- All of reflection now lives in upb/reflection/ - Each def type has its own .c file and its own .h file - Functions that require a builder context are declared in def_builder.h - The mini descriptor encoders have also been pulled into upb/reflection/ - upb/def.h, upb/def.hpp, upb/reflection.h, and upb/reflection.hpp are now deprecated stubs that point to the new headers PiperOrigin-RevId: 474459500	2 years ago

17 Commits (2272970a94874b5efb967bc44588868fe1881620)