protobuf

Commit Graph

Author	SHA1	Message	Date
Joshua Haberman	d0c8bb84f4	WIP.	6 years ago
Joshua Haberman	380558922b	test_encoder passes! Other tests still need to be fixed.	6 years ago
Joshua Haberman	aa2d5a609b	Fixed generated code for C++.	6 years ago
Joshua Haberman	10e682cf2a	Added hazzers.	6 years ago
Joshua Haberman	6bcdaa1352	Changed generated array accessors to be more convenient.	6 years ago
Joshua Haberman	336402b4d7	WIP, core library compiles now.	6 years ago
Josh Haberman	b79fd65a83	WIP.	6 years ago
Josh Haberman	950d7a9530	Fixed warnings.	6 years ago
Josh Haberman	a105c015b1	Added support for unknown fields to upb_msg. After this CL, upb passes all existing proto3 conformance tests. However the conformance suite is missing a lot of cases and should be fleshed out.	6 years ago
Josh Haberman	340bd01338	Removed default instance and oneof array from tables.	6 years ago
Josh Haberman	e94ac4f757	Moved upb_msg parts that depend on def to a separate msgfactory.{c,h}. Also got rid of the premature "v1" business that was attempting to create a binary compatibility story. Also added an in-progress CMakeLists.txt file.	6 years ago
Joshua Haberman	7059be68ae	Re-add message handlers to upb/handlers.*. These are still being used by the proto2 bindings.	6 years ago
Joshua Haberman	c8f6a27e6b	Enforced that upb_msg lives in an Arena only, and other simplifying. upb_msg was trying to be general enough that it could either live in an arena or be allocated with malloc()/free(). This was too much complexity for too little benefit. We should commit to just saying that upb_msg is arena-only. I also ripped out the code to glue upb_msg to the existing handlers-based encoder/decoder. upb_msg has its own, small, simple encoder/decoder. I'm trying to whittle down upb_msg to a small and simple core. I updated the Lua extension for these changes. Lua needs some more work to properly create arenas per message. For now I just created a single global arena.	6 years ago
Bo Yang	1080117f2b	Revert "Prepare upb_value for encoding/decoding map." This reverts commit `f30dd0ff0c`.	7 years ago
Bo Yang	f30dd0ff0c	Prepare upb_value for encoding/decoding map.	7 years ago
Bo Yang	cafebf6bee	For encoding upb needs descriptor type instead of type.	7 years ago
Joshua Haberman	be9094d91a	New encode/decode: most (171 / 192) conformance tests pass.	7 years ago
Joshua Haberman	1278ff8994	Responded to PR comments.	7 years ago
Joshua Haberman	c0a660f474	Added upb_stringview, the string representation for upb_msg.	7 years ago
Josh Haberman	1aafd4111b	A good start on upb_encode and upb_decode.	7 years ago
Joshua Haberman	9cb10577fc	First version of a real C codegen for upb. Also includes an implementation of the conformance tests to display what the API usage will be like. There is still a lot to do, and things that are broken (oneofs, repeated fields, etc), but it's a good start.	8 years ago
Joshua Haberman	76fcdd2ee9	Removed all upb_msgdef/upb_fielddef from upb_msg.	8 years ago
Joshua Haberman	1b9d37a00e	Start migrating upb_msglayout to be suitable for generated code. This involves: - remove upb_msglayout -> upb_msgfactory dependency. - remove upb_msglayout -> upb_msgdef dependency (in progress). - make upb_msglayout use a representation that can be statically initialized by generated code. The goal here is that upb_msglayout becomes a kind of "descriptor lite": it contains enough data to parser and serialize protobufs and manipulate a upb_msg in memory, while being far smaller and simpler than a full descriptor. It also does not include field names, which can be a benefit for applications that do not want to leak field names. Generated code can then create a upb_msglayout, and do most things without ever needing to construct full descriptors/defs if they don't want to.	8 years ago
Josh Haberman	c850bc0a4e	Moved upb_symtab to def.h/def.c. This is in anticipation of removing refcounting and making upb_symtab (soon to be upb_defpool) the unique owner of all defs inside.	8 years ago
Josh Haberman	15c388b819	Basic serialization for upb_msg and Lua. Doesn't yet include strings, submessages, maps, or repeated fields.	8 years ago
Josh Haberman	2b77da3da8	Update for final PR comments.	8 years ago
Josh Haberman	949aeee3f1	Changes for PR comments.	8 years ago
Josh Haberman	3122535726	Fleshed out comments and removed some dead code.	8 years ago
Josh Haberman	e977c0af03	Fixed more bugs surfaced by Travis.	8 years ago
Josh Haberman	4b0c4ca7fb	New upb_msg code and Lua bindings around it. There are still some things that are unfinished, but we are at parity with what Lua had before.	8 years ago
Josh Haberman	7d3e2bd2c4	Sync with 8 months of Google-internal development. Many things have changed and been simplified. The memory-management story for upb_def and upb_handlers is much more robust; upb_def and upb_handlers should be fairly stable interfaces now. There is still much work to do for the runtime component (upb_sink).	12 years ago
Joshua Haberman	86bad61b76	Sync from internal Google development. Many improvements, too many to mention. One significant perf regression warrants investigation: omitfp.parsetoproto2_googlemessage1.upb_jit: 343 -> 252 (-26.53) plain.parsetoproto2_googlemessage1.upb_jit: 334 -> 251 (-24.85) 25% regression for this benchmark is bad, but since I don't think there's any fundamental design issue that caused it I'm going to go ahead with the commit anyway. Can investigate and fix later. Other benchmarks were neutral or showed slight improvement.	13 years ago
Joshua Haberman	887abe669f	Added an example, constified some more methods.	13 years ago
Joshua Haberman	621c0cdcb5	Const invasion: large parts of upb made const-correct.	13 years ago
Joshua Haberman	51d4e295a4	Python: fleshed out accessors.	13 years ago
Joshua Haberman	6981e468a3	More work on Lua extension, and consequent core refactoring.	14 years ago
Joshua Haberman	56984e8db8	Significant work on Lua extension. Also changes in core library to accommodate.	14 years ago
Joshua Haberman	10265aa56b	Directory restructure. Includes are now via upb/foo.h. Files specific to the protobuf format are now in upb/pb (the core library is concerned with message definitions, handlers, and byte streams, but knows nothing about any particular serializationf format).	14 years ago
Joshua Haberman	6a1f3a6693	Major refactoring: upb_string is gone in favor of upb_strref.	14 years ago
Joshua Haberman	559e23c796	Major refactoring: abandon upb_msg, add upb_accessors. Next on the chopping block is upb_string.	14 years ago
Joshua Haberman	a503b8859c	Make all handlers objects refcounted. I'm realizing that basically all upb objects will need to be refcounted to be sharable across languages, but not messages which are on their way out so we can get out of the business of data representations. Things which must be refcounted: - encoders, decoders - handlers objects - defs	14 years ago
Joshua Haberman	0941664215	Add startseq/endseq handlers. Startseq/endseq handlers are called at the beginning and end of a sequence of repeated values. Protobuf does not really have direct support for this (repeated primitive fields do not delimit "begin" and "end" of the sequence) but we can infer them from the bytestream. The benefit of supporting them explicitly is that they get their own stack frame and closure, so we can avoid having to find the array's address over and over and deciding if we need to initialize it. This will also pave the way for better support of JSON, which does have explicit "startseq/endseq" markers: [].	14 years ago
Joshua Haberman	d619852e06	Change dispatcher error handling model. Now the dispatcher will call error handlers instaed of returning statuses that the caller has to constantly check.	14 years ago
Joshua Haberman	3231fd0fdd	Vastly improved/simplified the upb_handlers API.	14 years ago
Joshua Haberman	eb622c0531	Split upb_stream -> upb_bytestream/upb_handlers.	14 years ago
Joshua Haberman	7cf5893dcc	Revise/clarify comment about clear() implementation.	14 years ago
Joshua Haberman	066d1e024c	Speed up parsetostruct by using type-specialized callbacks.	14 years ago
Josh Haberman	b796c1b317	Update copyright to be Google Inc. This doesn't reflect any material change in how I will be working on upb, and I have no problem making this change. It's still open source under the BSD license, and I'll still be working on it well beyond the hours that constitute a normal job.	14 years ago
Josh Haberman	8ef6873e0e	upb_stream: all callbacks registered ahead-of-time. This is a significant change to the upb_stream protocol, and should hopefully be the last significant change. All callbacks are now registered ahead-of-time instead of having delegated callbacks registered at runtime, which makes it much easier to aggressively optimize ahead-of-time (like with a JIT). Other impacts of this change: - You no longer need to have loaded descriptor.proto as a upb_def to load other descriptors! This means the special-case code we used for bootstrapping is no longer necessary, and we no longer need to link the descriptor for descriptor.proto into upb. - A client can now register any upb_value as what will be delivered to their value callback, not just a upb_fielddef*. This should allow for other clients to get more bang out of the streaming decoder. This change unfortunately causes a bit of a performance regression -- I think largely due to highly suboptimal code that GCC generates when structs are returned by value. See: http://blog.reverberate.org/2011/03/19/when-a-compilers-slow-code-actually-bites-you/ On the other hand, once we have a JIT this should no longer matter. Performance numbers: plain.parsestream_googlemessage1.upb_table: 374 -> 396 (5.88) plain.parsestream_googlemessage2.upb_table: 616 -> 449 (-27.11) plain.parsetostruct_googlemessage1.upb_table_byref: 268 -> 269 (0.37) plain.parsetostruct_googlemessage1.upb_table_byval: 215 -> 204 (-5.12) plain.parsetostruct_googlemessage2.upb_table_byref: 307 -> 281 (-8.47) plain.parsetostruct_googlemessage2.upb_table_byval: 297 -> 272 (-8.42) omitfp.parsestream_googlemessage1.upb_table: 423 -> 410 (-3.07) omitfp.parsestream_googlemessage2.upb_table: 679 -> 483 (-28.87) omitfp.parsetostruct_googlemessage1.upb_table_byref: 287 -> 282 (-1.74) omitfp.parsetostruct_googlemessage1.upb_table_byval: 226 -> 219 (-3.10) omitfp.parsetostruct_googlemessage2.upb_table_byref: 315 -> 298 (-5.40) omitfp.parsetostruct_googlemessage2.upb_table_byval: 297 -> 287 (-3.37)	14 years ago
Joshua Haberman	a75a305c77	Implemented upb_stringsink, upb_msgtotext, and exposed the latter to Lua.	14 years ago

43 Commits (48863ea0be94ea3d3d61206ad7ce9ead206770fa)