protobuf

Commit Graph

Author	SHA1	Message	Date
Josh Haberman	932753d91e	WIP.	6 years ago
Joshua Haberman	cf35baa1ad	Moved macros from upb.h to port_def.inc to avoid leaking them to users. (#160 ) * Use port_def.inc to prevent macros from leaking to users. * Added helpful comments to port_def.inc/port_undef.inc.	6 years ago
Joshua Haberman	928ef7f2c0	Removed reflection and other extraneous things from the core library. (#158 ) * Removed reflection and other extraneous things from the core library. * Added missing files and ran buildifier. * New CMakeLists.txt. * Made table its own cc_library() for internal usage.	6 years ago
Joshua Haberman	f4532ab273	Properly align the arena.	6 years ago
Josh Haberman	9ea6bb4678	Renamed upb_stringview -> upb_strview for C terseness.	6 years ago
Josh Haberman	b79fd65a83	WIP.	6 years ago
Josh Haberman	950d7a9530	Fixed warnings.	6 years ago
Josh Haberman	a105c015b1	Added support for unknown fields to upb_msg. After this CL, upb passes all existing proto3 conformance tests. However the conformance suite is missing a lot of cases and should be fleshed out.	6 years ago
Josh Haberman	340bd01338	Removed default instance and oneof array from tables.	6 years ago
Josh Haberman	e94ac4f757	Moved upb_msg parts that depend on def to a separate msgfactory.{c,h}. Also got rid of the premature "v1" business that was attempting to create a binary compatibility story. Also added an in-progress CMakeLists.txt file.	6 years ago
Joshua Haberman	c8f6a27e6b	Enforced that upb_msg lives in an Arena only, and other simplifying. upb_msg was trying to be general enough that it could either live in an arena or be allocated with malloc()/free(). This was too much complexity for too little benefit. We should commit to just saying that upb_msg is arena-only. I also ripped out the code to glue upb_msg to the existing handlers-based encoder/decoder. upb_msg has its own, small, simple encoder/decoder. I'm trying to whittle down upb_msg to a small and simple core. I updated the Lua extension for these changes. Lua needs some more work to properly create arenas per message. For now I just created a single global arena.	6 years ago
Bo Yang	1080117f2b	Revert "Prepare upb_value for encoding/decoding map." This reverts commit `f30dd0ff0c`.	7 years ago
Bo Yang	f30dd0ff0c	Prepare upb_value for encoding/decoding map.	7 years ago
Bo Yang	0833cf29b3	Bytes type should return size of stringview	7 years ago
Bo Yang	bc7f1eaca0	In case of circular dependency, layout has to be inserted first.	7 years ago
Bo Yang	719f644232	Field missing submsg and hasbit information.	7 years ago
Bo Yang	cafebf6bee	For encoding upb needs descriptor type instead of type.	7 years ago
Joshua Haberman	be9094d91a	New encode/decode: most (171 / 192) conformance tests pass.	7 years ago
Joshua Haberman	1278ff8994	Responded to PR comments.	7 years ago
Joshua Haberman	c0a660f474	Added upb_stringview, the string representation for upb_msg.	7 years ago
Joshua Haberman	3e8acc3f4e	Removed incorrect assert and added comments.	7 years ago
Joshua Haberman	af43ea72b5	Removed incorrect assertion. Internal members aren't initialized by default_msg.	7 years ago
Joshua Haberman	2826811367	Responded to PR comments.	7 years ago
Josh Haberman	1aafd4111b	A good start on upb_encode and upb_decode.	7 years ago
Joshua Haberman	9cb10577fc	First version of a real C codegen for upb. Also includes an implementation of the conformance tests to display what the API usage will be like. There is still a lot to do, and things that are broken (oneofs, repeated fields, etc), but it's a good start.	8 years ago
Joshua Haberman	76fcdd2ee9	Removed all upb_msgdef/upb_fielddef from upb_msg.	8 years ago
Joshua Haberman	1b9d37a00e	Start migrating upb_msglayout to be suitable for generated code. This involves: - remove upb_msglayout -> upb_msgfactory dependency. - remove upb_msglayout -> upb_msgdef dependency (in progress). - make upb_msglayout use a representation that can be statically initialized by generated code. The goal here is that upb_msglayout becomes a kind of "descriptor lite": it contains enough data to parser and serialize protobufs and manipulate a upb_msg in memory, while being far smaller and simpler than a full descriptor. It also does not include field names, which can be a benefit for applications that do not want to leak field names. Generated code can then create a upb_msglayout, and do most things without ever needing to construct full descriptors/defs if they don't want to.	8 years ago
Josh Haberman	3b7dc27fb5	Fixed amalgamated build and added test.	8 years ago
Josh Haberman	47da2afd52	Make upb::SymbolTable no longer reference-counted. This transitions it from shared ownership to unique ownership.	8 years ago
Josh Haberman	6cccfe1649	Addressed PR comments.	8 years ago
Josh Haberman	15c388b819	Basic serialization for upb_msg and Lua. Doesn't yet include strings, submessages, maps, or repeated fields.	8 years ago
Josh Haberman	2b77da3da8	Update for final PR comments.	8 years ago
Josh Haberman	949aeee3f1	Changes for PR comments.	8 years ago
Josh Haberman	62472c1161	Suppress warnings on 32-bit for this dead code for now.	8 years ago
Josh Haberman	16ca9309b3	Removed some temporary code and fixed a few tests.	8 years ago
Josh Haberman	4b0c4ca7fb	New upb_msg code and Lua bindings around it. There are still some things that are unfinished, but we are at parity with what Lua had before.	8 years ago
Josh Haberman	7d3e2bd2c4	Sync with 8 months of Google-internal development. Many things have changed and been simplified. The memory-management story for upb_def and upb_handlers is much more robust; upb_def and upb_handlers should be fairly stable interfaces now. There is still much work to do for the runtime component (upb_sink).	12 years ago
Joshua Haberman	86bad61b76	Sync from internal Google development. Many improvements, too many to mention. One significant perf regression warrants investigation: omitfp.parsetoproto2_googlemessage1.upb_jit: 343 -> 252 (-26.53) plain.parsetoproto2_googlemessage1.upb_jit: 334 -> 251 (-24.85) 25% regression for this benchmark is bad, but since I don't think there's any fundamental design issue that caused it I'm going to go ahead with the commit anyway. Can investigate and fix later. Other benchmarks were neutral or showed slight improvement.	13 years ago
Joshua Haberman	1bcab1377d	Sync with internal Google development. This breaks the open-source build, will follow up with a change to fix it.	13 years ago
Joshua Haberman	b5f5ee867e	Refinement of upb_bytesrc interface. Added a upb_byteregion that tracks a region of the input buffer; decoders use this instead of using a upb_bytesrc directly. upb_byteregion is also used as the way of passing a string to a upb_handlers callback. This symmetry makes decoders compose better; if you want to take a parsed string and decode it as something else, you can take the string directly from the callback and feed it as input to another parser. A commented-out version of a pinning interface is present; I decline to actually implement it (and accept its extra complexity) until/unless it is clear that it is actually a win. But it is included as a proof-of-concept, to show that it fits well with the existing interface.	13 years ago
Joshua Haberman	887abe669f	Added an example, constified some more methods.	13 years ago
Joshua Haberman	621c0cdcb5	Const invasion: large parts of upb made const-correct.	13 years ago
Joshua Haberman	adb6580d97	Let the JIT emit hasbit-setting code in addition to calling a callback. This leads to a major (20-40%) improvement in the parsetoproto2 benchmark with small messages. We now are faster than proto2 in all apples-to-apples comparisons, at least given the (admittedly limited) set of benchmarks in this source tree.	13 years ago
Joshua Haberman	06b8181f97	Benchmark to parse into proto2 messages.	13 years ago
Joshua Haberman	6981e468a3	More work on Lua extension, and consequent core refactoring.	14 years ago
Joshua Haberman	10265aa56b	Directory restructure. Includes are now via upb/foo.h. Files specific to the protobuf format are now in upb/pb (the core library is concerned with message definitions, handlers, and byte streams, but knows nothing about any particular serializationf format).	14 years ago
Joshua Haberman	6a1f3a6693	Major refactoring: upb_string is gone in favor of upb_strref.	14 years ago
Joshua Haberman	559e23c796	Major refactoring: abandon upb_msg, add upb_accessors. Next on the chopping block is upb_string.	14 years ago
Joshua Haberman	a503b8859c	Make all handlers objects refcounted. I'm realizing that basically all upb objects will need to be refcounted to be sharable across languages, but not messages which are on their way out so we can get out of the business of data representations. Things which must be refcounted: - encoders, decoders - handlers objects - defs	14 years ago
Joshua Haberman	0941664215	Add startseq/endseq handlers. Startseq/endseq handlers are called at the beginning and end of a sequence of repeated values. Protobuf does not really have direct support for this (repeated primitive fields do not delimit "begin" and "end" of the sequence) but we can infer them from the bytestream. The benefit of supporting them explicitly is that they get their own stack frame and closure, so we can avoid having to find the array's address over and over and deciding if we need to initialize it. This will also pave the way for better support of JSON, which does have explicit "startseq/endseq" markers: [].	14 years ago

50 Commits (56779f09eb993bcde5b7bc1c7de9ad943d6cd5ff)