protobuf

Commit Graph

Author	SHA1	Message	Date
Josh Haberman	49dab06e03	Brought into compliance with Google open-source policies. - removed myself from Author headers in source files. - removed copyright notices from source file headers. - added CONTRIBUTING.md	10 years ago
Josh Haberman	919fea438a	Ported upb to C89, for greater portability. A large part of this change contains surface-level porting, like moving variable declarations to the top of the block. However there are a few more substantial things too: - moved internal-only struct definitions to a separate file (structdefs.int.h), for greater encapsulation and ABI compatibility. - removed the UPB_UPCAST macro, since it requires access to the internal-only struct definitions. Replaced uses with calls to inline, type-safe casting functions. - removed the UPB_DEFINE_CLASS/UPB_DEFINE_STRUCT macros. Class and struct definitions are now more explicit -- you get to see the actual class/struct keywords in the source. The casting convenience functions have been moved into UPB_DECLARE_DERIVED_TYPE() and UPB_DECLARE_DERIVED_TYPE2(). - the new way that we duplicate base methods in derived types is also more convenient and requires less duplication. It is also less greppable, but hopefully that is not too big a problem. Compiler flags (-std=c89 -pedantic) should help to rigorously enforce that the code is free of C99-isms. A few functions are not available in C89 (strtoll). There are temporary, hacky solutions in place.	10 years ago
Josh Haberman	3bd691a497	Google-internal development.	10 years ago
Josh Haberman	7d565f1e7a	Sync from Google development.	11 years ago
Josh Haberman	ce9bba3cb5	Sync from Google-internal development.	11 years ago
Josh Haberman	26d98ca94f	Merge from Google-internal development: - rewritten decoder; interpreted decoder is bytecode-based, JIT decoder no longer falls back to the interpreter. - C++ improvements: C++11-compatible iterators, upb::reffed_ptr for RAII refcounting, better upcast/downcast support. - removed the gross upb_value abstraction from public upb.h.	11 years ago
Josh Haberman	90bb4246c3	Synced with Google-internal development. C++ handlers are now type-safe; SinkFrame is gone. Various other changes.	12 years ago
Josh Haberman	cfdb9907cb	Synced with 3 months of Google-internal development. Major changes: - Got rid of all bytestream interfaces in favor of using regular handlers. - new Pipeline object represents a upb pipeline, does bump allocation internally to manage memory. - proto2 support now can handle extensions.	12 years ago
Josh Haberman	7d3e2bd2c4	Sync with 8 months of Google-internal development. Many things have changed and been simplified. The memory-management story for upb_def and upb_handlers is much more robust; upb_def and upb_handlers should be fairly stable interfaces now. There is still much work to do for the runtime component (upb_sink).	12 years ago
Joshua Haberman	86bad61b76	Sync from internal Google development. Many improvements, too many to mention. One significant perf regression warrants investigation: omitfp.parsetoproto2_googlemessage1.upb_jit: 343 -> 252 (-26.53) plain.parsetoproto2_googlemessage1.upb_jit: 334 -> 251 (-24.85) 25% regression for this benchmark is bad, but since I don't think there's any fundamental design issue that caused it I'm going to go ahead with the commit anyway. Can investigate and fix later. Other benchmarks were neutral or showed slight improvement.	13 years ago
Joshua Haberman	1bcab1377d	Sync with internal Google development. This breaks the open-source build, will follow up with a change to fix it.	13 years ago
Joshua Haberman	b5f5ee867e	Refinement of upb_bytesrc interface. Added a upb_byteregion that tracks a region of the input buffer; decoders use this instead of using a upb_bytesrc directly. upb_byteregion is also used as the way of passing a string to a upb_handlers callback. This symmetry makes decoders compose better; if you want to take a parsed string and decode it as something else, you can take the string directly from the callback and feed it as input to another parser. A commented-out version of a pinning interface is present; I decline to actually implement it (and accept its extra complexity) until/unless it is clear that it is actually a win. But it is included as a proof-of-concept, to show that it fits well with the existing interface.	13 years ago
Joshua Haberman	887abe669f	Added an example, constified some more methods.	13 years ago
Joshua Haberman	621c0cdcb5	Const invasion: large parts of upb made const-correct.	13 years ago
Joshua Haberman	4a8b9be46c	Header cleanup, clarify/correct comments for interfaces.	13 years ago
Joshua Haberman	521ac7a89a	Refined upb_status.	13 years ago
Joshua Haberman	06b8181f97	Benchmark to parse into proto2 messages.	14 years ago
Joshua Haberman	6981e468a3	More work on Lua extension, and consequent core refactoring.	14 years ago
Joshua Haberman	57abebaaf9	Fixed "make descriptorgen".	14 years ago
Joshua Haberman	daf36f0747	Get rid of upb_symtabtxn. This type was nothing but a map of defs. We can as easily just pass an array of defs into upb_symtab_add().	14 years ago
Joshua Haberman	10265aa56b	Directory restructure. Includes are now via upb/foo.h. Files specific to the protobuf format are now in upb/pb (the core library is concerned with message definitions, handlers, and byte streams, but knows nothing about any particular serializationf format).	14 years ago
Joshua Haberman	6a1f3a6693	Major refactoring: upb_string is gone in favor of upb_strref.	14 years ago
Joshua Haberman	559e23c796	Major refactoring: abandon upb_msg, add upb_accessors. Next on the chopping block is upb_string.	14 years ago
Joshua Haberman	a503b8859c	Make all handlers objects refcounted. I'm realizing that basically all upb objects will need to be refcounted to be sharable across languages, but not messages which are on their way out so we can get out of the business of data representations. Things which must be refcounted: - encoders, decoders - handlers objects - defs	14 years ago
Joshua Haberman	3231fd0fdd	Vastly improved/simplified the upb_handlers API.	14 years ago
Joshua Haberman	9eb4d695c4	First rough version of the JIT. It can successfully parse SpeedMessage1. Preliminary results: 750MB/s on Core2 2.4GHz. This number is 2.5x proto2. This isn't apples-to-apples, because proto2 is parsing to a struct and we are just doing stream parsing, but for apps that are currently using proto2, this is the improvement they would see if they could move to stream-based processing. Unfortunately perf-regression-test.py is broken, and I'm not 100% sure why. It would be nice to fix it first (to ensure that there are no performance regressions for the table-based decoder) but I'm really impatient to get the JIT checked in.	14 years ago
Josh Haberman	b796c1b317	Update copyright to be Google Inc. This doesn't reflect any material change in how I will be working on upb, and I have no problem making this change. It's still open source under the BSD license, and I'll still be working on it well beyond the hours that constitute a normal job.	14 years ago
Josh Haberman	8ef6873e0e	upb_stream: all callbacks registered ahead-of-time. This is a significant change to the upb_stream protocol, and should hopefully be the last significant change. All callbacks are now registered ahead-of-time instead of having delegated callbacks registered at runtime, which makes it much easier to aggressively optimize ahead-of-time (like with a JIT). Other impacts of this change: - You no longer need to have loaded descriptor.proto as a upb_def to load other descriptors! This means the special-case code we used for bootstrapping is no longer necessary, and we no longer need to link the descriptor for descriptor.proto into upb. - A client can now register any upb_value as what will be delivered to their value callback, not just a upb_fielddef*. This should allow for other clients to get more bang out of the streaming decoder. This change unfortunately causes a bit of a performance regression -- I think largely due to highly suboptimal code that GCC generates when structs are returned by value. See: http://blog.reverberate.org/2011/03/19/when-a-compilers-slow-code-actually-bites-you/ On the other hand, once we have a JIT this should no longer matter. Performance numbers: plain.parsestream_googlemessage1.upb_table: 374 -> 396 (5.88) plain.parsestream_googlemessage2.upb_table: 616 -> 449 (-27.11) plain.parsetostruct_googlemessage1.upb_table_byref: 268 -> 269 (0.37) plain.parsetostruct_googlemessage1.upb_table_byval: 215 -> 204 (-5.12) plain.parsetostruct_googlemessage2.upb_table_byref: 307 -> 281 (-8.47) plain.parsetostruct_googlemessage2.upb_table_byval: 297 -> 272 (-8.42) omitfp.parsestream_googlemessage1.upb_table: 423 -> 410 (-3.07) omitfp.parsestream_googlemessage2.upb_table: 679 -> 483 (-28.87) omitfp.parsetostruct_googlemessage1.upb_table_byref: 287 -> 282 (-1.74) omitfp.parsetostruct_googlemessage1.upb_table_byval: 226 -> 219 (-3.10) omitfp.parsetostruct_googlemessage2.upb_table_byref: 315 -> 298 (-5.40) omitfp.parsetostruct_googlemessage2.upb_table_byval: 297 -> 287 (-3.37)	14 years ago
Joshua Haberman	a75a305c77	Implemented upb_stringsink, upb_msgtotext, and exposed the latter to Lua.	14 years ago
Joshua Haberman	61e5d367ff	Change the API for getting the bootstrapped defs. The symtab that contains them is now hidden, and you can look them up by name but there is no access to the symtab itself, so there is no risk of mutating it (by extending it, adding other defs to it, etc).	14 years ago
Joshua Haberman	4dce5ab709	Fix upbc and descriptorgen, and update descriptor.	14 years ago
Joshua Haberman	6bdbb45e88	Merged core/ and stream/ -> src/. The split wasn't worth it.	14 years ago
Joshua Haberman	4667ed4be9	All tests pass again, valgrind-clean! Next up: benchmarks.	14 years ago
Joshua Haberman	3affb31926	Tons of work: we're close to passing test_vs_proto2 again.	14 years ago

21 Commits (458da2563fa674bbcfd32ade2254b3915f751491)