protobuf

Commit Graph

Author	SHA1	Message	Date
Joshua Haberman	cca4818eb7	Sync from internal Google development.	13 years ago
Joshua Haberman	86bad61b76	Sync from internal Google development. Many improvements, too many to mention. One significant perf regression warrants investigation: omitfp.parsetoproto2_googlemessage1.upb_jit: 343 -> 252 (-26.53) plain.parsetoproto2_googlemessage1.upb_jit: 334 -> 251 (-24.85) 25% regression for this benchmark is bad, but since I don't think there's any fundamental design issue that caused it I'm going to go ahead with the commit anyway. Can investigate and fix later. Other benchmarks were neutral or showed slight improvement.	13 years ago
Joshua Haberman	db59a5198f	Fixes to un-break "make descriptorgen"	13 years ago
Joshua Haberman	1bcab1377d	Sync with internal Google development. This breaks the open-source build, will follow up with a change to fix it.	13 years ago
Joshua Haberman	b5f5ee867e	Refinement of upb_bytesrc interface. Added a upb_byteregion that tracks a region of the input buffer; decoders use this instead of using a upb_bytesrc directly. upb_byteregion is also used as the way of passing a string to a upb_handlers callback. This symmetry makes decoders compose better; if you want to take a parsed string and decode it as something else, you can take the string directly from the callback and feed it as input to another parser. A commented-out version of a pinning interface is present; I decline to actually implement it (and accept its extra complexity) until/unless it is clear that it is actually a win. But it is included as a proof-of-concept, to show that it fits well with the existing interface.	13 years ago
Joshua Haberman	99ae0ed397	Changes to get upb compiling inside Google.	13 years ago
Joshua Haberman	bda3269a42	Fleshed out fielddef default functionality. Fixes unit test submitted by Hunter Morris (thanks!).	13 years ago
Joshua Haberman	621c0cdcb5	Const invasion: large parts of upb made const-correct.	13 years ago
Joshua Haberman	4a8b9be46c	Header cleanup, clarify/correct comments for interfaces.	13 years ago
Joshua Haberman	521ac7a89a	Refined upb_status.	13 years ago
Joshua Haberman	adb6580d97	Let the JIT emit hasbit-setting code in addition to calling a callback. This leads to a major (20-40%) improvement in the parsetoproto2 benchmark with small messages. We now are faster than proto2 in all apples-to-apples comparisons, at least given the (admittedly limited) set of benchmarks in this source tree.	13 years ago
Joshua Haberman	25cdf1e6f7	Fixed overzealous assert().	13 years ago
Joshua Haberman	336268b3d7	Fixed a few memory leaks and Makefile tweaks.	13 years ago
Joshua Haberman	a1bb3dc448	Makefile target for running Python tests.	14 years ago
Joshua Haberman	57abebaaf9	Fixed "make descriptorgen".	14 years ago
Joshua Haberman	56984e8db8	Significant work on Lua extension. Also changes in core library to accommodate.	14 years ago
Joshua Haberman	daf36f0747	Get rid of upb_symtabtxn. This type was nothing but a map of defs. We can as easily just pass an array of defs into upb_symtab_add().	14 years ago
Joshua Haberman	10265aa56b	Directory restructure. Includes are now via upb/foo.h. Files specific to the protobuf format are now in upb/pb (the core library is concerned with message definitions, handlers, and byte streams, but knows nothing about any particular serializationf format).	14 years ago
Joshua Haberman	6a1f3a6693	Major refactoring: upb_string is gone in favor of upb_strref.	14 years ago
Joshua Haberman	559e23c796	Major refactoring: abandon upb_msg, add upb_accessors. Next on the chopping block is upb_string.	14 years ago
Joshua Haberman	a503b8859c	Make all handlers objects refcounted. I'm realizing that basically all upb objects will need to be refcounted to be sharable across languages, but not messages which are on their way out so we can get out of the business of data representations. Things which must be refcounted: - encoders, decoders - handlers objects - defs	14 years ago
Joshua Haberman	2ccebb74c3	Add proof-of-concept C++ wrapper header.	14 years ago
Joshua Haberman	d619852e06	Change dispatcher error handling model. Now the dispatcher will call error handlers instaed of returning statuses that the caller has to constantly check.	14 years ago
Joshua Haberman	3231fd0fdd	Vastly improved/simplified the upb_handlers API.	14 years ago
Joshua Haberman	9eb4d695c4	First rough version of the JIT. It can successfully parse SpeedMessage1. Preliminary results: 750MB/s on Core2 2.4GHz. This number is 2.5x proto2. This isn't apples-to-apples, because proto2 is parsing to a struct and we are just doing stream parsing, but for apps that are currently using proto2, this is the improvement they would see if they could move to stream-based processing. Unfortunately perf-regression-test.py is broken, and I'm not 100% sure why. It would be nice to fix it first (to ensure that there are no performance regressions for the table-based decoder) but I'm really impatient to get the JIT checked in.	14 years ago
Josh Haberman	b796c1b317	Update copyright to be Google Inc. This doesn't reflect any material change in how I will be working on upb, and I have no problem making this change. It's still open source under the BSD license, and I'll still be working on it well beyond the hours that constitute a normal job.	14 years ago
Josh Haberman	8ef6873e0e	upb_stream: all callbacks registered ahead-of-time. This is a significant change to the upb_stream protocol, and should hopefully be the last significant change. All callbacks are now registered ahead-of-time instead of having delegated callbacks registered at runtime, which makes it much easier to aggressively optimize ahead-of-time (like with a JIT). Other impacts of this change: - You no longer need to have loaded descriptor.proto as a upb_def to load other descriptors! This means the special-case code we used for bootstrapping is no longer necessary, and we no longer need to link the descriptor for descriptor.proto into upb. - A client can now register any upb_value as what will be delivered to their value callback, not just a upb_fielddef*. This should allow for other clients to get more bang out of the streaming decoder. This change unfortunately causes a bit of a performance regression -- I think largely due to highly suboptimal code that GCC generates when structs are returned by value. See: http://blog.reverberate.org/2011/03/19/when-a-compilers-slow-code-actually-bites-you/ On the other hand, once we have a JIT this should no longer matter. Performance numbers: plain.parsestream_googlemessage1.upb_table: 374 -> 396 (5.88) plain.parsestream_googlemessage2.upb_table: 616 -> 449 (-27.11) plain.parsetostruct_googlemessage1.upb_table_byref: 268 -> 269 (0.37) plain.parsetostruct_googlemessage1.upb_table_byval: 215 -> 204 (-5.12) plain.parsetostruct_googlemessage2.upb_table_byref: 307 -> 281 (-8.47) plain.parsetostruct_googlemessage2.upb_table_byval: 297 -> 272 (-8.42) omitfp.parsestream_googlemessage1.upb_table: 423 -> 410 (-3.07) omitfp.parsestream_googlemessage2.upb_table: 679 -> 483 (-28.87) omitfp.parsetostruct_googlemessage1.upb_table_byref: 287 -> 282 (-1.74) omitfp.parsetostruct_googlemessage1.upb_table_byval: 226 -> 219 (-3.10) omitfp.parsetostruct_googlemessage2.upb_table_byref: 315 -> 298 (-5.40) omitfp.parsetostruct_googlemessage2.upb_table_byval: 297 -> 287 (-3.37)	14 years ago
Joshua Haberman	20b2a6bd0d	Default to -O3 if user doesn't specify opt. However if the user does specify a -O flag, don't override the optimization setting for upb_def.o to -Os like we usually do.	14 years ago
Joshua Haberman	abfc897b50	Pass the upb_fielddef* to the endmsg callback.	14 years ago
Joshua Haberman	3a758132b4	Added proper support for enum default values.	14 years ago
Joshua Haberman	fd184f0df2	Major work on Lua extension and default values. Default values are now supported, and the Lua extension can now create and modify individual protobuf objects.	14 years ago
Joshua Haberman	61e5d367ff	Change the API for getting the bootstrapped defs. The symtab that contains them is now hidden, and you can look them up by name but there is no access to the symtab itself, so there is no risk of mutating it (by extending it, adding other defs to it, etc).	14 years ago
Joshua Haberman	d8b2154862	First version of an assembly language decoder. It is slower than the C decoder for now because it falls off the fast path too often. But it can successfully decode varints, fixed32 and fixed64.	14 years ago
Joshua Haberman	f1e1cc4695	Split inttable into a hash part and an array part. upb_inttable() now supports a "compact" operation that will decide on an array size and put all entries with small enough keys into the array part for faster lookup. Also exposed the upb_itof_ent structure and put a few useful values there, so they are one fewer pointer chase away.	14 years ago
Joshua Haberman	4f9aeee6c7	More completely fixed the 0-key thing. Unfortunately this degrades hash table lookup performance by about 8%, which affects the streaming benchmark for googlemessage1 by about 5%. We could get this back at the cost of some memory, but it would be nice to avoid that.	14 years ago
Joshua Haberman	6881b2c5cb	Added proper error about broken 0-values for enums.	14 years ago
Joshua Haberman	4dce5ab709	Fix upbc and descriptorgen, and update descriptor.	14 years ago
Joshua Haberman	6bdbb45e88	Merged core/ and stream/ -> src/. The split wasn't worth it.	14 years ago
Joshua Haberman	f858a8f287	Precompute bit offset and bitmask for a small perf improvement.	14 years ago
Joshua Haberman	4667ed4be9	All tests pass again, valgrind-clean! Next up: benchmarks.	14 years ago
Joshua Haberman	806ba1c80d	Another round of fixes. test_vs_proto2.googlemessage1 passes again, with no memory leaks!	14 years ago
Joshua Haberman	3affb31926	Tons of work: we're close to passing test_vs_proto2 again.	14 years ago
Joshua Haberman	93381f1411	Decoder compiles again! But probably doesn't work.	14 years ago
Joshua Haberman	c9df91b04a	upb bootstraps again! and with no memory leaks!	14 years ago
Joshua Haberman	a695b92cce	Debugging test_def, it's close to working again!	14 years ago
Joshua Haberman	1dea81b1c2	Interface refinement: rename some constants. * UPB_STOP -> UPB_BREAK, better represents breaking out of a parsing loop. * UPB_STATUS_OK -> UPB_OK, for all status codes, more concise at no readability cost (perhaps an improvement).	14 years ago
Joshua Haberman	a38742bbe1	A few minor changes to the streaming protocol. 1. the start and end callbacks can now return a upb_flow_t and set a status message. 2. clarified some semantics around passing an error status back from the callbacks.	14 years ago
Joshua Haberman	bcc688a303	upb_def compiles again!	14 years ago
Joshua Haberman	4559918090	More work on upb_src.	14 years ago
Joshua Haberman	db512df98e	A bunch of work on upb_def and upb_value.	14 years ago

18 Commits (ea198bdcf947ba4bd51474bdd4f7b82b5e4cf41d)