protobuf

Commit Graph

Author	SHA1	Message	Date
Joshua Haberman	0941664215	Add startseq/endseq handlers. Startseq/endseq handlers are called at the beginning and end of a sequence of repeated values. Protobuf does not really have direct support for this (repeated primitive fields do not delimit "begin" and "end" of the sequence) but we can infer them from the bytestream. The benefit of supporting them explicitly is that they get their own stack frame and closure, so we can avoid having to find the array's address over and over and deciding if we need to initialize it. This will also pave the way for better support of JSON, which does have explicit "startseq/endseq" markers: [].	14 years ago
Joshua Haberman	d619852e06	Change dispatcher error handling model. Now the dispatcher will call error handlers instaed of returning statuses that the caller has to constantly check.	14 years ago
Joshua Haberman	a5506318aa	Fix JIT for new interface.	14 years ago
Joshua Haberman	3231fd0fdd	Vastly improved/simplified the upb_handlers API.	14 years ago
Joshua Haberman	1782f28c86	Documentation, some type renaming, nix unknown handler for now.	14 years ago
Joshua Haberman	eb622c0531	Split upb_stream -> upb_bytestream/upb_handlers.	14 years ago
Joshua Haberman	f74534b42a	Decoder redesign in preparation for packed fields and start/endseq.	14 years ago
Joshua Haberman	4a99abba12	Refactor varint encoding/decoding.	14 years ago
Joshua Haberman	6955dfb302	Calculate and print string sizes in test messages.	14 years ago
Joshua Haberman	9eb4d695c4	First rough version of the JIT. It can successfully parse SpeedMessage1. Preliminary results: 750MB/s on Core2 2.4GHz. This number is 2.5x proto2. This isn't apples-to-apples, because proto2 is parsing to a struct and we are just doing stream parsing, but for apps that are currently using proto2, this is the improvement they would see if they could move to stream-based processing. Unfortunately perf-regression-test.py is broken, and I'm not 100% sure why. It would be nice to fix it first (to ensure that there are no performance regressions for the table-based decoder) but I'm really impatient to get the JIT checked in.	14 years ago
Joshua Haberman	19517cc6f3	Switch to non-branching varint decoder.	14 years ago
Josh Haberman	b796c1b317	Update copyright to be Google Inc. This doesn't reflect any material change in how I will be working on upb, and I have no problem making this change. It's still open source under the BSD license, and I'll still be working on it well beyond the hours that constitute a normal job.	14 years ago
Josh Haberman	8ef6873e0e	upb_stream: all callbacks registered ahead-of-time. This is a significant change to the upb_stream protocol, and should hopefully be the last significant change. All callbacks are now registered ahead-of-time instead of having delegated callbacks registered at runtime, which makes it much easier to aggressively optimize ahead-of-time (like with a JIT). Other impacts of this change: - You no longer need to have loaded descriptor.proto as a upb_def to load other descriptors! This means the special-case code we used for bootstrapping is no longer necessary, and we no longer need to link the descriptor for descriptor.proto into upb. - A client can now register any upb_value as what will be delivered to their value callback, not just a upb_fielddef*. This should allow for other clients to get more bang out of the streaming decoder. This change unfortunately causes a bit of a performance regression -- I think largely due to highly suboptimal code that GCC generates when structs are returned by value. See: http://blog.reverberate.org/2011/03/19/when-a-compilers-slow-code-actually-bites-you/ On the other hand, once we have a JIT this should no longer matter. Performance numbers: plain.parsestream_googlemessage1.upb_table: 374 -> 396 (5.88) plain.parsestream_googlemessage2.upb_table: 616 -> 449 (-27.11) plain.parsetostruct_googlemessage1.upb_table_byref: 268 -> 269 (0.37) plain.parsetostruct_googlemessage1.upb_table_byval: 215 -> 204 (-5.12) plain.parsetostruct_googlemessage2.upb_table_byref: 307 -> 281 (-8.47) plain.parsetostruct_googlemessage2.upb_table_byval: 297 -> 272 (-8.42) omitfp.parsestream_googlemessage1.upb_table: 423 -> 410 (-3.07) omitfp.parsestream_googlemessage2.upb_table: 679 -> 483 (-28.87) omitfp.parsetostruct_googlemessage1.upb_table_byref: 287 -> 282 (-1.74) omitfp.parsetostruct_googlemessage1.upb_table_byval: 226 -> 219 (-3.10) omitfp.parsetostruct_googlemessage2.upb_table_byref: 315 -> 298 (-5.40) omitfp.parsetostruct_googlemessage2.upb_table_byval: 297 -> 287 (-3.37)	14 years ago
Joshua Haberman	abfc897b50	Pass the upb_fielddef* to the endmsg callback.	14 years ago
Joshua Haberman	fd184f0df2	Major work on Lua extension and default values. Default values are now supported, and the Lua extension can now create and modify individual protobuf objects.	14 years ago
Joshua Haberman	0c6786c6fa	Split varint decoders into separate .h file. This makes it easier to benchmark and test the multiple possible implementations of varint decoding.	14 years ago
Joshua Haberman	61e5d367ff	Change the API for getting the bootstrapped defs. The symtab that contains them is now hidden, and you can look them up by name but there is no access to the symtab itself, so there is no risk of mutating it (by extending it, adding other defs to it, etc).	14 years ago
Joshua Haberman	f1e1cc4695	Split inttable into a hash part and an array part. upb_inttable() now supports a "compact" operation that will decide on an array size and put all entries with small enough keys into the array part for faster lookup. Also exposed the upb_itof_ent structure and put a few useful values there, so they are one fewer pointer chase away.	14 years ago
Joshua Haberman	ee84a7da16	Add (but do not activate) an SSE varint decoder.	14 years ago
Joshua Haberman	4667ed4be9	All tests pass again, valgrind-clean! Next up: benchmarks.	14 years ago
Joshua Haberman	806ba1c80d	Another round of fixes. test_vs_proto2.googlemessage1 passes again, with no memory leaks!	14 years ago
Joshua Haberman	3affb31926	Tons of work: we're close to passing test_vs_proto2 again.	14 years ago
Joshua Haberman	e170259e4a	Improved table benchmark accuracy and output formatting.	14 years ago
Joshua Haberman	9aa7e559d6	Fixes to decoder and textprinter: it works (for some input)! A protobuf -> text stream for descriptor.proto now outputs the same text as proto2.	14 years ago
Joshua Haberman	02a8cdfff2	Fixes to decoder, stdio, textprinter.	14 years ago
Joshua Haberman	2ea9737e5d	Added test_stream.c for testing upb_stream.h.	14 years ago
Joshua Haberman	c9df91b04a	upb bootstraps again! and with no memory leaks!	14 years ago
Joshua Haberman	a695b92cce	Debugging test_def, it's close to working again!	14 years ago
Joshua Haberman	a9e998159c	Fleshed out upb_msg: test_vs_proto2 compiles but fails.	15 years ago
Joshua Haberman	21ee24a730	Updated Lua extension to handle fielddefs.	15 years ago
Joshua Haberman	79de3ca9e4	Add forgotten test_decoder.c.	15 years ago
Joshua Haberman	7a6a702792	Allow static upb_strings. This can allow strings to reference static data, and reduced the memory footprint of test_def by about 10% (3k).	15 years ago
Joshua Haberman	c7a95061a7	Successfully bootstraps!!	15 years ago
Joshua Haberman	ae0beee285	Fixed upb_string error with strange vsnprintf() behavior.	15 years ago
Joshua Haberman	db6c7387bc	Incremental progress towards getting upb_def to bootstrap.	15 years ago
Joshua Haberman	2ef013126c	Fleshed out upb_string further. Now upb_def's only unresolved references are upb_src.	15 years ago
Joshua Haberman	e29bf964d1	Tests for string and fleshed out implementation.	15 years ago
Joshua Haberman	be5ddd8a64	Tweaks to upb_src/upb_sink interfaces.	15 years ago
Joshua Haberman	611afe9c69	Removed union tag from types.	15 years ago
Joshua Haberman	d5566c6038	Remove struct keyword from all types, use typedef instead.	15 years ago
Joshua Haberman	9116c697f8	upb_parser -> upb_decoder	15 years ago
Joshua Haberman	d751973758	Ported/fixed tests to new data types.	15 years ago
Joshua Haberman	ece08710a6	Bugfixes: descriptorgen works without leaks!	15 years ago
Joshua Haberman	0a6fc5fad3	Truly fixed type cyclic refcounting.	15 years ago
Joshua Haberman	e15f834a91	Circular references truly work now, along with a test. One simplification to come.	15 years ago
Joshua Haberman	08b4a91204	Add a test for circularly-linked descriptors. The test currently triggers valgrind-detected memory errors.	15 years ago
Joshua Haberman	18291eedc3	Make defs refcounted, rename upb_context->upbsymtab. There is currently a memory leak when type definitions form cycles. This will need to be dealt with.	15 years ago
Joshua Haberman	a95ab58e79	Overhaul defs to derive from a common base.	15 years ago
Joshua Haberman	9e3f5e343b	Make upb_msgdef own all its data. This is in anticipation of making upb_msgdef's easy to dup. This involved removing all traces of any descriptors from the defs.	15 years ago
Joshua Haberman	868f118797	Changed parse API to know about msgdefs. This should make it both easier to use and easier to optimize, in exchange for a small amount of generality. In practice, any remotely normal case is still very natural.	15 years ago

1 2 3 4 5

231 Commits (638d114a1ac20a79bf7490eef6625f15bc61d52b)