protobuf

Commit Graph

Author	SHA1	Message	Date
Joshua Haberman	1bcab1377d	Sync with internal Google development. This breaks the open-source build, will follow up with a change to fix it.	13 years ago
Joshua Haberman	b5f5ee867e	Refinement of upb_bytesrc interface. Added a upb_byteregion that tracks a region of the input buffer; decoders use this instead of using a upb_bytesrc directly. upb_byteregion is also used as the way of passing a string to a upb_handlers callback. This symmetry makes decoders compose better; if you want to take a parsed string and decode it as something else, you can take the string directly from the callback and feed it as input to another parser. A commented-out version of a pinning interface is present; I decline to actually implement it (and accept its extra complexity) until/unless it is clear that it is actually a win. But it is included as a proof-of-concept, to show that it fits well with the existing interface.	13 years ago
Joshua Haberman	878fc9c362	Small typo fix.	13 years ago
Wink Saville	0606476cb6	Fix typo in handler.h Signed-off-by: Wink Saville <wink@saville.com>	13 years ago
Joshua Haberman	621c0cdcb5	Const invasion: large parts of upb made const-correct.	13 years ago
Joshua Haberman	8eb2b2a216	Revised upb_bytesink, refactored upb_textprinter (untested).	13 years ago
Joshua Haberman	48fedab345	Add packed field support (untested).	13 years ago
Joshua Haberman	adb6580d97	Let the JIT emit hasbit-setting code in addition to calling a callback. This leads to a major (20-40%) improvement in the parsetoproto2 benchmark with small messages. We now are faster than proto2 in all apples-to-apples comparisons, at least given the (admittedly limited) set of benchmarks in this source tree.	13 years ago
Joshua Haberman	282b34529f	Some source cleanup/commenting.	13 years ago
Joshua Haberman	40f271b854	x86 JIT: add callback specializations for a 10% speedup when parsing to struct.	13 years ago
Joshua Haberman	fe3df2c9bc	Python: basic SymbolTable support and empty accessors.	13 years ago
Joshua Haberman	10265aa56b	Directory restructure. Includes are now via upb/foo.h. Files specific to the protobuf format are now in upb/pb (the core library is concerned with message definitions, handlers, and byte streams, but knows nothing about any particular serializationf format).	14 years ago
Joshua Haberman	6a1f3a6693	Major refactoring: upb_string is gone in favor of upb_strref.	14 years ago
Joshua Haberman	a503b8859c	Make all handlers objects refcounted. I'm realizing that basically all upb objects will need to be refcounted to be sharable across languages, but not messages which are on their way out so we can get out of the business of data representations. Things which must be refcounted: - encoders, decoders - handlers objects - defs	14 years ago
Joshua Haberman	2ccebb74c3	Add proof-of-concept C++ wrapper header.	14 years ago
Joshua Haberman	0941664215	Add startseq/endseq handlers. Startseq/endseq handlers are called at the beginning and end of a sequence of repeated values. Protobuf does not really have direct support for this (repeated primitive fields do not delimit "begin" and "end" of the sequence) but we can infer them from the bytestream. The benefit of supporting them explicitly is that they get their own stack frame and closure, so we can avoid having to find the array's address over and over and deciding if we need to initialize it. This will also pave the way for better support of JSON, which does have explicit "startseq/endseq" markers: [].	14 years ago
Joshua Haberman	d619852e06	Change dispatcher error handling model. Now the dispatcher will call error handlers instaed of returning statuses that the caller has to constantly check.	14 years ago
Joshua Haberman	3231fd0fdd	Vastly improved/simplified the upb_handlers API.	14 years ago
Joshua Haberman	1782f28c86	Documentation, some type renaming, nix unknown handler for now.	14 years ago
Joshua Haberman	eb622c0531	Split upb_stream -> upb_bytestream/upb_handlers.	14 years ago
Joshua Haberman	f74534b42a	Decoder redesign in preparation for packed fields and start/endseq.	14 years ago
Joshua Haberman	d6cebc329b	JIT passes all tests!	14 years ago
Joshua Haberman	9eb4d695c4	First rough version of the JIT. It can successfully parse SpeedMessage1. Preliminary results: 750MB/s on Core2 2.4GHz. This number is 2.5x proto2. This isn't apples-to-apples, because proto2 is parsing to a struct and we are just doing stream parsing, but for apps that are currently using proto2, this is the improvement they would see if they could move to stream-based processing. Unfortunately perf-regression-test.py is broken, and I'm not 100% sure why. It would be nice to fix it first (to ensure that there are no performance regressions for the table-based decoder) but I'm really impatient to get the JIT checked in.	14 years ago
Josh Haberman	2c86e7eddb	Small semantics changes in the decoder. Simplified some of the semantics around the decoder's data structures, in anticipation of sharing them between the regular C decoder and a JIT-ted decoder.	14 years ago
Joshua Haberman	484809c272	Key dispatch table by (num x type), for modest perf improvement. This allows us to remove one type check in the critical path.	14 years ago
Josh Haberman	b796c1b317	Update copyright to be Google Inc. This doesn't reflect any material change in how I will be working on upb, and I have no problem making this change. It's still open source under the BSD license, and I'll still be working on it well beyond the hours that constitute a normal job.	14 years ago
Josh Haberman	8ef6873e0e	upb_stream: all callbacks registered ahead-of-time. This is a significant change to the upb_stream protocol, and should hopefully be the last significant change. All callbacks are now registered ahead-of-time instead of having delegated callbacks registered at runtime, which makes it much easier to aggressively optimize ahead-of-time (like with a JIT). Other impacts of this change: - You no longer need to have loaded descriptor.proto as a upb_def to load other descriptors! This means the special-case code we used for bootstrapping is no longer necessary, and we no longer need to link the descriptor for descriptor.proto into upb. - A client can now register any upb_value as what will be delivered to their value callback, not just a upb_fielddef*. This should allow for other clients to get more bang out of the streaming decoder. This change unfortunately causes a bit of a performance regression -- I think largely due to highly suboptimal code that GCC generates when structs are returned by value. See: http://blog.reverberate.org/2011/03/19/when-a-compilers-slow-code-actually-bites-you/ On the other hand, once we have a JIT this should no longer matter. Performance numbers: plain.parsestream_googlemessage1.upb_table: 374 -> 396 (5.88) plain.parsestream_googlemessage2.upb_table: 616 -> 449 (-27.11) plain.parsetostruct_googlemessage1.upb_table_byref: 268 -> 269 (0.37) plain.parsetostruct_googlemessage1.upb_table_byval: 215 -> 204 (-5.12) plain.parsetostruct_googlemessage2.upb_table_byref: 307 -> 281 (-8.47) plain.parsetostruct_googlemessage2.upb_table_byval: 297 -> 272 (-8.42) omitfp.parsestream_googlemessage1.upb_table: 423 -> 410 (-3.07) omitfp.parsestream_googlemessage2.upb_table: 679 -> 483 (-28.87) omitfp.parsetostruct_googlemessage1.upb_table_byref: 287 -> 282 (-1.74) omitfp.parsetostruct_googlemessage1.upb_table_byval: 226 -> 219 (-3.10) omitfp.parsetostruct_googlemessage2.upb_table_byref: 315 -> 298 (-5.40) omitfp.parsetostruct_googlemessage2.upb_table_byval: 297 -> 287 (-3.37)	14 years ago
Joshua Haberman	abfc897b50	Pass the upb_fielddef* to the endmsg callback.	14 years ago
Joshua Haberman	f9a6f67e27	Track buffer end instead of buffer length, for a small perf improvement.	14 years ago
Joshua Haberman	b2d66287d9	Add warning about upcoming delegation changes.	14 years ago
Joshua Haberman	6bdbb45e88	Merged core/ and stream/ -> src/. The split wasn't worth it.	14 years ago
Joshua Haberman	806ba1c80d	Another round of fixes. test_vs_proto2.googlemessage1 passes again, with no memory leaks!	14 years ago
Joshua Haberman	d98db7cb56	Textprinter is compiling again.	14 years ago
Joshua Haberman	fbb9fd35e0	Improve comments in headers, to better explain core interfaces.	14 years ago
Joshua Haberman	2c24cbb108	More work on decoder and stdio bytesrc/bytesink.	14 years ago
Joshua Haberman	fe659c8c93	Getting closer to a decoder that could actually compile and work.	14 years ago
Joshua Haberman	58a70b55c6	Decoder code structure is mostly in-place.	14 years ago
Joshua Haberman	a695b92cce	Debugging test_def, it's close to working again!	14 years ago
Joshua Haberman	1dea81b1c2	Interface refinement: rename some constants. * UPB_STOP -> UPB_BREAK, better represents breaking out of a parsing loop. * UPB_STATUS_OK -> UPB_OK, for all status codes, more concise at no readability cost (perhaps an improvement).	14 years ago
Joshua Haberman	a38742bbe1	A few minor changes to the streaming protocol. 1. the start and end callbacks can now return a upb_flow_t and set a status message. 2. clarified some semantics around passing an error status back from the callbacks.	14 years ago
Joshua Haberman	bcc688a303	upb_def compiles again!	14 years ago
Joshua Haberman	4559918090	More work on upb_src.	14 years ago
Joshua Haberman	b471ca6b81	The last major revision to the upb_stream protocol. Sources and sinks communicate by means of a upb_handlers object, which encapsulates a set of handler callbacks and will possibly offer richer semantics in the future like giving specific fields different callbacks. The upb_handlers protocol supports delegation, so sets of handlers can be written in reusable ways. For example, if a set of handlers is written to handle a specific .proto type, those handlers can be used whether that type is at the top level or whether it is a sub-message of a higher-level type. Delegation allows the streaming protocol to properly compose.	14 years ago
Joshua Haberman	2a7f51f3fd	Change upb_src to use push-based interface. Unfortunately my previous detailed commit message was lost somehow by git or vi. Will have to explain in more detail at a later date the rationale for this change. The build will be broken until I port the old decoder to this new interface.	14 years ago
Joshua Haberman	678799082b	Stream decoding benchmark.	15 years ago
Joshua Haberman	87b2c69c15	Fleshed out upb_stdio and upb_textprinter. test_decoder now compiles and links! But it doesn't work yet.	15 years ago
Joshua Haberman	8e138c4687	Added more comments for upb_src interface.	15 years ago
Joshua Haberman	28ec9a1fa0	Split src/ into core/ and stream/.	15 years ago
Joshua Haberman	be5ddd8a64	Tweaks to upb_src/upb_sink interfaces.	15 years ago
Joshua Haberman	229fcf7119	upb_def compiles again, though with lots of #if 0.	15 years ago

12 Commits (db59a5198f890ecdcac1227b0bb998160acac5c6)