protobuf

Commit Graph

Author	SHA1	Message	Date
Joshua Haberman	336402b4d7	WIP, core library compiles now.	6 years ago
Joshua Haberman	fa338b70a6	Added UPB_ASSERT() that helps avoid unused var warnings. * Added UPB_ASSERT() that helps avoid unused var warnings. * Addressed PR comments. * Fixed assert in the JIT.	9 years ago
Joshua Haberman	68bc62a7fa	Split upb::Arena/upb::Allocator from upb::Environment. (#58 ) * Split upb::Arena/upb::Allocator from upb::Environment. This will allow arenas and allocators to be used independently of environments, which will be important for an upcoming change (a message representation). Overall this design feels cleaner that the previous Environment/SeededAllocator design. As part of this change, moved all allocations in upb to use a global allocator instead of hard-coding malloc/free. This will allow injecting OOM faults for more robust testing. One place that doesn't use the global allocator is the tracked ref code. Instead of its previous approach of CHECK_OOM() after every malloc() or table insert, it simply uses an allocator that does this automatically. I moved Allocator/Arena/Environment into upb.h. This seems principled since these are the only types in upb whose size is directly exposed to users, since they form the basis of memory allocation strategy. * Cleaned up some header includes and fixed more malloc -> upb_gmalloc(). * Changes from PR review. * Don't use UINTPTR_MAX or UINT64_MAX. * Punt on adding line/file for now. * We actually can't store (uint64_t)-1, update comment and test.	9 years ago
Josh Haberman	49dab06e03	Brought into compliance with Google open-source policies. - removed myself from Author headers in source files. - removed copyright notices from source file headers. - added CONTRIBUTING.md	10 years ago
Josh Haberman	d9485c28ed	Fix for va_copy.	10 years ago
Josh Haberman	19a973a85e	Fixes from Google-internal.	10 years ago
Josh Haberman	919fea438a	Ported upb to C89, for greater portability. A large part of this change contains surface-level porting, like moving variable declarations to the top of the block. However there are a few more substantial things too: - moved internal-only struct definitions to a separate file (structdefs.int.h), for greater encapsulation and ABI compatibility. - removed the UPB_UPCAST macro, since it requires access to the internal-only struct definitions. Replaced uses with calls to inline, type-safe casting functions. - removed the UPB_DEFINE_CLASS/UPB_DEFINE_STRUCT macros. Class and struct definitions are now more explicit -- you get to see the actual class/struct keywords in the source. The casting convenience functions have been moved into UPB_DECLARE_DERIVED_TYPE() and UPB_DECLARE_DERIVED_TYPE2(). - the new way that we duplicate base methods in derived types is also more convenient and requires less duplication. It is also less greppable, but hopefully that is not too big a problem. Compiler flags (-std=c89 -pedantic) should help to rigorously enforce that the code is free of C99-isms. A few functions are not available in C89 (strtoll). There are temporary, hacky solutions in place.	10 years ago
Josh Haberman	3bd691a497	Google-internal development.	10 years ago
Chris Fallin	87a18f3774	Support oneof defs in upb. This change adds support for a OneofDef (upb_oneofdef), which represents a 'oneof' as introduced by Protocol Buffers. This is semantically a union type that contains fields and in turn may be added to a MessageDef. This change does not alter parsing or the handler abstraction in any way, because a oneof has impact only at a higher semantic level (i.e., any sort of storage of the fields in a message object), which is user-specific with respect to upb.	10 years ago
Chris Fallin	b3f6daf83d	Amalgamated distribution (upb.c/upb.h) tool. There are a number of tweaks to get this to work: - The #include dependence graph wasn't quite complete, and I had to add a few #includes to get the tool to work. - I had to change a number of symbol names to avoid conflicts between 'static' definitions in different .c files. This could be avoided if the tool were smart enough to rename static symbols to have unique prefixes instead, but (i) this requires semantic understanding of C, and (ii) the macro-defined static functions (e.g., handlers for primitive types in several places) would probably trip this up. Verified that the resulting upb.h/upb.c compiles and doesn't have any unresolved references.	10 years ago
Josh Haberman	f447370f80	Fixed build and added Travis CI support.	10 years ago
Josh Haberman	2d10fa3307	Sync from internal Google development.	11 years ago
Josh Haberman	0fd2f83088	Sync to internal Google development.	11 years ago
Josh Haberman	ce9bba3cb5	Sync from Google-internal development.	11 years ago
Josh Haberman	90bb4246c3	Synced with Google-internal development. C++ handlers are now type-safe; SinkFrame is gone. Various other changes.	12 years ago
Josh Haberman	cfdb9907cb	Synced with 3 months of Google-internal development. Major changes: - Got rid of all bytestream interfaces in favor of using regular handlers. - new Pipeline object represents a upb pipeline, does bump allocation internally to manage memory. - proto2 support now can handle extensions.	12 years ago
Josh Haberman	7d3e2bd2c4	Sync with 8 months of Google-internal development. Many things have changed and been simplified. The memory-management story for upb_def and upb_handlers is much more robust; upb_def and upb_handlers should be fairly stable interfaces now. There is still much work to do for the runtime component (upb_sink).	12 years ago
Joshua Haberman	9a7037a2fa	Got decoder & textprinter compiling in kernel mode.	13 years ago
Joshua Haberman	86bad61b76	Sync from internal Google development. Many improvements, too many to mention. One significant perf regression warrants investigation: omitfp.parsetoproto2_googlemessage1.upb_jit: 343 -> 252 (-26.53) plain.parsetoproto2_googlemessage1.upb_jit: 334 -> 251 (-24.85) 25% regression for this benchmark is bad, but since I don't think there's any fundamental design issue that caused it I'm going to go ahead with the commit anyway. Can investigate and fix later. Other benchmarks were neutral or showed slight improvement.	13 years ago
Joshua Haberman	b5f5ee867e	Refinement of upb_bytesrc interface. Added a upb_byteregion that tracks a region of the input buffer; decoders use this instead of using a upb_bytesrc directly. upb_byteregion is also used as the way of passing a string to a upb_handlers callback. This symmetry makes decoders compose better; if you want to take a parsed string and decode it as something else, you can take the string directly from the callback and feed it as input to another parser. A commented-out version of a pinning interface is present; I decline to actually implement it (and accept its extra complexity) until/unless it is clear that it is actually a win. But it is included as a proof-of-concept, to show that it fits well with the existing interface.	13 years ago
Joshua Haberman	c0a08a6827	Fixes to get upb to compile inside Google.	13 years ago
Joshua Haberman	621c0cdcb5	Const invasion: large parts of upb made const-correct.	13 years ago
Joshua Haberman	8eb2b2a216	Revised upb_bytesink, refactored upb_textprinter (untested).	13 years ago
Joshua Haberman	521ac7a89a	Refined upb_status.	13 years ago
Joshua Haberman	10265aa56b	Directory restructure. Includes are now via upb/foo.h. Files specific to the protobuf format are now in upb/pb (the core library is concerned with message definitions, handlers, and byte streams, but knows nothing about any particular serializationf format).	14 years ago
Joshua Haberman	6a1f3a6693	Major refactoring: upb_string is gone in favor of upb_strref.	14 years ago
Joshua Haberman	0941664215	Add startseq/endseq handlers. Startseq/endseq handlers are called at the beginning and end of a sequence of repeated values. Protobuf does not really have direct support for this (repeated primitive fields do not delimit "begin" and "end" of the sequence) but we can infer them from the bytestream. The benefit of supporting them explicitly is that they get their own stack frame and closure, so we can avoid having to find the array's address over and over and deciding if we need to initialize it. This will also pave the way for better support of JSON, which does have explicit "startseq/endseq" markers: [].	14 years ago
Joshua Haberman	d619852e06	Change dispatcher error handling model. Now the dispatcher will call error handlers instaed of returning statuses that the caller has to constantly check.	14 years ago
Joshua Haberman	3231fd0fdd	Vastly improved/simplified the upb_handlers API.	14 years ago
Joshua Haberman	ea2a80840e	More renaming.	14 years ago
Joshua Haberman	f74534b42a	Decoder redesign in preparation for packed fields and start/endseq.	14 years ago
Josh Haberman	b796c1b317	Update copyright to be Google Inc. This doesn't reflect any material change in how I will be working on upb, and I have no problem making this change. It's still open source under the BSD license, and I'll still be working on it well beyond the hours that constitute a normal job.	14 years ago
Josh Haberman	8ef6873e0e	upb_stream: all callbacks registered ahead-of-time. This is a significant change to the upb_stream protocol, and should hopefully be the last significant change. All callbacks are now registered ahead-of-time instead of having delegated callbacks registered at runtime, which makes it much easier to aggressively optimize ahead-of-time (like with a JIT). Other impacts of this change: - You no longer need to have loaded descriptor.proto as a upb_def to load other descriptors! This means the special-case code we used for bootstrapping is no longer necessary, and we no longer need to link the descriptor for descriptor.proto into upb. - A client can now register any upb_value as what will be delivered to their value callback, not just a upb_fielddef*. This should allow for other clients to get more bang out of the streaming decoder. This change unfortunately causes a bit of a performance regression -- I think largely due to highly suboptimal code that GCC generates when structs are returned by value. See: http://blog.reverberate.org/2011/03/19/when-a-compilers-slow-code-actually-bites-you/ On the other hand, once we have a JIT this should no longer matter. Performance numbers: plain.parsestream_googlemessage1.upb_table: 374 -> 396 (5.88) plain.parsestream_googlemessage2.upb_table: 616 -> 449 (-27.11) plain.parsetostruct_googlemessage1.upb_table_byref: 268 -> 269 (0.37) plain.parsetostruct_googlemessage1.upb_table_byval: 215 -> 204 (-5.12) plain.parsetostruct_googlemessage2.upb_table_byref: 307 -> 281 (-8.47) plain.parsetostruct_googlemessage2.upb_table_byval: 297 -> 272 (-8.42) omitfp.parsestream_googlemessage1.upb_table: 423 -> 410 (-3.07) omitfp.parsestream_googlemessage2.upb_table: 679 -> 483 (-28.87) omitfp.parsetostruct_googlemessage1.upb_table_byref: 287 -> 282 (-1.74) omitfp.parsetostruct_googlemessage1.upb_table_byval: 226 -> 219 (-3.10) omitfp.parsetostruct_googlemessage2.upb_table_byref: 315 -> 298 (-5.40) omitfp.parsetostruct_googlemessage2.upb_table_byval: 297 -> 287 (-3.37)	14 years ago
Joshua Haberman	65b57a2813	Added escaping for text output.	14 years ago
Joshua Haberman	a75a305c77	Implemented upb_stringsink, upb_msgtotext, and exposed the latter to Lua.	14 years ago
Joshua Haberman	abfc897b50	Pass the upb_fielddef* to the endmsg callback.	14 years ago
Joshua Haberman	6bdbb45e88	Merged core/ and stream/ -> src/. The split wasn't worth it.	14 years ago
Joshua Haberman	9aa7e559d6	Fixes to decoder and textprinter: it works (for some input)! A protobuf -> text stream for descriptor.proto now outputs the same text as proto2.	14 years ago
Joshua Haberman	02a8cdfff2	Fixes to decoder, stdio, textprinter.	14 years ago
Joshua Haberman	d98db7cb56	Textprinter is compiling again.	14 years ago
Joshua Haberman	5af1ade543	More work on textprinter.	14 years ago
Joshua Haberman	2c24cbb108	More work on decoder and stdio bytesrc/bytesink.	14 years ago
Joshua Haberman	4b6c8b6b23	Fixed bugs in textoutput. Text output from descriptor.proto is now identical to protoc!	15 years ago
Joshua Haberman	0fcfeab521	Bugfixes, test_decoder successfully stream-decodes a stream!	15 years ago
Joshua Haberman	b77db14646	Fixed broken submsg support in upb_streamdata.	15 years ago
Joshua Haberman	af9d691a34	Added Xcode project.	15 years ago
Joshua Haberman	87b2c69c15	Fleshed out upb_stdio and upb_textprinter. test_decoder now compiles and links! But it doesn't work yet.	15 years ago

24 Commits (31e0997c1abaa531505d28e36473f1c972ca0849)