protobuf

Commit Graph

Author	SHA1	Message	Date
Bo Yang	0a9681874e	Modify TODO	7 years ago
Bo Yang	6a6e192375	Remove unused declaration.	7 years ago
Bo Yang	5aa27d91c6	Use upb_sink_putunknown for reserve unknown	7 years ago
Bo Yang	dc9d15084f	Remove upb_addunknown_handlerfunc and upb_handlers_setaddunknown	7 years ago
Bo Yang	0b7904e18c	Reserve unknown fields in upb 1. For decoding, an unknownfields will be lazily created on message, which contains bytes of unknown fields. 2. For encoding, if the unknownfields is present on message, all bytes contained in it will be serialized.	7 years ago
Joshua Haberman	fa338b70a6	Added UPB_ASSERT() that helps avoid unused var warnings. * Added UPB_ASSERT() that helps avoid unused var warnings. * Addressed PR comments. * Fixed assert in the JIT.	9 years ago
Mattia Barbon	e943fc6e7a	Make sure upb_pbdecoder.status is initialized Otherwhise the end message callback is passed a garbage value.	9 years ago
Josh Haberman	146a9c22ef	Added lots of decoder tests and fixed lots of bugs.	9 years ago
Josh Haberman	85440108e5	More decoder fixes, and slightly changed parse call semantics. Prior to this change, if an error was returned, it would be guaranteed to always return a short byte count. Now the two concepts are a bit more orthogonal. There are cases where the entire input is consumed even though an error was encountered.	9 years ago
Josh Haberman	fe427341f2	Decoder fix: skipped data at end of submessage.	9 years ago
Josh Haberman	7dcd017f4e	Fixed PR for JIT-enabled builds.	9 years ago
Josh Haberman	abcb6428ad	Changed parser semantics around skipping. Prior to this change: parse(buf, len) -> len + N ...would indicate that the next N bytes of the input are not needed, and would advance the decoding position by this much. After this change: parse(buf, len) -> len + N parse(NULL, N) -> N ...can be used to achieve the same thing. But skipping the N bytes is not explicitly performed by the user. A user that doesn't want/need to skip can just say: parsed = parse(buf, len); if (parsed < len) { // Handle suspend, advance stream by "parsed". } else { // Stream was advanced by "len" (even if parsed > len). } Updated unit tests to test this new behavior, and refactored test utility code a bit to support it.	9 years ago
Josh Haberman	49dab06e03	Brought into compliance with Google open-source policies. - removed myself from Author headers in source files. - removed copyright notices from source file headers. - added CONTRIBUTING.md	10 years ago
Josh Haberman	919fea438a	Ported upb to C89, for greater portability. A large part of this change contains surface-level porting, like moving variable declarations to the top of the block. However there are a few more substantial things too: - moved internal-only struct definitions to a separate file (structdefs.int.h), for greater encapsulation and ABI compatibility. - removed the UPB_UPCAST macro, since it requires access to the internal-only struct definitions. Replaced uses with calls to inline, type-safe casting functions. - removed the UPB_DEFINE_CLASS/UPB_DEFINE_STRUCT macros. Class and struct definitions are now more explicit -- you get to see the actual class/struct keywords in the source. The casting convenience functions have been moved into UPB_DECLARE_DERIVED_TYPE() and UPB_DECLARE_DERIVED_TYPE2(). - the new way that we duplicate base methods in derived types is also more convenient and requires less duplication. It is also less greppable, but hopefully that is not too big a problem. Compiler flags (-std=c89 -pedantic) should help to rigorously enforce that the code is free of C99-isms. A few functions are not available in C89 (strtoll). There are temporary, hacky solutions in place.	10 years ago
Josh Haberman	e087947c84	Enabled asserts() and verbosity for most Travis builds. Also added a separate ndebug build for testing that -DNDEBUG builds still work. Also disabled reference debugging by default, since it requires either a global lock or -DUPB_THREAD_UNSAFE.	10 years ago
Josh Haberman	37cffddc5d	Decoder bugfix. Don't back up decoder after skipunknown() unless we actually successfully consumed input data.	10 years ago
Josh Haberman	838009ba2b	Fixes for the open-source build.	10 years ago
Josh Haberman	3bd691a497	Google-internal development.	10 years ago
Martin Maly	508c39ee13	Resolve compilation errors if compiled with more stringent semantic checks. Adding Travis test to build with strict warnings. Fixing a warning in a test which used signed/unsigned integer comparison.	10 years ago
Josh Haberman	87fc2c516b	Changes from Google-internal development. * JSON parser expanded to handle split buffers. * bugfix to the protobuf decoder.	10 years ago
Chris Fallin	b3f6daf83d	Amalgamated distribution (upb.c/upb.h) tool. There are a number of tweaks to get this to work: - The #include dependence graph wasn't quite complete, and I had to add a few #includes to get the tool to work. - I had to change a number of symbol names to avoid conflicts between 'static' definitions in different .c files. This could be avoided if the tool were smart enough to rename static symbols to have unique prefixes instead, but (i) this requires semantic understanding of C, and (ii) the macro-defined static functions (e.g., handlers for primitive types in several places) would probably trip this up. Verified that the resulting upb.h/upb.c compiles and doesn't have any unresolved references.	10 years ago
Josh Haberman	d493500abc	Sync from Google-internal development.	11 years ago
Josh Haberman	2d10fa3307	Sync from internal Google development.	11 years ago
Josh Haberman	7d565f1e7a	Sync from Google development.	11 years ago
Josh Haberman	0fd2f83088	Sync to internal Google development.	11 years ago
Josh Haberman	ce9bba3cb5	Sync from Google-internal development.	11 years ago
Josh Haberman	ccb2f8ab87	Fixes to make the open-source build compile on Linux.	11 years ago
Josh Haberman	26d98ca94f	Merge from Google-internal development: - rewritten decoder; interpreted decoder is bytecode-based, JIT decoder no longer falls back to the interpreter. - C++ improvements: C++11-compatible iterators, upb::reffed_ptr for RAII refcounting, better upcast/downcast support. - removed the gross upb_value abstraction from public upb.h.	11 years ago
Josh Haberman	90bb4246c3	Synced with Google-internal development. C++ handlers are now type-safe; SinkFrame is gone. Various other changes.	12 years ago
Joshua Haberman	70293f5faa	Open source fixes: builds on OS X again.	12 years ago
Josh Haberman	cfdb9907cb	Synced with 3 months of Google-internal development. Major changes: - Got rid of all bytestream interfaces in favor of using regular handlers. - new Pipeline object represents a upb pipeline, does bump allocation internally to manage memory. - proto2 support now can handle extensions.	12 years ago
Josh Haberman	7d3e2bd2c4	Sync with 8 months of Google-internal development. Many things have changed and been simplified. The memory-management story for upb_def and upb_handlers is much more robust; upb_def and upb_handlers should be fairly stable interfaces now. There is still much work to do for the runtime component (upb_sink).	12 years ago
Joshua Haberman	9a7037a2fa	Got decoder & textprinter compiling in kernel mode.	13 years ago
Joshua Haberman	86bad61b76	Sync from internal Google development. Many improvements, too many to mention. One significant perf regression warrants investigation: omitfp.parsetoproto2_googlemessage1.upb_jit: 343 -> 252 (-26.53) plain.parsetoproto2_googlemessage1.upb_jit: 334 -> 251 (-24.85) 25% regression for this benchmark is bad, but since I don't think there's any fundamental design issue that caused it I'm going to go ahead with the commit anyway. Can investigate and fix later. Other benchmarks were neutral or showed slight improvement.	13 years ago
Joshua Haberman	1b9b6bd1ad	Fixed the open-source build.	13 years ago
Joshua Haberman	1bcab1377d	Sync with internal Google development. This breaks the open-source build, will follow up with a change to fix it.	13 years ago
Joshua Haberman	b5f5ee867e	Refinement of upb_bytesrc interface. Added a upb_byteregion that tracks a region of the input buffer; decoders use this instead of using a upb_bytesrc directly. upb_byteregion is also used as the way of passing a string to a upb_handlers callback. This symmetry makes decoders compose better; if you want to take a parsed string and decode it as something else, you can take the string directly from the callback and feed it as input to another parser. A commented-out version of a pinning interface is present; I decline to actually implement it (and accept its extra complexity) until/unless it is clear that it is actually a win. But it is included as a proof-of-concept, to show that it fits well with the existing interface.	13 years ago
Joshua Haberman	64e199d18b	Small bugfix for x86->x64 rename.	13 years ago
Joshua Haberman	4a8b9be46c	Header cleanup, clarify/correct comments for interfaces.	13 years ago
Joshua Haberman	521ac7a89a	Refined upb_status.	13 years ago
Joshua Haberman	48fedab345	Add packed field support (untested).	13 years ago
Joshua Haberman	8bdc6d233e	Prime the decoder buf for modest perf improvement on small messages.	13 years ago
Joshua Haberman	7935b702c5	More cleanup.	13 years ago
Joshua Haberman	282b34529f	Some source cleanup/commenting.	13 years ago
Josh Haberman	3387ccaffd	Avoid longjmp() in successful case. Speeds up short messages by 15-25%.	13 years ago
Joshua Haberman	10265aa56b	Directory restructure. Includes are now via upb/foo.h. Files specific to the protobuf format are now in upb/pb (the core library is concerned with message definitions, handlers, and byte streams, but knows nothing about any particular serializationf format).	14 years ago
Joshua Haberman	6a1f3a6693	Major refactoring: upb_string is gone in favor of upb_strref.	14 years ago
Joshua Haberman	559e23c796	Major refactoring: abandon upb_msg, add upb_accessors. Next on the chopping block is upb_string.	14 years ago
Joshua Haberman	a503b8859c	Make all handlers objects refcounted. I'm realizing that basically all upb objects will need to be refcounted to be sharable across languages, but not messages which are on their way out so we can get out of the business of data representations. Things which must be refcounted: - encoders, decoders - handlers objects - defs	14 years ago
Joshua Haberman	0941664215	Add startseq/endseq handlers. Startseq/endseq handlers are called at the beginning and end of a sequence of repeated values. Protobuf does not really have direct support for this (repeated primitive fields do not delimit "begin" and "end" of the sequence) but we can infer them from the bytestream. The benefit of supporting them explicitly is that they get their own stack frame and closure, so we can avoid having to find the array's address over and over and deciding if we need to initialize it. This will also pave the way for better support of JSON, which does have explicit "startseq/endseq" markers: [].	14 years ago

46 Commits (d2f9bec5c6f3c34362cf13e35e11d3dbc7888a32)