protobuf

Commit Graph

Author	SHA1	Message	Date
Joshua Haberman	2a85bef825	Generated code interface for maps is complete, though not yet tested.	5 years ago
Joshua Haberman	7f5fe52dfa	Fixes for non-C89 code.	5 years ago
Joshua Haberman	d6c3152c0b	Added more Lua tests that are passing. Also ripped out the ctype checking in upb_table, it was not helpful (didn't help catch bugs) but was causing problems.	5 years ago
Joshua Haberman	23825332e1	WIP.	5 years ago
Alan Wu	a73fd86c13	Use memcpy to perform unaligned reads Creating and reading from unaligned pointers is UB and I'm trying to run upb on a platform (GraalVM) that is sensitive to that unfortunately. Recent compilers are smart enough to fold the memcpy down to a simple memory load on platforms that support it, so this should mostly be a aesthetic change.	5 years ago
Joshua Haberman	555b60b062	A memory safety fix, found by ASAN. We cannot assume that the input string is NULL-terminated, or read past "len." Instead we manually NULL-terminate it.	5 years ago
Esun Kim	6f9a9fb2fa	Rename MurmurHash2 to upb_murmur_hash2	5 years ago
Esun Kim	a8bb192fa4	Fixed -Wshorten-64-to-32	5 years ago
Joshua Haberman	cf35baa1ad	Moved macros from upb.h to port_def.inc to avoid leaking them to users. (#160 ) * Use port_def.inc to prevent macros from leaking to users. * Added helpful comments to port_def.inc/port_undef.inc.	6 years ago
Shahid	e223001916	Update table.c	6 years ago
Josh Haberman	865876895d	Fixed tests and code.	6 years ago
Sakala Venkata Krishna Rohit	898f640e65	Bugfix on bigendianess by casting size_t to unint32_t The reason for typecasting size_t to unint32_t is that size_t is 8 bytes and uint32_t is only 4 bytes. If not typecasted Memcpy fails to copy the correct four bytes in big endian platforms.	6 years ago
Sakala Venkata Krishna Rohit	6522ae4fb3	Bugfix on bigendianess by casting size_t to unint32_t The reason for typecasting size_t to unint32_t is that size_t is 8 bytes and uint32_t is only 4 bytes. If not typecasted Memcpy fails to copy the correct four bytes in big endian platforms.	6 years ago
Josh Haberman	b09c59cc05	A small bugfix to upb_table and simplified some code.	8 years ago
Joshua Haberman	fa338b70a6	Added UPB_ASSERT() that helps avoid unused var warnings. * Added UPB_ASSERT() that helps avoid unused var warnings. * Addressed PR comments. * Fixed assert in the JIT.	9 years ago
Joshua Haberman	68bc62a7fa	Split upb::Arena/upb::Allocator from upb::Environment. (#58 ) * Split upb::Arena/upb::Allocator from upb::Environment. This will allow arenas and allocators to be used independently of environments, which will be important for an upcoming change (a message representation). Overall this design feels cleaner that the previous Environment/SeededAllocator design. As part of this change, moved all allocations in upb to use a global allocator instead of hard-coding malloc/free. This will allow injecting OOM faults for more robust testing. One place that doesn't use the global allocator is the tracked ref code. Instead of its previous approach of CHECK_OOM() after every malloc() or table insert, it simply uses an allocator that does this automatically. I moved Allocator/Arena/Environment into upb.h. This seems principled since these are the only types in upb whose size is directly exposed to users, since they form the basis of memory allocation strategy. * Cleaned up some header includes and fixed more malloc -> upb_gmalloc(). * Changes from PR review. * Don't use UINTPTR_MAX or UINT64_MAX. * Punt on adding line/file for now. * We actually can't store (uint64_t)-1, update comment and test.	9 years ago
Josh Haberman	ae8d257985	Added small explanatory comment.	9 years ago
Josh Haberman	0d18e1f7e3	Optimized upb_inttable_compact(): it shrinks inttables more now.	9 years ago
Josh Haberman	49dab06e03	Brought into compliance with Google open-source policies. - removed myself from Author headers in source files. - removed copyright notices from source file headers. - added CONTRIBUTING.md	10 years ago
Josh Haberman	19a973a85e	Fixes from Google-internal.	10 years ago
Josh Haberman	6f30032183	Sync from Google-internal development.	10 years ago
Josh Haberman	919fea438a	Ported upb to C89, for greater portability. A large part of this change contains surface-level porting, like moving variable declarations to the top of the block. However there are a few more substantial things too: - moved internal-only struct definitions to a separate file (structdefs.int.h), for greater encapsulation and ABI compatibility. - removed the UPB_UPCAST macro, since it requires access to the internal-only struct definitions. Replaced uses with calls to inline, type-safe casting functions. - removed the UPB_DEFINE_CLASS/UPB_DEFINE_STRUCT macros. Class and struct definitions are now more explicit -- you get to see the actual class/struct keywords in the source. The casting convenience functions have been moved into UPB_DECLARE_DERIVED_TYPE() and UPB_DECLARE_DERIVED_TYPE2(). - the new way that we duplicate base methods in derived types is also more convenient and requires less duplication. It is also less greppable, but hopefully that is not too big a problem. Compiler flags (-std=c89 -pedantic) should help to rigorously enforce that the code is free of C99-isms. A few functions are not available in C89 (strtoll). There are temporary, hacky solutions in place.	10 years ago
Josh Haberman	e2840a4aa1	Restructure tables for C89 port and smaller size. Changes the data layout of tables slightly so that string keys are prefixed with their size, rather than the size being inline in the table itself. This has a few benefits: 1. inttables shrink a bit, because there is no longer a wasted and unused size field sitting in them. 2. This avoids the need to have a union in the table. This is important for an impending C89 port of upb, since C89 has literally no way of statically initializing a non-first union member.	10 years ago
Josh Haberman	3bd691a497	Google-internal development.	10 years ago
Martin Maly	508c39ee13	Resolve compilation errors if compiled with more stringent semantic checks. Adding Travis test to build with strict warnings. Fixing a warning in a test which used signed/unsigned integer comparison.	10 years ago
Chris Fallin	fb58504569	Support maps in JSON parsing and serialization. This is a sync of our internal developing of JSON parsing and serialization. It implements native understanding of MapEntry submessages, so that map fields with (key, value) pairs are serialized as JSON maps (objects) natively rather than as arrays of objects with 'key' and 'value' fields. The parser also now understands how to emit handler calls corresponding to MapEntry objects when processing a map field. This sync also picks up a bugfix in `table.c` to handle an alloc-failed case.	10 years ago
Chris Fallin	fd1cc56625	Modified strtable to support length-delimited string keys. Allows for arbitrary binary data, e.g., to support strings from other languages as key values.	10 years ago
Josh Haberman	3d0c7c45da	Sync to Google-internal development.	10 years ago
Josh Haberman	d869097400	Make the absence of perf-cppflags give a good default build. Defaults are now: - thread-safe with GCC/Clang - Debugging not enabled (enable with -UNDEBUG)	10 years ago
Josh Haberman	d493500abc	Sync from Google-internal development.	10 years ago
Josh Haberman	2d10fa3307	Sync from internal Google development.	11 years ago
Josh Haberman	26d98ca94f	Merge from Google-internal development: - rewritten decoder; interpreted decoder is bytecode-based, JIT decoder no longer falls back to the interpreter. - C++ improvements: C++11-compatible iterators, upb::reffed_ptr for RAII refcounting, better upcast/downcast support. - removed the gross upb_value abstraction from public upb.h.	11 years ago
Joshua Haberman	70293f5faa	Open source fixes: builds on OS X again.	12 years ago
Josh Haberman	cfdb9907cb	Synced with 3 months of Google-internal development. Major changes: - Got rid of all bytestream interfaces in favor of using regular handlers. - new Pipeline object represents a upb pipeline, does bump allocation internally to manage memory. - proto2 support now can handle extensions.	12 years ago
Josh Haberman	7d3e2bd2c4	Sync with 8 months of Google-internal development. Many things have changed and been simplified. The memory-management story for upb_def and upb_handlers is much more robust; upb_def and upb_handlers should be fairly stable interfaces now. There is still much work to do for the runtime component (upb_sink).	12 years ago
Joshua Haberman	cca4818eb7	Sync from internal Google development.	13 years ago
Joshua Haberman	86bad61b76	Sync from internal Google development. Many improvements, too many to mention. One significant perf regression warrants investigation: omitfp.parsetoproto2_googlemessage1.upb_jit: 343 -> 252 (-26.53) plain.parsetoproto2_googlemessage1.upb_jit: 334 -> 251 (-24.85) 25% regression for this benchmark is bad, but since I don't think there's any fundamental design issue that caused it I'm going to go ahead with the commit anyway. Can investigate and fix later. Other benchmarks were neutral or showed slight improvement.	13 years ago
Joshua Haberman	c0a08a6827	Fixes to get upb to compile inside Google.	13 years ago
Joshua Haberman	621c0cdcb5	Const invasion: large parts of upb made const-correct.	13 years ago
Joshua Haberman	10265aa56b	Directory restructure. Includes are now via upb/foo.h. Files specific to the protobuf format are now in upb/pb (the core library is concerned with message definitions, handlers, and byte streams, but knows nothing about any particular serializationf format).	14 years ago
Joshua Haberman	6a1f3a6693	Major refactoring: upb_string is gone in favor of upb_strref.	14 years ago
Joshua Haberman	9eb4d695c4	First rough version of the JIT. It can successfully parse SpeedMessage1. Preliminary results: 750MB/s on Core2 2.4GHz. This number is 2.5x proto2. This isn't apples-to-apples, because proto2 is parsing to a struct and we are just doing stream parsing, but for apps that are currently using proto2, this is the improvement they would see if they could move to stream-based processing. Unfortunately perf-regression-test.py is broken, and I'm not 100% sure why. It would be nice to fix it first (to ensure that there are no performance regressions for the table-based decoder) but I'm really impatient to get the JIT checked in.	14 years ago
Josh Haberman	b796c1b317	Update copyright to be Google Inc. This doesn't reflect any material change in how I will be working on upb, and I have no problem making this change. It's still open source under the BSD license, and I'll still be working on it well beyond the hours that constitute a normal job.	14 years ago
Josh Haberman	8ef6873e0e	upb_stream: all callbacks registered ahead-of-time. This is a significant change to the upb_stream protocol, and should hopefully be the last significant change. All callbacks are now registered ahead-of-time instead of having delegated callbacks registered at runtime, which makes it much easier to aggressively optimize ahead-of-time (like with a JIT). Other impacts of this change: - You no longer need to have loaded descriptor.proto as a upb_def to load other descriptors! This means the special-case code we used for bootstrapping is no longer necessary, and we no longer need to link the descriptor for descriptor.proto into upb. - A client can now register any upb_value as what will be delivered to their value callback, not just a upb_fielddef*. This should allow for other clients to get more bang out of the streaming decoder. This change unfortunately causes a bit of a performance regression -- I think largely due to highly suboptimal code that GCC generates when structs are returned by value. See: http://blog.reverberate.org/2011/03/19/when-a-compilers-slow-code-actually-bites-you/ On the other hand, once we have a JIT this should no longer matter. Performance numbers: plain.parsestream_googlemessage1.upb_table: 374 -> 396 (5.88) plain.parsestream_googlemessage2.upb_table: 616 -> 449 (-27.11) plain.parsetostruct_googlemessage1.upb_table_byref: 268 -> 269 (0.37) plain.parsetostruct_googlemessage1.upb_table_byval: 215 -> 204 (-5.12) plain.parsetostruct_googlemessage2.upb_table_byref: 307 -> 281 (-8.47) plain.parsetostruct_googlemessage2.upb_table_byval: 297 -> 272 (-8.42) omitfp.parsestream_googlemessage1.upb_table: 423 -> 410 (-3.07) omitfp.parsestream_googlemessage2.upb_table: 679 -> 483 (-28.87) omitfp.parsetostruct_googlemessage1.upb_table_byref: 287 -> 282 (-1.74) omitfp.parsetostruct_googlemessage1.upb_table_byval: 226 -> 219 (-3.10) omitfp.parsetostruct_googlemessage2.upb_table_byref: 315 -> 298 (-5.40) omitfp.parsetostruct_googlemessage2.upb_table_byval: 297 -> 287 (-3.37)	14 years ago
Joshua Haberman	d8b2154862	First version of an assembly language decoder. It is slower than the C decoder for now because it falls off the fast path too often. But it can successfully decode varints, fixed32 and fixed64.	14 years ago
Joshua Haberman	f1e1cc4695	Split inttable into a hash part and an array part. upb_inttable() now supports a "compact" operation that will decide on an array size and put all entries with small enough keys into the array part for faster lookup. Also exposed the upb_itof_ent structure and put a few useful values there, so they are one fewer pointer chase away.	14 years ago
Joshua Haberman	4f9aeee6c7	More completely fixed the 0-key thing. Unfortunately this degrades hash table lookup performance by about 8%, which affects the streaming benchmark for googlemessage1 by about 5%. We could get this back at the cost of some memory, but it would be nice to avoid that.	14 years ago
Joshua Haberman	6117730c85	Remove the restriction that 0 cannot be a table key. This fixes issue: http://code.google.com/p/upb/issues/detail?id=1	14 years ago
Joshua Haberman	6bdbb45e88	Merged core/ and stream/ -> src/. The split wasn't worth it.	14 years ago
Joshua Haberman	21ee24a730	Updated Lua extension to handle fielddefs.	15 years ago

39 Commits (a202c5f84dd10dcb39bad5122b5aafa531ba4196)