protobuf

Commit Graph

Author	SHA1	Message	Date
Protobuf Team Bot	b174d9d5a0	split out the internal arena defs from internal/upb.h PiperOrigin-RevId: 456584853	2 years ago
Protobuf Team Bot	7b05f25b98	split out the alloc code from the arena code PiperOrigin-RevId: 456531002	2 years ago
Protobuf Team Bot	e4635f223e	match file names to type names Lots of changes but it's all just moving things around. Backward-compatible stub #include's have been provided for now. upb_Arena/upb_Status have been split out from upb/upb.? upb_Array/upb_Map/upb_MessageValue have been split out from upb/collections.? upb_ExtensionRegistry has been split out from upb/msg.? upb/decode_internal.h is now upb/internal/decode.h upb/mini_table_accessors_internal.h is now upb/internal/mini_table_accessors.h upb/table_internal.h is now upb/internal/table.h upb/upb_internal.h is now upb/internal/upb.h PiperOrigin-RevId: 456297617	2 years ago
Joshua Haberman	1cf8214e4d	Changed upb's arena alignment from 16 to 8. upb has traditionally returned 16-byte-aligned pointers from arena allocation. This was out of an abundance of caution, since users could theoretically be using upb arenas to allocate memory that is then used for SSE/AVX values (eg. [`__m128`](https://docs.microsoft.com/en-us/cpp/cpp/m128?view=msvc-170), which require 16-byte alignment. In practice, the protobuf C++ arena has used 8-byte alignment for 8 years with no significant problems I know of arising from SSE etc. Reducing the alignment requirement to 8 will save memory. It will also help with compatibility on 32-bit architectures where `malloc()` only returns 8-byte aligned memory. The immediate motivation is to fix the win32 build for Python protobuf. PiperOrigin-RevId: 448331777	3 years ago
Joshua Haberman	be7dfeba6b	Added GitHub Action to test for clang-format.	3 years ago
Joshua Haberman	bc7b5dcadf	Ported protobuf's dtoa() function for text format and JSON.	3 years ago
Joshua Haberman	1c955f37ce	Mass API rename and clang-reformat (#485 ) * Wave 1: upb_fielddef. * upb_fielddef itself. * upb_oneofdef. * upb_msgdef. * ExtensionRange. * upb_enumdef * upb_enumvaldef * upb_filedef * upb_methoddef * upb_servicedef * upb_symtab * upb_defpool_init * upb_wellknown and upb_syntax_t * Some constants. * upb_status * upb_strview * upb_arena * upb.h constants * reflection * encode * JSON decode. * json encode. * msg_internal. * Formatted with clang-format. * Some naming fixups and comment reformatting. * More refinements. * A few more stragglers. * Fixed PyObject_HEAD with semicolon. Removed TODO entries.	3 years ago
Joshua Haberman	a669587817	Fixed the edge case where rounding up causes overflow.	3 years ago
Joshua Haberman	e83aeba595	Align arena initial block to ensure allocations are aligned.	3 years ago
Joshua Haberman	3881393907	Renamed .int.h to _internal.h, for greater clarity.	4 years ago
Joshua Haberman	823eb09694	Update all 2011 dates to 2021.	4 years ago
Joshua Haberman	e59d2c8fa7	Added license headers to all files.	4 years ago
Matt Kulukundis	5b97df91dd	Restrict fuse to matching block_alloc	4 years ago
Matt Kulukundis	e74d6c23de	Small renames and use uintptr_t instead of void*	4 years ago
Matt Kulukundis	d9a0c58108	Allow arena fuse to fail Track initial blocks to avoid having fuse operate on arenas that cannot be fused.	4 years ago
Joshua Haberman	9df96874e9	Start arena block doubling at initial block size. If an initial block is provided, we should start our block doubling at the size of the initial block, not 128. This saves us from unnecessary overhead when we overflow the initial block.	4 years ago
Joshua Haberman	8f3ee80d46	Drop C89/C90 support and MSVC prior to Visual Studio 2015. upb previously attempted to support C89 and pre-2015 versions of Visual Studio. This was to support older compilers with limited C99 support (particularly MSVC). But as of last August, even gRPC has dropped support for MSVC prior to 2015 `c87276d058` Therefore it seems safe for upb to no longer attempt C89 support (we were already not truly C89 compliant, with our use of "bool"). We now explicitly require C99 or greater and MSVC 2015 or greater. This cleaned up port_def.inc a fair bit. I took the chance to also remove some obsolete macros.	4 years ago
Joshua Haberman	e2c709e047	Repeated string and primitive support. Much of the code was adapted from Gerben's code in: `6333031195`	4 years ago
Joshua Haberman	d5096f9ee8	Fixed bug in addunknown and added ASAN poisoning.	4 years ago
Joshua Haberman	ebe53f8590	Fixed compile error.	4 years ago
Joshua Haberman	b37f82b58b	Fixed compile error.	4 years ago
Joshua Haberman	c25d895adf	Shrunk the arena state that needs to be synced.	4 years ago
Joshua Haberman	7f67f68c1c	Shrunk the arena state that needs to be synced.	4 years ago
Joshua Haberman	746f64692c	Moved arena inline for decoder.	4 years ago
Joshua Haberman	7363b91ac3	Moved arena inline for decoder.	4 years ago
Joshua Haberman	086a68d191	Fixed memory leak that could occur after upb_arena_fuse(). Also added valgrind testing for Kokoro.	5 years ago
Joshua Haberman	b717575cef	Added -Wextra and -Wshorten-64-to-32 and fixed resulting errors. (#289 ) * Added -Wextra and -Wshorten-64-to-32 and fixed resulting errors. * Disable -Wshorten-32-to-64 since Kokoro is missing Clang. * Fixed -Wextra warnings for gcc. * Reordered UPB_UNUSED() to come after declarations. * Added another -pedantic fix and log CC version. * Fix compile error and conditionally run use_bazel.sh. * Moved set -e after use_bazel.sh. * Fixed typo in conditional.	5 years ago
Joshua Haberman	634d37515c	Bugfix for oneofs and added line/col info to JSON.	5 years ago
Joshua Haberman	6c4acba610	Implemented upb_arena_fuse() (#278 ) * WIP. * WIP. * Tests are passing. * Recover some perf: LIKELY doesn't propagate through functions. :( * Added some more benchmarks. * Simplify & optimize upb_arena_realloc(). * Only add owned blocks to the freelist. * More optimization/simplification. * Re-fixed the bug. * Revert unintentional changes to parser.rl. * Revert Lua changes for now. * Revert the arena fuse changes for now. * Added last_size to the arena representation. * Re-applied Lua changes. * Implemented upb_arena_fuse(). * Fix the compile by re-ordering statements. * Improve comments.	5 years ago
Joshua Haberman	2b1e7dc1cc	Arena refactor: moves cleanup list into regular blocks (#277 ) * WIP. * WIP. * Tests are passing. * Recover some perf: LIKELY doesn't propagate through functions. :( * Added some more benchmarks. * Simplify & optimize upb_arena_realloc(). * Only add owned blocks to the freelist. * More optimization/simplification. * Re-fixed the bug. * Revert unintentional changes to parser.rl. * Revert Lua changes for now. * Revert the arena fuse changes for now. * Added last_size to the arena representation. * Fixed compile errors. * Fixed compile error and changed benchmarks to do one allocation.	5 years ago
Joshua Haberman	a0ae30bd16	Remove bytes allocated measurement functions. (#276 )	5 years ago
Joshua Haberman	9bd23dab42	Changed upb status to suit GCC10's warning about strncpy(). (#268 ) Added tests for all cases. Also removed ellipses from truncated messages, they were more trouble than they are worth.	5 years ago
Joshua Haberman	08b6d2d6fd	Rewrite of the decoder (#263 ) New code is smaller (in both source size and compiled size) and faster. # Speed The decoder speeds up on all machines I tested, though the amount of speedup varies. I was only able to test Intel CPUs. ### Linux Desktop ``` CPU: Intel(R) Core(TM) i7-8700K CPU @ 3.70GHz OS: Linux name old time/op new time/op delta CreateArena 4.72ns ± 0% 4.93ns ± 0% +4.47% (p=0.000 n=11+11) ParseDescriptor 12.4µs ± 1% 9.1µs ± 1% -26.65% (p=0.000 n=11+11) ``` ### Mac Laptop ``` CPU: Intel(R) Core(TM) i7-8850H CPU @ 2.60GHz OS: macOS name old time/op new time/op delta CreateArena 5.33ns ± 3% 5.58ns ± 2% +4.69% (p=0.000 n=12+12) ParseDescriptor 15.0µs ± 2% 11.9µs ± 2% -20.20% (p=0.000 n=12+12) ``` ### Linux Workstation ``` CPU: Intel(R) Xeon(R) Gold 6154 CPU @ 3.00GHz OS: Linux name old time/op new time/op delta CreateArena 5.29ns ± 0% 5.52ns ± 0% +4.37% (p=0.000 n=10+12) ParseDescriptor 18.6µs ± 0% 16.4µs ± 0% -11.54% (p=0.000 n=12+12) ``` # Size A few source files grow marginally because of some arena functionality moved inline. But `upb/decode.c` shrinks by 30% on Linux: ``` VM SIZE -------------- +2.1% +283 upb/json_decode.c +24% +205 upb/msg.c +8.4% +115 upb/upb.c +0.9% +28 upb/reflection.c [ = ] 0 upb/def.c [ = ] 0 upb/encode.c [ = ] 0 upb/json_encode.c [ = ] 0 upb/table.c -30.3% -1.51Ki upb/decode.c -0.7% -738 TOTAL ```	5 years ago
Joshua Haberman	cf35baa1ad	Moved macros from upb.h to port_def.inc to avoid leaking them to users. (#160 ) * Use port_def.inc to prevent macros from leaking to users. * Added helpful comments to port_def.inc/port_undef.inc.	6 years ago
Joshua Haberman	f4532ab273	Properly align the arena.	6 years ago
Joshua Haberman	315c167bed	Some more fixes for PHP.	6 years ago
Joshua Haberman	2c26f60dbb	Added some comments and reversed upb_arena_cleanup() args.	6 years ago
Joshua Haberman	cb26d883d1	WIP.	6 years ago
Joshua Haberman	a9c375f8ea	Partway through refactoring of Arena.	6 years ago
Joshua Haberman	380558922b	test_encoder passes! Other tests still need to be fixed.	6 years ago
Josh Haberman	3b7dc27fb5	Fixed amalgamated build and added test.	8 years ago
Josh Haberman	4b0c4ca7fb	New upb_msg code and Lua bindings around it. There are still some things that are unfinished, but we are at parity with what Lua had before.	8 years ago
Joshua Haberman	fa338b70a6	Added UPB_ASSERT() that helps avoid unused var warnings. * Added UPB_ASSERT() that helps avoid unused var warnings. * Addressed PR comments. * Fixed assert in the JIT.	9 years ago
Joshua Haberman	d0c2479920	Fixed small omission: upb_env_init2(). (#61 )	9 years ago
Joshua Haberman	68bc62a7fa	Split upb::Arena/upb::Allocator from upb::Environment. (#58 ) * Split upb::Arena/upb::Allocator from upb::Environment. This will allow arenas and allocators to be used independently of environments, which will be important for an upcoming change (a message representation). Overall this design feels cleaner that the previous Environment/SeededAllocator design. As part of this change, moved all allocations in upb to use a global allocator instead of hard-coding malloc/free. This will allow injecting OOM faults for more robust testing. One place that doesn't use the global allocator is the tracked ref code. Instead of its previous approach of CHECK_OOM() after every malloc() or table insert, it simply uses an allocator that does this automatically. I moved Allocator/Arena/Environment into upb.h. This seems principled since these are the only types in upb whose size is directly exposed to users, since they form the basis of memory allocation strategy. * Cleaned up some header includes and fixed more malloc -> upb_gmalloc(). * Changes from PR review. * Don't use UINTPTR_MAX or UINT64_MAX. * Punt on adding line/file for now. * We actually can't store (uint64_t)-1, update comment and test.	9 years ago
Josh Haberman	49dab06e03	Brought into compliance with Google open-source policies. - removed myself from Author headers in source files. - removed copyright notices from source file headers. - added CONTRIBUTING.md	10 years ago
Josh Haberman	919fea438a	Ported upb to C89, for greater portability. A large part of this change contains surface-level porting, like moving variable declarations to the top of the block. However there are a few more substantial things too: - moved internal-only struct definitions to a separate file (structdefs.int.h), for greater encapsulation and ABI compatibility. - removed the UPB_UPCAST macro, since it requires access to the internal-only struct definitions. Replaced uses with calls to inline, type-safe casting functions. - removed the UPB_DEFINE_CLASS/UPB_DEFINE_STRUCT macros. Class and struct definitions are now more explicit -- you get to see the actual class/struct keywords in the source. The casting convenience functions have been moved into UPB_DECLARE_DERIVED_TYPE() and UPB_DECLARE_DERIVED_TYPE2(). - the new way that we duplicate base methods in derived types is also more convenient and requires less duplication. It is also less greppable, but hopefully that is not too big a problem. Compiler flags (-std=c89 -pedantic) should help to rigorously enforce that the code is free of C99-isms. A few functions are not available in C89 (strtoll). There are temporary, hacky solutions in place.	10 years ago
Josh Haberman	87fc2c516b	Changes from Google-internal development. * JSON parser expanded to handle split buffers. * bugfix to the protobuf decoder.	10 years ago
Josh Haberman	ce9bba3cb5	Sync from Google-internal development.	11 years ago
Josh Haberman	26d98ca94f	Merge from Google-internal development: - rewritten decoder; interpreted decoder is bytecode-based, JIT decoder no longer falls back to the interpreter. - C++ improvements: C++11-compatible iterators, upb::reffed_ptr for RAII refcounting, better upcast/downcast support. - removed the gross upb_value abstraction from public upb.h.	11 years ago

3 Commits (46e306bead6e99385a5895640974fb36a59ce67e)