protobuf

Commit Graph

Author	SHA1	Message	Date
Protobuf Team Bot	385d42a6cd	Always start a retried search for root at the previous root. There's already path compression which guarantees amortized fast times (halving the cost of subsequent lookups, alas not the inverse ackermann), but there's still no need to redo work and acquire/release atomics the whole way along the path. This also takes advantage of the fast-path relaxed-only read for querying the root of a root node. PiperOrigin-RevId: 712770023	2 months ago
Protobuf Team Bot	7de789ed4c	Relax memory order for reference counting unfused arenas. Most arenas aren't fused, and reference counting should be cheap. PiperOrigin-RevId: 712760349	2 months ago
Protobuf Team Bot	301275dea4	Fix races in arena fusing * Add acquire/release where necessary for all atomic ops * Add sentinel member to ensure safe publication when tsan is active; tsan will not catch the previous errors without this member. * For all operations using relaxed memory order, comment why relaxed order is safe * Add a test that exercises racy fuses and space allocated checks without mutexes or other memory barriers from the test harness. This test proved the existence of several races not caught by the existing tests, including one with a confident comment about why relaxed memory order was safe. * Add a test that exercises racing allocation and destruction among fused arenas, which doesn't use locks and substitutes a custom allocator that verifies its memory blocks. Test coverage and assert/tsan instrumentation is now sufficient to cause test failures if any call site is further relaxed. PiperOrigin-RevId: 712751905	2 months ago
Protobuf Team Bot	ca702f4531	Support C11 atomics on compilers that don't define `__GNUC__` if they declare an extension or C11 atomic support. This fixes: * MSVC with `/std:c11 /experimental:c11atomics` on recent versions now emits atomics * Clang with `-std=c11 -fgnuc-version=0` now emits atomics * Clang and GCC 14 when built with `-std=c99 -pedantic-errors` will now compile, and not emit atomics PiperOrigin-RevId: 712538312	2 months ago
Protobuf Team Bot	3e316c4c55	Delete some portability macros we don't use anywhere. PiperOrigin-RevId: 712518254	2 months ago
Protobuf Team Bot	a123879666	Remove atomics from linked list of blocks We no longer need to traverse the linked list of blocks to check allocated space, which means we also no longer need atomics in the linked list or even its head. This is especially beneficial as the previous implementation contained a race where we could dereference uninitialized memory; because the setting of the `next` pointers did not use release semantics and the reading of them in `SpaceAllocated` reads with relaxed order, there's no guarantee that `size` has actually been initialized - but worse, there is also no guarantee that `next` has been!. Simplified: ``` AddBlock: 1 ptr = malloc(); 2 ptr->size = 123; 3 ptr->next = ai->blocks; 4 ai->blocks = ptr (release order); ``` ``` SpaceAllocated: 5 block = ai->blocks (relaxed order) 6 block->size (acquire, but probably by accident) 7 block = block->next (relaxed order) ``` So I think a second thread calling SpaceAllocated could see the order 1, 4, 5, 6, 7, 2, 3 and read uninitialized memory - there is no data-dependency relationship or happens-before edge that this order violates, and so it would be valid for a compiler+hardware to produce. In reality, operation 4 will produce an `stlr` on arm (forcing an order of 1, 2, 3 before 4), and `block->next` has a data dependency on `ai->blocks` which would force an ordering in the hardware between 5->6 and 5->7 even for regular `ldr` instructions. Delete arena contains, it's private and the only user is its own test. PiperOrigin-RevId: 709918443	2 months ago
Protobuf Team Bot	79a34c489f	Minor tidying of block size calculations - avoid unnecessary memory barriers and likely cache miss for most recent block size. PiperOrigin-RevId: 708296701	2 months ago
Protobuf Team Bot	e9e08cd0a2	Use uint32_t instead of size_t for size and capacity of the list holding extensions and unknown fields. PiperOrigin-RevId: 708120760	2 months ago
Protobuf Team Bot	a673dccf92	Promote extensions in place of the unknown field they were promoted from PiperOrigin-RevId: 708100356	2 months ago
Protobuf Team Bot	efbd632e56	Implement unknown field aliasing support PiperOrigin-RevId: 708077050	2 months ago
Protobuf Team Bot	cec097b9b5	Implement merged unknowns/extensions storage. Previously, extensions and unknown fields were stored on opposite ends of a growing buffer: ``` \|------unknown fields-------\|---------unallocated space------\|--extensions---\| ``` Unknown fields were appended and extensions were prepended during parse. When either side ran into the other, the buffer was reallocated to fit, rounding up to the nearest power of 2. This meant that for a proto with 70,000 bytes of unknown fields, the total memory consumed could be up to 128+256+512+1024+2048+4096+8192+16384+32768+65536+131072=262016 bytes allocated in the arena. In the more common case of a large, length-delimited field it'd be just 131072 bytes; but as a 3.74x increase or a 1.87x increase, that's a lot of extra memory. The new representation still does exponential reallocation, but only for pointers to normal arena allocations. We exploit the fact that arena allocations are aligned to store data about whether the pointer is to an extension or a `upb_StringView` of unknown fields in the low bits of the pointer itself. This costs three pointers of overhead per unknown field and one pointer of overhead per extension, but that's a fixed overhead - we won't over-allocate large buffers for large unknown fields. If this overhead proves to be a problem, more compact representations could be implemented. In addition, because unknown field bytes are now in their own allocations, they are pointer stable - in the future, this will allow us to exploit aliasing (when enabled during parse) for both unknown fields and lazy extensions (parsed from unknown fields), which can greatly reduce memory use for messages with a lot of unknown, string, or bytes fields. PiperOrigin-RevId: 708058272	2 months ago
Protobuf Team Bot	c78129dba2	Fix bug in deterministic extension encoding when empty extensions are present PiperOrigin-RevId: 707943424	2 months ago
Protobuf Team Bot	85435346a2	Silence some warnings in GCC. PiperOrigin-RevId: 707649778	2 months ago
Mike Kruskal	72b3eda2ec	Breaking change: fix closed enum validation under editions See https://protobuf.dev/news/2024-10-02/#python-setter-validation PiperOrigin-RevId: 707612946	2 months ago
zhangskz	e68fa2693a	Remove unused upb CMakeLists.txt (#19638 )	3 months ago
Sandy Zhang	abb197cb61	Raise ParseError for non-numeric strings in numeric fields in Ruby and PHP JSON parsing. This fixes Ruby and PHP to be conformant with the Protobuf's JSON spec. Note this fix is not accompanied by a major version bump for Ruby or PHP, but was pre-announced in https://engdoc.corp.google.com/eng/doc/devguide/proto/news/2024-11-07.md#ruby-and-php-errors-in-json-parsing and landed as a warning in 29.x. PiperOrigin-RevId: 704855202	3 months ago
Protobuf Team Bot	e9b9b5f08a	Automated Code Change PiperOrigin-RevId: 704616341	3 months ago
Protobuf Team Bot	9f29f02a36	Add branch hint to make CSEL generation more reliable PiperOrigin-RevId: 703295900	3 months ago
Protobuf Team Bot	aac2600b62	Add `upb_StringView_Compare` function. PiperOrigin-RevId: 703267593	3 months ago
Protobuf Team Bot	929905bd28	Minor binary search optimization for field lookup slow path. On a Cortex-A55 this resulted in a 28.30% reduction in CPU and wall time for the binary search path. Loop body before: ``` .LBB0_2: add w8, w12, #1 cmp w8, w11 b.gt .LBB0_6 // Predictable branch, ends the loop .LBB0_3: add w12, w8, w11 add w12, w12, w12, lsr #31 asr w12, w12, #1 smaddl x0, w12, w10, x9 ldr w13, [x0] cmp w13, w1 b.lo .LBB0_2 // Unpredictable branch here! Will be hit 50/50 in prod b.ls .LBB0_7 // Predictable branch - ends the loop sub w11, w12, #1 cmp w8, w11 b.le .LBB0_3 // Predictable branch - continues the loop ``` Loop body after: ``` .LBB7_1: cmp w9, w11 b.hi .LBB7_4 // Predictable branch - ends the loop add w12, w9, w11 lsr w12, w12, #1 umaddl x0, w12, w8, x10 sub w14, w12, #1 ldr w13, [x0] cmp w13, w1 csel w11, w14, w11, hs csinc w9, w9, w12, hs b.ne .LBB7_1 // Predictable branch - continues the loop ``` PiperOrigin-RevId: 703214356	3 months ago
Protobuf Team Bot	e5199878d9	Add benchmark for field lookup PiperOrigin-RevId: 702790181	3 months ago
Protobuf Team Bot	9a8494d270	Add a upb_alloc cleanup function pointer to upb_Arena. The allocator cleanup should be called last, after all the blocks have been freed. PiperOrigin-RevId: 702483612	3 months ago
Protobuf Team Bot	347ac4ac3b	Don't add LayoutItems for non-oneof fields. This simplifies the code, and since the vast majority of messages don't have oneof, should also speed things up. PiperOrigin-RevId: 702395824	3 months ago
Protobuf Team Bot	eb8a34d22d	Fix issue where a tmp buffer could have been too small when handling a serialized FeatureSet. Rename upb_Log2CeilingSize() to upb_RoundUpToPowerOfTwo() to minimize the chance of this confusion happening in the future. In practice this condition could never be hit by descriptors generated by protoc since FeatureSet is small and 1-byte-on-wire fields. PiperOrigin-RevId: 702381123	3 months ago
Protobuf Team Bot	a7db4a7b2b	Remove unnecessary offsets field from LayoutItem, and instead assign it directly to the minitable fields PiperOrigin-RevId: 702086770	3 months ago
Protobuf Team Bot	c6a452aa7c	Don't sort fields when assigning offsets. Some qsort implementations will allocate buffers rather than sorting purely in-place; this new algorithm avoids that and also works in O(n) time rather than O(nlogn). PiperOrigin-RevId: 702081436	3 months ago
Protobuf Team Bot	a79fbc9d32	Fixed comparison of empty repeated/map extensions. Repeated/map extensions are semantically equivalent to an extension that is not present at all. We had code paths that were treating them differently, which led to incorrect results. In particular, we were considering `{.repeated_ext = []}` to be different from `{}` when comparing with `upb_Message_IsEqual()`. This change fixes this bug so that they will be considered equivalent. PiperOrigin-RevId: 702072912	3 months ago
Protobuf Team Bot	5d0865cf15	Auto-generate files after cl/701102058	3 months ago
Protobuf Team Bot	79fbab0b65	Fix inadvertent sorting order of message memory layout PiperOrigin-RevId: 701102058	3 months ago
Tony Liao	783b307703	Print better error message when registering an extension with a duplicate number. Before, the upb_ExtensionRegistry_AddArray API would just return a boolean indicating whether the operation succeeded or failed. This is not descriptive enough in some cases and made the error output confusing. Specifically, when trying to register an extension with a duplicate extension number, AddArray first performs a map lookup before inserting the extension entry into the registry. The code handled lookup failure (due to duplicates) the same way as insertion failure (due to OOM), and printed an error message that showed OOM when there is a duplicate array entry. This was acknolwedged in a TODO in the AddArray code comment -- which is now fixed. :) PiperOrigin-RevId: 700764584	3 months ago
Protobuf Team Bot	ce9071a708	Cut the size of upb_LayoutItem from 12 bytes to 6. If we require C23 I could do this in the enum declaration. PiperOrigin-RevId: 700728937	3 months ago
Taylor Cramer	653f511ad4	Build with -Wundef (#17845 ) This allows downstream consumers of protobuf to enable Wundef in their code which consumes protobuf APIs. Closes #17845 COPYBARA_INTEGRATE_REVIEW=https://github.com/protocolbuffers/protobuf/pull/17845 from cramertj:wundef `8750f8c5f8` PiperOrigin-RevId: 700721632	3 months ago
Mike Kruskal	d0151aa70c	Fix const INFINITY issue on MSVC. We previously hit this for NAN, but it appears the latest version of MSVC used in windows github runners has the same issue for INFINITY. The same strategy can be applied here. This will be backported to fix broken release branches. PiperOrigin-RevId: 700042372	3 months ago
Protobuf Team Bot	d76359aa45	Use parentheses for macro argument PiperOrigin-RevId: 699254841	3 months ago
Protobuf Team Bot	0bdcf98f16	Emit extensions in debug string in wire order PiperOrigin-RevId: 699254023	3 months ago
Protobuf Team Bot	ac5ce6e4cb	Rename _upb_Message_Realloc to _EnsureAvailable PiperOrigin-RevId: 699248319	3 months ago
Vadmeme	deec498803	Improve support for Clang inside of UPB (#17433 ) After using UPB I noticed that field accessors were not getting inlined properly using Clang. The source appears to be the UPB_FORCEINLINE macro falling back to just being `static` when underneath clang despite clang having full support of the required attributes. Closes #17433 COPYBARA_INTEGRATE_REVIEW=https://github.com/protocolbuffers/protobuf/pull/17433 from Vadmeme:main `57204e78cd` PiperOrigin-RevId: 699195477	3 months ago
Joshua Haberman	3d065d1ced	Fixed depth limit check by comparing effective depth limits. Before we were trying to work around the fact that we don't know the default depth limit. The logic is simpler and more robust if we take the default into account. PiperOrigin-RevId: 698856552	3 months ago
Joshua Haberman	245acbfc16	Make fuzz test check round-trip correctness of upb encoder/decoder. Prior to this CL, the fuzz tests only checked that the code does not crash, but it was not checking any correctness properties. This CL adds correctness checking, verifying that we can round trip through the wire format without losing or corrupting data. This highlighted a minor bug in the encoder where the depth limit check was off by one (too strict). This CL makes the encoder more accepting when checking the depth limit. PiperOrigin-RevId: 698527429	3 months ago
Protobuf Team Bot	db71344633	Move iteration APIs to headers, so they're inlined in loop calls PiperOrigin-RevId: 698497070	3 months ago
Protobuf Team Bot	35dbd5cfd6	Use an explicit union with commented explanation rather than casting PiperOrigin-RevId: 698437576	3 months ago
Jie Luo	3781f45f39	Fix a python bug that UPB and Python C++ extension assume MessageSet extensions are ordered first PiperOrigin-RevId: 698430014	3 months ago
Protobuf Team Bot	b428e53016	Propagate aliasing option to parse of unknown fields No implementation hooked up yet. PiperOrigin-RevId: 698144846	3 months ago
Hong Shin	cd837fd6bf	upb: Add upb_Message_GetExtensionMutableArray PiperOrigin-RevId: 698010022	3 months ago
Protobuf Team Bot	f1d81a0d38	Tighten up size calculation for flexible array members PiperOrigin-RevId: 697036283	3 months ago
Protobuf Team Bot	5e1cc249bf	Use noncontiguous unknown fields API in upb message compare PiperOrigin-RevId: 697017028	3 months ago
Protobuf Team Bot	d1b851c9bc	Add unknown fields during group decode in a single call, to permit aliasing PiperOrigin-RevId: 697008785	3 months ago
Protobuf Team Bot	1380653e42	Reduce oversized stack buffers - 32 bit unsigned varints are not encoded with 64 bit sign extension, and thus can only take up 5 bytes. PiperOrigin-RevId: 696932804	3 months ago
Protobuf Team Bot	32afcb9cf8	Update callers to use noncontiguous APIs PiperOrigin-RevId: 696922501	3 months ago
Protobuf Team Bot	1863e58488	Always add unknown fields in a single call PiperOrigin-RevId: 696895119	3 months ago

1 2 3 4 5 ...

1910 Commits (178f8db655c7bcf73b19df55839d546e7e0b5b7e)