protobuf

Commit Graph

Author	SHA1	Message	Date
Protobuf Team Bot	05c9bbd41d	Auto-generate files after cl/691951661	4 months ago
Tony Liao	79ccb8fac9	Maintain an invariant that hasbit is set iff string is nondefault. As an optimization, string fields are initialized with a pointer to a global immutable std::string instance and create a local std::string only when "set". If a field has hasbits, it presents a possibility that the hasbit is set but the string field is still pointing to the global empty string instance. This can happen, for example, when the field is implicit-presence but hasbit has been generated for it. Maintaining an invariant that hasbit is set iff string is nondefault can simplify the implementation of destructors and message.Clear(). The code would not need to branch further after scanning hasbits, instead it can always assume that a local std::string object exists as soon as it sees that the hasbit is set. However, this does require an else block in the merge implementation of implicit-presence string fields. When hasbits are implemented for implicit-presence string fields, merging from a non-present (i.e. empty) string field requires a nondefault std::string instance to be created. On the other hand, branches in Clear() can be eliminated. We think this is the right tradeoff because: 1. The allocation of nondefault string instance can only happen when the source proto has hasbit set but the field is empty. This is a relatively rare scenario. 2. Clear() is called every time a protobuf object is "overwritten" via an assignment operator or ParseFrom(). This happens probably more frequently than 1. PiperOrigin-RevId: 691951661	4 months ago
Protobuf Team Bot	7bfe237b45	Auto-generate files after cl/691945237	4 months ago
Tony Liao	3e82ed436b	Generate internal hasbits for singular proto3 implicit presence fields. N.B.: - This change is not intended to affect any well-defined protobuf behaviour in an observable way. - The wire parsing codepath is not affected. - This change only affects the C++ protobuf implementation (other languages are not affected). - sizeof proto3 message objects may increase in 32-bit increments to accommodate hasbits. - When profiled on some of Google's largest binaries, we have seen a code size increase of ~0.1%, which we consider to be a reasonable increase. There are quite a few terminologies in the title: - singular: a field that is not repeated, not oneof, not extension, not lazy, just a field with a simple primitive type (number or boolean), or string/bytes. - proto3: describes behaviour consistent to the "proto3" syntax. This is equivalent to `edition = "2023"` with `option features.field_presence = IMPLICIT;`. - implicit presence: describes behaviour consistent with "non-optional" fields in proto3. This is described in more detail in https://protobuf.dev/programming-guides/field_presence/#presence-in-proto3-apis This change enables C++ proto3 objects to generate hasbits for regular proto3 (i.e. non-`optional`) fields. This code change might make certain codepaths negligibly more efficient, but large improvement or regression is unlikely. A larger performance improvement is expected from generating hasbits for repeated fields -- this change will pave the way for future work there. Hasbits in C++ will have slightly different semantics for implicit presence fields. In the past, all hasbits are true field presence indicators. If the hasbit is set, the field is guaranteed to be present; if the hasbit is unset, the field is guaranteed to be missing. This change introduces a new hasbit mode that I will call "hint hasbits", denoted by a newly-introduced enum, `internal::cpp::HasbitMode::kHintHasbit`. For implicit presence fields, it may be possible to mutate the field and have it end up as a zero field, especially with `mutable_foo` APIs. To handle those cases correctly, we unconditionally set the hasbit when `mutable_foo` is called, then we must do an additional check for field emptiness before serializing the field onto the wire. PiperOrigin-RevId: 691945237	4 months ago
Tony Liao	56580bd0d4	Remove implementation related to lazy fields. PiperOrigin-RevId: 691936133	4 months ago
Protobuf Team Bot	f2cf85c941	Add name mangling to nested names that collide with known generated names, like the `New` function under messages. This allows C++ codegen to compile on .proto files where it did not work before. PiperOrigin-RevId: 691852276	4 months ago
Protobuf Team Bot	f27532d0bd	Check that there are unknown fields before calling MutableUnknownFields to clear it. Otherwise, we end up creating the unknown field set just to call Clear on it, which wastes memory. PiperOrigin-RevId: 691844650	4 months ago
Protobuf Team Bot	31fae84678	Move fixed_address_empty_string to rodata section when possible. In C++20 take advantage of constexpr support to do so. PiperOrigin-RevId: 691522919	4 months ago
Tony Liao	86e767f7ce	Fix header includes on compiler/java/helpers.h. PiperOrigin-RevId: 691514037	4 months ago
Protobuf Team Bot	adc8718150	Auto-generate files after cl/691426487	4 months ago
Protobuf Team Bot	0c0cdb3f85	Change namespace scope constants from internal linkage to `inline` linkage to avoid potential ODR violations and code bloat. Also, remove the now unnecessary redundant definition of class-scope `static constexpr` variables. PiperOrigin-RevId: 691426487	4 months ago
Tony Liao	774a10798c	Remove dead branch in protobuf reflection for message fields. Message fields can never have implicit presence, but we have logic in ClearField that deallocates the message field and reassigns nullptr if the field is a "proto3" field. This snippet is the remnants of an old implementation of message field reflection when proto3 was first introduced (when the initial idea is to use open structs for everything). During implementation however, we ended up preserving explicit presence behavior for message fields. PiperOrigin-RevId: 691199008	4 months ago
Protobuf Team Bot	9a13553f59	Internal change PiperOrigin-RevId: 691173359	4 months ago
Ilya Tokar	a850a5c92a	Add prefetch to RepeatedPtrFieldBase::MergeFrom PiperOrigin-RevId: 691136548	4 months ago
Protobuf Team Bot	6cb7140294	Auto-generate files after cl/691046942	4 months ago
Protobuf Team Bot	083bbd4fcd	Fix bug where enums only got traits if they are used in a field. Now they always get traits, even if unused. PiperOrigin-RevId: 691046942	4 months ago
Protobuf Team Bot	116edcefe3	Create some tests for GetClassName for edition < 2024 so that we can make sure these invariants won't be changed/broken when merging to the edition 2024 feature implementations. PiperOrigin-RevId: 691010930	4 months ago
Protobuf Team Bot	1aff4adfc6	Internal change PiperOrigin-RevId: 690798605	4 months ago
Sandy Zhang	e8e3253f63	Breaking Change: Remove deprecated RepeatedPtrField::ClearedCount(). These have been marked ABSL_DEPRECATED as of v4.22.x and was intended for removal in v5.26.x: https://protobuf.dev/news/2023-12-13/#remove-deprecated-clear-apis-on-repeated-fields. Users of this API should consider migrating to arenas for better memory reuse. See https://protobuf.dev/news/2024-10-02/#repeatedptrfieldclearedcount and https://protobuf.dev/support/migration/#cleared-elements PiperOrigin-RevId: 690652745	4 months ago
Protobuf Team Bot	24ef3d685a	Move the `throw` statement to the template in the header. PiperOrigin-RevId: 690650749	4 months ago
Protobuf Team Bot	dcd10a0f68	Auto-generate files after cl/689943588	4 months ago
Mike Kruskal	e98a263d22	Add some java plugin integration tests. PiperOrigin-RevId: 689943588	4 months ago
Chris Kennelly	9f6c3d5f39	Enforce SetNonZero invariant with an assertion. PiperOrigin-RevId: 689833707	4 months ago
Chris Kennelly	03ed5d970b	Directly test calling ByteSizeLong() on default instances. These instances may be in `.rodata`, so ByteSizeLong() cannot write to the _cached_size_. The call itself should succeed. PiperOrigin-RevId: 689822451	4 months ago
Protobuf Team Bot	b5fca3e1b5	Add a consistency check to the parser to verify that has bits are valid on parse failure, and fix the issues found by tests. Keeping the invariant can help performance and future changes. PiperOrigin-RevId: 689799465	4 months ago
Tony Liao	54d068e11c	Breaking Change: Add ASAN poisoning after clearing oneof messages on arena. Note: This change primarily affects debug + ASAN builds using protobuf arenas. If this change causes a crash in your debug build, it probably means that there is a use-after-free bug in your program. This change has already been implemented and battle-tested within Google for some time. Oneof messages on the regular heap should not be affected because the memory they hold are already deleted. Users will already see use-after-free errors if they attempt to access heap-allocated oneof messages after calling Clear(). When a protobuf message is cleared, all raw pointers should be invalidated because undefined things may happen to any of the fields pointed to by mutable_foo() APIs. While destructors may not necessarily be invoked, Clear() should be considered a pointer invalidation event. #test-continuous PiperOrigin-RevId: 689569669	4 months ago
Sandy Zhang	d83a5365d1	Breaking Change: Remove deprecated Arena::CreateMessage. These have been marked ABSL_DEPRECATED as of v5.27.x and is replaced with Arena::Create which has been available since v3.0.0. See https://engdoc.corp.google.com/eng/doc/devguide/proto/news/2024-10-02.md#arenacreatemessage PiperOrigin-RevId: 689557672	4 months ago
Protobuf Team Bot	33bbbebf20	Change DynamicCastMessage to throw a `std::bad_cast` exception when exceptions are enabled. This makes the function a drop-in replacement for `dynamic_cast` when the user is expecting exceptions to be thrown. PiperOrigin-RevId: 689419852	4 months ago
Protobuf Team Bot	f549fc3ccc	Simplify proto2::(anonymous namespace)::FlatAllocatorImpl::GetFieldNameCase and reuse absl's ascii lower/uppercasing. By just checking for upper-cased characters, rather than digits and lower-cased characters, we have to perform fewer comparisons. This should be safe because absl::AsciiStrToLower only operates on upper-case characters. PiperOrigin-RevId: 688453016	4 months ago
Joshua Haberman	2ac862f36c	Fixed a missing check in wire format verification. Also added some extra DCHECKs to more easily catch issues like this in the future. PiperOrigin-RevId: 688347347	5 months ago
Protobuf Team Bot	f92335b36d	Comment change: clarifies that the field/value order is based on textual order in the file, not the order of the enums. Clarifies that reordering `enum` fields (even without changing their IDs) will change the order of the `value` indices. (This means that a seemingly "no-op" change to reorganize enums may affect code that (incorrectly) relied on the order of `value()`. PiperOrigin-RevId: 688297400	5 months ago
Evan Brown	f971ed3f36	Use Layout::WithStaticSizes in SerialArenaChunk to improve performance of Layout computations. PiperOrigin-RevId: 688155852	5 months ago
Protobuf Team Bot	ba85b2003f	Auto-generate files after cl/688151072	5 months ago
Protobuf Team Bot	8f7aab29b2	Migrate enum extensions to data based validation instead of function pointer based one. This reduces binary size and runtime dispatch costs. Also, since we are changing the declaration of the type trait, take the opportunity to remove the validator from the template parameters. It can be inferred directly from the type if we add traits for the enum. PiperOrigin-RevId: 688151072	5 months ago
Protobuf Team Bot	a98b0bec43	Improve ImplicitWeakMessage to allocate the internal string in the arena, and allow skipping the destructor. PiperOrigin-RevId: 688142492	5 months ago
Tony Liao	fe535930d3	Bump minimum C++ version to C++17 after branch cut for v29. Branch cut for v29 is done on 2024-09-30: https://github.com/protocolbuffers/protobuf/releases/tag/v29.0-rc1 The next version v30 will be a breaking release. The release date is scheduled after the EOL of C++14 support on 2024-12-15 for Google open source projects generally: https://github.com/google/oss-policies-info/blob/main/foundational-cxx-support-matrix.md This commit allows us to start taking advantage of C++17 features now. Some issues I ran into while upgrading: Two GCC 9.5 bugs related to -Wunused-but-set-parameter: - https://godbolt.org/z/qo51cKe7b - https://godbolt.org/z/65qW3vGhP Another GCC warning related to -Wself-assign in a template. There is a custom ASAN check that is not yet open sourced. I'll see if I can open source them in a subsequent commit. #test-continuous PiperOrigin-RevId: 687435042	5 months ago
Protobuf Team Bot	18903c4a4c	Implement Java feature `use_old_outer_classname_default` for edition 2024. PiperOrigin-RevId: 687389326	5 months ago
Adam Cozzette	cbb3edd86d	Rust C++: get all map fields onto a common implementation of ProxiedInMapValue This CL migrates messages, enums, and primitive types all onto the same blanket implementation of the `ProxiedInMapValue` trait. This gets us to the point where messages and enums no longer need to generate any significant amount of extra code just in case they might be used as a map value. There are a few big pieces to this: - I generalized the message-specific FFI endpoints in `rust/cpp_kernel/map.cc` to be able to additionally handle enums and primitive types as values. This mostly consisted of replacing `MessageLite*` parameters with a new `MapValue` tagged union. - On the Rust side, I added a new blanket implementation of `ProxiedInMapValue` in rust/cpp.rs. It relies on its value type to implement a new `CppMapTypeConversions` trait so that it can convert to and from the `MapValue` tagged union used for FFI. - In the Rust generated code, I deleted the generated `ProxiedInMapValue` implementations for messages and enums and replaced them with implementations of the `CppMapTypeConversions` trait. PiperOrigin-RevId: 687355817	5 months ago
Protobuf Team Bot	dcc1c08ef7	Internal changes PiperOrigin-RevId: 687342496	5 months ago
Adam Cozzette	7d619e8974	Rust: fix extra copy in map setters When you call a map field setter, we currently make an unnecessary extra copy, so this CL fixes that problem. I followed the example of how we already handle this for repeated field setters. This required adding a new move setter thunk for map fields with the C++ kernel. Originally I tried instead to add an FFI endpoint that could swap two `RawMap` pointers, but it turned out to be difficult to implement this in a way that worked correctly when the two maps are not on the same arena. PiperOrigin-RevId: 687334655	5 months ago
Protobuf Team Bot	f58849b603	Automated rollback of commit `bd03560d9f`. PiperOrigin-RevId: 687328386	5 months ago
Protobuf Team Bot	3fe0c75fc9	Internal change PiperOrigin-RevId: 686989033	5 months ago
Protobuf Team Bot	12eadfdfd9	Handle `PROTOBUF_ENABLE_DEBUG_LOGGING_MAY_LEAK_PII` being undefined Check if `defined(PROTOBUF_ENABLE_DEBUG_LOGGING_MAY_LEAK_PII)` before trying to use it. PiperOrigin-RevId: 686945589	5 months ago
Tony Liao	571ff6f2b2	Uncouple Java hasbits from C++ hasbits. There is really no reason for the Java compiler code to call into the internal C++ implementation of HasHasbit. In the future, the two implementations may evolve separately and this decoupling can make it easier. PiperOrigin-RevId: 686672397	5 months ago
Zoey Greer	fbe6168a02	Address warning regarding incorrectly-terminated heredoc in `src/google/protobuf/compiler/test_plugin_injection.bzl` (#18238 ) Closes #18238 COPYBARA_INTEGRATE_REVIEW=https://github.com/protocolbuffers/protobuf/pull/18238 from tempoz:tempoz-fix-heredoc `87644ab5e9` PiperOrigin-RevId: 686625159	5 months ago
Protobuf Team Bot	ecec105916	Add `DynamicCastMessage` overloads for `std::shared_ptr` to replace uses of `std::dynamic_pointer_cast`. PiperOrigin-RevId: 686603280	5 months ago
Mark Hansen	f63b0ece75	Combine dead-code gencode switch clauses for smaller codegen For messages with no required fields, SET_MEMOIZED_IS_INITIALIZED is never called, see the early returns above http://google3/third_party/java_src/protobuf/current/java/com/google/protobuf/GeneratedMessageLite.java;l=1526;rcl=684621860 if we return '1'. So we can put whatever logic in here we want, and we can avoid codegenning the 'return null' instructions. This should generate slightly smaller dex and oat code. https://godbolt.org/z/bGWaf68xv PiperOrigin-RevId: 686323338	5 months ago
Mark Hansen	b8557ebbd4	Fix indentation on closing brace in gencode PiperOrigin-RevId: 686304907	5 months ago
Mark Hansen	0a419d4ce6	Fix indentation in lite message clear by adding newline PiperOrigin-RevId: 686300005	5 months ago
Mark Hansen	1286f4d931	Tidy up lite gencode by adding newline PiperOrigin-RevId: 686293878	5 months ago

1 2 3 4 5 ...

5621 Commits (a422268dda2d73a1b47e4559e3ee66b9385aa677)