protobuf

Commit Graph

Author	SHA1	Message	Date
Eric Salo	34c692f0f9	upb: cast gencode enums in bitwise ops to make C++20 happy Reference: https://github.com/protocolbuffers/upb/issues/1340 PiperOrigin-RevId: 538332530	1 year ago
Protobuf Team Bot	73ee41cbb2	license changes PiperOrigin-RevId: 534600788	2 years ago
Joshua Haberman	6c10ce248d	Removed old ResizeArray() API per TODO PiperOrigin-RevId: 530375932	2 years ago
Adam Cozzette	7bd972db76	Internal change PiperOrigin-RevId: 530370607	2 years ago
Joshua Haberman	18b98a8ec1	Give sub-messages and sub-enums distinct initialization code paths. Prior to this change, all sub-messages and sub-enums were initialized to NULL. Going forward, sub-messages will need to be initialized differently than sub-enums. To facilitate this, we change the order of subs, so that sub-messages always come before sub-enums. Then when we allocate the subs, we can initialize them in two separate loops. This unfortunately requires one extra iteration over the fields if any closed enums are present, to adjust the `submsg_index` according to how many sub-messages were seen. This CL is the first step towards changing how we handle unlinked sub-messages. PiperOrigin-RevId: 529100966	2 years ago
Thomas Van Lenten	2282505327	Move to proto_common for all upb aspects to fix numerous tricky edge cases and simplify the code PiperOrigin-RevId: 527937369	2 years ago
Joshua Haberman	88d5b91810	Move to proto_common for all upb aspects to fix numerous tricky edge cases and simplify the code PiperOrigin-RevId: 527904449	2 years ago
Eric Salo	4c6931f22a	upb: clean up the Dart ffi build targets PiperOrigin-RevId: 527669656	2 years ago
Eric Salo	8ad3a76d8f	upb: Dart pb_runtime now uses the google3 ffigen everywhere PiperOrigin-RevId: 527599631	2 years ago
Eric Salo	3475ebec94	upb: move the split64 accessors out of upbc/ PiperOrigin-RevId: 527396510	2 years ago
Eric Salo	3a0e3f22cd	upb: use google3 ffigen to build the upbdev function wrappers PiperOrigin-RevId: 527372793	2 years ago
Protobuf Team Bot	30af08f511	Add private accessors for repeated array get to be used by C++ RepeatedField implementation. PiperOrigin-RevId: 526656160	2 years ago
Mike Kruskal	9fa51d0bc9	Enable Windows CI PiperOrigin-RevId: 526638532	2 years ago
Protobuf Team Bot	0fa12dce2a	Fix field name conflict resolution for `has_` prefix. Note: Code looks duplicated but in C case, it is for performance. For C++, C++ and C may diverge in the future for certain methods. PiperOrigin-RevId: 525826831	2 years ago
Joshua Haberman	339fdb5e7b	Hide `upb_MiniTableField.descriptortype` with `UPB_PRIVATE()` macro PiperOrigin-RevId: 524371449	2 years ago
Joshua Haberman	df93cf65a2	Hide upb_MiniTableField.submsg_index with new `UPB_PRIVATE()` macro The fields of upb_MiniTableField are intended to be internal-only, accessed only through public functions like `upb_MiniTable_GetSubMessageTable()`. But over time, clients have started accessing many of these fields directly. This is an easy mistake to make, as there is no clear signal that the fields should not be used in applications. This makes the implementation difficult to change without breaking users. The new `UPB_PRIVATE()` macro appends an unpredictable string to each private symbol. This makes it very difficult to accidentally use a private symbol, since users would need to write something like `field->submsg_index_dont_copy_me__upb_internal_use_only`. This is still possible to do, but it leaves a clear wart in the code showing that an an encapsulation break has occurred. The `UPB_PRIVATE()` macro itself is defined in `port/def.inc`, which users cannot include directly. Once we land this, more such CLs will follow for the other fields of `upb_MiniTable*`. We will add inline functions as needed to provide the semantic functionality needed by users. PiperOrigin-RevId: 523166901	2 years ago
Mike Kruskal	d260ab343e	Add windows CI PiperOrigin-RevId: 520478558	2 years ago
Joshua Haberman	8562ccc1f0	Use _fileno(stdin) instead of STDIN_FILENO on Windows The `*_FILENO` constants don't exist on Windows. PiperOrigin-RevId: 518553512	2 years ago
Deanna Garcia	b7437a1b0e	Update UPB main's protobuf dependency. This also requires some editing in the rewrites since importing plugin.proto shouldn't require the src prefix after https://github.com/protocolbuffers/protobuf/pull/11991. PiperOrigin-RevId: 518387501	2 years ago
Joshua Haberman	2e9278de50	Fix for win32 binary i/o PiperOrigin-RevId: 516951027	2 years ago
Joshua Haberman	b9fb58bba5	Emit upbdev JSON using numeric representation for enums. PiperOrigin-RevId: 516273336	2 years ago
Joshua Haberman	bdee30b0a6	Added special case for INT64_MIN in the codegen PiperOrigin-RevId: 516236492	2 years ago
Eric Salo	a6ce73370f	upb: implement unsigned Int64 list PiperOrigin-RevId: 514312150	2 years ago
Joshua Haberman	56c4a42cdd	Added new APIs for linking a MiniTable all at one time The new API upb_MiniTable_Link() links all sub-messages and sub-enums at a single time, by accepting an array of sub-tables and sub-enums. The order of these sub-tables can be queried using a separate function `upb_MiniTable_GetSubList()`, and this information is added to `CodeGeneratorRequest` as part of the upb-specific info. PiperOrigin-RevId: 513970874	2 years ago
Eric Salo	dfd5f176f4	implement Dart Int64 repeated fields PiperOrigin-RevId: 513854105	2 years ago
Eric Salo	a7a097d443	remove generated hazzer for map fields PiperOrigin-RevId: 510449677	2 years ago
Mike Kruskal	662497f1d3	Removing non-deterministic pointer sort. We're already doing a proper string sort in SortedEnums as of cl/503574792, but then we follow it up with a sort on the char* pointers. PiperOrigin-RevId: 506778694	2 years ago
Joshua Haberman	150847d56e	Removed unnecessary includes. PiperOrigin-RevId: 506751620	2 years ago
Protobuf Team Bot	2a7e743a15	Avoid automatic variables in functions using setjmp. According to https://en.cppreference.com/w/c/program/setjmp automatic variables modified in a function calling setjmp can have indeterminate values. Instead, refactor all functions calling setjmp so that the function calling setjmp doesn’t have any local variables. Part VI: Code generator. PiperOrigin-RevId: 504563663	2 years ago
Mike Kruskal	a1abf835d2	Removed upb dependencies on absl/log. absl/log is not yet released in any ABSL LTS. PiperOrigin-RevId: 503575398	2 years ago
Joshua Haberman	a780ffae65	Fixed non-determinism in the upb compiler. PiperOrigin-RevId: 503574792	2 years ago
Joshua Haberman	aa68739fa7	Fixed a bug with field numbers >255 in a oneof. The compiler should not assert-fail in this case, it should merely decline to emit a fast parser for that field. PiperOrigin-RevId: 502919928	2 years ago
Mike Kruskal	a77cec4f4f	Remove C++17 feature that's incompatible with C++14 support PiperOrigin-RevId: 501989602	2 years ago
Joshua Haberman	4f02fc4790	Removed upb dependencies on absl/log. absl/log is not yet released in any ABSL LTS. PiperOrigin-RevId: 501965150	2 years ago
Joshua Haberman	e41a2d7ba0	upb is self-hosting! This CL changes the upb compiler to no longer depend on C++ protobuf libraries. upb now uses its own reflection libraries to implement its code generator. # Key Benefits 1. upb can now use its own reflection libraries throughout the compiler. This makes upb more consistent and principled, and gives us more chances to dogfood our own C++ reflection API. This highlighted several parts of the C++ reflection API that were incomplete. 2. This CL removes code duplication that previously existed in the compiler. The upb reflection library has code to build MiniDescriptors and MiniTables out of descriptors, but prior to this CL the upb compiler could not use it. The upb compiler had a separate copy of this logic, and the compiler's copy of this logic was especially tricky and hard to maintain. This CL removes the separate copy of that logic. 3. This CL (mostly) removes upb's dependency on the C++ protobuf library. We still depend on `protoc` (the binary), but the runtime and compiler no longer link against C++'s libraries. This opens up the possibility of speeding up some builds significantly if we can use a prebuilt `protoc` binary. # Bootstrap Stages To bootstrap, we check in a copy of our generated code for `descriptor.proto` and `plugin.proto`. This allows the compiler to depend on the generated code for these two protos without creating a circular dependency. This code is checked in to the `stage0` directory. The bootstrapping process is divided into a few stages. All `cc_library()`, `upb_proto_library()`, and `cc_binary()` targets that would otherwise be circular participate in this staging process. That currently includes: * `//third_party/upb:descriptor_upb_proto` * `//third_party/upb:plugin_upb_proto` * `//third_party/upb:reflection` * `//third_party/upb:reflection_internal` * `//third_party/upbc:common` * `//third_party/upbc:file_layout` * `//third_party/upbc:plugin` * `//third_party/upbc:protoc-gen-upb` For each of these targets, we produce a rule for each stage (the logic for this is nicely encapsulated in Blaze/Bazel macros like `bootstrap_cc_library()` and `bootstrap_upb_proto_library()`, so the `BUILD` file remains readable). For example: * `//third_party/upb:descriptor_upb_proto_stage0` * `//third_party/upb:descriptor_upb_proto_stage1` * `//third_party/upb:descriptor_upb_proto` The stages are: 1. `stage0`: This uses the checked-in version of the generated code. The stage0 compiler is correct and outputs the same code as all other compilers, but it is unnecessarily slow because its protos were compiled in bootstrap mode. The stage0 compiler is used to generate protos for stage1. 2. `stage1`: The stage1 compiler is correct and fast, and therefore we use it in almost all cases (eg. `upb_proto_library()`). However its own protos were not generated using `upb_proto_library()`, so its `cc_library()` targets cannot be safely mixed with `upb_proto_library()`, as this would lead to duplicate symbols. 3. final (no stage): The final compiler is identical to the `stage1` compiler. The only difference is that its protos were built with `upb_proto_library()`. This doesn't matter very much for the compiler binary, but for the `cc_library()` targets like `//third_party/upb:reflection`, only the final targets can be safely linked in by other applications. # "Bootstrap Mode" Protos The checked-in generated code is generated in a special "bootstrap" mode that is a bit different than normal generated code. Bootstrap mode avoids depending on the internal representation of MiniTables or the messages, at the cost of slower runtime performance. Bootstrap mode only interacts with MiniTables and messages using public APIs such as `upb_MiniTable_Build()`, `upb_Message_GetInt32()`, etc. This is very important as it allows us to change the internal representation without needing to regenerate our bootstrap protos. This will make it far easier to write CLs that change the internal representation, because it avoids the awkward dance of trying to regenerate the bootstrap protos when the compiler itself is broken due to bootstrap protos being out of date. The bootstrap generated code does have two downsides: 1. The accessors are less efficient, because they look up MiniTable fields by number instead of hard-coding the MiniTableField into the generated code. 2. It requires runtime initialization of the MiniTables, which costs CPU cycles at startup, and also allocates memory which is never freed. Per google3 rules this is not really a leak, since this memory is still reachable via static variables, but it is undesirable in many contexts. We could fix this part by introducing the equivalent of `google::protobuf::ShutdownProtobufLibrary()`). These downsides are fine for the bootstrapping process, but they are reason enough not to enable bootstrap mode in general for all protos. # Bootstrapping Always Uses OSS Protos To enable smooth syncing between Google3 and OSS, we always use an OSS version of the checked in generated code for `stage0`, even in google3. This requires that the google3 code can be switched to reference the OSS proto names using a preprocessor define. We introduce the `UPB_DESC(xyz)` macro for this, which will expand into either `proto2_xyz` or `google_protobuf_xyz`. Any libraries used in `stage0` must use `UPB_DESC(xyz)` rather than refer to the symbol names directly. PiperOrigin-RevId: 501458451	2 years ago
Eric Salo	0e286b4037	hook up Dart Int64 scalars PiperOrigin-RevId: 500017090	2 years ago
Eric Salo	c628e53dde	upb_MiniTable_GetMutableMap() -> upb_Message_GetOrCreateMutableMap() PiperOrigin-RevId: 499371815	2 years ago
Joshua Haberman	143132fa27	Make upb's generated code agnostic to fasttable. This simplifies the code generation by making output agnostic to whether fasttables will be used or not. This grows the generated code in the common case, but when fasttables are not being used the preprocessor will strip away the unused tables. PiperOrigin-RevId: 499340805	2 years ago
Joshua Haberman	003ecb3125	Rolling forward: expanded GCC diagnostic pragmas to cover all of accessors.h. PiperOrigin-RevId: 497983863	2 years ago
Protobuf Team Bot	565b28c0d1	Unified accessor API for repeated field getters and setters. This CL eliminates the last remaining callers of GetFieldOffset(), therefore opening the door to a more principled bootstrapping process. PiperOrigin-RevId: 497871886	2 years ago
Joshua Haberman	48ad4bbd0f	Unified accessor API for repeated field getters and setters. This CL eliminates the last remaining callers of GetFieldOffset(), therefore opening the door to a more principled bootstrapping process. PiperOrigin-RevId: 497864910	2 years ago
Eric Salo	00623c3cde	delete obsolete array wrapper funcs from upbc_so PiperOrigin-RevId: 497607419	2 years ago
Joshua Haberman	0f938ec4df	Unified accessor API for map getters and setters. This is part of the ongoing effort to remove any hard-coding of layout offsets into the generated code (except via `upb_MiniTableField` values). PiperOrigin-RevId: 497281306	2 years ago
Joshua Haberman	e34e3bd328	Unified accessor API for map getters and setters. This is part of the ongoing effort to remove any hard-coding of layout offsets into the generated code (except via `upb_MiniTableField` values). PiperOrigin-RevId: 497266785	2 years ago
Joshua Haberman	45dfd77d87	Unified accessor API for map getters and setters. This is part of the ongoing effort to remove any hard-coding of layout offsets into the generated code (except via `upb_MiniTableField` values). PiperOrigin-RevId: 497238313	2 years ago
Eric Salo	abed1995a9	implement some simple array accessors for Dart add upb_Message_GetOrCreateMutableArray() add some array wrappers to upbc_so.c PiperOrigin-RevId: 495976356	2 years ago
Eric Salo	d9ee7f5b10	implement end-to-end dart unit test PiperOrigin-RevId: 495384181	2 years ago
Joshua Haberman	fa2481ddeb	Unified accessor for WhichOneof(). PiperOrigin-RevId: 495220998	2 years ago
Eric Salo	3bac8780cb	normalize most of the message accessors: - Rename the accessors from upb_MiniTable_Foo() to upb_Message_Foo() - delete _upb_Message_Clearext() which is now redundant - Allow the getters and setters to accept both extension and non-extension fields - Add a (upb_Arena*) param to setters (only needed for extensions) - Change setters from void to bool (since extensions may require allocations) PiperOrigin-RevId: 493760399	2 years ago
Eric Salo	f458e05718	add some UPB_API / UPB_API_INTERNAL declarations PiperOrigin-RevId: 492551734	2 years ago

1 2 3 4 5 ...

296 Commits (cbacdf152a04abc7e849b68ba6c4eaeb3c3669e3)