protobuf

Commit Graph

Author	SHA1	Message	Date
Adam Cozzette	e02b6a9335	Add new search path for utf8_range in CMake build This will allow us to run the CMake test from within the protobuf repo's Bazel workspace. The directory structure is a little bit different depending on which workspace the test is invoked in. PiperOrigin-RevId: 557275295	1 year ago
Protobuf Team Bot	c552102d66	No public description PiperOrigin-RevId: 556849883	1 year ago
Adam Cozzette	5aca728f72	Reformat copyright headers PiperOrigin-RevId: 554509301	1 year ago
Joshua Haberman	ba500734c3	Split the JSON rules out of the main BUILD file and removed obsolete forwarding headers PiperOrigin-RevId: 541016462	1 year ago
Protobuf Team Bot	e83207ab8b	cleanup: generate a cleaner CMake file This simplifies the CMake code to ask for the minimum required version and also allow newer policies. It also uses `target_include_directories()` to set the header search path in each library (and their downstream dependencies). Using `include_directories()` is not idiomatic in CMake >= 3.0. It sets the include path for all targets and one may need to have a few targets with a different search path. PiperOrigin-RevId: 538450541	2 years ago
Thomas Van Lenten	2282505327	Move to proto_common for all upb aspects to fix numerous tricky edge cases and simplify the code PiperOrigin-RevId: 527937369	2 years ago
Joshua Haberman	88d5b91810	Move to proto_common for all upb aspects to fix numerous tricky edge cases and simplify the code PiperOrigin-RevId: 527904449	2 years ago
Mike Kruskal	f4d045aa92	Only include utf8_range if it hasn't been included already See #1201 PiperOrigin-RevId: 518333273	2 years ago
Deanna Garcia	1043eee891	Update dependency on com_google_googletest to use the newly added googletest_deps to install transitive dependencies. PiperOrigin-RevId: 517217973	2 years ago
Joshua Haberman	e41a2d7ba0	upb is self-hosting! This CL changes the upb compiler to no longer depend on C++ protobuf libraries. upb now uses its own reflection libraries to implement its code generator. # Key Benefits 1. upb can now use its own reflection libraries throughout the compiler. This makes upb more consistent and principled, and gives us more chances to dogfood our own C++ reflection API. This highlighted several parts of the C++ reflection API that were incomplete. 2. This CL removes code duplication that previously existed in the compiler. The upb reflection library has code to build MiniDescriptors and MiniTables out of descriptors, but prior to this CL the upb compiler could not use it. The upb compiler had a separate copy of this logic, and the compiler's copy of this logic was especially tricky and hard to maintain. This CL removes the separate copy of that logic. 3. This CL (mostly) removes upb's dependency on the C++ protobuf library. We still depend on `protoc` (the binary), but the runtime and compiler no longer link against C++'s libraries. This opens up the possibility of speeding up some builds significantly if we can use a prebuilt `protoc` binary. # Bootstrap Stages To bootstrap, we check in a copy of our generated code for `descriptor.proto` and `plugin.proto`. This allows the compiler to depend on the generated code for these two protos without creating a circular dependency. This code is checked in to the `stage0` directory. The bootstrapping process is divided into a few stages. All `cc_library()`, `upb_proto_library()`, and `cc_binary()` targets that would otherwise be circular participate in this staging process. That currently includes: * `//third_party/upb:descriptor_upb_proto` * `//third_party/upb:plugin_upb_proto` * `//third_party/upb:reflection` * `//third_party/upb:reflection_internal` * `//third_party/upbc:common` * `//third_party/upbc:file_layout` * `//third_party/upbc:plugin` * `//third_party/upbc:protoc-gen-upb` For each of these targets, we produce a rule for each stage (the logic for this is nicely encapsulated in Blaze/Bazel macros like `bootstrap_cc_library()` and `bootstrap_upb_proto_library()`, so the `BUILD` file remains readable). For example: * `//third_party/upb:descriptor_upb_proto_stage0` * `//third_party/upb:descriptor_upb_proto_stage1` * `//third_party/upb:descriptor_upb_proto` The stages are: 1. `stage0`: This uses the checked-in version of the generated code. The stage0 compiler is correct and outputs the same code as all other compilers, but it is unnecessarily slow because its protos were compiled in bootstrap mode. The stage0 compiler is used to generate protos for stage1. 2. `stage1`: The stage1 compiler is correct and fast, and therefore we use it in almost all cases (eg. `upb_proto_library()`). However its own protos were not generated using `upb_proto_library()`, so its `cc_library()` targets cannot be safely mixed with `upb_proto_library()`, as this would lead to duplicate symbols. 3. final (no stage): The final compiler is identical to the `stage1` compiler. The only difference is that its protos were built with `upb_proto_library()`. This doesn't matter very much for the compiler binary, but for the `cc_library()` targets like `//third_party/upb:reflection`, only the final targets can be safely linked in by other applications. # "Bootstrap Mode" Protos The checked-in generated code is generated in a special "bootstrap" mode that is a bit different than normal generated code. Bootstrap mode avoids depending on the internal representation of MiniTables or the messages, at the cost of slower runtime performance. Bootstrap mode only interacts with MiniTables and messages using public APIs such as `upb_MiniTable_Build()`, `upb_Message_GetInt32()`, etc. This is very important as it allows us to change the internal representation without needing to regenerate our bootstrap protos. This will make it far easier to write CLs that change the internal representation, because it avoids the awkward dance of trying to regenerate the bootstrap protos when the compiler itself is broken due to bootstrap protos being out of date. The bootstrap generated code does have two downsides: 1. The accessors are less efficient, because they look up MiniTable fields by number instead of hard-coding the MiniTableField into the generated code. 2. It requires runtime initialization of the MiniTables, which costs CPU cycles at startup, and also allocates memory which is never freed. Per google3 rules this is not really a leak, since this memory is still reachable via static variables, but it is undesirable in many contexts. We could fix this part by introducing the equivalent of `google::protobuf::ShutdownProtobufLibrary()`). These downsides are fine for the bootstrapping process, but they are reason enough not to enable bootstrap mode in general for all protos. # Bootstrapping Always Uses OSS Protos To enable smooth syncing between Google3 and OSS, we always use an OSS version of the checked in generated code for `stage0`, even in google3. This requires that the google3 code can be switched to reference the OSS proto names using a preprocessor define. We introduce the `UPB_DESC(xyz)` macro for this, which will expand into either `proto2_xyz` or `google_protobuf_xyz`. Any libraries used in `stage0` must use `UPB_DESC(xyz)` rather than refer to the symbol names directly. PiperOrigin-RevId: 501458451	2 years ago
Joshua Haberman	143132fa27	Make upb's generated code agnostic to fasttable. This simplifies the code generation by making output agnostic to whether fasttables will be used or not. This grows the generated code in the common case, but when fasttables are not being used the preprocessor will strip away the unused tables. PiperOrigin-RevId: 499340805	2 years ago
Deanna Garcia	68d3ae7586	Fix cmake	2 years ago
Deanna Garcia	6f17e81048	Add pkg_files to cmake defs	2 years ago
Deanna Garcia	9880136636	Add bazel target for source distribution	2 years ago
Deanna Garcia	92dbe4b8bb	Add license file to pypi wheels. Addresses https://github.com/protocolbuffers/protobuf/issues/10936. This requires updating to the newest version of rules_python to use the new py_wheel API that includes a parameter for extra distinfo files PiperOrigin-RevId: 493060514	2 years ago
Mike Kruskal	4069649ecd	Switch to cmake fetch instead of git submodules	2 years ago
Mike Kruskal	3e078f5fe4	Add CMake+Bazel dependencies on utf8_range repo	2 years ago
Mike Kruskal	248ed86f2b	Add better handling for systems without python3 installed. The current behavior will crash any Bazel command immediately, due to our declared pip dependencies in WORKSPACE, if python3 can't be found. The new behavior will mock out these workspace dependencies and allow any non-python targets to run. Python targets will be skipped by wildcard expressions if there's no system python3, and will fail when run directly, due to compatibility mismatch. PiperOrigin-RevId: 492085254	2 years ago
Protobuf Team Bot	04363f7bae	Update workspace_deps.bzl protobuf main commit PiperOrigin-RevId: 479635174	2 years ago
Adam Cozzette	7189539610	Rename generated_file_staleness_test() to just staleness_test() This renaming is something we have been planning on doing, and I would like to do it now because I'm getting ready to rely on this staleness_test() macro from the main protobuf repo.	2 years ago
Joshua Haberman	125db89ff5	Added fuzz tests for mini table building and binary format parsing/serialization. PiperOrigin-RevId: 458240180	2 years ago
Joshua Haberman	6df5517d25	Consolidate upb visibility into a single visibility list. PiperOrigin-RevId: 450733238	3 years ago
Protobuf Team	ee6b1abb35	Create targets for UPB release PiperOrigin-RevId: 441496547	3 years ago
Joshua Haberman	7ff1662f97	Removed pre-generated CMake files from the main branch. From now on, these files will live in the "generated" branch only, and a GitHub action will regenerate these files whenever there is a commit to the main branch. PiperOrigin-RevId: 438879338	3 years ago
Joshua Haberman	a5243ff6d9	Restructure our file syncing so GitHub only files are tracked separately in Piper. PiperOrigin-RevId: 438395194	3 years ago
Joshua Haberman	56c59c10ed	Call protobuf_deps() ourselves instead of from upb_deps().	3 years ago
Joshua Haberman	11b6df0c46	Moved tests into the main source tree.	3 years ago
Joshua Haberman	3921e02990	Fixed make_cmakelists.py.	3 years ago
Joshua Haberman	7183780b60	Added a Valgrind test that works for Python!	3 years ago
Joshua Haberman	5d8c3db94f	Added copyright header and docs for python_headers().	3 years ago
Joshua Haberman	f098230df8	Exclude fuzz test from non-Clang compilers.	3 years ago
Joshua Haberman	fa4d70fad6	Restore CMake files, we're not ready to delete them yet.	3 years ago
Joshua Haberman	173554146f	Updated some docs and removed/rearranged some obsolete stuff.	3 years ago
Joshua Haberman	91d506ac32	Ported ABSL's wyhash to C.	3 years ago
Joshua Haberman	823eb09694	Update all 2011 dates to 2021.	4 years ago
Joshua Haberman	e59d2c8fa7	Added license headers to all files.	4 years ago
Joshua Haberman	e9b79542ad	Added a BUILD file for wyhash. This will make the build more closely resemble the google3 build. The CMake output from this is a bit busted, but the build does succeed.	4 years ago
Joshua Haberman	43c207ea7e	Added CMake dummy rule.	4 years ago
Joshua Haberman	7e5bd65098	Plumbed copts (including the crucial -std=c99) to upb_proto_library() aspect.	4 years ago
Joshua Haberman	8f3ee80d46	Drop C89/C90 support and MSVC prior to Visual Studio 2015. upb previously attempted to support C89 and pre-2015 versions of Visual Studio. This was to support older compilers with limited C99 support (particularly MSVC). But as of last August, even gRPC has dropped support for MSVC prior to 2015 `c87276d058` Therefore it seems safe for upb to no longer attempt C89 support (we were already not truly C89 compliant, with our use of "bool"). We now explicitly require C99 or greater and MSVC 2015 or greater. This cleaned up port_def.inc a fair bit. I took the chance to also remove some obsolete macros.	4 years ago
Joshua Haberman	a274ad786a	Plumbed copts (including the crucial -std=c99) to upb_proto_library() aspect.	4 years ago
Joshua Haberman	a345af9883	Added a codegen parameter for whether fasttables are generated or not. Example: $ CC=clang bazel build -c opt --copt=-g benchmarks:benchmark --//:fasttable_enabled=false INFO: Build option --//:fasttable_enabled has changed, discarding analysis cache. INFO: Analyzed target //benchmarks:benchmark (0 packages loaded, 913 targets configured). INFO: Found 1 target... Target //benchmarks:benchmark up-to-date: bazel-bin/benchmarks/benchmark INFO: Elapsed time: 0.760s, Critical Path: 0.58s INFO: 7 processes: 1 internal, 6 linux-sandbox. INFO: Build completed successfully, 7 total actions $ bazel-bin/benchmarks/benchmark --benchmark_filter=BM_Parse_Upb ------------------------------------------------------------------------------ Benchmark Time CPU Iterations ------------------------------------------------------------------------------ BM_Parse_Upb_FileDesc_WithArena 10985 ns 10984 ns 63567 651.857MB/s BM_Parse_Upb_FileDesc_WithInitialBlock 10556 ns 10554 ns 66138 678.458MB/s $ CC=clang bazel build -c opt --copt=-g benchmarks:benchmark --//:fasttable_enabled=true INFO: Build option --//:fasttable_enabled has changed, discarding analysis cache. INFO: Analyzed target //benchmarks:benchmark (0 packages loaded, 913 targets configured). INFO: Found 1 target... Target //benchmarks:benchmark up-to-date: bazel-bin/benchmarks/benchmark INFO: Elapsed time: 0.744s, Critical Path: 0.58s INFO: 7 processes: 1 internal, 6 linux-sandbox. INFO: Build completed successfully, 7 total actions $ bazel-bin/benchmarks/benchmark --benchmark_filter=BM_Parse_Upb ------------------------------------------------------------------------------ Benchmark Time CPU Iterations ------------------------------------------------------------------------------ BM_Parse_Upb_FileDesc_WithArena 3284 ns 3284 ns 213495 2.1293GB/s BM_Parse_Upb_FileDesc_WithInitialBlock 2882 ns 2882 ns 243069 2.4262GB/s Biggest unknown is whether this parameter should default to true or false.	4 years ago
Joshua Haberman	e3f41de6c7	Split monolithic BUILD file into many build files.	4 years ago
Joshua Haberman	bdd1a516e8	Fixed other tests.	4 years ago
Joshua Haberman	543a0ce8f2	Fixes for PHP. (#286 ) - A new PHP-specific upb amalgamation. It contains everything related to upb_msg, but leaves out all of the old handlers-related interfaces and encoders/decoders. # Schema/Defs Changes - Changed `upb_fielddef_msgsubdef()` and `upb_fielddef_enumsubdef()` to return `NULL` instead of assert-failing if the field is not a message or enum. - Added `upb_msgdef_iswrapper()`, to test whether this is a wrapper well-known type. # Decoder - Decoder bugfix: when we parse a submessage inside a oneof, we need to clear out any previous data, so we don't misinterpret it as a pointer to an existing submessage. # JSON Decoder - Allowed well-known types at the top level to have their special processing. - Fixed a bug that could occur when parsing nested empty lists/objects, eg `[[]]`. - Made the "ignore unknown" option also be permissive about unknown enumerators by setting them to 0. # JSON Encoder - Allowed well-known types at the top level to have their special processing. - Removed all spaces after `:` and `,` characters, to match the old encoder and pass goldenfile tests. # Message / Reflection - Changed `upb_msg_hasoneof()` -> `upb_msg_whichoneof()`. The new function returns the `upb_fielddef*` of whichever oneof is set. - Implemented `upb_msg_clearfield()` and added/implemented `upb_msg_clear()`. - Added `upb_msg_discardunknown()`. Part of me thinks this should go in a util library instead of core reflection since it is a recursive algorithm. # Compiler - Always emit descriptors as an array instead of as a string, to avoid exceeding maximum string lengths. If this becomes a speed issue later we can go back to two separate paths.	5 years ago
Joshua Haberman	16facab490	Created an amalgamation without handlers, and fixed some bugs. (#283 ) * Created amalgamation with upb_msg but no handlers. * Bugfix for upb_array_resize(). * Renamed "lite" amalgamation to "core", to avoid confusion. Traditionally "lite" has meant "without reflection", but here we mean it as "without handlers-based code." * Build fixes from CI tests. * Removed some more C++-style comments. * Fix for out-of-order statements.	5 years ago
Joshua Haberman	4c57b1fefd	More progress on Lua extension.	5 years ago
Joshua Haberman	626ec4bfcf	Everything builds, test pass except test_decoder.	5 years ago
Joshua Haberman	493e9b2614	Build fixes from fuzz target.	6 years ago
Joshua Haberman	f74cb51f11	Refactored workspace deps into a separate file.	6 years ago

45 Commits (db88465fb5460b379bfa21eb6fbd26f090141bc4)