Chiebot-Mirror/grpc - grpc - Gitea: Git with a cup of tea

Commit Graph

Author	SHA1	Message	Date
Mark D. Roth	e022a3dfa9	xDS fault injection e2e test: fix flakes caused by processing queued calls in parallel (#32429 ) The `XdsFaultInjectionMaxFault` test has seen a few flakes since #32326 was merged. I believe the flakiness is caused by the fact that when a large number of RPCs are queued up before the resolver result comes in, those RPCs are now re-processed in parallel instead of sequentially, which can cause us to delay more RPCs than we should due to the `max_faults` setting. To fix this, we change the test to ensure that the channel is connected (i.e., the resolver result has already been returned) before we start sending a large number of concurrent RPCs. Although this is the only test that I've seen flakes in, I've made this same change consistently to all fault injection tests that are creating a large number of concurrent RPCs, since the same flake could affect any of them.	2 years ago
Paulo Castello da Costa	758f1c2b11	Exclude php7 from OSS benchmarks CI. (#32420 ) PHP7 build is failing, removing from CI while investigating the failure. <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
Yash Tibrewal	04e3a8e73d	GCP Observability : Framework for detecting the environment (#32294 ) This code is not plumbed through yet, but it provides the core infrastructure needed to detect the proper GCP environment resources needed to set up the labels/attributes/resources for stats, tracing and logging. Details on how the various environment resources are setup has been derived by looking at java's cloud logging library and OpenTelemetry's future plans. (Could be better explained in an offline review since some links are internal). Requesting @veblush for a full review and @markdroth for a structural review. <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
Mark D. Roth	3013c7e9b3	client channel: refactor call objects in preparation for promise conversion (#32223 ) This is a prerequisite for converting the client_channel filter to promises. This refactors two objects: - `ClientChannel::CallData`, which is primarily responsible for applying the service config to the call - `ClientChannel::LoadBalancedCall`, which is responsible for doing the LB pick for the call attempt Each of those classes has been split into two pieces: - a base class with the functionality to be shared between the legacy filter stack implementation and the new promise-based implementation - a subclass providing the legacy filter stack implementation A subsequent PR will add another subclass that provides the promise-based implementation.	2 years ago
ericsalo	d98edb20ab	grpc: replace has_ methods for upb map fields with _size methods (#32410 ) The upb team wants to remove this particular bit of syntactic sugar from the generated code. So instead of calling has_foo() when foo is a map field, we call foo_size() and test the result against zero. <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
Craig Tiller	8d5559d713	Revert "[EventEngine] WindowsEventEngine Endpoint and Socket fixes" (#32419 ) Reverts grpc/grpc#32385 Co-authored-by: Yijie Ma <yijiem.main@gmail.com>	2 years ago
Mark D. Roth	8249fc10a9	Second attempt: client channel: don't hold mutexes while calling the ConfigSelector or the LB picker (#32326 ) Original attempt was #31973, reverted in #32324 due to test flakiness. There were two problems causing test flakiness here. The first problem was that, upon resolver error, we were dispatching an async callback to re-process each of the queued picks before we updated the channel's connectivity state, which meant that the queued picks might be re-processed in another thread before the new connectivity state was set, so tests that expected the state to be TRANSIENT_FAILURE once RPCs failed might not see the expected state. The second problem affected the xDS ring hash tests, and it's a bit more involved to explain. We have an e2e test that simulates an aggregate cluster failover from a primary cluster using ring_hash at startup. The primary cluster has two addresses, both of which are unreachable when the client starts up, so the client should immediately fail over to the secondary cluster, which does have reachable endpoints. The test requires that no RPCs are failed while this failover occurs. The original PR made this test flaky. The problem here was caused by a combination of two factors: 1. Prior to the original PR, when the picker was updated (which happens inside the WorkSerializer), we re-processed previously queued picks synchronously, so it was not possible for another subchannel connectivity state update (which also happens in the WorkSerializer) to be processed between the time that we updated the picker and the time that we re-processed the previously queued picks. The original PR changed this such that the queued picks are re-processed asynchronously (outside of the WorkSerializer), so it is now possible for a subchannel connectivity state update to be processed between when the picker is updated and when we re-process the previously queued picks. 2. Unlike most LB policies, where the picker does not see updated subchannel connectivity states until a new picker is created, the ring_hash picker gets the subchannel connectivity states from the LB policy via a lock, so it can wind up seeing the new states before it gets updated. This means that when a subchannel connectivity state update is processed by the ring_hash policy in the WorkSerializer, it will immediately be seen by the existing picker, even without a picker update. With those two points in mind, the sequence of events in the failing test were as follows: 1. The pick is attempted in the ring_hash picker for the primary cluster. This causes the first subchannel to attempt to connect. 2. The subchannel transitions from IDLE to CONNECTING. A new picker is returned due to the subchannel connectivity state change, and the channel retries the queued pick. The retried pick is done asynchronously, but in this case it does not matter: the call will be re-queued. 3. The connection attempt fails, and the subchannel reports TRANSIENT_FAILURE. A new picker is again returned, and the channel retries the queued pick. The retried pick is done asynchronously, but in this case it does not matter: this causes the picker to trigger a connection attempt for the second subchannel. 4. The second subchannel transitions from IDLE to CONNECTING. A new picker is again returned, and the channel retries the queued pick. The retried pick is done asynchronously, and in this case it does matter. 5. The second subchannel now transitions to TRANSIENT_FAILURE. The ring_hash policy will now report TRANSIENT_FAILURE, but before it can finish that... 6. ...In another thread, the channel now tries to re-process the queued pick using the CONNECTING picker from step 4. However, because the ring_hash policy has already seen the TRANSIENT_FAILURE report from the second subchannel, that picker will now fail the pick instead of queuing it. After discussion with @ejona86 and @dfawley (since this bug actually exists in Java and Go as well), we agreed that the right solution is to change the ring_hash picker to contain its own copy of the subchannel connectivity state information, rather than sharing that information with the LB policy using synchronization.	2 years ago
Mark D. Roth	6589340efc	Bump core version 202302161703 (#32416 )	2 years ago
Craig Tiller	d49e151306	[backoff] Add random early detection classifier (#32354 ) <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. --> --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	2 years ago
Mark D. Roth	7fab06b923	Revert "filter stack: pass peer name up via recv_initial_metadata batch" (#32415 ) Reverts grpc/grpc#31933	2 years ago
AJ Heller	1c5db3404b	[codegen] Escape '$' delimiter in proto comments (#32411 ) This applies to all wrapped languages. <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
gankineri	bab3e0ff42	Remove unused references to absl::Status (aka grpc_error_handle) and absl::StatusOr<> (#32233 ) Removing a number of unused variables. This has no behaviour change. These types are not considered "unused variables" by normal `-Wunused-variable` flags because they have nontrivial destructors, but these types' destructors are not used for their side effects, so unused variables of these types should be considered bug-prone. This PR removes all unused `absl::Status` and `absl::StatusOr<>` variables I could find in grpc.	2 years ago
sanjaypujare	7139bb1d2e	xds-interop-testing: move TESTING_VERSION check to after calling kokoro_setup_test_driver func and fix script_dir (#32405 )	2 years ago
Mark D. Roth	3a94e50e78	filter stack: pass peer name up via recv_initial_metadata batch (#31933 ) Currently, the peer name is returned with the completion of the send_initial_metadata op, which does not make sense, because with retries, we don't actually know the peer name until we complete the recv_initial_metadata op. This PR changes our code to return the peer string as an attribute of the recv_initial_metadata op, so that it is not available to the application until that point. This change may be user-visible, but since our API docs don't seem to guarantee exactly when this data will be available, it's not technically a breaking change. Note that in the promise-based stack, we were already assuming that the peer string would be returned as part of the recv_initial_metadata batch, so this PR helps reduce risk for the promise conversion by making this semantic change now, thus decoupling it from the promise conversion. I have also changed the representation of the string in the metadata batch to be a `grpc_core::Slice` instead of a `std::string`, so that we can just take a ref to the string held in the transport instead of having to copy the whole string for every call.	2 years ago
Sergii Tkachenko	01d1f30571	PSM interop: update python kubernetes client from 12.0.1 to 25.3.0 (#32372 )	2 years ago
Sergii Tkachenko	78cbe0ff1c	doc: Document when xds v2 support removed in grpc_xds_features.md (#32406 )	2 years ago
AJ Heller	a879544a65	[EventEngine] WindowsEventEngine Endpoint and Socket fixes (#32385 ) A handful of problems were identified while writing the WindowsEventEngine Listener. To make the listener review easier, these fixes can be landed separately. This is built upon https://github.com/grpc/grpc/pull/32376 Problems that are fixed in this PR: * `OnConnectCompleted` held a Mutex while calling the user callback, which can deadlock. * The WinSocket and some associated data needs to remain alive after the Endpoint destroyed, since Windows IOCP still needs to use some of that data. Endpoint destruction and socket shutdown are now decoupled, with the socket managed by a shared_ptr. <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. --> --------- Co-authored-by: drfloob <drfloob@users.noreply.github.com>	2 years ago
Matthew Stevenson	49b5dfc14f	Add missing dependency for tsi_alts_credentials. (#32340 ) While creating an internal CL that depends directly on tsi_alts_credentials, I was getting linker errors saying ` error: backward reference detected: grpc_channel_credentials_release`, because `alts_tsi_handshaker.cc` uses the `grpc_channel_credentials_release` API, which is defined in the `grpc_security_base` target.	2 years ago
AJ Heller	ffe3968d0b	[EventEngine] Add advice against blocking work in callbacks (#32397 ) <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
Mark D. Roth	77d2475074	Upmerge previous release 202302151940 (#32407 ) Co-authored-by: Jan Tattermusch <jtattermusch@google.com> Co-authored-by: Sergii Tkachenko <sergiitk@google.com> Co-authored-by: apolcyn <apolcyn@google.com> Co-authored-by: Daniel Azuma <dazuma@gmail.com> Co-authored-by: Yash Tibrewal <yashkt@google.com> Co-authored-by: Craig Tiller <ctiller@google.com> Co-authored-by: Esun Kim <veblush@google.com>	2 years ago
Yijie Ma	b1619f1dc8	Revert "Revert "[EventEngine] RunAfter migration: grpc_chttp2_transport"" (#32341 ) Reverts grpc/grpc#32339 Rolling this forward after https://github.com/grpc/grpc/pull/32350.	2 years ago
AJ Heller	007c4073c8	[documentation] Fix documentation build script by mocking python dependencies (#32398 ) cc @markdroth <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
Ivo List	653fb79676	Remove returning struct in cc_grpc_library (#32360 ) Struct providers were deprecated https://github.com/bazelbuild/bazel/issues/7347 release notes: no	2 years ago
Craig Tiller	241e8ed417	Revert "[promises] Rollforward: Finish of server side calls (#32347 )" (#32394 ) There were some rollback conflicts, so this isn't a pure rollback. This reverts commit `ba0e55f539`. <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
apolcyn	4777db3003	[ruby testing]: experimental change to grpc_class_init_test (#32337 ) Let's see if this fixes the "Bus error" flakes that have been happening in CI. If it does, then we can narrow things down a bit. If flakes continue, then we can revert this PR.	2 years ago
AJ Heller	dd07fd8669	[EventEngine] Add more granular trace flags (#32376 ) The set of trace flags is now: * event_engine * event_engine_endpoint * event_engine_endpoint_data: additionally log all sent/received data, similar to what the shims do. * event_engine_poller <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
sanjaypujare	9dd4b7757d	xds-interop-tests: add cross branch testing to the script (#31569 ) Co-authored-by: Sergii Tkachenko <hi@sergii.org>	2 years ago
Sergii Tkachenko	811510145c	PSM interop: bump minor pip dependencies (#32371 )	2 years ago
Hannah Shi	2ac1b1708d	[ObjC] run cpp ios cronet test with bazel (#31808 ) Cleanup and remove ios cpp test cronet To test manually: ./tools/bazel test //src/objective-c/tests:CppCronetTests @sampajano <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
AJ Heller	cfb05a9945	[EventEngine] Fix the shims for iOS (#32363 ) The shims assumed that all platforms with Posix socket support would use the PosixEventEngine. This is not the case for iOS. <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
Mark D. Roth	d6bb5157a4	xds_wrr_locality: add trace log for generated child config (#32351 )	2 years ago
Esun Kim	faa80eaa10	Updated clang 15 images (#32317 ) This is to get the latest version of clang 15 (15.0.7) for our docker images based on that. By doing so, I had to address this new git security enforcement so I added a new file to tame it. In a nutshell, this PR is about polishing docker images based on clang 15.	2 years ago
AJ Heller	054f3c62e5	[EventEngine] Elaborate on GetDefaultEventEngine intended usage (#32358 ) Based on a discussion with @yashykt , I outlined the intended use of `GetDefaultEventEngine`, with a bit of explanation around why we chose to make it work the way it does. <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
Esun Kim	ea4204bb51	Update boringssl to grpc-202302 (#32353 ) This is to address [CVE-2023-0286](https://www.trellix.com/en-us/about/newsroom/stories/research/cve-2023-0286-the-openssl-who-cried-severity-high.html)	2 years ago
Craig Tiller	0ecc18ef0f	[promises] Party: an activity with many participant promises (#32308 ) <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. --> --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	2 years ago
Yash Tibrewal	b58b5cf3a8	OpenCensus server filter: Convert to promises (#32318 ) <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
Yash Tibrewal	f2d5c47ff3	xDS Interop: Update tracers (#32352 ) <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
Yijie Ma	82f7405595	Disable testKeepaliveWithV2API test case for objc interop test (#32350 ) This test has a race and is flaky, see [b/268379869](https://b.corp.google.com/issues/268379869). It sends an RPC to the interop server with 0s `keepaliveTimeout` and expects the keepalive watchdog timer to fire immediately before the server acks the ping. <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
Mark D. Roth	625b3f8385	resolver registry: require URI schemes to be lower-case (#32348 ) As per https://www.rfc-editor.org/rfc/rfc3986#section-3.1.	2 years ago
Craig Tiller	ba0e55f539	[promises] Rollforward: Finish of server side calls (#32347 ) Rollforward #32346 with some fixes in `1e88193edd` <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
Lucas Abel	d5fcbce4b4	grpcio: fix `AioRpcError` self recursion (#32305 ) See https://github.com/google/mobly/pull/870#issuecomment-1419629154 for more details.	2 years ago
Esun Kim	b1f1fa1e64	Use WaitWithTimeout for gpr_cv_wait (#32274 ) This is a prerequisite for upcoming Abseil change using a monotonic clock for `WaitWithTimeout`. This change allows gRPC gpr_cv_wait to use `WaitWithTimeout` when it's given monotonic clock or timespan. Caveat: This won't change the actual gRPC behavior until Abseil gets new change regarding this.	2 years ago
AJ Heller	290af3a3e5	[codehealth] Teach core_banned_functions.py to check headers as well (#32342 ) Fixes the banned function checker to include header files. Some things had slipped through, such as `absl::make_unique` `033d55ffd3/tools/run_tests/sanity/core_banned_functions.py (L64-L65)` <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. --> --------- Co-authored-by: drfloob <drfloob@users.noreply.github.com>	2 years ago
AJ Heller	f5334db300	Add gRPC-core v1.52.0 to the interop matrix (#32338 ) <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
Craig Tiller	d5685d34dc	Revert "[promises] Finishing off the server stack" (#32346 ) Reverts grpc/grpc#32158	2 years ago
Craig Tiller	98caaaefbd	[promises] Finishing off the server stack (#32158 ) To be merged after #31448 #32110 #32094 <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. --> --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	2 years ago
Craig Tiller	033d55ffd3	[arena] Fix ABA problem in pooled allocation (#32336 ) The pooled allocator currently has an ABA issue in the allocation path. This change should fix that - algorithm is described reasonably well in the PR. <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
Arvind Bright	9862cfdf05	remove go version v1.52.0 and keep only v1.52.3 (#32252 ) As a reactive from #32205, we need to remove the 52.0 release and only keep v1.52.3 which is the latest patch version for that minor release	2 years ago
Yijie Ma	af83803811	Revert "[EventEngine] RunAfter migration: grpc_chttp2_transport" (#32339 ) Reverts grpc/grpc#32240 Breaks `grpc/core/master/macos/grpc_objc_bazel_test`, sample run: https://source.cloud.google.com/results/invocations/04e5a95b-5f06-4b3e-95fe-5e1895068475/targets Seems like the `testKeepaliveWithV2API` test case failed: ``` Test Case '-[InteropTestsLocalCleartext testKeepaliveWithV2API]' started. <unknown>:0: error: -[InteropTestsLocalCleartext testKeepaliveWithV2API] : Asynchronous wait failed: Exceeded timeout of 5 seconds, with unfulfilled expectations: "Keepalive". Test Case '-[InteropTestsLocalCleartext testKeepaliveWithV2API]' failed (5.547 seconds). ...... Test Case '-[InteropTestsLocalSSL testKeepaliveWithV2API]' started. <unknown>:0: error: -[InteropTestsLocalSSL testKeepaliveWithV2API] : Asynchronous wait failed: Exceeded timeout of 5 seconds, with unfulfilled expectations: "Keepalive". Test Case '-[InteropTestsLocalSSL testKeepaliveWithV2API]' failed (5.011 seconds). ```	2 years ago
AJ Heller	51f276e7be	[EventEngine] ThreadPool: manage fork and shutdown bits separately (#32329 ) The previous implementation assumed that shutdown and fork events could not occur at the same time, but that's not the case. This change adds separate tracking for fork and shutdown bits. cc @gnossen	2 years ago

1 2 3 4 5 ...

52628 Commits (e022a3dfa93bdcd8af587560d6db76d8cacd6176) All Branches Search

52628 Commits (e022a3dfa93bdcd8af587560d6db76d8cacd6176)

All Branches