Chiebot-Mirror/grpc - grpc - Gitea: Git with a cup of tea

Commit Graph

Author	SHA1	Message	Date
Craig Tiller	86b931c354	[work-serializer] Dispatch on run experiment (relanding) (#34372 ) Reverts grpc/grpc#34371	1 year ago
Craig Tiller	d589caa679	Revert "[work-serializer] Dispatch on run experiment" (#34371 ) Reverts grpc/grpc#34274 (needs some changes internally)	1 year ago
Craig Tiller	1705470950	[work-serializer] Dispatch on run experiment (#34274 ) Co-authored-by: ctiller <ctiller@users.noreply.github.com> Co-authored-by: Mark D. Roth <roth@google.com>	1 year ago
nanahpang	a4ac80c394	Revert "[Security] Move ownership of tsi_ssl_client_handshaker_factory to grpc_ssl_credentials." (#34355 ) Reverts grpc/grpc#34180	1 year ago
Gregory Cooke	36dc5e7391	[Security] Move ownership of tsi_ssl_client_handshaker_factory to grpc_ssl_credentials. (#34180 ) Move the SSL_CTX to the level of the credentials rather than the subchannel. The SSL_CTX should only get created once per credential rather than once per subchannel. We should observe no behavior change with this PR, only efficiency gains.	1 year ago
Gregory Cooke	8d62fc2b0b	[Test] Add concurrent test for session reuse (#34293 ) Add a test that runs concurrent requests using session caching.	1 year ago
Mark D. Roth	1986007e1e	[round_robin] 4th attempt: delegate to pick_first as per dualstack design (#34337 ) Most recent attempt was #34320, reverted in #34335. The first commit here is a pure revert. The second commit fixes the outlier_detection unit test to pass both with and without the experiment.	1 year ago
Eugene Ostroukhov	77f80f3de5	[ssa test] Test TTL attribute on cookie (#34326 )	1 year ago
Mark D. Roth	6534f0a6bf	Revert "[round_robin] third attempt: delegate to pick_first as per dualstack design" (#34335 ) Reverts grpc/grpc#34320	1 year ago
Eugene Ostroukhov	59bab7f27f	[ssa test] Add test for per-route SSA configuration (#34313 )	1 year ago
Mark D. Roth	d713427cec	[round_robin] third attempt: delegate to pick_first as per dualstack design (#34320 ) Previous attempt was #34241, reverted in #34317. The second commit here makes the experiment disablable, so that we can roll it out slowly internally.	1 year ago
Craig Tiller	e6bf7c12cf	Revert "[round_robin] delegate to pick_first as per dualstack design" (#34317 ) Reverts grpc/grpc#34241	1 year ago
Mark D. Roth	97571ebf81	[round_robin] delegate to pick_first as per dualstack design (#34241 ) Rolls forward the remaining changes from #32692, which were rolled back in #33718.	1 year ago
Eugene Ostroukhov	3d1f242abe	[Session Affinity] Update validation and add a test case (#34277 )	1 year ago
Mark D. Roth	a315171880	[ring_hash] delegate to pick_first as per dualstack design (#34244 ) Rolls forward the changes from #33093 and some from #33568, which were rolled back in #33718.	1 year ago
Mark D. Roth	6dea42c874	[XdsClient] replace e2e test with unit test (#34258 ) This should address one of the failures we're seeing in #34224. The test failure is caused by the changes in timing triggering a race condition. In the code at head, we delay sending out the subscription for the first CDS watch until we've already seen the other two CDS watches, because the previous send_message op has not yet completed, and by the time it does, we've seen all 3 watches, so we can send a subscription for all 3 at the same time. With the WorkSerializer change, the send_message op is complete by the time we see the first CDS watch, so we subscribe to only that resource, and then later add the other two. The result is that we'll NACK twice with two different messages, the first one including only the error about the first resource, and the second one including all three. I suspect this same race condition would have been triggered eventually by the EventEngine migration anyway; the current test basically depends on the single-thread timing of the iomgr approach. So I'm addressing it by replacing the e2e test with a unit test that covers the same cases without the timing issue.	1 year ago
Mark D. Roth	b7e680ad46	[health checking] move to generic health watch for dualstack design (#34222 ) Rolls forward part of the dualstack changes, mostly from #33427 and a little bit from #32692, both of which were reverted in #33718. Specifically: - For petiole policies, unconditionally start health watch on subchannels, even if client side health checking is not enabled; in this case, the health watch will report the subchannel's raw connectivity state. - Fix edge cases in health check reporting that occur when a watcher is started before the initial state is reported. - When client-side health checking fails, add the subchannel's address to the RPC failure status message. - Outlier detection now works only via the health checking watch, not via the raw connectivity state watch. - Remove now-unnecessary hack to ensure that outlier detection does not work for pick_first.	1 year ago
Mark D. Roth	1c54662866	[xDS] improve RPC failure status message when aggregate cluster graph has no leaf clusters (#34201 ) Old message: new_cluster_1: UNAVAILABLE: errors validating xds_cluster_resolver LB policy config: [field:discoveryMechanisms error:must be non-empty] New message: new_cluster_1: FAILED_PRECONDITION: aggregate cluster graph has no leaf clusters	1 year ago
Mark D. Roth	b980f62ca6	[pick_first] adjust threshold on e2e test to address flake (#34157 )	1 year ago
jrandolf	3489b6304e	[OpenSSL] Support for OpenSSL 3 (#31256 ) Update from gtcooke94: This PR adds support to build gRPC and it's tests with OpenSSL3. There were some hiccups with tests as the tests with openssl haven't been built or exercised in a few months, so they needed some work to fix. Right now I expect all test files to pass except the following: - h2_ssl_cert_test - ssl_transport_security_utils_test I confirmed locally that these tests fail with OpenSSL 1.1.1 as well, thus we are at least not introducing regressions. Thus, I've added compiler directives around these tests so they only build when using BoringSSL. --------- Co-authored-by: Gregory Cooke <gregorycooke@google.com> Co-authored-by: Esun Kim <veblush@google.com>	1 year ago
Mark D. Roth	72e791402f	[pick_first] fix test flake (#34098 ) CNR the flake, but I've changed the test (which is very old) to use some of our more modern helper functions that have saner timeouts. Also re-add a `return` statement that was accidentally removed in #33753, which I noticed while working on this. Its absence doesn't cause a real problem, but it does cause us to needlessly trigger a duplicate connection attempt or report a duplicate CONNECTING update in some cases.	1 year ago
Mohan Li	ab024624da	[pick_first] de-experiment pick first (#34054 ) De-experiment pick first since we have both affinity and randomness E2E test running successfully. --------- Co-authored-by: Yash Tibrewal <yashkt@google.com>	1 year ago
Mark D. Roth	64a318acd4	[pick_first] fix sticky-TF and handling of subchannels in TRANSIENT_FAILURE (#33753 ) Fix sticky-TF behavior such that once we enter TRANSIENT_FAILURE, we do not leave that state if we get a new address list. Also, fix handling of subchannels in state TRANSIENT_FAILURE. Previously, if a subchannel was already in state TRANSIENT_FAILURE when we wanted to start a connection attempt on it (e.g., because the subchannel already existed from a different channel, or because it already existed in the previous subchannel list), we would wait for it to report IDLE before attempting to connect. This PR changes pick_first to instead immediately skip the subchannel and move on to the next one. Now, the only time we wait for a subchannel in TRANSIENT_FAILURE is when we wrap back around to the first subchannel in the list.	1 year ago
Yousuk Seung	4acb7d38b9	[xds] Apply the slowdown factor only once to LRS load reporting period (#34042 ) <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	1 year ago
Luwei Ge	a5f1121982	[xDS] Remove filter name from GenerateServiceConfig (#33915 ) We decided to not populate `policy_name` with the HTTP filter name in xDS case. So removing it from `GenerateServiceConfig`. This will be consistent across languages. The gRFC [PR](https://github.com/grpc/proposal/pull/346) has been updated.	1 year ago
Vignesh Babu	0616c8b838	[xds] Regex fix in test (#33981 )	1 year ago
Craig Tiller	91e7f223d3	[server] Remove `Notification` from shutdown path (#33953 ) I'm fairly certain that this path should be non-blocking (and making it so makes the promise based code far more tractable). This moves the blocking behavior into the blocking server_cc.cc function that calls `grpc_server_shutdown_and_notify` instead of in that non-blocking function. --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	1 year ago
Vignesh Babu	67f4e4e4c2	[resource quota] Reduce stress test size to prevent OOMs (#33776 )	1 year ago
Mark D. Roth	083bbee480	[LB policies] revert changes for dualstack design (#33718 ) This reverts the following PRs: #32692 #33087 #33093 #33427 #33568 These changes seem to have introduced some flaky crashes. Reverting while I investigate.	1 year ago
Mark D. Roth	ec39600872	[WRR] fix bugs that caused us to re-enter blackout period upon updates (#33694 ) As per gRFC A58, when WRR sees a subchannel report READY, it reset the non_empty_since value, thus restarting the blackout period. However, there were two cases where we were incorrectly triggering this code: 1. When WRR got an updated address list that contained addresses that were already present on the old list and whose subchannels were already in READY state, the initial notification for those subchannels on the new list was READY, which incorrectly triggered resetting the non_empty_since value. 2. Due to a bug in the outlier_detection policy, whenever an update was propagated down through the OD policy without actually enabling OD, it would incorrectly send a duplicate connectivity state notification for the subchannels. This meant that a subchannel that was already in state READY would report READY again, which would also incorrectly trigger resetting the non_empty_since value. This PR makes two changes: 1. Fix the bug in outlier_detection that caused it to generate the spurious duplicate READY updates. 2. Fix WRR to reset the non_empty_since value when a subchannel goes READY only if the subchannel has seen a previous state update and only if that previous state was not READY. (The duplicate READY notifications should not actually happen anymore now that the OD policy has been fixed, but better to be defensive.) Fixes b/290983884.	1 year ago
Eugene Ostroukhov	e0bc8a2c85	[xDS LB] xDS pick first support (#33540 )	1 year ago
Mark D. Roth	51e54ed636	[outlier detection] remove support for ejection via raw connectivity state (#33427 ) More work on the dualstack backend design: - Now that all petiole policies have been changed to delegate to pick_first, outlier detection no longer needs to eject via the subchannel's raw connectivity state; it can now eject only via the health state. See #33340. - This also removes the now-unnecessary hack to explicitly disable outlier detection in pick_first. See #33336.	1 year ago
Mark D. Roth	f09357ccb4	[ring hash] delegate to pick_first instead of creating subchannels directly (#33093 ) More work on the dualstack backend design: - Change ring_hash policy to delegate to pick_first instead of creating subchannels directly. - Note that, as mentioned in the WIP gRFC, because we lazily create the pick_first child policies, so there's no need to swap over to a new list as an atomic whole. As a result, we don't use the endpoint_list library in this policy; instead, we just update a map in-place. - Remove now-unused subchannel_list library.	1 year ago
Mark D. Roth	27a778fece	[round robin] delegate to pick_first instead of creating subchannels directly (#32692 ) More work on the dualstack backend design: - Change round_robin to delegate to pick_first instead of creating subchannels directly. - Change pick_first such that when it is the child of a petiole policy, it will unconditionally start a health watch. - Change the client-side health checking code such that if client-side health checking is not enabled, it will return the subchannel's raw connectivity state. - As part of this, we introduce a new endpoint_list library to be used by petiole policies, which is intended to replace the existing subchannel_list library. The only policy that will still directly interact with subchannels is pick_first, so the relevant parts of the subchannel_list functionality have been copied directly into that policy. The subchannel_list library will be removed after all petiole policies are updated to delegate to pick_first.	1 year ago
Mark D. Roth	8427bacaea	[resolver API] remove address attribute interface (#33514 ) The address attribute interface was intended to provide a mechanism to pass attributes separately from channel args, for values that do not affect subchannel behavior and therefore do not need to be present in the subchannel key, which does include channel args. However, the mechanism as currently designed is fairly clunky and is probably not the direction we will want to go in the long term. Eventually, we will want some mechanism for registering channel args, which would provide a cleaner way to indicate that a given channel arg should not be used in the subchannel key, so that we don't need a completely different mechanism. For now, this PR is just doing an interim step, which is to establish a special channel arg key prefix to indicate that an arg is not needed in the subchannel key.	1 year ago
Vignesh Babu	cd4ff81b3f	Revert "Revert "[resource quota] Fix bugs in iomgr and event engine endpoint interactions with resource quota"" (#33499 ) Reverts grpc/grpc#33417 Deadlock https://fusion2.corp.google.com/invocations/99834386-79ff-4707-86eb-52e604774ea9/details fixed in the `c9a1bdc3dc` commit.	1 year ago
Eugene Ostroukhov	6451beba8e	Revert "Revert "Revert "Revert "[xDS LB] Override cluster with value … (#33424 ) Previous attempt: #33416 This reverts commit `19460ea82f`.	1 year ago
Mark D. Roth	f34a39af74	[xDS e2e tests] add 1K RPCs to try to work around statistical problem (#33409 )	1 year ago
Craig Tiller	80dbe90c18	Revert "[resource quota] Fix bugs in iomgr and event engine endpoint interactions with resource quota" (#33417 ) Reverts grpc/grpc#33375 Breaks import	2 years ago
Craig Tiller	19460ea82f	Revert "Revert "Revert "[xDS LB] Override cluster with value from cookie"" (#33379 )" (#33416 ) Reverts grpc/grpc#33388 Breaks import	2 years ago
Vignesh Babu	e6c1b13aed	[resource quota] Fix bugs in iomgr and event engine endpoint interactions with resource quota (#33375 ) The following bugs are fixed: * Missing ExecCtx in event engine endpoints and listeners * Ref counting issue with iomgr endpoint which causes crashes in overloaded situations The PR includes a test which triggers these bugs by simulating an overloaded system.	2 years ago
Eugene Ostroukhov	57bb6fb65c	Revert "Revert "[xDS LB] Override cluster with value from cookie"" (#33388 ) Reapplying #32973	2 years ago
Eugene Ostroukhov	6f685274a1	Revert "[xDS LB] Override cluster with value from cookie (#32973 )" (#33379 ) This reverts #32973 that causes breakages in internal CI.	2 years ago
Eugene Ostroukhov	8fe3472e53	[xDS LB] Override cluster with value from cookie (#32973 )	2 years ago
Yousuk Seung	c03cd744b2	[WRR] Prefer application_utilization to cpu_utilization (#33355 ) <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
Mark D. Roth	6b4a1e4243	[outlier detection] hack to prevent OD from working with pick_first (#33336 ) As per discussion in #32967.	2 years ago
apolcyn	889412c416	[Rls] de-experimentalize RLS in XDS (#33290 ) Integration tests are passing, so we should be ready to de-experimentalize. Related: internal bug b/265209578	2 years ago
Luwei Ge	d1c0dc58cc	[Audit Logging] xDS e2e test for audit logging. (#33252 ) Added tests involve: 1. Checking the # of logger invocations with multiple RBACs in the chain. 2. Verifying content in audit context with action and audit condition permutations. 3. Confirm custom logger and built-in logger configurations are working. 4. Confirm the feature is protected by the environment variable. --------- Co-authored-by: rockspore <rockspore@users.noreply.github.com>	2 years ago
Mark D. Roth	52d687ad42	[xDS] second attempt: clean up cert provider factory and registry APIs (#33249 ) Original was #33226, reverted in #33248.	2 years ago
Craig Tiller	9faa39d88b	Revert "[xDS] clean up cert provider factory and registry APIs" (#33248 ) Reverts grpc/grpc#33226 (looks to be creating some import problems)	2 years ago

1 2 3 4 5 ...

2388 Commits (b038da5072ad36b2cd244209fee2dc6f4ed94fb8)