Chiebot-Mirror/grpc - grpc - Gitea: Git with a cup of tea

Commit Graph

Author	SHA1	Message	Date
Mark D. Roth	b980f62ca6	[pick_first] adjust threshold on e2e test to address flake (#34157 )	1 year ago
jrandolf	3489b6304e	[OpenSSL] Support for OpenSSL 3 (#31256 ) Update from gtcooke94: This PR adds support to build gRPC and it's tests with OpenSSL3. There were some hiccups with tests as the tests with openssl haven't been built or exercised in a few months, so they needed some work to fix. Right now I expect all test files to pass except the following: - h2_ssl_cert_test - ssl_transport_security_utils_test I confirmed locally that these tests fail with OpenSSL 1.1.1 as well, thus we are at least not introducing regressions. Thus, I've added compiler directives around these tests so they only build when using BoringSSL. --------- Co-authored-by: Gregory Cooke <gregorycooke@google.com> Co-authored-by: Esun Kim <veblush@google.com>	1 year ago
Mark D. Roth	72e791402f	[pick_first] fix test flake (#34098 ) CNR the flake, but I've changed the test (which is very old) to use some of our more modern helper functions that have saner timeouts. Also re-add a `return` statement that was accidentally removed in #33753, which I noticed while working on this. Its absence doesn't cause a real problem, but it does cause us to needlessly trigger a duplicate connection attempt or report a duplicate CONNECTING update in some cases.	1 year ago
Mohan Li	ab024624da	[pick_first] de-experiment pick first (#34054 ) De-experiment pick first since we have both affinity and randomness E2E test running successfully. --------- Co-authored-by: Yash Tibrewal <yashkt@google.com>	1 year ago
Mark D. Roth	64a318acd4	[pick_first] fix sticky-TF and handling of subchannels in TRANSIENT_FAILURE (#33753 ) Fix sticky-TF behavior such that once we enter TRANSIENT_FAILURE, we do not leave that state if we get a new address list. Also, fix handling of subchannels in state TRANSIENT_FAILURE. Previously, if a subchannel was already in state TRANSIENT_FAILURE when we wanted to start a connection attempt on it (e.g., because the subchannel already existed from a different channel, or because it already existed in the previous subchannel list), we would wait for it to report IDLE before attempting to connect. This PR changes pick_first to instead immediately skip the subchannel and move on to the next one. Now, the only time we wait for a subchannel in TRANSIENT_FAILURE is when we wrap back around to the first subchannel in the list.	1 year ago
Yousuk Seung	4acb7d38b9	[xds] Apply the slowdown factor only once to LRS load reporting period (#34042 ) <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	1 year ago
Luwei Ge	a5f1121982	[xDS] Remove filter name from GenerateServiceConfig (#33915 ) We decided to not populate `policy_name` with the HTTP filter name in xDS case. So removing it from `GenerateServiceConfig`. This will be consistent across languages. The gRFC [PR](https://github.com/grpc/proposal/pull/346) has been updated.	1 year ago
Vignesh Babu	0616c8b838	[xds] Regex fix in test (#33981 )	1 year ago
Craig Tiller	91e7f223d3	[server] Remove `Notification` from shutdown path (#33953 ) I'm fairly certain that this path should be non-blocking (and making it so makes the promise based code far more tractable). This moves the blocking behavior into the blocking server_cc.cc function that calls `grpc_server_shutdown_and_notify` instead of in that non-blocking function. --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	1 year ago
Vignesh Babu	67f4e4e4c2	[resource quota] Reduce stress test size to prevent OOMs (#33776 )	1 year ago
Mark D. Roth	083bbee480	[LB policies] revert changes for dualstack design (#33718 ) This reverts the following PRs: #32692 #33087 #33093 #33427 #33568 These changes seem to have introduced some flaky crashes. Reverting while I investigate.	1 year ago
Mark D. Roth	ec39600872	[WRR] fix bugs that caused us to re-enter blackout period upon updates (#33694 ) As per gRFC A58, when WRR sees a subchannel report READY, it reset the non_empty_since value, thus restarting the blackout period. However, there were two cases where we were incorrectly triggering this code: 1. When WRR got an updated address list that contained addresses that were already present on the old list and whose subchannels were already in READY state, the initial notification for those subchannels on the new list was READY, which incorrectly triggered resetting the non_empty_since value. 2. Due to a bug in the outlier_detection policy, whenever an update was propagated down through the OD policy without actually enabling OD, it would incorrectly send a duplicate connectivity state notification for the subchannels. This meant that a subchannel that was already in state READY would report READY again, which would also incorrectly trigger resetting the non_empty_since value. This PR makes two changes: 1. Fix the bug in outlier_detection that caused it to generate the spurious duplicate READY updates. 2. Fix WRR to reset the non_empty_since value when a subchannel goes READY only if the subchannel has seen a previous state update and only if that previous state was not READY. (The duplicate READY notifications should not actually happen anymore now that the OD policy has been fixed, but better to be defensive.) Fixes b/290983884.	1 year ago
Eugene Ostroukhov	e0bc8a2c85	[xDS LB] xDS pick first support (#33540 )	1 year ago
Mark D. Roth	51e54ed636	[outlier detection] remove support for ejection via raw connectivity state (#33427 ) More work on the dualstack backend design: - Now that all petiole policies have been changed to delegate to pick_first, outlier detection no longer needs to eject via the subchannel's raw connectivity state; it can now eject only via the health state. See #33340. - This also removes the now-unnecessary hack to explicitly disable outlier detection in pick_first. See #33336.	1 year ago
Mark D. Roth	f09357ccb4	[ring hash] delegate to pick_first instead of creating subchannels directly (#33093 ) More work on the dualstack backend design: - Change ring_hash policy to delegate to pick_first instead of creating subchannels directly. - Note that, as mentioned in the WIP gRFC, because we lazily create the pick_first child policies, so there's no need to swap over to a new list as an atomic whole. As a result, we don't use the endpoint_list library in this policy; instead, we just update a map in-place. - Remove now-unused subchannel_list library.	1 year ago
Mark D. Roth	27a778fece	[round robin] delegate to pick_first instead of creating subchannels directly (#32692 ) More work on the dualstack backend design: - Change round_robin to delegate to pick_first instead of creating subchannels directly. - Change pick_first such that when it is the child of a petiole policy, it will unconditionally start a health watch. - Change the client-side health checking code such that if client-side health checking is not enabled, it will return the subchannel's raw connectivity state. - As part of this, we introduce a new endpoint_list library to be used by petiole policies, which is intended to replace the existing subchannel_list library. The only policy that will still directly interact with subchannels is pick_first, so the relevant parts of the subchannel_list functionality have been copied directly into that policy. The subchannel_list library will be removed after all petiole policies are updated to delegate to pick_first.	1 year ago
Mark D. Roth	8427bacaea	[resolver API] remove address attribute interface (#33514 ) The address attribute interface was intended to provide a mechanism to pass attributes separately from channel args, for values that do not affect subchannel behavior and therefore do not need to be present in the subchannel key, which does include channel args. However, the mechanism as currently designed is fairly clunky and is probably not the direction we will want to go in the long term. Eventually, we will want some mechanism for registering channel args, which would provide a cleaner way to indicate that a given channel arg should not be used in the subchannel key, so that we don't need a completely different mechanism. For now, this PR is just doing an interim step, which is to establish a special channel arg key prefix to indicate that an arg is not needed in the subchannel key.	1 year ago
Vignesh Babu	cd4ff81b3f	Revert "Revert "[resource quota] Fix bugs in iomgr and event engine endpoint interactions with resource quota"" (#33499 ) Reverts grpc/grpc#33417 Deadlock https://fusion2.corp.google.com/invocations/99834386-79ff-4707-86eb-52e604774ea9/details fixed in the `c9a1bdc3dc` commit.	1 year ago
Eugene Ostroukhov	6451beba8e	Revert "Revert "Revert "Revert "[xDS LB] Override cluster with value … (#33424 ) Previous attempt: #33416 This reverts commit `19460ea82f`.	1 year ago
Mark D. Roth	f34a39af74	[xDS e2e tests] add 1K RPCs to try to work around statistical problem (#33409 )	1 year ago
Craig Tiller	80dbe90c18	Revert "[resource quota] Fix bugs in iomgr and event engine endpoint interactions with resource quota" (#33417 ) Reverts grpc/grpc#33375 Breaks import	1 year ago
Craig Tiller	19460ea82f	Revert "Revert "Revert "[xDS LB] Override cluster with value from cookie"" (#33379 )" (#33416 ) Reverts grpc/grpc#33388 Breaks import	1 year ago
Vignesh Babu	e6c1b13aed	[resource quota] Fix bugs in iomgr and event engine endpoint interactions with resource quota (#33375 ) The following bugs are fixed: * Missing ExecCtx in event engine endpoints and listeners * Ref counting issue with iomgr endpoint which causes crashes in overloaded situations The PR includes a test which triggers these bugs by simulating an overloaded system.	1 year ago
Eugene Ostroukhov	57bb6fb65c	Revert "Revert "[xDS LB] Override cluster with value from cookie"" (#33388 ) Reapplying #32973	1 year ago
Eugene Ostroukhov	6f685274a1	Revert "[xDS LB] Override cluster with value from cookie (#32973 )" (#33379 ) This reverts #32973 that causes breakages in internal CI.	2 years ago
Eugene Ostroukhov	8fe3472e53	[xDS LB] Override cluster with value from cookie (#32973 )	2 years ago
Yousuk Seung	c03cd744b2	[WRR] Prefer application_utilization to cpu_utilization (#33355 ) <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
Mark D. Roth	6b4a1e4243	[outlier detection] hack to prevent OD from working with pick_first (#33336 ) As per discussion in #32967.	2 years ago
apolcyn	889412c416	[Rls] de-experimentalize RLS in XDS (#33290 ) Integration tests are passing, so we should be ready to de-experimentalize. Related: internal bug b/265209578	2 years ago
Luwei Ge	d1c0dc58cc	[Audit Logging] xDS e2e test for audit logging. (#33252 ) Added tests involve: 1. Checking the # of logger invocations with multiple RBACs in the chain. 2. Verifying content in audit context with action and audit condition permutations. 3. Confirm custom logger and built-in logger configurations are working. 4. Confirm the feature is protected by the environment variable. --------- Co-authored-by: rockspore <rockspore@users.noreply.github.com>	2 years ago
Mark D. Roth	52d687ad42	[xDS] second attempt: clean up cert provider factory and registry APIs (#33249 ) Original was #33226, reverted in #33248.	2 years ago
Craig Tiller	9faa39d88b	Revert "[xDS] clean up cert provider factory and registry APIs" (#33248 ) Reverts grpc/grpc#33226 (looks to be creating some import problems)	2 years ago
Mark D. Roth	eb2b1edd1c	[xDS] clean up cert provider factory and registry APIs (#33226 ) - switch to json_object_loader for config parsing - use `absl::string_view` instead of `const char*` for cert provider names - change cert provider registry to use a map instead of a vector - remove unused mesh_ca cert provider factory	2 years ago
Luwei Ge	de9d398e8f	[Audit Logging] End2end test for audit logging in authorization policy (#33196 ) I generated a new client key and cert where a Spiffe ID is added as the URI SAN. As such, we are able to test the audit log contains the principal correctly. Update: I switched to use the test logger to verify the log content and removed stdout logger here because one the failure of [RBE Windows Debug C/C++](https://source.cloud.google.com/results/invocations/c3187f41-bb1f-44b3-b2b1-23f38e47386d). Update again: Refactored the test logger in a util such that the authz engine test also uses the same logger. Subsequently, xDS e2e test will also use it. --------- Co-authored-by: rockspore <rockspore@users.noreply.github.com>	2 years ago
Mark D. Roth	a78001a087	[resolver] remove unused ctor for ServerAddress (#33148 ) Co-authored-by: markdroth <markdroth@users.noreply.github.com>	2 years ago
Mark D. Roth	1fcaccdf5f	[client channel] Second attempt: use ChunkedVector for call attributes (#33015 ) Original was #33002, reverted in #33014. The second commit here adds a build visibility tag necessary to fix the internal build problems.	2 years ago
AJ Heller	18aab6ffb5	Revert "[client channel] use ChunkedVector for call attributes" (#33014 ) Reverts grpc/grpc#33002. Breaks internal builds: `.../privacy_context:filters does not depend on a module exporting '.../src/core/lib/channel/context.h'`	2 years ago
Mark D. Roth	2f89fd5528	[client channel] use ChunkedVector for call attributes (#33002 ) Change call attributes to be stored in a `ChunkedVector` instead of `std::map<>`, so that the storage can be allocated on the arena. This means that we're now doing a linear search instead of a map lookup, but the total number of attributes is expected to be low enough that that should be okay. Also, we now hide the actual data structure inside of the `ServiceConfigCallData` object, which required some changes to the `ConfigSelector` API. Previously, the `ConfigSelector` would return a `CallConfig` struct, and the client channel would then use the data in that struct to populate the `ServiceConfigCallData`. This PR changes that such that the client channel creates the `ServiceConfigCallData` before invoking the `ConfigSelector`, and it passes the `ServiceConfigCallData` into the `ConfigSelector` so that the `ConfigSelector` can populate it directly.	2 years ago
Craig Tiller	65a2a895af	[chttp2] Fix some fuzzer found bugs. (#33005 ) <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
Yousuk Seung	8b02295e58	[xDS] Accept cpu_utilization over 100% (#32954 ) <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
Mark D. Roth	020e9b4dd6	[WRR] Remove env var guard for WRR policy (#32936 ) - remove the `_experimental` suffix from the gRPC policy name - remove the env var guard for the xDS policy config	2 years ago
apolcyn	017d9943ef	[XDS] Revert "Revert "XDS: enable XDS federation by default (#32711 )" (#32814 ) (#32902 ) Previous lack-of-load-reporting issue has been fixed (b/276944116)	2 years ago
Eugene Ostroukhov	e59a3e25ca	[xds] Remove variable protection from custom LB policies (#32888 )	2 years ago
Mark D. Roth	26df3d14e2	[XDS] fix federation bug that prevented load reports from being sent (#32826 ) This bug occurred when the same xDS server was configured twice in the same bootstrap config, once in an authority and again as the top-level server. In that case, we were incorrectly failing to de-dup them and were creating a separate channel for the LRS stream than the one that already existed for the ADS stream. We fix this by canonicalizing the server keys the same way in both cases. As a separate follow-up item, I will work on trying to find a better way to key these maps that does not suffer from this kind of fragility.	2 years ago
Craig Tiller	63c094cf5b	[promises] Run C++ end to end tests with server promises (#32537 ) Expand server promises to run with C++ end2end tests. Across connected_channel/call/batch_builder/pipe/transport: - fix a bug where read errors weren't propagated from transport to call so that we can populate failed_before_recv_message for the c++ bindings - ensure those errors are not, however, used to populate the returned call status Add a new latch call arg to lazily propagate the bound CQ for a server call (and client call, but here it's used degenerately - it's always populated). This allows server calls to be properly bound to pollsets.(1)/(2) In call.cc: - move some profiling code from FilterStackCall to Call, and then use it in PromiseBasedCall (this should be cleaned up with tracing work) - implement GetServerAuthority In server.cc: - use an RAII pattern on `MatchResult` to avoid a bug whereby a tag could be dropped if we cancel a request after it's been matched but before it's published - fix deadline export to ServerContext In resource_quota_server.cc: - fix some long standing flakes (that were finally obvious with the new test code) - it's legal here to have client calls not arrive at the server due to resource starvation, work through that (includes adding expectations during a `Step` call, which required some small tweaks to cq_verifier) In the C++ end2end_test.cc: - strengthen a flaky test so it passes consistently (it's likely we'll revisit this with the fuzzing efforts to strengthen it into an actually robust test) (1) It's time to remove this concept (2) Surprisingly the only test that reliably demonstrates this not being done is time_change_test --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	2 years ago
apolcyn	b0636e7a23	Revert "XDS: enable XDS federation by default (#32711 )" (#32814 ) This reverts commit `4b46dbc19e`. Reason: this seems to be breaking load reports in certain cases, b/276944116 Let's revert so this doesn't accidentally get released.	2 years ago
BrandonY	bc6a2ee918	[RLS] Change case of RLS 'x-google-rls-data' header to lowercase. (#32760 ) "X-Google-RLS-Data" does not work as gRPC metadata key.	2 years ago
Yousuk Seung	c02b3e695c	xDS: Include orca named_metrics in LRS load reports (#32690 ) <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
apolcyn	4b46dbc19e	XDS: enable XDS federation by default (#32711 ) Integration tests have been green so let's enable this (verification of test results in https://b.corp.google.com/issues/262593165#comment30).	2 years ago
Craig Tiller	175ccc3a90	Reland global config changes (#32661 ) <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. --> --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	2 years ago

1 2 3 4 5 ...

2420 Commits (9303b86010387c2e27bfc4c2f045a907967cb2b5)