Chiebot-Mirror/grpc - grpc - Gitea: Git with a cup of tea

Commit Graph

Author	SHA1	Message	Date
Mark D. Roth	64a318acd4	[pick_first] fix sticky-TF and handling of subchannels in TRANSIENT_FAILURE (#33753 ) Fix sticky-TF behavior such that once we enter TRANSIENT_FAILURE, we do not leave that state if we get a new address list. Also, fix handling of subchannels in state TRANSIENT_FAILURE. Previously, if a subchannel was already in state TRANSIENT_FAILURE when we wanted to start a connection attempt on it (e.g., because the subchannel already existed from a different channel, or because it already existed in the previous subchannel list), we would wait for it to report IDLE before attempting to connect. This PR changes pick_first to instead immediately skip the subchannel and move on to the next one. Now, the only time we wait for a subchannel in TRANSIENT_FAILURE is when we wrap back around to the first subchannel in the list.	1 year ago
Yash Tibrewal	462a2cae35	Revert "[GSM Observability] Add bazel dependency on Google Cloud C++ OTel library" (#34069 ) Reverts grpc/grpc#34043 This caused bazel distribtest failures	1 year ago
Yijie Ma	67ad297e61	[EventEngine] Port GrpcPolledFdFactoryPosix fix to EE (#34025 ) Port https://github.com/grpc/grpc/pull/33871 to EE's GrpcPolledFdFactoryPosix. <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	1 year ago
Yousuk Seung	4acb7d38b9	[xds] Apply the slowdown factor only once to LRS load reporting period (#34042 ) <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	1 year ago
Yash Tibrewal	bd343fd51d	[GSM Observability] Add bazel dependency on Google Cloud C++ OTel library (#34043 ) Not adding CMake support yet <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	1 year ago
Eugene Ostroukhov	44de3ab221	[PSM Interop] Restore "Report per-RPC metadata if requested. (#33939 )" (#34037 )	1 year ago
Mohan Li	66f60aa763	[test] Allow set request/response size in interop soak test (#34010 ) Internal bug: b/289109827	1 year ago
Eugene Ostroukhov	fc9a1ccaed	[PSM Interop] Revert "Report per-RPC metadata if requested. (#33939 )" (#34028 ) This reverts commit `6fadb994ef`.	1 year ago
Eugene Ostroukhov	6fadb994ef	[PSM Interop] Report per-RPC metadata if requested. (#33939 )	1 year ago
Luwei Ge	a5f1121982	[xDS] Remove filter name from GenerateServiceConfig (#33915 ) We decided to not populate `policy_name` with the HTTP filter name in xDS case. So removing it from `GenerateServiceConfig`. This will be consistent across languages. The gRFC [PR](https://github.com/grpc/proposal/pull/346) has been updated.	1 year ago
Eugene Ostroukhov	18be986e3b	[XDS Interop] Move XdsStatsWatcher to a separate file. (#34000 ) This will help with introducing test coverage as the logic becomes more complex.	1 year ago
Mario Jones Vimal	1c0f5d32a0	[core/gpr] Move subprocess to gpr and add subprocess creation using execve (#33983 ) Move subprocess util to gpr. Add support for communication with the subprocess. This is required to support authentication using an executable.	1 year ago
Vignesh Babu	0616c8b838	[xds] Regex fix in test (#33981 )	1 year ago
Craig Tiller	5325b65d84	Revert "[core/gpr] move subprocess to gpr" (#33972 ) Reverts grpc/grpc#33870 - since it breaks memory usage tooling.	1 year ago
Yash Tibrewal	7e63a2f382	[GSM] Some initial structure (#33952 ) <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	1 year ago
Mario Jones Vimal	f10a8e3418	[core/gpr] move subprocess to gpr (#33870 ) Move subprocess util to gpr. Add support for communication with the subprocess. This is required to support authentication using an executable.	1 year ago
Craig Tiller	91e7f223d3	[server] Remove `Notification` from shutdown path (#33953 ) I'm fairly certain that this path should be non-blocking (and making it so makes the promise based code far more tractable). This moves the blocking behavior into the blocking server_cc.cc function that calls `grpc_server_shutdown_and_notify` instead of in that non-blocking function. --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	1 year ago
Yash Tibrewal	860167a7d0	[OTel] Add target on client-rpc metrics, and authority on server-rpc metrics (#33946 )	1 year ago
Yijie Ma	7f332ef69d	[Deps] Update pyyaml to 6.0.1 for bazel build system (#33932 ) The previous version (`3.12`) is 7 years old and does not support the newest Python 3 versions. This causes issues to move certain test targets (which depends on `pyyaml`) to Python 3 when some CI environment (e.g. `arm64v8/debian:11`) does not have Python 2 installed. And in general, we should move away from Python 2. Thus, updated `pyyaml` to the latest version. This hopefully should also fix the `prod:grpc/core/master/linux/arm64/grpc_bazel_test_c_cpp` job breakage. <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	1 year ago
apolcyn	e923706d6f	[c-ares DNS resolver] Revert "Revert "[c-ares DNS resolver] Fix file descriptor use-after-close bug when c-ares writes succeed but subsequent read fails" (#33934 )" (#33942 ) Rolls forward https://github.com/grpc/grpc/pull/33871 Second and third commits here fix internal build issues In particular, add a `// IWYU pragma: no_include <ares_build.h>` since `ares.h` [includes that anyways](`bad62225b7/include/ares.h (L23)`) (and seems unlikely for that to change since it would be breaking)	1 year ago
Alisha Nanda	f7fc3fbed4	[tracing] Add annotation with metadata sizes and limits (#33910 ) Only create annotation when call is sampled for cost reasons. --------- Co-authored-by: ananda1066 <ananda1066@users.noreply.github.com>	1 year ago
Alisha Nanda	9aca06d38a	Revert "[c-ares DNS resolver] Fix file descriptor use-after-close bug when c-ares writes succeed but subsequent read fails" (#33934 ) Reverts grpc/grpc#33871 due to build failures in google3. Co-authored-by: Yijie Ma <yijiem@google.com>	1 year ago
apolcyn	76203ba589	[c-ares DNS resolver] Fix file descriptor use-after-close bug when c-ares writes succeed but subsequent read fails (#33871 ) Normally, c-ares related fds are destroyed after all DNS resolution is finished in [this code path](`c82d31677a/src/core/ext/filters/client_channel/resolver/dns/c_ares/grpc_ares_wrapper.cc (L210)`). Also there are some fds that c-ares may fail to open or write to initially, and c-ares will close them internally before grpc ever knows about them. But if: 1) c-ares opens a socket and successfully writes a request on it 2) then a subsequent read fails Then c-ares will close the fd in [this code path](`bad62225b7/src/lib/ares_process.c (L740)`), but gRPC will have a reference on the fd and will still use it afterwards. Fix here is to leverage the c-ares socket-override API to properly track fd ownership between c-ares and grpc. Related: internal issue b/292203138	1 year ago
Yash Tibrewal	9ea30fa9fd	[OTel] Add an OpenTelemetryPluginBuilder (#33895 ) <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	1 year ago
Yijie Ma	2bb9aea332	[CI breakage] Fix health_check.py permission (#33815 ) <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	1 year ago
Yijie Ma	a7bf07e86a	[EventEngine] PosixEventEngine DNS Resolver (#32701 ) This PR implements a c-ares based DNS resolver for EventEngine with the reference from the original [grpc_ares_wrapper.h](../blob/master/src/core/ext/filters/client_channel/resolver/dns/c_ares/grpc_ares_wrapper.h). The PosixEventEngine DNSResolver is implemented on top of that. Tests which use the client channel resolver API ([resolver.h](../blob/master/src/core/lib/resolver/resolver.h#L54)) are ported, namely the [resolver_component_test.cc](../blob/master/test/cpp/naming/resolver_component_test.cc) and the [cancel_ares_query_test.cc](../blob/master/test/cpp/naming/cancel_ares_query_test.cc). The WindowsEventEngine DNSResolver will use the same EventEngine's grpc_ares_wrapper and will be worked on next. The [resolve_address_test.cc](https://github.com/grpc/grpc/blob/master/test/core/iomgr/resolve_address_test.cc) which uses the iomgr [DNSResolver](../blob/master/src/core/lib/iomgr/resolve_address.h#L44) API has been ported to EventEngine's dns_test.cc. That leaves only 2 tests which use iomgr's API, notably the [dns_resolver_cooldown_test.cc](../blob/master/test/core/client_channel/resolvers/dns_resolver_cooldown_test.cc) and the [goaway_server_test.cc](../blob/master/test/core/end2end/goaway_server_test.cc) which probably need to be restructured to use EventEngine DNSResolver (for one thing they override the original grpc_ares_wrapper's free functions). I will try to tackle these in the next step. <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	1 year ago
Craig Tiller	801f106992	[promises] Add logging_test to promise_based_server_call testing (#33774 )	1 year ago
Mark D. Roth	38da78e416	[test] delete client_channel_stress_test (#33763 ) This test has been disabled for a long time now due to flakiness, but it's now causing problems with the import. And stress tests don't provide positive ROI anyway, so let's just get rid of it.	1 year ago
Vignesh Babu	67f4e4e4c2	[resource quota] Reduce stress test size to prevent OOMs (#33776 )	1 year ago
Yash Tibrewal	d2f37b8b45	[OTel] Basic C++ OTel Stats Functionality (#33650 ) Note that the plugin is still under `grpc::internal` namespace and not under `experimental` intentionally. <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	1 year ago
Mark D. Roth	083bbee480	[LB policies] revert changes for dualstack design (#33718 ) This reverts the following PRs: #32692 #33087 #33093 #33427 #33568 These changes seem to have introduced some flaky crashes. Reverting while I investigate.	1 year ago
Mark D. Roth	ec39600872	[WRR] fix bugs that caused us to re-enter blackout period upon updates (#33694 ) As per gRFC A58, when WRR sees a subchannel report READY, it reset the non_empty_since value, thus restarting the blackout period. However, there were two cases where we were incorrectly triggering this code: 1. When WRR got an updated address list that contained addresses that were already present on the old list and whose subchannels were already in READY state, the initial notification for those subchannels on the new list was READY, which incorrectly triggered resetting the non_empty_since value. 2. Due to a bug in the outlier_detection policy, whenever an update was propagated down through the OD policy without actually enabling OD, it would incorrectly send a duplicate connectivity state notification for the subchannels. This meant that a subchannel that was already in state READY would report READY again, which would also incorrectly trigger resetting the non_empty_since value. This PR makes two changes: 1. Fix the bug in outlier_detection that caused it to generate the spurious duplicate READY updates. 2. Fix WRR to reset the non_empty_since value when a subchannel goes READY only if the subchannel has seen a previous state update and only if that previous state was not READY. (The duplicate READY notifications should not actually happen anymore now that the OD policy has been fixed, but better to be defensive.) Fixes b/290983884.	1 year ago
Craig Tiller	af257b8a39	[hpack] Fix benchmarking timeout (#33675 ) Disable uninteresting sanitizers for this benchmark	1 year ago
Craig Tiller	b7077f4bbf	[hpack] Rollforward huffman read optimization (#33657 ) Rollforward in first commit, fixes in subsequent.	1 year ago
Craig Tiller	57c697d8ae	Revert "[hpack] Huffman read optimization" (#33655 ) Reverts grpc/grpc#33269	1 year ago
Craig Tiller	4ce51fe45d	[hpack] Huffman read optimization (#33269 ) In real services most of our time ends up in the `Read1()` function, which populates one byte into the bit buffer. Change this to read in as many as possible bytes at a time into that buffer. Additionally, generate all possible (to some depth) parser geometries, and add a benchmark for them. Run that benchmark and select the best geometry for decoding base64 strings (since this is the main use-case). (gives about a 30% speed boost parsing base64 then huffman encoded random binary strings) --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	1 year ago
Eugene Ostroukhov	e0bc8a2c85	[xDS LB] xDS pick first support (#33540 )	1 year ago
Mark D. Roth	51e54ed636	[outlier detection] remove support for ejection via raw connectivity state (#33427 ) More work on the dualstack backend design: - Now that all petiole policies have been changed to delegate to pick_first, outlier detection no longer needs to eject via the subchannel's raw connectivity state; it can now eject only via the health state. See #33340. - This also removes the now-unnecessary hack to explicitly disable outlier detection in pick_first. See #33336.	1 year ago
Mark D. Roth	f09357ccb4	[ring hash] delegate to pick_first instead of creating subchannels directly (#33093 ) More work on the dualstack backend design: - Change ring_hash policy to delegate to pick_first instead of creating subchannels directly. - Note that, as mentioned in the WIP gRFC, because we lazily create the pick_first child policies, so there's no need to swap over to a new list as an atomic whole. As a result, we don't use the endpoint_list library in this policy; instead, we just update a map in-place. - Remove now-unused subchannel_list library.	1 year ago
Yash Tibrewal	98417f3bd0	Revert "Revert "[otel] Add bazel dependency"" (#33560 ) Reverts grpc/grpc#33559	1 year ago
Mark D. Roth	017153a0c5	Revert "[otel] Add bazel dependency" (#33559 ) Reverts grpc/grpc#33548	1 year ago
Mark D. Roth	27a778fece	[round robin] delegate to pick_first instead of creating subchannels directly (#32692 ) More work on the dualstack backend design: - Change round_robin to delegate to pick_first instead of creating subchannels directly. - Change pick_first such that when it is the child of a petiole policy, it will unconditionally start a health watch. - Change the client-side health checking code such that if client-side health checking is not enabled, it will return the subchannel's raw connectivity state. - As part of this, we introduce a new endpoint_list library to be used by petiole policies, which is intended to replace the existing subchannel_list library. The only policy that will still directly interact with subchannels is pick_first, so the relevant parts of the subchannel_list functionality have been copied directly into that policy. The subchannel_list library will be removed after all petiole policies are updated to delegate to pick_first.	1 year ago
Yash Tibrewal	875b7fdcff	[otel] Add bazel dependency (#33548 ) Add bazel dependency on opentelemetry-cpp. <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	1 year ago
Esun Kim	16a11fadff	[Test] Explicitly cast `enum class` to `int` before passing it with a format string (#33554 ) Corresponding internal cl/542804880 > Explicitly cast `enum class` to `int` before passing it with a format string. > > Next version of the crosstool will start warning about this (see: > [https://github.com/llvm/llvm-project/issues/38717](https://www.google.com/url?sa=D&q=https%3A%2F%2Fgithub.com%2Fllvm%2Fllvm-project%2Fissues%2F38717))	1 year ago
Mark D. Roth	8427bacaea	[resolver API] remove address attribute interface (#33514 ) The address attribute interface was intended to provide a mechanism to pass attributes separately from channel args, for values that do not affect subchannel behavior and therefore do not need to be present in the subchannel key, which does include channel args. However, the mechanism as currently designed is fairly clunky and is probably not the direction we will want to go in the long term. Eventually, we will want some mechanism for registering channel args, which would provide a cleaner way to indicate that a given channel arg should not be used in the subchannel key, so that we don't need a completely different mechanism. For now, this PR is just doing an interim step, which is to establish a special channel arg key prefix to indicate that an arg is not needed in the subchannel key.	1 year ago
Eugene Ostroukhov	fc4736c1ad	[interop] Fix crash in pick_first LB policy (#33519 ) Fixes http://b/288420022	1 year ago
Yash Tibrewal	441ff0e757	[logging] Add tests for cases where we don't send any metadata and improve debuggability (#33486 ) <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	1 year ago
Vignesh Babu	cd4ff81b3f	Revert "Revert "[resource quota] Fix bugs in iomgr and event engine endpoint interactions with resource quota"" (#33499 ) Reverts grpc/grpc#33417 Deadlock https://fusion2.corp.google.com/invocations/99834386-79ff-4707-86eb-52e604774ea9/details fixed in the `c9a1bdc3dc` commit.	1 year ago
Eugene Ostroukhov	d55431995c	[interop] Implement "hostname" for RPC behavior (#33446 ) This enables outlier detection test. See #33135	1 year ago
Mark D. Roth	d20e8d141b	[LB policies] add delegating helper classes (#33445 ) This eliminates the need to modify every parent policy whenever we add new helper methods. It should also eliminate some binary bloat.	1 year ago

... 2 3 4 5 6 ...

6381 Commits (00a35e24e516296158517f46eddbfa9cd5c49cc3)