Chiebot-Mirror/grpc - grpc - Gitea: Git with a cup of tea

Commit Graph

Author	SHA1	Message	Date
Mark D. Roth	21cb320080	[reorg] move service config code to src/core/service_config (#35843 ) Closes #35843 COPYBARA_INTEGRATE_REVIEW=https://github.com/grpc/grpc/pull/35843 from markdroth:client_channel_reorg4 `0c50ada6f9` PiperOrigin-RevId: 605466874	10 months ago
Mark D. Roth	f22c954ef5	[reorg] move client channel code to src/core/client_channel (#35827 ) Also rename client_channel.{h,cc} -> client_channel_filter.{h,cc}. Closes #35827 COPYBARA_INTEGRATE_REVIEW=https://github.com/grpc/grpc/pull/35827 from markdroth:client_channel_reorg3 `449bff563f` PiperOrigin-RevId: 605006293	10 months ago
Mark D. Roth	10e83973e7	[reorg] move resolver code to src/core/resolver (#35804 ) This new directory combines code from the following locations: - src/core/ext/filters/client_channel/resolver - src/core/lib/resolver Closes #35804 COPYBARA_INTEGRATE_REVIEW=https://github.com/grpc/grpc/pull/35804 from markdroth:client_channel_resolver_reorg2 `30660e6b00` PiperOrigin-RevId: 604665835	10 months ago
Mark D. Roth	148f59c15a	[reorg] move LB policy code to src/core/load_balancing (#35786 ) This new directory combines code from the following locations: - src/core/ext/filters/client_channel/lb_policy - src/core/lib/load_balancing Closes #35786 COPYBARA_INTEGRATE_REVIEW=https://github.com/grpc/grpc/pull/35786 from markdroth:client_channel_resolver_reorg `98554efb98` PiperOrigin-RevId: 604351832	10 months ago
Mark D. Roth	fa5603c72c	[client channel] rename ClientChannel to ClientChannelFilter (#35783 ) Closes #35783 COPYBARA_INTEGRATE_REVIEW=https://github.com/grpc/grpc/pull/35783 from markdroth:client_channel_filter_rename `ea8b74a33a` PiperOrigin-RevId: 603424220	10 months ago
Yijie Ma	77ad5a786e	[CSM O11Y] CSM Service Label Plumbing from LB Policies to CallAttemptTracer (#35210 ) <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. --> Closes #35210 COPYBARA_INTEGRATE_REVIEW=https://github.com/grpc/grpc/pull/35210 from yijiem:csm-service-label `6a6a7d1774` PiperOrigin-RevId: 597641393	11 months ago
Mark D. Roth	a446df61f4	[xDS] remove unnecessary string from XdsConfig struct (#35503 ) I realized that this field wasn't actually necessary, since the string is already present in the map key. Closes #35503 COPYBARA_INTEGRATE_REVIEW=https://github.com/grpc/grpc/pull/35503 from markdroth:xds_config_remove_cluster_name `94d5edc133` PiperOrigin-RevId: 597375018	11 months ago
Mark D. Roth	6a4b5ccea3	[SSA] change xds_override_host policy to manage subchannels based on last-used time rather than EDS health state (#35397 ) Part of the work needed for in-progress gRFC A75 (https://github.com/grpc/proposal/pull/405). Closes #35397 COPYBARA_INTEGRATE_REVIEW=https://github.com/grpc/grpc/pull/35397 from markdroth:xds_ssa_subchannel_management_revamp `8902deafad` PiperOrigin-RevId: 597288930	11 months ago
Mark D. Roth	c7101d0867	[xDS] move CDS and EDS watchers into xds resolver (#35011 ) Implements gRFC A74 (https://github.com/grpc/proposal/pull/404). Closes #35011 COPYBARA_INTEGRATE_REVIEW=https://github.com/grpc/grpc/pull/35011 from markdroth:xds_watchers_in_xds_resolver `a39f71f37f` PiperOrigin-RevId: 595134549	11 months ago
Mark D. Roth	3e785d395d	[RefCounted and friends] Fix type safety of ref-counted types. Previously, `RefCountedPtr<>` and `WeakRefCountedPtr<>` incorrectly allowed implicit casting of any type to any other type. This hadn't caused a problem until recently, but now that it has, we need to fix it. I have fixed this by changing these smart pointer types to allow type conversions only when the type used is convertible to the type of the smart pointer. This means that if `Subclass` inherits from `Base`, then we can set a `RefCountedPtr<BaseClass>` to a value of type `RefCountedPtr<Subclass>`, but we cannot do the reverse. We had been (ab)using this bug to make it more convenient to deal with down-casting in subclasses of ref-counted types. For example, because `Resolver` inherits from `InternallyRefCounted<Resolver>`, calling `Ref()` on a subclass of `Resolver` will return `RefCountedPtr<Resolver>` rather than returning the subclass's type. The ability to implicitly convert to the subclass type made this a bit easier to deal with. Now that that ability is gone, we need a different way of dealing with that problem. I considered several ways of dealing with this, but none of them are quite as ergonomic as I would ideally like. For now, I've settled on requiring callers to explicitly down-cast as needed, although I have provided some utility functions to make this slightly easier: - `RefCounted<>`, `InternallyRefCounted<>`, and `DualRefCounted<>` all provide a templated `RefAsSubclass<>()` method that will return a new ref as a subclass. The type used with `RefAsSubclass()` must be a subclass of the type passed to `RefCounted<>`, `InternallyRefCounted<>`, or `DualRefCounted<>`. - In addition, `DualRefCounted<>` provides a templated `WeakRefAsSubclass<T>()` method. This is the same as `RefAsSubclass()`, except that it returns a weak ref instead of a strong ref. - In `RefCountedPtr<>`, I have added a new `Ref()` method that takes debug tracing parameters. This can be used instead of calling `Ref()` on the underlying object in cases where the caller already has a `RefCountedPtr<>` and is calling `Ref()` only to specify the debug tracing parameters. Using this method on `RefCountedPtr<>` is more ergonomic, because the smart pointer is already using the right subclass, so no down-casting is needed. - In `WeakRefCountedPtr<>`, I have added a new `WeakRef()` method that takes debug tracing parameters. This is the same as the new `Ref()` method on `RefCountedPtr<>`. - In both `RefCountedPtr<>` and `WeakRefCountedPtr<>`, I have added a templated `TakeAsSubclass<>()` method that takes the ref out of the smart pointer and returns a new smart pointer of the down-casted type. Just as with the `RefAsSubclass()` method above, the type used with `TakeAsSubclass()` must be a subclass of the type passed to `RefCountedPtr<>` or `WeakRefCountedPtr<>`. Note that I have not provided an `AsSubclass<>()` variant of the `RefIfNonZero()` methods. Those methods are used relatively rarely, so it's not as important for them to be quite so ergonomic. Callers of these methods that need to down-cast can use `RefIfNonZero().TakeAsSubclass<>()`. PiperOrigin-RevId: 592327447	12 months ago
Yijie Ma	86d90f54b0	[EventEngine] Skip `dns_resolver_cooldown_test` for `event_engine_dns` experiment (#35251 ) <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. --> Closes #35251 COPYBARA_INTEGRATE_REVIEW=https://github.com/grpc/grpc/pull/35251 from yijiem:fix-dns-resolver-cooldown-test `857835200a` PiperOrigin-RevId: 589159895	1 year ago
Mark D. Roth	fcdc9b4d29	[LB policy API] pass address lists down via an iterator interface (#34753 ) This avoids storing unnecessary copies of the address list in each node of the LB policy tree. Closes #34753 COPYBARA_INTEGRATE_REVIEW=https://github.com/grpc/grpc/pull/34753 from markdroth:lb_address_list_iterator `1d39465fbc` PiperOrigin-RevId: 582891475	1 year ago
Mark D. Roth	8a000f45f8	[grpclb and fake resolver] clean up e2e tests and simplify fake resolver (#34887 ) Changes to fake resolver: - Add `WaitForReresolutionRequest()` method to fake resolver response generator to allow tests to tell when re-resolution has been requested. - Change fake resolver response generator API to have only one mechanism for injecting results, regardless of whether the result is an error or whether it's triggered by a re-resolution. Changes to grpclb_end2end_test: - Change balancer interface such that instead of setting a list of responses with fixed delays, the test can control exactly when each response is set. - Change balancer impl to always send the initial LB response, as expected by the grpclb protocol. - Change balancer impl to always read load reports, even if load reporting is not expected to be enabled. (The latter case will still cause the test to fail.) Reads are done in a different thread than writes. - Allow each test to directly control how many backends and balancers are started and the client load reporting interval, so that (a) we don't waste resources starting servers we don't need and (b) there is no need to arbitrarily split tests across different test classes. - Add timeouts to `WaitForAllBackends()` functionality, so that tests will fail with a useful error rather than timing out. - Improved ergonomics of various helper functions in the test framework. In the process of making these changes, I found a couple of bugs: - A bug in pick_first, which I fixed in #34885. - A bug in grpclb, in which we were using the wrong condition to decide whether to propagate a re-resolution request from the child policy, which I've fixed in this PR. (This bug probably originated way back in #18344.) This should address a lot of the flakes seen in grpclb_e2e_test recently.	1 year ago
Mark D. Roth	1324ce42ea	[pick_first] fix race condition for detecting idleness (#34885 ) This fixes a bug accidentally introduced in #33753. The symptom is that if we exit idle and then get a new address list before any of the subchannels in the old list can report their initial connectivity state, we will incorrectly ignore the new address list.	1 year ago
Mark D. Roth	4826efa619	[pick_first] fix happy eyeballs address interleaving bug (#34804 ) The original logic from #34615 was incorrect in cases where one address family has a different number of addresses than the other(s). Fixes b/307937051.	1 year ago
Eugene Ostroukhov	1a76e7cb42	[Proxy] Support for setting proxy for addresses (#34617 )	1 year ago
Mark D. Roth	b2d5a3c8da	[pick_first] don't finish Happy Eyeballs pass until all subchannels fail at least once (#34717 )	1 year ago
Mark D. Roth	f15635287b	[xxhash] add a wrapper header to avoid clang-format breakage (#34658 )	1 year ago
Mark D. Roth	067fc48dca	[pick_first] implement address interleaving for Happy Eyeballs (#34615 )	1 year ago
Yijie Ma	bae0c705aa	[Deps] Update to Clang-16 (#34492 ) <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	1 year ago
Mark D. Roth	01907a7767	[ring_hash] add test and make some minor fixes and improvements (#34610 ) - Fixes support for the same address being present more than once in the address list, which was accidentally broken in #34244. - Change the call attribute to encode the hash as an integer instead of a string.	1 year ago
Mark D. Roth	bb6a6faa69	[SSA] support multiple addresses per endpoint (#34472 )	1 year ago
Mark D. Roth	36b70504e5	[outlier detection] support multiple addresses per endpoint (#34526 )	1 year ago
Mark D. Roth	7a06614f95	[resolver and LB policy APIs] reland: change address list to support multiple addresses per endpoint (#34531 ) Re-land #33567, which was reverted in #34527. First commit is a pure revert, second commit is a small fix needed to avoid breaking internal callers.	1 year ago
Mark D. Roth	41f26de3b6	Revert "[resolver and LB policy APIs] change address list to support multiple addresses per endpoint" (#34527 ) Reverts grpc/grpc#33567 due to import problems.	1 year ago
Mark D. Roth	fd2e8c9462	[resolver and LB policy APIs] change address list to support multiple addresses per endpoint (#33567 ) More changes as part of the dualstack design: - Change resolver and LB policy APIs to support multiple addresses per endpoint. Specifically, replace `ServerAddress` with `EndpointAddresses`, which encodes more than one address. Per-address channel args are retained at the same level, so they are now per-endpoint. For now, `EndpointAddress` provides a single-address ctor and a single-address accessor for backward compatibility, so `ServerAdress` is an alias for `EndpointAddresses`; eventually, this alias and the single-address methods will be removed. - Add an `EndpointAddressSet` class, which represents an unordered set of addresses to be used as a map key. This will be used in a number of LB policies that need to store per-endpoint state. - Change the LB policy API's `ChannelControlHelper::CreateSubchannel()` method to take the address and per-endpoint channel args as separate parameters, so that we don't need to construct a legacy `ServerAddress` object as we create a new subchannel for each address in the endpoint. - Change pick_first to flatten the address list. - Change ring_hash to use `EndpointAddressSet` as the key for its endpoint map, and to use the first address of the endpoint as the hash key. - Change WRR to use `EndpointAddressSet` as the key for its endpoint weight map. Note that support for multiple addresses per endpoint is guarded in RR by the existing `round_robin_delegate_to_pick_fist` experiment and in WRR by the existing `wrr_delegate_to_pick_first` experiment. This PR does not include support for multiple addresses per endpoint for the outlier_detection or xds_override_host LB policies; those will come in subsequent PRs.	1 year ago
Mark D. Roth	835775e347	[pick_first] implement Happy Eyeballs (#34426 )	1 year ago
Craig Tiller	47306d78f4	[work-serializer] Add some basic process-wide monitoring (#34369 ) Add some basic metrics to work serializer, keep them process wide for now (though it may be interesting to get these into channelz in the future). Collected are: - time spent running a work serializer when it starts - time spent actually executing work when the work serializer runs - number of items executed each run A high disparity between the first two indicates our dispatching mechanism is adding large amounts of latency (perhaps due to thread starvation like effects). A high value for any of these indicate contention on the serializer. It's likely a future iteration on these will select different metrics - I'm not entirely sure which will be useful in production analysis yet. I'm using `std::chrono::steady_clock` here for precision (nanoseconds) with a compact representation (better than timespec) and a robust & portable api - I think it's appropriate for metrics, but wouldn't use it much beyond that at this point.	1 year ago
Mark D. Roth	25cb8e6ed2	[WRR] delegate to pick_first as per dualstack design (#34245 ) Rolls forward the changes from #33087, which were rolled back in #33718. This change is now guarded by a disablable experiment.	1 year ago
Craig Tiller	86b931c354	[work-serializer] Dispatch on run experiment (relanding) (#34372 ) Reverts grpc/grpc#34371	1 year ago
Craig Tiller	d589caa679	Revert "[work-serializer] Dispatch on run experiment" (#34371 ) Reverts grpc/grpc#34274 (needs some changes internally)	1 year ago
Craig Tiller	1705470950	[work-serializer] Dispatch on run experiment (#34274 ) Co-authored-by: ctiller <ctiller@users.noreply.github.com> Co-authored-by: Mark D. Roth <roth@google.com>	1 year ago
Mark D. Roth	1986007e1e	[round_robin] 4th attempt: delegate to pick_first as per dualstack design (#34337 ) Most recent attempt was #34320, reverted in #34335. The first commit here is a pure revert. The second commit fixes the outlier_detection unit test to pass both with and without the experiment.	1 year ago
Mark D. Roth	6534f0a6bf	Revert "[round_robin] third attempt: delegate to pick_first as per dualstack design" (#34335 ) Reverts grpc/grpc#34320	1 year ago
Mark D. Roth	d713427cec	[round_robin] third attempt: delegate to pick_first as per dualstack design (#34320 ) Previous attempt was #34241, reverted in #34317. The second commit here makes the experiment disablable, so that we can roll it out slowly internally.	1 year ago
Craig Tiller	e6bf7c12cf	Revert "[round_robin] delegate to pick_first as per dualstack design" (#34317 ) Reverts grpc/grpc#34241	1 year ago
Mark D. Roth	97571ebf81	[round_robin] delegate to pick_first as per dualstack design (#34241 ) Rolls forward the remaining changes from #32692, which were rolled back in #33718.	1 year ago
Eugene Ostroukhov	3824288bad	[Tests] Move the http_proxy_mapper_test.cc back (#34268 )	1 year ago
Eugene Ostroukhov	a5e9feeb04	[HTTP Proxy] Rename source/header and move test (#34221 )	1 year ago
Mark D. Roth	b7e680ad46	[health checking] move to generic health watch for dualstack design (#34222 ) Rolls forward part of the dualstack changes, mostly from #33427 and a little bit from #32692, both of which were reverted in #33718. Specifically: - For petiole policies, unconditionally start health watch on subchannels, even if client side health checking is not enabled; in this case, the health watch will report the subchannel's raw connectivity state. - Fix edge cases in health check reporting that occur when a watcher is started before the initial state is reported. - When client-side health checking fails, add the subchannel's address to the RPC failure status message. - Outlier detection now works only via the health checking watch, not via the raw connectivity state watch. - Remove now-unnecessary hack to ensure that outlier detection does not work for pick_first.	1 year ago
Mark D. Roth	b8fd38d7cb	[xds_override_host] improve logging for debuggability (#34223 ) I wound up needing this to debug some problems in the dualstack code.	1 year ago
Mark D. Roth	6412412ae1	[pick_first] changes to support dualstack design (#34218 ) This rolls forward only the pick_first changes from #32692, which were rolled back in #33718. Specifically: - Changes PF to use its own subchannel list implementation instead of using the subchannel_list library, since the latter will be going away with the dualstack changes. - As a result of no longer using the subchannel_list library, PF no longer needs to set the `GRPC_ARG_INHIBIT_HEALTH_CHECKING` channel arg. - Adds an option to start a health watch on the chosen subchannel, to be used in the future when pick_first is the child of a petiole policy. (Currently, this code is not actually called anywhere.)	1 year ago
Craig Tiller	b85b57fdc7	[wrr] Add metrics to help debug high WRR cost (#34095 ) WRR is showing a very high CPU cost relative to previous solutions, and it's unclear why this is. Add two metrics that should help us see the shape of the subchannel sets that are being passed to high cost systems in order to confirm/deny theories. --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	1 year ago
Mohan Li	ab024624da	[pick_first] de-experiment pick first (#34054 ) De-experiment pick first since we have both affinity and randomness E2E test running successfully. --------- Co-authored-by: Yash Tibrewal <yashkt@google.com>	1 year ago
Mark D. Roth	64a318acd4	[pick_first] fix sticky-TF and handling of subchannels in TRANSIENT_FAILURE (#33753 ) Fix sticky-TF behavior such that once we enter TRANSIENT_FAILURE, we do not leave that state if we get a new address list. Also, fix handling of subchannels in state TRANSIENT_FAILURE. Previously, if a subchannel was already in state TRANSIENT_FAILURE when we wanted to start a connection attempt on it (e.g., because the subchannel already existed from a different channel, or because it already existed in the previous subchannel list), we would wait for it to report IDLE before attempting to connect. This PR changes pick_first to instead immediately skip the subchannel and move on to the next one. Now, the only time we wait for a subchannel in TRANSIENT_FAILURE is when we wrap back around to the first subchannel in the list.	1 year ago
Craig Tiller	3717ff04ba	[chttp2] Split ping policy from transport (#33703 ) Why: Cleanup for chttp2_transport ahead of promise conversion - lots of logic has become interleaved throughout chttp2, so some effort to isolate logic out is warranted ahead of that conversion. What: Split configuration and policy tracking for each of ping rate throttling and abuse detection into their own modules. Add tests for them. Incidentally: Split channel args into their own header so that we can split the policy stuff into separate build targets. --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	1 year ago
Mark D. Roth	083bbee480	[LB policies] revert changes for dualstack design (#33718 ) This reverts the following PRs: #32692 #33087 #33093 #33427 #33568 These changes seem to have introduced some flaky crashes. Reverting while I investigate.	1 year ago
Mark D. Roth	ec39600872	[WRR] fix bugs that caused us to re-enter blackout period upon updates (#33694 ) As per gRFC A58, when WRR sees a subchannel report READY, it reset the non_empty_since value, thus restarting the blackout period. However, there were two cases where we were incorrectly triggering this code: 1. When WRR got an updated address list that contained addresses that were already present on the old list and whose subchannels were already in READY state, the initial notification for those subchannels on the new list was READY, which incorrectly triggered resetting the non_empty_since value. 2. Due to a bug in the outlier_detection policy, whenever an update was propagated down through the OD policy without actually enabling OD, it would incorrectly send a duplicate connectivity state notification for the subchannels. This meant that a subchannel that was already in state READY would report READY again, which would also incorrectly trigger resetting the non_empty_since value. This PR makes two changes: 1. Fix the bug in outlier_detection that caused it to generate the spurious duplicate READY updates. 2. Fix WRR to reset the non_empty_since value when a subchannel goes READY only if the subchannel has seen a previous state update and only if that previous state was not READY. (The duplicate READY notifications should not actually happen anymore now that the OD policy has been fixed, but better to be defensive.) Fixes b/290983884.	1 year ago
Mark D. Roth	38816cf327	[WRR] delegate to pick_first instead of creating subchannels directly (#33087 ) As part of the dualstack backend design, change WRR to delegate to pick_first instead of creating subchannels directly.	1 year ago
Mark D. Roth	27a778fece	[round robin] delegate to pick_first instead of creating subchannels directly (#32692 ) More work on the dualstack backend design: - Change round_robin to delegate to pick_first instead of creating subchannels directly. - Change pick_first such that when it is the child of a petiole policy, it will unconditionally start a health watch. - Change the client-side health checking code such that if client-side health checking is not enabled, it will return the subchannel's raw connectivity state. - As part of this, we introduce a new endpoint_list library to be used by petiole policies, which is intended to replace the existing subchannel_list library. The only policy that will still directly interact with subchannels is pick_first, so the relevant parts of the subchannel_list functionality have been copied directly into that policy. The subchannel_list library will be removed after all petiole policies are updated to delegate to pick_first.	1 year ago

1 2 3 4 5 ...

566 Commits (61e1c030b28c3b3bc5849015cc19f2dbe01ed059)