Chiebot-Mirror/grpc - grpc - Gitea: Git with a cup of tea

Commit Graph

Author	SHA1	Message	Date
Craig Tiller	01fb4a0fe4	[experiments] Clean up some rolled out experiments (#35195 ) Closes #35195 COPYBARA_INTEGRATE_REVIEW=https://github.com/grpc/grpc/pull/35195 from ctiller:cleanup-cleanup `1f22298ac9` PiperOrigin-RevId: 587857022	12 months ago
Craig Tiller	399fded213	Reapply "[experiments] Explicit requirement check" (#34911 ) (#34915 ) This reverts commit `b0e0659bab`. Closes #34915 COPYBARA_INTEGRATE_REVIEW=https://github.com/grpc/grpc/pull/34915 from ctiller:requires2 `8e4f033317` PiperOrigin-RevId: 583110606	1 year ago
Alisha Nanda	b0e0659bab	Revert "[experiments] Explicit requirement check" (#34911 ) Reverts grpc/grpc#34880, needs to be cherry-picked in.	1 year ago
Craig Tiller	88011e05f5	[experiments] Explicit requirement check (#34880 ) Add a config to experiments & rollouts to allow dependent experiments to be flagged. We're getting past the point where it's possible to reason about which experiments need to be turned off if we disable some other experiment, and so this provides some additional rollout safety. Can be specified in both experiments and rollouts: experiments.yaml makes the most sense and we should default to it, but rollouts.yaml lets us put dependencies between internal & external dependencies internally and that's gonna be a little useful. --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	1 year ago
AJ Heller	64207f8b08	[EventEngine] Enable EventEngine Listener experiment for Posix builds (#34851 ) Previously disabled due to * a problem in the core/end2end http_proxy_fixture (fixed in https://github.com/grpc/grpc/pull/34838) * a race with the EventEngine in the bad_server_response test (fixed in https://github.com/grpc/grpc/pull/34816)	1 year ago
AJ Heller	1fd45fcd01	[EventEngine] Disable Posix EventEngine Listener experiment (#34793 ) There are a small handful of failures: TSAN issue: * https://source.cloud.google.com/results/invocations/f82a4be1-a38e-4a19-b22f-295ed6f7d2d2/targets/%2F%2Ftest%2Fcore%2Fend2end:bad_server_response_test@poller%3Dpoll/log Flakes: * https://source.cloud.google.com/results/invocations/a11b04d8-e0d1-4175-a7a2-6e712b9bef9b/targets/%2F%2Ftest%2Fcore%2Fend2end:cancel_with_status_test@poller%3Depoll1/tests	1 year ago
AJ Heller	4c834d4721	[EventEngine] Enable Posix EventEngine Listener on all builds (#34748 )	1 year ago
Craig Tiller	c9e362238c	[chttp2] Re-enable overload protection experiment (#34770 ) Seems like a small pause is needed for max_concurrent_streams to allow things to settle out properly with this experiment.	1 year ago
AJ Heller	872c7dbed5	[chttp2] Disable overload_protection experiment (#34767 ) This causes breakages in multiple tests. The event_engine_listener tests are particularly sensitive to the breakage.	1 year ago
AJ Heller	c806680535	[EventEngine] Enable Windows EventEngine Listener on all builds (#34436 )	1 year ago
Craig Tiller	e81d181fd7	[chttp2] Alternative protection for too many streams in the system (#34697 ) Cancel streams if we have too much concurrency on a single channel to allow that server to recover. There seems to be a convergence in the HTTP2 community about this being a reasonable thing to do, so I'd like to try it in some real scenarios. If this pans out well then I'll likely drop the `red_max_concurrent_streams` and the `rstpit` experiments in preference to this. I'm also considering tying in resource quota so that under high memory pressure we just default to this path. --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	1 year ago
Craig Tiller	f832772d89	[chttp2] Fix behavior on max concurrent streams exceeded (#34654 ) Noticed that our behavior conflicts with what's mandated: https://www.rfc-editor.org/rfc/rfc9113.html#section-5.1.2 So here's a quick fix - I like it because it's not a disconnection, which always feels like one outage avoided. --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	1 year ago
Craig Tiller	7c59c09f43	[chttp2] Bound write sizes based on observed write performance (#34665 ) Instead of fixing a target size for writes, try to adapt it a little to observed bandwidth. The initial algorithm tries to get large writes within 100-1000ms maximum delay - this range probably wants to be tuned, but let's see. The hope here is that on slow connections we can not back buffer so much and so when we need to send a ping-ack it's possible without great delay.	1 year ago
Craig Tiller	6a49e953a4	[chttp2] Experiments for rst_stream pushback (#34642 ) Experiment 1: On RST_STREAM: reduce MAX_CONCURRENT_STREAMS for one round trip. Experiment 2: If a settings frame is outstanding with a lower MAX_CONCURRENT_STREAMS than is configured, and we receive a new incoming stream that would exceed the new cap, randomly reject it. --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	1 year ago
Craig Tiller	e527581a93	[chttp2] Tarpit invalid requests (#34641 ) If a request is invalid, take a random amount of time before sending the RST_STREAM, so that MAX_CONCURRENT_STREAMS remaining becomes unpredictable.	1 year ago
Craig Tiller	d94313b949	[chttp2] Limit work per read cycle (#34639 ) Cap requests per read, rst_stream handled per read. If these caps are exceeded, offload processing of the connection to a backing thread pool, and allow other connections to make progress.	1 year ago
Craig Tiller	954b285dd2	[chttp2] Limit request count before receiving settings ack (#34638 ) Previously chttp2 would allow infinite requests prior to a settings ack - as the agreed upon limit for requests in that state is infinite. Instead, after MAX_CONCURRENT_STREAMS requests have been attempted, start blanket cancelling requests until the settings ack is received. This can be done efficiently without allocating request state structures.	1 year ago
Craig Tiller	a17f08b49d	[chttp2] Continue refactoring towards promises (#34437 ) Isolate ping callback tracking to its own file. Also takes the opportunity to simplify keepalive code by applying the ping timeout to all pings. Adds an experiment to allow multiple pings outstanding too (this was originally an accidental behavior change of the work, but one that I think may be useful going forward). --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	1 year ago
Mark D. Roth	835775e347	[pick_first] implement Happy Eyeballs (#34426 )	1 year ago
Craig Tiller	5a28bcb574	[promises] Re-enable CI for promise-based-client-call (#34466 ) We disabled this a little while ago for lack of CI bandwidth, but #34404 ought to have freed up enough capacity that we can keep running this. --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	1 year ago
AJ Heller	15a8aebc6d	[experiment] Remove work_stealing experiment configuration (#34468 ) Cleanup after https://github.com/grpc/grpc/pull/34315	1 year ago
Yash Tibrewal	06b55bdaa4	[RegisteredMethod] Set information on initial metadata about whether method is registered or not (#34432 ) Summary - On the server-side, we are changing the point at which we decide whether a method is registered or not from the surface to the transport at the point where we are done receiving initial metadata and before we invoke the recv_initial_metadata_ready closures from the filters. The main motivation for this is to allow filters to check whether the incoming method is a registered or not. The exact use-case is for observability where we only want to record the method if it is registered. We store the information about the registered method in the initial metadata. On the client-side, we also set information about whether the method is registered or not in the outgoing initial metadata. Since we are effectively changing the lookup point of the registered method, there are slight concerns of this being a potentially breaking change, so we are guarding this with an experiment to be safe. Changes - * Transport API changes - * Along with `accept_stream_fn`, a new callback `registered_method_matcher_cb` will be sent down as a transport op on the server side. When initial metadata is received on the server side, this callback is invoked. This happens before invoking the `recv_initial_metadata_ready` closure. * Metadata changes - * We add a new non-serializable metadata trait `GrpcRegisteredMethod()`. On the client-side, the value is a uintptr_t with a value of 1 if the call has a registered/known method, or 0, if it's not known. On the server side, the value is a (ChannelRegisteredMethod). This metadata information can be used throughout the stack to check whether a call is registered or not. Server Changes - * When a new transport connection is accepted, the server sets `registered_method_matcher_cb` along with `accept_stream_fn`. This function checks whether the method is registered or not and sets the RegisteredMethod matcher in the metadata for use later. * Client Changes - * Set the metadata on call creation on whether the method is registered or not.	1 year ago
Mark D. Roth	ddd4d6e318	[client_channel] don't hop into WorkSerializer to unref ConfigSelector per-call (#34399 ) Also fold the `client_channel_subchannel_wrapper_work_serializer_orphan` experiment into the `work_serializer_dispatch` experiment.	1 year ago
Mark D. Roth	25cb8e6ed2	[WRR] delegate to pick_first as per dualstack design (#34245 ) Rolls forward the changes from #33087, which were rolled back in #33718. This change is now guarded by a disablable experiment.	1 year ago
Craig Tiller	86b931c354	[work-serializer] Dispatch on run experiment (relanding) (#34372 ) Reverts grpc/grpc#34371	1 year ago
Mark D. Roth	5a4e8f3dbd	[client_channel] second attempt: SubchannelWrapper hops into WorkSerializer before destruction (#34321 ) Original PR was #34307, reverted in #34318 due to internal test failures. The first commit is a revert of the revert. The second commit contains the fix. The original idea here was that `SubchannelWrapper::Orphan()`, which is called when the strong refcount reaches 0, would take a new weak ref and then hop into the `WorkSerializer` before dropping that weak ref, thus ensuring that the `SubchannelWrapper` is destroyed inside the `WorkSerializer` (which is needed because the `SubchannelWrapper` dtor cleans up some state in the channel related to the subchannel). The problem is that `DualRefCounted<>::Unref()` itself actually increments the weak ref count before calling `Orphan()` and then decrements it afterwards. So in the case where the `SubchannelWrapper` is unreffed outside of the `WorkSerializer` and no other thread happens to be holding the `WorkSerializer`, the weak ref that we were taking in `Orphan()` was unreffed inline, which meant that it wasn't actually the last weak ref -- the last weak ref was the one taken by `DualRefCounted<>::Unref()`, and it wasn't released until after the `WorkSerializer` was released. To this this problem, we move the code from the `SubchannelWrapper` dtor that cleans up the channel's state into the `WorkSerializer` callback that is scheduled in `Orphan()`. Thus, regardless of whether or not the last weak ref is released inside of the `WorkSerializer`, we are definitely doing that cleanup inside the `WorkSerializer`, which is what we actually care about. Also adds an experiment to guard this behavior.	1 year ago
Craig Tiller	d589caa679	Revert "[work-serializer] Dispatch on run experiment" (#34371 ) Reverts grpc/grpc#34274 (needs some changes internally)	1 year ago
Craig Tiller	1705470950	[work-serializer] Dispatch on run experiment (#34274 ) Co-authored-by: ctiller <ctiller@users.noreply.github.com> Co-authored-by: Mark D. Roth <roth@google.com>	1 year ago
Mark D. Roth	1986007e1e	[round_robin] 4th attempt: delegate to pick_first as per dualstack design (#34337 ) Most recent attempt was #34320, reverted in #34335. The first commit here is a pure revert. The second commit fixes the outlier_detection unit test to pass both with and without the experiment.	1 year ago
Mark D. Roth	6534f0a6bf	Revert "[round_robin] third attempt: delegate to pick_first as per dualstack design" (#34335 ) Reverts grpc/grpc#34320	1 year ago
Mark D. Roth	d713427cec	[round_robin] third attempt: delegate to pick_first as per dualstack design (#34320 ) Previous attempt was #34241, reverted in #34317. The second commit here makes the experiment disablable, so that we can roll it out slowly internally.	1 year ago
Craig Tiller	aed2797cd2	[chttp2] Fix inefficiency in flow control (#34265 ) In certain situations the current flow control algorithm can result in sending one flow control update write for every write sent (known situation: rollout of promise based server calls with qps_test). Fix things up so that the updates are only sent when truly needed, and then fix the fallout (turns out our fuzzer had some bugs) I've placed actual logic changes behind an experiment so that it can be incrementally & safely rolled out.	1 year ago
Craig Tiller	b478add7ec	[experiments] Remove unused experiment (#34090 ) We added this as an exploratory measure for a customer that thought they were using open census (this turned out to be emphatically false). Remove it since it's probably not how we ultimately want to do this, and wait for something better to come along. --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	1 year ago
AJ Heller	cbfae38403	[EventEngine] Enable core end2end tests for the event_engine_client experiment (#34060 )	1 year ago
AJ Heller	03eed0f696	[EventEngine] Enable core end2end tests for the event_engine_listener experiment (#34058 ) These may be needed again to help debug remote build environment problems.	1 year ago
Craig Tiller	22b5e8d389	[experiments] Default unique metadata strings to on in OSS, disable tests (#34039 ) This is pretty robust and doesn't warrant the level of testing its getting right now.	1 year ago
AJ Heller	3ac675b389	[EventEngine] Temporarily disable EventEngine experiments in end2end tests (#34041 ) We are not currently using the signal that these experiment tests provide, so we can speed up the CI by disabling them for now.	1 year ago
Craig Tiller	c9fe64c409	Revert "[promises] Enable promise-based calls on server side for OSS build" (#33989 ) Reverts grpc/grpc#33945	1 year ago
Craig Tiller	2feb821eb5	[promises] Enable promise-based calls on server side for OSS build (#33945 )	1 year ago
AJ Heller	c5246fc059	[EventEngine] Enable the work_stealing experiment in debug builds (#33912 ) Over the past 5 days, this experiment has not introduced any new flakes, nor increased any flake rates. Let's enable it for debug builds. To prevent issues over the weekend, I plan to merge it next week, July 31st (with announcement).	1 year ago
AJ Heller	29dd271d44	[testing] Enable end2end experiments for Windows continuous integration jobs (#32567 ) Co-authored-by: drfloob <drfloob@users.noreply.github.com>	1 year ago
Vignesh Babu	f85b7c79ee	[experiments] Fix processing of platform specific test tags (#33749 ) Also adds a unit test: experiments_tag_test which should fail if the appropriate tags are not set for it.	1 year ago
Yijie Ma	a7bf07e86a	[EventEngine] PosixEventEngine DNS Resolver (#32701 ) This PR implements a c-ares based DNS resolver for EventEngine with the reference from the original [grpc_ares_wrapper.h](../blob/master/src/core/ext/filters/client_channel/resolver/dns/c_ares/grpc_ares_wrapper.h). The PosixEventEngine DNSResolver is implemented on top of that. Tests which use the client channel resolver API ([resolver.h](../blob/master/src/core/lib/resolver/resolver.h#L54)) are ported, namely the [resolver_component_test.cc](../blob/master/test/cpp/naming/resolver_component_test.cc) and the [cancel_ares_query_test.cc](../blob/master/test/cpp/naming/cancel_ares_query_test.cc). The WindowsEventEngine DNSResolver will use the same EventEngine's grpc_ares_wrapper and will be worked on next. The [resolve_address_test.cc](https://github.com/grpc/grpc/blob/master/test/core/iomgr/resolve_address_test.cc) which uses the iomgr [DNSResolver](../blob/master/src/core/lib/iomgr/resolve_address.h#L44) API has been ported to EventEngine's dns_test.cc. That leaves only 2 tests which use iomgr's API, notably the [dns_resolver_cooldown_test.cc](../blob/master/test/core/client_channel/resolvers/dns_resolver_cooldown_test.cc) and the [goaway_server_test.cc](../blob/master/test/core/end2end/goaway_server_test.cc) which probably need to be restructured to use EventEngine DNSResolver (for one thing they override the original grpc_ares_wrapper's free functions). I will try to tackle these in the next step. <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	1 year ago
Craig Tiller	801f106992	[promises] Add logging_test to promise_based_server_call testing (#33774 )	1 year ago
Craig Tiller	d139c4a014	[metadata] Add an experiment to ensure a unique refcount on parsed slice strings (#33205 ) The intuition here is that these strings may end up in the hpack table, and then unnecessarily extend the lifetime of the read blocks. Instead, take a copy of these short strings when we need to and allow the incoming large memory object to be discarded. --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	1 year ago
AJ Heller	f974491065	[EventEngine] Return work stealing experiment to CI-only (#33389 ) There is potential for a long shutdown process under load (seconds instead of milliseconds).	1 year ago
AJ Heller	28692fdcae	[EventEngine] Enable work stealing thread pool in debug builds (#33333 )	1 year ago
Vignesh Babu	d11a62e3d0	[experiments] Re-structure experiments codegen to make it more modular and re-usable (#33263 )	2 years ago
Vignesh Babu	4d85f514cb	[experiments] Split experiments into two separate experiment definition and rollout definition files (#33228 ) The PR does the following: * Splits the single experiments.yaml file into two files: experiments.yaml and rollouts.yaml. * The experiments.yaml will now only include experiment definitions. The default values of the experiments must now be specified in rollouts.yaml * Removes the 'release' default value because it is not used. * Adds an additional_constraints character string to ExperimentMetadata. * Introduces a hook in src/core/lib/experiments/config.h to allow registering arbitrary experiment constraint validation callbacks. These callbacks would take an ExperimentMetadata object as input and return the correct value to use for an experiment subject to additional constraints.	2 years ago
Craig Tiller	ad3c6273c6	[experiments] Remove flow_control_fixes experiment (#33230 ) We defaulted this on 5 months ago, and it seems to be working... let's remove the experiment bit! --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	2 years ago

1 2 3

107 Commits (e497eed25169d19e8bd9a39ea6526a861ead1ec7)