Chiebot-Mirror/grpc - grpc - Gitea: Git with a cup of tea

Commit Graph

Author	SHA1	Message	Date
Craig Tiller	7c59c09f43	[chttp2] Bound write sizes based on observed write performance (#34665 ) Instead of fixing a target size for writes, try to adapt it a little to observed bandwidth. The initial algorithm tries to get large writes within 100-1000ms maximum delay - this range probably wants to be tuned, but let's see. The hope here is that on slow connections we can not back buffer so much and so when we need to send a ping-ack it's possible without great delay.	1 year ago
Craig Tiller	6a49e953a4	[chttp2] Experiments for rst_stream pushback (#34642 ) Experiment 1: On RST_STREAM: reduce MAX_CONCURRENT_STREAMS for one round trip. Experiment 2: If a settings frame is outstanding with a lower MAX_CONCURRENT_STREAMS than is configured, and we receive a new incoming stream that would exceed the new cap, randomly reject it. --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	1 year ago
Craig Tiller	e527581a93	[chttp2] Tarpit invalid requests (#34641 ) If a request is invalid, take a random amount of time before sending the RST_STREAM, so that MAX_CONCURRENT_STREAMS remaining becomes unpredictable.	1 year ago
Craig Tiller	d94313b949	[chttp2] Limit work per read cycle (#34639 ) Cap requests per read, rst_stream handled per read. If these caps are exceeded, offload processing of the connection to a backing thread pool, and allow other connections to make progress.	1 year ago
Craig Tiller	954b285dd2	[chttp2] Limit request count before receiving settings ack (#34638 ) Previously chttp2 would allow infinite requests prior to a settings ack - as the agreed upon limit for requests in that state is infinite. Instead, after MAX_CONCURRENT_STREAMS requests have been attempted, start blanket cancelling requests until the settings ack is received. This can be done efficiently without allocating request state structures.	1 year ago
Craig Tiller	a17f08b49d	[chttp2] Continue refactoring towards promises (#34437 ) Isolate ping callback tracking to its own file. Also takes the opportunity to simplify keepalive code by applying the ping timeout to all pings. Adds an experiment to allow multiple pings outstanding too (this was originally an accidental behavior change of the work, but one that I think may be useful going forward). --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	1 year ago
Mark D. Roth	835775e347	[pick_first] implement Happy Eyeballs (#34426 )	1 year ago
Craig Tiller	5a28bcb574	[promises] Re-enable CI for promise-based-client-call (#34466 ) We disabled this a little while ago for lack of CI bandwidth, but #34404 ought to have freed up enough capacity that we can keep running this. --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	1 year ago
AJ Heller	15a8aebc6d	[experiment] Remove work_stealing experiment configuration (#34468 ) Cleanup after https://github.com/grpc/grpc/pull/34315	1 year ago
Yash Tibrewal	06b55bdaa4	[RegisteredMethod] Set information on initial metadata about whether method is registered or not (#34432 ) Summary - On the server-side, we are changing the point at which we decide whether a method is registered or not from the surface to the transport at the point where we are done receiving initial metadata and before we invoke the recv_initial_metadata_ready closures from the filters. The main motivation for this is to allow filters to check whether the incoming method is a registered or not. The exact use-case is for observability where we only want to record the method if it is registered. We store the information about the registered method in the initial metadata. On the client-side, we also set information about whether the method is registered or not in the outgoing initial metadata. Since we are effectively changing the lookup point of the registered method, there are slight concerns of this being a potentially breaking change, so we are guarding this with an experiment to be safe. Changes - * Transport API changes - * Along with `accept_stream_fn`, a new callback `registered_method_matcher_cb` will be sent down as a transport op on the server side. When initial metadata is received on the server side, this callback is invoked. This happens before invoking the `recv_initial_metadata_ready` closure. * Metadata changes - * We add a new non-serializable metadata trait `GrpcRegisteredMethod()`. On the client-side, the value is a uintptr_t with a value of 1 if the call has a registered/known method, or 0, if it's not known. On the server side, the value is a (ChannelRegisteredMethod). This metadata information can be used throughout the stack to check whether a call is registered or not. Server Changes - * When a new transport connection is accepted, the server sets `registered_method_matcher_cb` along with `accept_stream_fn`. This function checks whether the method is registered or not and sets the RegisteredMethod matcher in the metadata for use later. * Client Changes - * Set the metadata on call creation on whether the method is registered or not.	1 year ago
Mark D. Roth	ddd4d6e318	[client_channel] don't hop into WorkSerializer to unref ConfigSelector per-call (#34399 ) Also fold the `client_channel_subchannel_wrapper_work_serializer_orphan` experiment into the `work_serializer_dispatch` experiment.	1 year ago
Mark D. Roth	25cb8e6ed2	[WRR] delegate to pick_first as per dualstack design (#34245 ) Rolls forward the changes from #33087, which were rolled back in #33718. This change is now guarded by a disablable experiment.	1 year ago
Craig Tiller	86b931c354	[work-serializer] Dispatch on run experiment (relanding) (#34372 ) Reverts grpc/grpc#34371	1 year ago
Mark D. Roth	5a4e8f3dbd	[client_channel] second attempt: SubchannelWrapper hops into WorkSerializer before destruction (#34321 ) Original PR was #34307, reverted in #34318 due to internal test failures. The first commit is a revert of the revert. The second commit contains the fix. The original idea here was that `SubchannelWrapper::Orphan()`, which is called when the strong refcount reaches 0, would take a new weak ref and then hop into the `WorkSerializer` before dropping that weak ref, thus ensuring that the `SubchannelWrapper` is destroyed inside the `WorkSerializer` (which is needed because the `SubchannelWrapper` dtor cleans up some state in the channel related to the subchannel). The problem is that `DualRefCounted<>::Unref()` itself actually increments the weak ref count before calling `Orphan()` and then decrements it afterwards. So in the case where the `SubchannelWrapper` is unreffed outside of the `WorkSerializer` and no other thread happens to be holding the `WorkSerializer`, the weak ref that we were taking in `Orphan()` was unreffed inline, which meant that it wasn't actually the last weak ref -- the last weak ref was the one taken by `DualRefCounted<>::Unref()`, and it wasn't released until after the `WorkSerializer` was released. To this this problem, we move the code from the `SubchannelWrapper` dtor that cleans up the channel's state into the `WorkSerializer` callback that is scheduled in `Orphan()`. Thus, regardless of whether or not the last weak ref is released inside of the `WorkSerializer`, we are definitely doing that cleanup inside the `WorkSerializer`, which is what we actually care about. Also adds an experiment to guard this behavior.	1 year ago
Craig Tiller	d589caa679	Revert "[work-serializer] Dispatch on run experiment" (#34371 ) Reverts grpc/grpc#34274 (needs some changes internally)	1 year ago
Craig Tiller	1705470950	[work-serializer] Dispatch on run experiment (#34274 ) Co-authored-by: ctiller <ctiller@users.noreply.github.com> Co-authored-by: Mark D. Roth <roth@google.com>	1 year ago
Mark D. Roth	1986007e1e	[round_robin] 4th attempt: delegate to pick_first as per dualstack design (#34337 ) Most recent attempt was #34320, reverted in #34335. The first commit here is a pure revert. The second commit fixes the outlier_detection unit test to pass both with and without the experiment.	1 year ago
Mark D. Roth	6534f0a6bf	Revert "[round_robin] third attempt: delegate to pick_first as per dualstack design" (#34335 ) Reverts grpc/grpc#34320	1 year ago
Mark D. Roth	d713427cec	[round_robin] third attempt: delegate to pick_first as per dualstack design (#34320 ) Previous attempt was #34241, reverted in #34317. The second commit here makes the experiment disablable, so that we can roll it out slowly internally.	1 year ago
Craig Tiller	aed2797cd2	[chttp2] Fix inefficiency in flow control (#34265 ) In certain situations the current flow control algorithm can result in sending one flow control update write for every write sent (known situation: rollout of promise based server calls with qps_test). Fix things up so that the updates are only sent when truly needed, and then fix the fallout (turns out our fuzzer had some bugs) I've placed actual logic changes behind an experiment so that it can be incrementally & safely rolled out.	1 year ago
Craig Tiller	b478add7ec	[experiments] Remove unused experiment (#34090 ) We added this as an exploratory measure for a customer that thought they were using open census (this turned out to be emphatically false). Remove it since it's probably not how we ultimately want to do this, and wait for something better to come along. --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	1 year ago
AJ Heller	cbfae38403	[EventEngine] Enable core end2end tests for the event_engine_client experiment (#34060 )	1 year ago
AJ Heller	03eed0f696	[EventEngine] Enable core end2end tests for the event_engine_listener experiment (#34058 ) These may be needed again to help debug remote build environment problems.	1 year ago
Craig Tiller	22b5e8d389	[experiments] Default unique metadata strings to on in OSS, disable tests (#34039 ) This is pretty robust and doesn't warrant the level of testing its getting right now.	1 year ago
AJ Heller	3ac675b389	[EventEngine] Temporarily disable EventEngine experiments in end2end tests (#34041 ) We are not currently using the signal that these experiment tests provide, so we can speed up the CI by disabling them for now.	1 year ago
Craig Tiller	c9fe64c409	Revert "[promises] Enable promise-based calls on server side for OSS build" (#33989 ) Reverts grpc/grpc#33945	1 year ago
Craig Tiller	2feb821eb5	[promises] Enable promise-based calls on server side for OSS build (#33945 )	1 year ago
AJ Heller	c5246fc059	[EventEngine] Enable the work_stealing experiment in debug builds (#33912 ) Over the past 5 days, this experiment has not introduced any new flakes, nor increased any flake rates. Let's enable it for debug builds. To prevent issues over the weekend, I plan to merge it next week, July 31st (with announcement).	1 year ago
AJ Heller	29dd271d44	[testing] Enable end2end experiments for Windows continuous integration jobs (#32567 ) Co-authored-by: drfloob <drfloob@users.noreply.github.com>	1 year ago
Vignesh Babu	f85b7c79ee	[experiments] Fix processing of platform specific test tags (#33749 ) Also adds a unit test: experiments_tag_test which should fail if the appropriate tags are not set for it.	1 year ago
Yijie Ma	a7bf07e86a	[EventEngine] PosixEventEngine DNS Resolver (#32701 ) This PR implements a c-ares based DNS resolver for EventEngine with the reference from the original [grpc_ares_wrapper.h](../blob/master/src/core/ext/filters/client_channel/resolver/dns/c_ares/grpc_ares_wrapper.h). The PosixEventEngine DNSResolver is implemented on top of that. Tests which use the client channel resolver API ([resolver.h](../blob/master/src/core/lib/resolver/resolver.h#L54)) are ported, namely the [resolver_component_test.cc](../blob/master/test/cpp/naming/resolver_component_test.cc) and the [cancel_ares_query_test.cc](../blob/master/test/cpp/naming/cancel_ares_query_test.cc). The WindowsEventEngine DNSResolver will use the same EventEngine's grpc_ares_wrapper and will be worked on next. The [resolve_address_test.cc](https://github.com/grpc/grpc/blob/master/test/core/iomgr/resolve_address_test.cc) which uses the iomgr [DNSResolver](../blob/master/src/core/lib/iomgr/resolve_address.h#L44) API has been ported to EventEngine's dns_test.cc. That leaves only 2 tests which use iomgr's API, notably the [dns_resolver_cooldown_test.cc](../blob/master/test/core/client_channel/resolvers/dns_resolver_cooldown_test.cc) and the [goaway_server_test.cc](../blob/master/test/core/end2end/goaway_server_test.cc) which probably need to be restructured to use EventEngine DNSResolver (for one thing they override the original grpc_ares_wrapper's free functions). I will try to tackle these in the next step. <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	1 year ago
Craig Tiller	801f106992	[promises] Add logging_test to promise_based_server_call testing (#33774 )	1 year ago
Craig Tiller	d139c4a014	[metadata] Add an experiment to ensure a unique refcount on parsed slice strings (#33205 ) The intuition here is that these strings may end up in the hpack table, and then unnecessarily extend the lifetime of the read blocks. Instead, take a copy of these short strings when we need to and allow the incoming large memory object to be discarded. --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	1 year ago
AJ Heller	f974491065	[EventEngine] Return work stealing experiment to CI-only (#33389 ) There is potential for a long shutdown process under load (seconds instead of milliseconds).	2 years ago
AJ Heller	28692fdcae	[EventEngine] Enable work stealing thread pool in debug builds (#33333 )	2 years ago
Vignesh Babu	d11a62e3d0	[experiments] Re-structure experiments codegen to make it more modular and re-usable (#33263 )	2 years ago
Vignesh Babu	4d85f514cb	[experiments] Split experiments into two separate experiment definition and rollout definition files (#33228 ) The PR does the following: * Splits the single experiments.yaml file into two files: experiments.yaml and rollouts.yaml. * The experiments.yaml will now only include experiment definitions. The default values of the experiments must now be specified in rollouts.yaml * Removes the 'release' default value because it is not used. * Adds an additional_constraints character string to ExperimentMetadata. * Introduces a hook in src/core/lib/experiments/config.h to allow registering arbitrary experiment constraint validation callbacks. These callbacks would take an ExperimentMetadata object as input and return the correct value to use for an experiment subject to additional constraints.	2 years ago
Craig Tiller	ad3c6273c6	[experiments] Remove flow_control_fixes experiment (#33230 ) We defaulted this on 5 months ago, and it seems to be working... let's remove the experiment bit! --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	2 years ago
AJ Heller	b225083b34	[test] Re-enable work stealing thread pool experiment (#33213 ) I have not been able to reproduce the non-empty pool @ shutdown bug in around 200k runs of various kinds. Now that experiments are marked flaky by default, any similar failures should not block PR submission, and this will give me good signal if the bugs reproduce more frequently in the CI environment. I have a fix in theory, but I don't think it should be necessary. If the bug reproduces, I'll try the fix.	2 years ago
Craig Tiller	8a8f1eba4b	[promises] Enable server promise calls in C++ e2e tests (#33097 ) #thistimeforsure `a863532c62` adds some debug to help track which batches get leaked by a transport `3203e75ec5` makes connected_channel respect the high level intent of cancellation better (and fixes the last reason we needed to turn these tests off) `aaf5fa036b` re-enables testing of c++ e2e tests with server based promise calls <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
Vignesh Babu	b8a6b4267d	Revert "[test] Disable end2end tests for the EventEngine listener experiment" (#33100 ) Reverts grpc/grpc#32968	2 years ago
AJ Heller	7b74b07885	[experiment] Disable work stealing end2end test experiments (#33081 ) A known bug is being worked on. This will temporarily reduce flaky test noise.	2 years ago
AJ Heller	3fb738b9b1	[EventEngine] Implement work-stealing in the EventEngine ThreadPool (#32869 ) This PR implements a work-stealing thread pool for use inside EventEngine implementations. Because of historical risks here, I've guarded the new implementation behind an experiment flag: `GRPC_EXPERIMENTS=work_stealing`. Current default behavior is the original thread pool implementation. Benchmarks look very promising: ``` bazel test \ --test_timeout=300 \ --config=opt -c opt \ --test_output=streamed \ --test_arg='--benchmark_format=csv' \ --test_arg='--benchmark_min_time=0.15' \ --test_arg='--benchmark_filter=_FanOut' \ --test_arg='--benchmark_repetitions=15' \ --test_arg='--benchmark_report_aggregates_only=true' \ test/cpp/microbenchmarks:bm_thread_pool ``` 2023-05-04: `bm_thread_pool` benchmark results on my local machine (64 core ThreadRipper PRO 3995WX, 256GB memory), comparing this PR to master: ![image](https://user-images.githubusercontent.com/295906/236315252-35ed237e-7626-486c-acfa-71a36f783d22.png) 2023-05-04: `bm_thread_pool` benchmark results in the Linux RBE environment (unsure of machine configuration, likely small), comparing this PR to master. ![image](https://user-images.githubusercontent.com/295906/236317164-2c5acbeb-fdac-4737-9b2d-4df9c41cb825.png) --------- Co-authored-by: drfloob <drfloob@users.noreply.github.com>	2 years ago
Craig Tiller	0e7cc360eb	[promises] Disable C++ e2e tests with server_promise_based_call for now (#33008 ) <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
Craig Tiller	ad41fe96b6	[promises] Re-enable C++ end2end tests (with fixes) (#32837 ) Makes some awkward fixes to compression filter, call, connected channel to hold the semantics we have upheld now in tests. Once the fixes described here https://github.com/grpc/grpc/blob/master/src/core/lib/channel/connected_channel.cc#L636 are in this gets a lot less ad-hoc, but that's likely going to be post-landing promises client & server side. We specifically need special handling for server side cancellation in response to reads wrt the inproc transport - which doesn't track cancellation thoroughly enough itself. <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. --> --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	2 years ago
AJ Heller	e5725e4730	[test] Disable end2end tests for the EventEngine listener experiment (#32968 ) Sanitizer issues found MSAN: https://source.cloud.google.com/results/invocations/f1667575-9397-41e9-ae8f-8560ddac9187/targets/%2F%2Ftest%2Fcore%2Fend2end:core_end2end_tests@poller%3Dpoll@experiment%3Devent_engine_listener/log ASAN: https://source.cloud.google.com/results/invocations/651d71a5-0676-4f0d-a109-531abe5d218e/targets/%2F%2Ftest%2Fcore%2Fend2end:core_end2end_tests@poller%3Depoll1@experiment%3Devent_engine_listener/log	2 years ago
AJ Heller	a9afd1cde8	[test] Re-land: Enable EventEngine experiments for Posix end2end tests (#32948 ) Relands #32844. End2end tests will now wait for the default EventEngine to shut down between tests. This should avoid some use-after-frees and leaks.	2 years ago
AJ Heller	ca92648aa3	Revert "[test] Enable EventEngine experiments for Posix end2end tests." (#32855 ) Reverts grpc/grpc#32844. CI revealed multiple EventEngine issues overnight.	2 years ago
AJ Heller	b16bf18bc3	[test] Enable EventEngine experiments for Posix end2end tests. (#32844 ) This enables the EventEngine experiments in end2end tests, excluding the ResourceQuota tests which have known failures. Some Windows tests are hanging, so they will be enabled later. --------- Co-authored-by: drfloob <drfloob@users.noreply.github.com>	2 years ago
Craig Tiller	4a4e2889a1	[promises] Disable C++ e2e tests with server_promise_based_call for now (#32831 ) <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago

1 2

95 Commits (a54c7f7266934a567c92d50f42c3a6eda40c2a91)