Chiebot-Mirror/grpc - grpc - Gitea: Git with a cup of tea

Commit Graph

Author	SHA1	Message	Date
Craig Tiller	accc1688a8	[build] Exclude some e2e suites from experiments tests (#34404 ) We have a bunch of experiments testing against core e2e - and this is good for robustness, bad for CI times. We also have a bunch of marginal but overall necessary fixtures in the e2e suites - again good for robustness, bad for CI times. We can eliminate some of the cross product though, and I think safely: run experiments on a broad range of suites, but not ALL the suites, and get a bunch of our CI time back. Here I introduce an environment variable: `GRPC_CI_EXPERIMENTS` that's set when running bazel @experiment= configs, cleared otherwise (so we can still execute those tests directly when necessary). When that env var is set we filter out a bunch of suites from the test configurations.	1 year ago
Craig Tiller	e6359c34a4	[fuzzing] Extend deadline to fix fuzzer failure (#34389 )	1 year ago
Craig Tiller	c0155b4188	[experiments] Make codegen more merge friendly (#34393 ) Remove the explicit numbering that's hostile to source code merge tools.	1 year ago
Craig Tiller	47306d78f4	[work-serializer] Add some basic process-wide monitoring (#34369 ) Add some basic metrics to work serializer, keep them process wide for now (though it may be interesting to get these into channelz in the future). Collected are: - time spent running a work serializer when it starts - time spent actually executing work when the work serializer runs - number of items executed each run A high disparity between the first two indicates our dispatching mechanism is adding large amounts of latency (perhaps due to thread starvation like effects). A high value for any of these indicate contention on the serializer. It's likely a future iteration on these will select different metrics - I'm not entirely sure which will be useful in production analysis yet. I'm using `std::chrono::steady_clock` here for precision (nanoseconds) with a compact representation (better than timespec) and a robust & portable api - I think it's appropriate for metrics, but wouldn't use it much beyond that at this point.	1 year ago
Mark D. Roth	25cb8e6ed2	[WRR] delegate to pick_first as per dualstack design (#34245 ) Rolls forward the changes from #33087, which were rolled back in #33718. This change is now guarded by a disablable experiment.	1 year ago
Craig Tiller	4cfa676045	[combiner] Add a force-offload mechanism (#34377 ) Add a mechanism to allow the transport to force an offload when it knows that's appropriate.	1 year ago
Craig Tiller	86b931c354	[work-serializer] Dispatch on run experiment (relanding) (#34372 ) Reverts grpc/grpc#34371	1 year ago
AJ Heller	2467562e4b	[EventEngine] Delete OriginalThreadPool, remove work_stealing experiment (#34315 ) This has been stable for a bit, everywhere that the EventEngine is enabled. Going forward, I think the event_engine_{client\|listener} experiments can probably be used to regulate thread-pool-specific issues. --------- Co-authored-by: drfloob <drfloob@users.noreply.github.com>	1 year ago
Craig Tiller	d589caa679	Revert "[work-serializer] Dispatch on run experiment" (#34371 ) Reverts grpc/grpc#34274 (needs some changes internally)	1 year ago
Craig Tiller	1705470950	[work-serializer] Dispatch on run experiment (#34274 ) Co-authored-by: ctiller <ctiller@users.noreply.github.com> Co-authored-by: Mark D. Roth <roth@google.com>	1 year ago
Esun Kim	9a7ecfad00	[Fix] Added missing #include (#34359 ) One more missing #include	1 year ago
nanahpang	a4ac80c394	Revert "[Security] Move ownership of tsi_ssl_client_handshaker_factory to grpc_ssl_credentials." (#34355 ) Reverts grpc/grpc#34180	1 year ago
Gregory Cooke	36dc5e7391	[Security] Move ownership of tsi_ssl_client_handshaker_factory to grpc_ssl_credentials. (#34180 ) Move the SSL_CTX to the level of the credentials rather than the subchannel. The SSL_CTX should only get created once per credential rather than once per subchannel. We should observe no behavior change with this PR, only efficiency gains.	1 year ago
Mark D. Roth	1986007e1e	[round_robin] 4th attempt: delegate to pick_first as per dualstack design (#34337 ) Most recent attempt was #34320, reverted in #34335. The first commit here is a pure revert. The second commit fixes the outlier_detection unit test to pass both with and without the experiment.	1 year ago
Esun Kim	1d55e8dd88	[Fix] Added missing #include (#34339 ) To fix the following build error with the head of abseil ``` /var/local/git/grpc/test/core/tsi/ssl_transport_security_utils_test.cc:231:42: error: no member named 'StrCat' in namespace 'absl' return absl::InternalError(absl::StrCat("Client error:", client_err)); ~~~~~~^ /var/local/git/grpc/test/core/tsi/ssl_transport_security_utils_test.cc:238:42: error: no member named 'StrCat' in namespace 'absl' return absl::InternalError(absl::StrCat("Server error:", server_err)); ~~~~~~^ ```	1 year ago
Mark D. Roth	6534f0a6bf	Revert "[round_robin] third attempt: delegate to pick_first as per dualstack design" (#34335 ) Reverts grpc/grpc#34320	1 year ago
Mark D. Roth	d713427cec	[round_robin] third attempt: delegate to pick_first as per dualstack design (#34320 ) Previous attempt was #34241, reverted in #34317. The second commit here makes the experiment disablable, so that we can roll it out slowly internally.	1 year ago
Craig Tiller	a9bf741735	[fuzzing] Make it easier for fuzzers to find experiments (#34296 ) The previous approach of generating strings was not converging well. Instead, load a bitfield from the protobuf and use the bits to select experiments. The fuzzers can explore this space swiftly. Downside is that as experiments rotate in/out the corpus gets a bit messed up, but I'm reasonably confident we'll recover quickly. --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	1 year ago
Romain Geissler @ Amadeus	d0d826750f	[C++] Stop using std::aligned_storage. (#34110 ) Indeed this is now deprecated since C++23. Fix #32848.	1 year ago
Craig Tiller	e6bf7c12cf	Revert "[round_robin] delegate to pick_first as per dualstack design" (#34317 ) Reverts grpc/grpc#34241	1 year ago
Craig Tiller	3e3c828f91	[fuzzing] Add TickUntil variants to FuzzingEventEngine (#34308 ) Sometimes we just want to wait until a specific time	1 year ago
Mark D. Roth	97571ebf81	[round_robin] delegate to pick_first as per dualstack design (#34241 ) Rolls forward the remaining changes from #32692, which were rolled back in #33718.	1 year ago
Craig Tiller	768a224711	[fuzzing] Extend deadline to fix fuzzer failure (#34301 )	1 year ago
Eugene Ostroukhov	3d1f242abe	[Session Affinity] Update validation and add a test case (#34277 )	1 year ago
Yash Tibrewal	b388a7e250	[Core Config] Use absl::Invocable instead of std::function (#34282 ) Splitting off from https://github.com/grpc/grpc/pull/34273 <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	1 year ago
Craig Tiller	aed2797cd2	[chttp2] Fix inefficiency in flow control (#34265 ) In certain situations the current flow control algorithm can result in sending one flow control update write for every write sent (known situation: rollout of promise based server calls with qps_test). Fix things up so that the updates are only sent when truly needed, and then fix the fallout (turns out our fuzzer had some bugs) I've placed actual logic changes behind an experiment so that it can be incrementally & safely rolled out.	1 year ago
Eugene Ostroukhov	3824288bad	[Tests] Move the http_proxy_mapper_test.cc back (#34268 )	1 year ago
Craig Tiller	5bab2976c4	[max-age] Add jitter to max idle, use absl bitgen for rng (#34225 ) Co-authored-by: ctiller <ctiller@users.noreply.github.com>	1 year ago
Mark D. Roth	6dea42c874	[XdsClient] replace e2e test with unit test (#34258 ) This should address one of the failures we're seeing in #34224. The test failure is caused by the changes in timing triggering a race condition. In the code at head, we delay sending out the subscription for the first CDS watch until we've already seen the other two CDS watches, because the previous send_message op has not yet completed, and by the time it does, we've seen all 3 watches, so we can send a subscription for all 3 at the same time. With the WorkSerializer change, the send_message op is complete by the time we see the first CDS watch, so we subscribe to only that resource, and then later add the other two. The result is that we'll NACK twice with two different messages, the first one including only the error about the first resource, and the second one including all three. I suspect this same race condition would have been triggered eventually by the EventEngine migration anyway; the current test basically depends on the single-thread timing of the iomgr approach. So I'm addressing it by replacing the e2e test with a unit test that covers the same cases without the timing issue.	1 year ago
Eugene Ostroukhov	a5e9feeb04	[HTTP Proxy] Rename source/header and move test (#34221 )	1 year ago
Mark D. Roth	b7e680ad46	[health checking] move to generic health watch for dualstack design (#34222 ) Rolls forward part of the dualstack changes, mostly from #33427 and a little bit from #32692, both of which were reverted in #33718. Specifically: - For petiole policies, unconditionally start health watch on subchannels, even if client side health checking is not enabled; in this case, the health watch will report the subchannel's raw connectivity state. - Fix edge cases in health check reporting that occur when a watcher is started before the initial state is reported. - When client-side health checking fails, add the subchannel's address to the RPC failure status message. - Outlier detection now works only via the health checking watch, not via the raw connectivity state watch. - Remove now-unnecessary hack to ensure that outlier detection does not work for pick_first.	1 year ago
Craig Tiller	b38bb68e80	[chttp2] Review feedback for new framing layer (#34179 ) Missed your comments on #33692 before merging, so here's some updates.	1 year ago
Mark D. Roth	b8fd38d7cb	[xds_override_host] improve logging for debuggability (#34223 ) I wound up needing this to debug some problems in the dualstack code.	1 year ago
Mark D. Roth	6412412ae1	[pick_first] changes to support dualstack design (#34218 ) This rolls forward only the pick_first changes from #32692, which were rolled back in #33718. Specifically: - Changes PF to use its own subchannel list implementation instead of using the subchannel_list library, since the latter will be going away with the dualstack changes. - As a result of no longer using the subchannel_list library, PF no longer needs to set the `GRPC_ARG_INHIBIT_HEALTH_CHECKING` channel arg. - Adds an option to start a health watch on the chosen subchannel, to be used in the future when pick_first is the child of a petiole policy. (Currently, this code is not actually called anywhere.)	1 year ago
Craig Tiller	be22006bc1	[resource_quota] Add a mechanism to query all of the memory quotas in the system (#34169 ) Pre-req for adding observability for this stuff	1 year ago
Craig Tiller	79a983472c	[promises] Client channel promise conversion (#33210 ) <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. --> --------- Co-authored-by: Mark D. Roth <roth@google.com> Co-authored-by: markdroth <markdroth@users.noreply.github.com> Co-authored-by: ctiller <ctiller@users.noreply.github.com>	1 year ago
Craig Tiller	a749d07acf	[promises] Add an unbuffered/immediate send to mpsc for cancellations (#34208 )	1 year ago
Craig Tiller	60c6b6bb3b	[promises] Inter-activity pipe (#34188 ) Pipe-like type (has a send end, a receive end, and a closing mechanism) for cross-activity transfers. --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	1 year ago
apolcyn	a35f282d58	[c-ares] fix spin loop bug when c-ares gives up on a socket that still has data left in its read buffer (#34185 ) If we get a readable event on an fd and both the following happens: - c-ares does not read all bytes off the fd - c-ares removes the fd from the set ARES_GETSOCK_READABLE ... then we have a busy loop here, where we'd keep asking c-ares to process an fd that it no longer cares about. This is indirectly related to a change in this code one month ago: https://github.com/grpc/grpc/pull/33942 - before that change, c-ares would close the socket when it called [handle_error](`7f3262312f/src/lib/ares_process.c (L707)`) and so `IsFdStillReadableLocked` would start returning `false`, causing us to get away with [this loop](`f6a994229e/src/core/ext/filters/client_channel/resolver/dns/c_ares/grpc_ares_wrapper.cc (L371)`). Now, because `IsFdStillReadableLocked` will keep returning true (because of our overridden `close` API), we'll loop forever.	1 year ago
Esun Kim	1a1124903c	[Deps] Upgrade Protobuf and Upb to 24.x (#34123 ) On top of https://github.com/grpc/grpc/pull/34120	1 year ago
Craig Tiller	f6fd5172ad	[fuzzing] Increase deadline, fix b/296076392, b/296712500 (#34176 )	1 year ago
Craig Tiller	2b0d2a7115	[fuzzing] Tune deadline parameters to avoid fuzzing crash (#34174 ) fix for b/295670348	1 year ago
Craig Tiller	12c9748134	[chttp2] New framing layer (#33692 ) Building out a new framing layer for chttp2. The central idea here is to have the framing layer be solely responsible for serialization of frames, and their deserialization - the framing layer can reject frames that have invalid syntax - but the enacting of what that frame means is left to a higher layer. This class will become foundational for the promise conversion of chttp2 - by eliminating action from the parsing of frames we can reuse this sensitive code. Right now the new layer is inactive - there's a test that exercises it relatively well, and not much more. In the next PRs I'll add an experiments to enable using this layer or the existing code in the writing and reading paths. --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	1 year ago
Mark D. Roth	de98c1c9ad	[xDS] ref-count xDS resources instead of copying them (#34111 )	1 year ago
nanahpang	9907d94da4	[chaotic-good] Client transport write path (#33876 ) This is the initial implementation of the chaotic-good client transport write path. There will be a follow-up PR to fulfill the read path. <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	1 year ago
Craig Tiller	6c86e79c95	[promises] Fix handling of `absl::Status` by `TrySeq` (#34162 )	1 year ago
Gregory Cooke	e9c2feb788	[Testing] Disable failing OpenSSL Test (#34131 ) We enabled OpenSSL3 testing with #31256 and missed a failing test It wasn't running before, so this isn't a regression - disabling it so master doesn't fail while we figure out how to fix it.	1 year ago
jrandolf	3489b6304e	[OpenSSL] Support for OpenSSL 3 (#31256 ) Update from gtcooke94: This PR adds support to build gRPC and it's tests with OpenSSL3. There were some hiccups with tests as the tests with openssl haven't been built or exercised in a few months, so they needed some work to fix. Right now I expect all test files to pass except the following: - h2_ssl_cert_test - ssl_transport_security_utils_test I confirmed locally that these tests fail with OpenSSL 1.1.1 as well, thus we are at least not introducing regressions. Thus, I've added compiler directives around these tests so they only build when using BoringSSL. --------- Co-authored-by: Gregory Cooke <gregorycooke@google.com> Co-authored-by: Esun Kim <veblush@google.com>	1 year ago
Mark D. Roth	1e818c98bb	[client_channel] ensure that subchannels are always destroyed inside the WorkSerializer (#34077 ) - add debug-only `WorkSerializer::IsRunningInWorkSerializer()` method and use it in client_channel to verify that subchannels are destroyed in the `WorkSerializer` - note: this mechanism uses `std:🧵:id`, so I had to exclude work_serializer.cc from the core_banned_constructs check - fix `WorkSerializer::Run()` to unref the callback before releasing ownership of the `WorkSerializer`, so that any refs captured by the `std::function<>` will be released before releasing ownership - fix the WRR timer callback to hop into the `WorkSerializer` to release its ref to the picker, since that transitively releases refs to subchannels - fix subchannel connectivity state notifications to unref the watcher inside the `WorkSerializer`, since the watcher often transitively holds refs to subchannels	1 year ago
AJ Heller	108af0a94f	[EventEngine] Improve lock contention in WorkStealingThreadPool (alternative) (#34065 ) Proposed alternative to https://github.com/grpc/grpc/pull/34024. This version has a simpler, faster busy-count implementation based on a sharded set of atomic counts: fast increment/decrement operations, relatively slower summation of total counts (which need to happen much less frequently).	1 year ago

1 2 3 4 5 ...

8013 Commits (113dbf518389ecc643ec6025deaf68b6def7373e)