Chiebot-Mirror/grpc - grpc - Gitea: Git with a cup of tea

Commit Graph

Author	SHA1	Message	Date
Mark D. Roth	41f26de3b6	Revert "[resolver and LB policy APIs] change address list to support multiple addresses per endpoint" (#34527 ) Reverts grpc/grpc#33567 due to import problems.	1 year ago
Mark D. Roth	fd2e8c9462	[resolver and LB policy APIs] change address list to support multiple addresses per endpoint (#33567 ) More changes as part of the dualstack design: - Change resolver and LB policy APIs to support multiple addresses per endpoint. Specifically, replace `ServerAddress` with `EndpointAddresses`, which encodes more than one address. Per-address channel args are retained at the same level, so they are now per-endpoint. For now, `EndpointAddress` provides a single-address ctor and a single-address accessor for backward compatibility, so `ServerAdress` is an alias for `EndpointAddresses`; eventually, this alias and the single-address methods will be removed. - Add an `EndpointAddressSet` class, which represents an unordered set of addresses to be used as a map key. This will be used in a number of LB policies that need to store per-endpoint state. - Change the LB policy API's `ChannelControlHelper::CreateSubchannel()` method to take the address and per-endpoint channel args as separate parameters, so that we don't need to construct a legacy `ServerAddress` object as we create a new subchannel for each address in the endpoint. - Change pick_first to flatten the address list. - Change ring_hash to use `EndpointAddressSet` as the key for its endpoint map, and to use the first address of the endpoint as the hash key. - Change WRR to use `EndpointAddressSet` as the key for its endpoint weight map. Note that support for multiple addresses per endpoint is guarded in RR by the existing `round_robin_delegate_to_pick_fist` experiment and in WRR by the existing `wrr_delegate_to_pick_first` experiment. This PR does not include support for multiple addresses per endpoint for the outlier_detection or xds_override_host LB policies; those will come in subsequent PRs.	1 year ago
AJ Heller	3707b42bec	[reland][EventEngine] Move combiner executor usage to EventEngine (#34396 ) Relands #31713	1 year ago
Craig Tiller	accc1688a8	[build] Exclude some e2e suites from experiments tests (#34404 ) We have a bunch of experiments testing against core e2e - and this is good for robustness, bad for CI times. We also have a bunch of marginal but overall necessary fixtures in the e2e suites - again good for robustness, bad for CI times. We can eliminate some of the cross product though, and I think safely: run experiments on a broad range of suites, but not ALL the suites, and get a bunch of our CI time back. Here I introduce an environment variable: `GRPC_CI_EXPERIMENTS` that's set when running bazel @experiment= configs, cleared otherwise (so we can still execute those tests directly when necessary). When that env var is set we filter out a bunch of suites from the test configurations.	1 year ago
Craig Tiller	e6359c34a4	[fuzzing] Extend deadline to fix fuzzer failure (#34389 )	1 year ago
Craig Tiller	86b931c354	[work-serializer] Dispatch on run experiment (relanding) (#34372 ) Reverts grpc/grpc#34371	1 year ago
Craig Tiller	d589caa679	Revert "[work-serializer] Dispatch on run experiment" (#34371 ) Reverts grpc/grpc#34274 (needs some changes internally)	1 year ago
Craig Tiller	1705470950	[work-serializer] Dispatch on run experiment (#34274 ) Co-authored-by: ctiller <ctiller@users.noreply.github.com> Co-authored-by: Mark D. Roth <roth@google.com>	1 year ago
Craig Tiller	a9bf741735	[fuzzing] Make it easier for fuzzers to find experiments (#34296 ) The previous approach of generating strings was not converging well. Instead, load a bitfield from the protobuf and use the bits to select experiments. The fuzzers can explore this space swiftly. Downside is that as experiments rotate in/out the corpus gets a bit messed up, but I'm reasonably confident we'll recover quickly. --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	1 year ago
Craig Tiller	768a224711	[fuzzing] Extend deadline to fix fuzzer failure (#34301 )	1 year ago
Craig Tiller	79a983472c	[promises] Client channel promise conversion (#33210 ) <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. --> --------- Co-authored-by: Mark D. Roth <roth@google.com> Co-authored-by: markdroth <markdroth@users.noreply.github.com> Co-authored-by: ctiller <ctiller@users.noreply.github.com>	1 year ago
Craig Tiller	f6fd5172ad	[fuzzing] Increase deadline, fix b/296076392, b/296712500 (#34176 )	1 year ago
Craig Tiller	2b0d2a7115	[fuzzing] Tune deadline parameters to avoid fuzzing crash (#34174 ) fix for b/295670348	1 year ago
jrandolf	3489b6304e	[OpenSSL] Support for OpenSSL 3 (#31256 ) Update from gtcooke94: This PR adds support to build gRPC and it's tests with OpenSSL3. There were some hiccups with tests as the tests with openssl haven't been built or exercised in a few months, so they needed some work to fix. Right now I expect all test files to pass except the following: - h2_ssl_cert_test - ssl_transport_security_utils_test I confirmed locally that these tests fail with OpenSSL 1.1.1 as well, thus we are at least not introducing regressions. Thus, I've added compiler directives around these tests so they only build when using BoringSSL. --------- Co-authored-by: Gregory Cooke <gregorycooke@google.com> Co-authored-by: Esun Kim <veblush@google.com>	1 year ago
Mark D. Roth	64a318acd4	[pick_first] fix sticky-TF and handling of subchannels in TRANSIENT_FAILURE (#33753 ) Fix sticky-TF behavior such that once we enter TRANSIENT_FAILURE, we do not leave that state if we get a new address list. Also, fix handling of subchannels in state TRANSIENT_FAILURE. Previously, if a subchannel was already in state TRANSIENT_FAILURE when we wanted to start a connection attempt on it (e.g., because the subchannel already existed from a different channel, or because it already existed in the previous subchannel list), we would wait for it to report IDLE before attempting to connect. This PR changes pick_first to instead immediately skip the subchannel and move on to the next one. Now, the only time we wait for a subchannel in TRANSIENT_FAILURE is when we wrap back around to the first subchannel in the list.	1 year ago
Craig Tiller	2e2f5c9ba6	[fuzzer] Fix another deadline exceeded case (#34015 )	1 year ago
Craig Tiller	8e18f1c1df	[fuzzer] Fix another deadline exceeded case (#34014 )	1 year ago
Craig Tiller	ad9e8f45eb	[end2end] Fix fuzzer found crash (#34004 ) Fixes b/293789128	1 year ago
Craig Tiller	862e6d0346	[fuzzing] Increase deadline, fix b/292258333 (#33877 )	1 year ago
Craig Tiller	1f05719c56	[end2end] Ensure deterministic ordering of tests (#33984 ) This doesn't matter for gtest (it does its own sorting later), but for the fuzzers this will probably save a bit of churn in the corpus and lead to faster discovery of interesting cases.	1 year ago
AJ Heller	0e9553cf4e	[EventEngine] Add TODOs to re-enable EventEngine end2end tests (#33911 ) These tests should be re-enabled before we claim confidence in the engine implementations. It seems these tests are still being run, not sure if that's true in all cases ([example](https://source.cloud.google.com/results/invocations/be524340-c98f-4915-a833-192047ae9925/targets/%2F%2Ftest%2Fcore%2Fend2end:call_creds_test@experiment%3Devent_engine_client/log)). Alternatively, we can scrap this PR and enable all tests now if you feel you're ready to start looking at PosixEventEngine test failures. CC @yijiem @ctiller	1 year ago
AJ Heller	0897f0faf3	[EventEngine][Windows] Temporary changes for rare-flake debugging (#33894 ) CNR a WindowsEventEngine listener flake in: * 10k local Windows development machine runs * 50k Windows RBE runs * 10k Windows VM runs It fails ~5 times per day on the master CI jobs. This PR adds some logging to try to see if an edge is missed, and switches the thread pool implementation to see if that makes the flake go away. If the flakes disappear, I'll try removing one or the other to see if either independently fix the problem (hopefully not logging). --------- Co-authored-by: drfloob <drfloob@users.noreply.github.com>	1 year ago
Craig Tiller	a008026890	[fuzzing] Increase deadline, fix b/293425905 (#33897 )	1 year ago
Craig Tiller	3717ff04ba	[chttp2] Split ping policy from transport (#33703 ) Why: Cleanup for chttp2_transport ahead of promise conversion - lots of logic has become interleaved throughout chttp2, so some effort to isolate logic out is warranted ahead of that conversion. What: Split configuration and policy tracking for each of ping rate throttling and abuse detection into their own modules. Add tests for them. Incidentally: Split channel args into their own header so that we can split the policy stuff into separate build targets. --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	1 year ago
Craig Tiller	417c8e3499	[fuzzing] Increase deadline, tweak timeouts for b/291372661 (#33766 )	1 year ago
Craig Tiller	5b46c8bdba	[fuzzing] Increase deadline, fix b/291630910 (#33768 )	1 year ago
Craig Tiller	112a29c6af	[fuzzing] Increase deadline (#33765 ) Fix b/290782226	1 year ago
Craig Tiller	7223a9e5fe	[fuzzing] Increase deadline (#33663 ) Fix b/290886936	1 year ago
Craig Tiller	86d7c8125e	[fuzzing] Increase deadline (#33658 ) Resolves b/290812157	1 year ago
Craig Tiller	dc5c99c9b4	[fuzzing] Increase deadline (#33600 ) Similar pattern to many others.. increase this deadline to have the fuzzer pass. --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	1 year ago
Craig Tiller	d3d4d5309d	[end2end] Fix fuzzer found deadline bug (#33633 ) fix b/290140776	1 year ago
Craig Tiller	cdfbb0ced7	[end2end] Fix fuzzer found deadline bug (#33629 ) Fixes b/288888511	1 year ago
Craig Tiller	e28729fe0a	[end2end] Fix fuzzer found deadline bug (#33630 ) fix b/288965746	1 year ago
Craig Tiller	f417da77a6	[end2end] Fix fuzzer found deadline bug (#33631 ) fix b/288718007	1 year ago
Craig Tiller	4b7a360041	[end2end] Fix fuzzer found deadline bug (#33632 ) fix b/289593034	1 year ago
Craig Tiller	08f1cc3ba8	[end2end] Explain failures a little better (#33621 ) I'd been adding the following stanza regularly to debug flakes/fuzz failures: ``` Expect(1, CoreEnd2endTest::MaybePerformAction{[&](bool success) { Crash(absl::StrCat( "Unexpected completion of client side call: success=", success ? "true" : "false", " status=", server_status.ToString(), " initial_md=", server_initial_metadata.ToString())); }}); ``` it was helpful because it indicated why a call batch finished successfully and helped quickly identify next steps. It occurred to me however that this would better be done inside of the framework, and for all ops that have outputs, so this PR does just that. Any time a batch with an op that outputs information finishes successfully but unexpectedly we now display those outputs in human readable form in the error message. Sample output: ``` [ RUN ] CorpusExamples/FuzzerCorpusTest.RunOneExample/0 RUN TEST: Http2SingleHopTest.SimpleDelayedRequestShort/Chttp2SimpleSslFullstack E0101 00:00:05.000000000 396633 simple_delayed_request.cc:37] Create client side call E0101 00:00:05.000000000 396633 simple_delayed_request.cc:41] Start initial batch E0101 00:00:05.000000000 396633 simple_delayed_request.cc:47] Start server E0101 00:00:05.000000000 396633 cq_verifier.cc:364] Verify tag(101)-✅ for 600000ms test/core/end2end/cq_verifier.cc:316: Unexpected event: OP_COMPLETE: tag:0x1 OK with: incoming_metadata: {} status_on_client: status=4 msg=Deadline Exceeded trailing_metadata={} checked @ test/core/end2end/tests/simple_delayed_request.cc:51 expected: test/core/end2end/tests/simple_delayed_request.cc:50: tag(101) success=true ```	1 year ago
Yash Tibrewal	c0889a4f23	[fuzz] Increase call timeout for retry_unref_before_recv (#33608 ) Noticed this failing on an internal cl due to deadline exceeded errors. <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	1 year ago
Mark D. Roth	15db5cd16a	[resolvers] use proper %-encoding of authority by default (#33571 ) - Change the `ResolverFactory::GetDefaultAuthority()` method to %-encode the authority by default, so individual resolver impls don't need to remember to do this. - Remove the hack in the xds resolver for setting the authority to everything after the last `/` character. - Change the `unix`, `unix-abstract`, and `vsock` resolvers to use a real authority instead of hard-coding to "localhost".	1 year ago
Craig Tiller	b28c4048f9	[fuzzing] Fix failures found by max_connection_idle_fuzzer (#33487 ) In chttp2: a pending but not yet sent goaway should block incoming requests just like a sent one (we will sent that data momentarily!) In the test: - handle the case of the connection idle timeout happening before the request arrives at the server - disable retries, as these cause the request to get stuck (as we don't have an additional server to retry on) Fix b/287897932 --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	1 year ago
Craig Tiller	223117fc85	[fuzzing] Increase deadline to accommodate fuzzing injected delays (#33480 ) Co-authored-by: ctiller <ctiller@users.noreply.github.com>	1 year ago
Craig Tiller	a8132669f4	[flake] Raise deadline to eliminate flake in request_with_payload (#33488 ) Observed on CI over the weekend (and quite reproducible) --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	1 year ago
Craig Tiller	153f4e262c	[fuzzing] Increase deadline to accommodate fuzzing injected delays (#33481 )	1 year ago
Craig Tiller	26ae8a1d96	[fuzzing] Fix fuzzer found test bug (#33471 ) I think this is the right fix (doesn't seem deadline is related to the test logic), but please check.	1 year ago
Craig Tiller	23f5810264	[no_logging] Restrict regexes by module name (#33468 ) Promised follow-up to #33444	1 year ago
Craig Tiller	a5272fa3c3	[test] Disable cq verifier logs for stress test (#33467 ) They were tipping this test off into timeout land... but they're proving useful for non-stress tests, so added an option just for here to disable.	1 year ago
Craig Tiller	ecb7549a99	[end2end] Increase deadline on filtered_metadata test after observed failure (#33466 )	1 year ago
Mark D. Roth	d8db05a068	[core e2e tests] increase RPC deadline in a couple of tests to avoid flakes (#33456 )	1 year ago
Craig Tiller	d4be39a6ab	[fuzzing] Use a smaller max delay for writes than run-after (#33455 ) We want writes to participate in event re-ordering, but it's unlikely that we can sustain one byte per 500ms on all tests and keep them passing (which is the degenerate case right now). Tune write delays down to 50ms for the moment, though I expect we'll want to talk about going lower.	1 year ago
Craig Tiller	e0ad9e5746	[end2end] Fix simple delayed request (#33450 ) omgwtfbbq This test relies on WAIT_FOR_READY semantics, but we don't do that in the proxy, so it got assigned the wrong suite. Fix the suite, fix the flakes. Also add some handy dandy logging to help figure this stuff out in the future.	1 year ago
Craig Tiller	e5035063e8	[end2end] Further robustness tuning for uds path name selection (#33443 ) I can still make the old algorithm break and assign duplicate names on my machine... make it a little more robust. --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	1 year ago

... 2 3 4 5 6 ...

3003 Commits (406fbf07a4429d2ca23ee93a5c0ee06ceca9fc4f)