Chiebot-Mirror/grpc - grpc - Gitea: Git with a cup of tea

Commit Graph

Author	SHA1	Message	Date
Matthew Stevenson	fae2982647	[ssl] Fix SSL stack to handle large handshake messages whose length exceeds the BIO buffer size. (#33638 ) There is a bug in the SSL stack that was only partially fixed in #29176: if more than 17kb is written to the BIO buffer, then everything over 17kb will be discarded, and the SSL handshake will fail with a bad record mac error or hang if not enough bytes have arrived yet. It's relatively uncommon to hit this bug, because the TLS handshake messages need to be much larger than normal for you to have a chance of hitting this bug. However, there was a separate bug in the SSL stack (recently fixed in #33558) that causes the ServerHello produced by a gRPC-C++ TLS server to grow linearly in size with the size of the trust bundle; these 2 bugs combined to cause a large number of TLS handshake failures for gRPC-C++ clients talking to gRPC-C++ servers when the server had a large trust bundle. This PR fixes the bug by ensuring that all bytes are successfully written to the BIO buffer. An initial quick fix for this bug was planned in #33611, but abandoned because we were worried about temporarily doubling the memory footprint of all SSL channels. The complexity in this PR is mostly in the test: it is fairly tricky to force gRPC-C++'s SSL stack to generate a sufficiently large ServerHello to trigger this bug.	1 year ago
Mark D. Roth	083bbee480	[LB policies] revert changes for dualstack design (#33718 ) This reverts the following PRs: #32692 #33087 #33093 #33427 #33568 These changes seem to have introduced some flaky crashes. Reverting while I investigate.	1 year ago
Craig Tiller	e9ba954eef	[owners] Remove CODEOWNERS for ctiller where its no longer necessary (#33704 )	1 year ago
Mark D. Roth	ec39600872	[WRR] fix bugs that caused us to re-enter blackout period upon updates (#33694 ) As per gRFC A58, when WRR sees a subchannel report READY, it reset the non_empty_since value, thus restarting the blackout period. However, there were two cases where we were incorrectly triggering this code: 1. When WRR got an updated address list that contained addresses that were already present on the old list and whose subchannels were already in READY state, the initial notification for those subchannels on the new list was READY, which incorrectly triggered resetting the non_empty_since value. 2. Due to a bug in the outlier_detection policy, whenever an update was propagated down through the OD policy without actually enabling OD, it would incorrectly send a duplicate connectivity state notification for the subchannels. This meant that a subchannel that was already in state READY would report READY again, which would also incorrectly trigger resetting the non_empty_since value. This PR makes two changes: 1. Fix the bug in outlier_detection that caused it to generate the spurious duplicate READY updates. 2. Fix WRR to reset the non_empty_since value when a subchannel goes READY only if the subchannel has seen a previous state update and only if that previous state was not READY. (The duplicate READY notifications should not actually happen anymore now that the OD policy has been fixed, but better to be defensive.) Fixes b/290983884.	1 year ago
Mario Jones Vimal	a934848de5	[core/security] Add Custom Token Lifetime - Service Acc Impersonation (#33351 ) Adds access token lifetime configuration for workload identity federation with service account impersonation for both explicit and implicit flows. Changes: 1. Adds a new member "service_account_impersonation" to the ExternalAccountCredentials class. "token_lifetime_seconds" is a member of "service_account_impersonation". 2. Adds validation checks, like token_lifetime_seconds should be between the minimum and maximum accepted value, during the creation of an ExternalAccountCredentials object. 3. Appends "lifetime" to the body of the service account impersonation request. Tests: 1. Modifies a test to check if the default value is passed when "service_account_impersonation" is empty. 2. Adds tests to check if the token_lifetime_seconds value is propagated to the request body. 3. Adds tests to verify that an error is thrown when token_lifetime_seconds is invalid.	1 year ago
Craig Tiller	7223a9e5fe	[fuzzing] Increase deadline (#33663 ) Fix b/290886936	1 year ago
Craig Tiller	86d7c8125e	[fuzzing] Increase deadline (#33658 ) Resolves b/290812157	1 year ago
Craig Tiller	8845e290db	[filter-test] Enhancements for better testing (#33652 ) - Support call finalizers in filter test. - Add an accessor to the filter implementation from the channel, so that it can be interrogated by tests. - Matcher to ensure that some metadata is not in a metadata batch (functionality needed to support the additional testing we talked about this morning) --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	1 year ago
Craig Tiller	b7077f4bbf	[hpack] Rollforward huffman read optimization (#33657 ) Rollforward in first commit, fixes in subsequent.	1 year ago
Craig Tiller	57c697d8ae	Revert "[hpack] Huffman read optimization" (#33655 ) Reverts grpc/grpc#33269	1 year ago
Craig Tiller	4ce51fe45d	[hpack] Huffman read optimization (#33269 ) In real services most of our time ends up in the `Read1()` function, which populates one byte into the bit buffer. Change this to read in as many as possible bytes at a time into that buffer. Additionally, generate all possible (to some depth) parser geometries, and add a benchmark for them. Run that benchmark and select the best geometry for decoding base64 strings (since this is the main use-case). (gives about a 30% speed boost parsing base64 then huffman encoded random binary strings) --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	1 year ago
Vignesh Babu	974798a427	[tracing] Fix flakiness in tcp_posix_test (#33639 ) tcp_posix_test is incorrectly assuming that all endpoint_writes with timestamps enabled will be successfully traced. Remove the timestamps checking related tests to prevent flakes when the test is enabled internally.	1 year ago
Craig Tiller	ed587f2b07	[hpack] Reduce parse table size in the rare case of a parse error (#33637 ) Most of the time parsing succeeds, and only rarely do we see an error. This change reduces the parse memento size from 120 bytes to 56 bytes. --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	1 year ago
Craig Tiller	dc5c99c9b4	[fuzzing] Increase deadline (#33600 ) Similar pattern to many others.. increase this deadline to have the fuzzer pass. --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	1 year ago
Craig Tiller	d3d4d5309d	[end2end] Fix fuzzer found deadline bug (#33633 ) fix b/290140776	1 year ago
Craig Tiller	cdfbb0ced7	[end2end] Fix fuzzer found deadline bug (#33629 ) Fixes b/288888511	1 year ago
Craig Tiller	e28729fe0a	[end2end] Fix fuzzer found deadline bug (#33630 ) fix b/288965746	1 year ago
Craig Tiller	f417da77a6	[end2end] Fix fuzzer found deadline bug (#33631 ) fix b/288718007	1 year ago
Craig Tiller	4b7a360041	[end2end] Fix fuzzer found deadline bug (#33632 ) fix b/289593034	1 year ago
nanahpang	0cc9d16e9c	[chaotic-good] Implement a promise-based endpoint for chaotic-good transport to read & write to EventEngine::Endpoint. (#33257 ) This PR is continuing the work of prototyping in https://github.com/grpc/grpc/pull/31592, and the design doc is at [link](https://docs.google.com/document/d/1vRy0yse-d1heLQRmLPo_0figsTPXJAnNN84tBCAne_s/edit?pli=1&resourcekey=0-JvUPdq0LaZq8gMkgT9Pzlw#heading=h.qgvc5vr55ytg). <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	1 year ago
Yash Tibrewal	9984f1bd5b	Revert "Revert "Revert "Revert "[HTTP2] Fix inconsistencies in keepalive configuration ( #33428 )" (#33512 )"" (#33601 ) Reverts grpc/grpc#33599 Needs to be cherry-picked	1 year ago
Craig Tiller	08f1cc3ba8	[end2end] Explain failures a little better (#33621 ) I'd been adding the following stanza regularly to debug flakes/fuzz failures: ``` Expect(1, CoreEnd2endTest::MaybePerformAction{[&](bool success) { Crash(absl::StrCat( "Unexpected completion of client side call: success=", success ? "true" : "false", " status=", server_status.ToString(), " initial_md=", server_initial_metadata.ToString())); }}); ``` it was helpful because it indicated why a call batch finished successfully and helped quickly identify next steps. It occurred to me however that this would better be done inside of the framework, and for all ops that have outputs, so this PR does just that. Any time a batch with an op that outputs information finishes successfully but unexpectedly we now display those outputs in human readable form in the error message. Sample output: ``` [ RUN ] CorpusExamples/FuzzerCorpusTest.RunOneExample/0 RUN TEST: Http2SingleHopTest.SimpleDelayedRequestShort/Chttp2SimpleSslFullstack E0101 00:00:05.000000000 396633 simple_delayed_request.cc:37] Create client side call E0101 00:00:05.000000000 396633 simple_delayed_request.cc:41] Start initial batch E0101 00:00:05.000000000 396633 simple_delayed_request.cc:47] Start server E0101 00:00:05.000000000 396633 cq_verifier.cc:364] Verify tag(101)-✅ for 600000ms test/core/end2end/cq_verifier.cc:316: Unexpected event: OP_COMPLETE: tag:0x1 OK with: incoming_metadata: {} status_on_client: status=4 msg=Deadline Exceeded trailing_metadata={} checked @ test/core/end2end/tests/simple_delayed_request.cc:51 expected: test/core/end2end/tests/simple_delayed_request.cc:50: tag(101) success=true ```	1 year ago
Craig Tiller	d139c4a014	[metadata] Add an experiment to ensure a unique refcount on parsed slice strings (#33205 ) The intuition here is that these strings may end up in the hpack table, and then unnecessarily extend the lifetime of the read blocks. Instead, take a copy of these short strings when we need to and allow the incoming large memory object to be discarded. --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	1 year ago
Craig Tiller	c5bb43ab61	[chttp2] Eliminate grpc_chttp2_stream_map (#33503 ) No need for a bespoke type anymore... and a step along the path to C++ification. --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	1 year ago
Yash Tibrewal	c0889a4f23	[fuzz] Increase call timeout for retry_unref_before_recv (#33608 ) Noticed this failing on an internal cl due to deadline exceeded errors. <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	1 year ago
Craig Tiller	43f67d3640	[fuzzing] Allow hpack fuzzers to supply config (#33242 ) <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. --> --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	1 year ago
AJ Heller	eb5c4da829	[EventEngine] Only use fork handlers when enabled via an environment variable (#33582 ) Co-authored-by: drfloob <drfloob@users.noreply.github.com>	1 year ago
Vignesh Babu	63037989ca	[experiments] Add specification of different default values for different platforms. (#33543 ) It introduces the following syntax: The following would mark the experiment as broken on ios, false on windows and debug on posix. If a platform is un-specified, the default for that platform will be set to false. Refer to test/core/experiments/fixtures/test_experiments_rollout.yaml for examples which are tested. - name: experiment_1 default:   ios: broken   windows: false   posix: debug It also supports the already existing syntax and interprets it as just specifying one default for all platforms. Supported platform tags: ios, windows, posix	1 year ago
Esun Kim	415f2f9ee3	[Core] Improve HPACK (#33597 )	1 year ago
Yash Tibrewal	523d9a2135	Revert "Revert "Revert "[HTTP2] Fix inconsistencies in keepalive configuration ( #33428 )" (#33512 )" (#33599 ) Reverts grpc/grpc#33586 Looks like this has to be cherry-picked	1 year ago
Yash Tibrewal	8d9164fc16	Revert "Revert "[HTTP2] Fix inconsistencies in keepalive configuration ( #33428 )" (#33512 ) (#33586 ) Along with an experiment this time <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	1 year ago
Bjorn Svensson	ac874c2c83	[core] Add a channel argument to set DSCP on streams (#28322 ) This adds a new channel argument `GRPC_ARG_DSCP` which allows users to create classified gRPC streams with a Differentiated Services Code Point (DSCP) marking on the IP frames. The channel argument is handled on both clients and servers, but currently only on posix based systems. Fixes #17225 Background: In addition to what is already described is #17225, when gRPC is used in telco systems there is often a need to classify streams of importance. There can be multiple hops between two endpoints (e.g. between 2 telecom operators) and some streams that are more important than others (e.g. emergency call related or similar). By marking the IP packets using DSCP the aware routers can make a sound decision of the prioritization. This PR propose to use DSCP as the configuration value since its common for both IPv4/IPv6, an alternative would be to use a config name that includes TOS and Traffic Class. There might be more needed regarding documentation and end2end testing, but there I need some advice. References https://datatracker.ietf.org/doc/html/rfc2474 https://www.iana.org/assignments/dscp-registry/dscp-registry.xhtml <!-- Your pull request will be routed to the following person by default for triaging. If you know who should review your pull request, please remove the mentioning below. --> @yashykt	1 year ago
Eugene Ostroukhov	e0bc8a2c85	[xDS LB] xDS pick first support (#33540 )	1 year ago
Mark D. Roth	15db5cd16a	[resolvers] use proper %-encoding of authority by default (#33571 ) - Change the `ResolverFactory::GetDefaultAuthority()` method to %-encode the authority by default, so individual resolver impls don't need to remember to do this. - Remove the hack in the xds resolver for setting the authority to everything after the last `/` character. - Change the `unix`, `unix-abstract`, and `vsock` resolvers to use a real authority instead of hard-coding to "localhost".	1 year ago
Vignesh Babu	bf3ffcf600	[experiments] A unit test for the experiments framework (#33334 ) Adds a test for the experiments codegen. It updates the codegen to parse test_experiments.yaml and test_experiments_rollouts.yaml files and generate test_experiments.h and test_experiments.cc files along with an experiments_test.cc file. The experiments test verifies the returned value of IsExperimentEnabled with the expected value.	1 year ago
Matthew Stevenson	278978d6f0	[tls] Remove use of SSL_CTX_set_client_CA_list for TLS server credentials. (#33558 ) This PR does the following: for the TLS server credentials, stops calling `SSL_CTX_set_client_CA_list` by default in `ssl_transport_security.cc`, and gives users a knob to re-enable calling this API. ## What does the `SSL_CTX_set_client_CA_list` API do? When this API is called, a gRPC TLS server sends the following data in the ServerHello: for each certificate in the server's trust bundle, the CA name in the certificate. This API does not change the set of certificates trusted by the server in any way. Rather, it is just providing a hint to the client about what client certificate should be sent to the server. ## Why are we removing the use of `SSL_CTX_set_client_CA_list` by default for the TLS server credentials? Removing the use of this API by default has 2 benefits: 1. Calling this API makes gRPC TLS unusable for servers with a sufficiently large trust bundle. Indeed, if the server trust bundle is too large, then the server will always fail to build the ServerHello. 2. Calling this API is introducing a huge amount of overhead (1000s of bytes) to each ServerHello, so removing this feature will improve connection establishment latency for all users of the TLS server credentials.	1 year ago
Mark D. Roth	38816cf327	[WRR] delegate to pick_first instead of creating subchannels directly (#33087 ) As part of the dualstack backend design, change WRR to delegate to pick_first instead of creating subchannels directly.	1 year ago
Mark D. Roth	27a778fece	[round robin] delegate to pick_first instead of creating subchannels directly (#32692 ) More work on the dualstack backend design: - Change round_robin to delegate to pick_first instead of creating subchannels directly. - Change pick_first such that when it is the child of a petiole policy, it will unconditionally start a health watch. - Change the client-side health checking code such that if client-side health checking is not enabled, it will return the subchannel's raw connectivity state. - As part of this, we introduce a new endpoint_list library to be used by petiole policies, which is intended to replace the existing subchannel_list library. The only policy that will still directly interact with subchannels is pick_first, so the relevant parts of the subchannel_list functionality have been copied directly into that policy. The subchannel_list library will be removed after all petiole policies are updated to delegate to pick_first.	1 year ago
Mark D. Roth	ababacd5fc	[xDS] fix fuzzer-found failure in ClusterSpecifierPlugin handling (#33552 ) Fixes b/288542307.	1 year ago
Yijie Ma	e3c22b944b	[Fuzzing] Fix use-after-free in resolver_fuzzer (#33553 ) In FuzzingDNSResolver, capturing the engine as raw pointers in the lambda functions instead of capturing the `this` pointer. By the time the lambda is ran, the FuzzingDNSResolver might already be destroyed but the engine should still be alive. <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	1 year ago
Matthew Stevenson	4b55f22e6e	[alts] Remove ExecCtx check from alts_zero_copy_grpc_protector_create function. (#33494 ) I'm fixing the ALTS/Envoy transport socket extension (which is currently broken). Along the way, I'm trying to remove as many uses of gRPC internals as possible (with the eventual goal of only relying on public gRPC APIs and the alts_zero_copy_grpc_protector). To this end, I need to remove the ExecCtx check in the alts_zero_copy_grpc_protector_create function, so that Envoy can call into this function without needing to create an ExecCtx.	1 year ago
Mark D. Roth	8427bacaea	[resolver API] remove address attribute interface (#33514 ) The address attribute interface was intended to provide a mechanism to pass attributes separately from channel args, for values that do not affect subchannel behavior and therefore do not need to be present in the subchannel key, which does include channel args. However, the mechanism as currently designed is fairly clunky and is probably not the direction we will want to go in the long term. Eventually, we will want some mechanism for registering channel args, which would provide a cleaner way to indicate that a given channel arg should not be used in the subchannel key, so that we don't need a completely different mechanism. For now, this PR is just doing an interim step, which is to establish a special channel arg key prefix to indicate that an arg is not needed in the subchannel key.	1 year ago
Hannah Shi	c6e4a60f8f	[ObjC] run cf event engine test suit on ios (#33534 ) Run cf event engine test suit on ios CC: @sampajano @mikefan1991	1 year ago
Yijie Ma	6e95cebbd0	[EventEngine] Simplify `EventEngine::DNSResolver` API (#33459 ) This change simplifies `EventEngine::DNSResolver`'s API based on the proposal: [go/event-engine-dns-resolver-api-changes](http://go/event-engine-dns-resolver-api-changes). Note that this API change + the implementation described in [go/event-engine-dns-resolver-implementation](http://go/event-engine-dns-resolver-implementation) has already been tested against our main test suites and are passing them. <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	1 year ago
Yash Tibrewal	ba878c804d	Revert "[HTTP2] Fix inconsistencies in keepalive configuration (#33428 )" (#33512 ) This reverts commit `e107ff5e99`. <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	1 year ago
Ben Beasley	011e1162c2	[Test] Use `ssl.SSLContext.wrap_socket`, not `ssl.wrap_socket` (#33492 ) In the HTTP(S) test server in the core tests, use `ssl.SSLContext.wrap_socket`, not `ssl.wrap_socket`. The latter emits a `DeprecationWarning` since Python 3.10 and is [removed in Python 3.12](https://github.com/python/cpython/issues/94199). This fixes the core tests (but not necessarily the `grpcio` tests) for Python 3.12. This is relevant to https://github.com/grpc/grpc/issues/33063.	1 year ago
Eugene Ostroukhov	7bce35ed41	Revert "Revert "[lb pick_first] Enable random shuffling of address list" (#33497 ) Original: #33496 This reverts commit `d59c8eb0f5`.	1 year ago
Yash Tibrewal	f8dd1a05dc	[http2] Rename configuration_test to ping_configuration_test (#33500 ) <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	1 year ago
Eugene Ostroukhov	d59c8eb0f5	Revert "[lb pick_first] Enable random shuffling of address list (#33254 )" (#33496 ) Original PR: 33254 This reverts commit `7e14a322a2`.	1 year ago
Craig Tiller	b28c4048f9	[fuzzing] Fix failures found by max_connection_idle_fuzzer (#33487 ) In chttp2: a pending but not yet sent goaway should block incoming requests just like a sent one (we will sent that data momentarily!) In the test: - handle the case of the connection idle timeout happening before the request arrives at the server - disable retries, as these cause the request to get stuck (as we don't have an additional server to retry on) Fix b/287897932 --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	1 year ago

1 2 3 4 5 ...

8013 Commits (113dbf518389ecc643ec6025deaf68b6def7373e)