Chiebot-Mirror/grpc - grpc - Gitea: Git with a cup of tea

Commit Graph

Author	SHA1	Message	Date
Mark D. Roth	7f555bd9a1	[TLS creds] fix cancel_check_peer() to actually work (#34434 ) The `cancel_check_peer()` method is [always called with a non-OK status](`866fc41067/src/core/lib/security/transport/security_handshaker.cc (L560)`), since it's used only in cancellation cases. However, the implementation of this method for TLS creds was bailing out if the status was non-OK, meaning that `cancel_check_peer()` was never actually cancelling the verification request. This bug seems to have been introduced back in #25631, when the method was initially implemented. I don't think we actually have any async verifier implementations today, so this isn't actually causing a problem. I discovered this bug as part of #34426, which was triggering the core e2e `no_logging` test to fail. That test is designed to ensure that we don't generate any logs while processing individual RPCs, since that would be bad for performance and would flood logfiles. My PR caused a connection attempt to be cancelled during the test, which triggered the error log that I am removing in this PR. Note that with this PR, the TLS creds `cancel_check_peer()` methods are not actually doing anything with the status. Ideally, they should be passing the status through to the verifier's `Cancel()` method, but we apparently didn't add a parameter for that, which means that although cancellation will work now, it will not properly pass through the right error message. At some point, we should fix this and add tests covering cancellation of async verifier requests to prove that the error message is propagated correctly.	1 year ago
Jan Tattermusch	60f25c289b	[bazelified tests] Bazelify tests from "linux/grpc_bazel_build" and make the original test job a noop. (#34429 ) Bazelify tests from "linux/grpc_bazel_build" kokoro job by creating 3 bazelified tests - "build with strict warning", "build with no_xds=True" and "build with no_xds=True negative test". - also make the original "linux/grpc_bazel_build" kokoro job a no-op (since bazelified tests now provide the same coverage).	1 year ago
apolcyn	c48250dc1a	[test scripts] fix GRPC_VERBOSITY setting for run_tests jobs on CI (#34433 ) The deleted code here was overriding the [intended](`866fc41067/tools/run_tests/run_tests.py (L62)`) default test env of `GRPC_VERBOSITY=DEBUG`. I'm just deleting it because it looks like`GRPC_TRACE=api` is not having any affect anyways, since it relies on `GRPC_VERBOSITY=DEBUG` which it happens to be unsetting.	1 year ago
Hannah Shi	e5d41f2a1f	[ObjC] cf event engine supports resolve recursively from on_resolve callback (#34385 ) If the client calls LookupHostname again within the on_resolve callback, it re-acquires the `request_mu_` before releasing it which results in deadlock. With this PR it extracts the request and releases the lock before calling on_resolve callback so it won't deadlock any more.	1 year ago
Stan Hu	b3467cdab4	[ruby] Fix linking errors on x86-darwin (#34134 ) https://github.com/grpc/grpc/pull/33538 added `-weak_framework CoreFoundation` in `DLDFLAGS` for only `arm64-darwin` builds, but the issue reported in https://github.com/grpc/grpc/issues/33483 can also happen on `x86-darwin` builds. This can happen if: 1. The Ruby interpreter is compiled without `-Wl,-undefined,dynamic_lookup`. 2. This happens if the Ruby interpreter is built with XCode 14.0 to 14.2 (https://bugs.ruby-lang.org/issues/19005). Simplify the logic and always include `-weak_framework CoreFoundation` for macOS builds.	1 year ago
Hannah Shi	d636507ba9	[ObjC] Remove grpc core podspec module map (#34361 ) Not really needed, this should help the firestore upgrade issue Tested with https://github.com/wu-hui/ReproGrpcCyclic	1 year ago
Hannah Shi	22aff69c82	[ObjC] require osx version > 10.12 for cf event engine (#34061 ) fixes #34049	1 year ago
Yash Tibrewal	1091cc3211	[CSM] Update labels (#34412 ) Changes - * Remove `csm.remote_workload_pod_name` and `csm.remote_workload_container_name`. * Add `csm.remote_workload_name`, the value for which is sent through MetadataExchange, from the `CSM_WORKLOAD_NAME` env var. (Note that this is not added in local labels.) * Add a local `csm.canonical_service` (@markdroth, please verify the key that we want here) that is read from `CSM_CANONICAL_SERVICE_NAME` env var, and we continue to send it over via MetadataExchange	1 year ago
Jan Tattermusch	866fc41067	[bazelified tests] Make bazelified C basictests build only (#34428 ) - make C-core basictests use `--build_only` when running as bazelified tests. This is because the volume of C core tests is expected to grow very significantly after https://github.com/grpc/grpc/pull/34419 and currently the non-bazelified counterpart of the tests (the presubmit grpc_basictests_c_cpp_build_only job) is also "build only". - make the linux presubmit job `grpc_basictests_c_cpp_build_only` a noop, since the bazelified tests already give the same coverage on presubmit.	1 year ago
Gregory Cooke	aa17285f8e	[Security] Move ownership of tsi_ssl_client_handshaker_factory to grpc_ssl_credentials, version 2. (#34408 ) Revert the reversion of the SSL_CTX_new change (#34355 reverted #34180 ) with a fix. There was an issue with using `strcpy` on a `new[] string` in the constructor of `ssl_credentials`. An ASAN test caught this in some CI down the line - `ERROR: AddressSanitizer: alloc-dealloc-mismatch (operator new [] vs free)` That `strcpy` call was changed to `grp_strdup` which duplicates a string in a way that can be freed by `gpr_free` and should resolve the ASAN failure.	1 year ago
Yash Tibrewal	112fffcdb4	[OTel] Minor test cleanup (#34353 ) <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	1 year ago
Esun Kim	6c11f4f181	[Testing] Added windows/grpc_distribtests_cpp_dll (#34425 ) Added a separate distribtests for gRPC C++ DLL build on Windows. This DLL build is a community support so it should be independently run from the existing Windows distribtests. Actual DLL test will be added.	1 year ago
Craig Tiller	9b4e2c06e5	[pings] Trace abuse counters (#34414 ) We're seeing some reports of the ping abuse policy not working like it ought... add some tracing here to debug. --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	1 year ago
Eugene Ostroukhov	98ac00d7d2	[interop test] Fix the test (#34424 )	1 year ago
Roman-Byshliaha-Bose	49bb52d7f4	[build] Add detection of QNX platform (#34418 )	1 year ago
Mark D. Roth	ddd4d6e318	[client_channel] don't hop into WorkSerializer to unref ConfigSelector per-call (#34399 ) Also fold the `client_channel_subchannel_wrapper_work_serializer_orphan` experiment into the `work_serializer_dispatch` experiment.	1 year ago
Eugene Ostroukhov	490f6a3ee9	[test interop] Add HookService to the maintenence server (#34413 ) This pull request adds another hook service on the maintenance server. This will enable clients to gradually migrate from the standalone hook server. Changes: 1. Hook service can now be used separately. 2. Copied latest protos and updated the hook service to new API. 3. Added the hook service to the maintenance server.	1 year ago
Stanley Cheung	fc159a6901	[Observability Testing] register prometheus exporter (#34380 ) Working towards testing against CSM Observability. Added ability to register a prometheus exporter with our Opentelemetry plugin. This will allow our metrics to be available at the standard prometheus port `:9464`.	1 year ago
AJ Heller	3707b42bec	[reland][EventEngine] Move combiner executor usage to EventEngine (#34396 ) Relands #31713	1 year ago
Mark D. Roth	83c35169e5	[client_channel] don't unref picker while holding the LB mutex (#34407 ) This fixes a deadlock seen when both the `round_robin_delegate_to_pick_first` and `client_channel_subchannel_wrapper_work_serializer_orphan` experiments are enabled -- although I think the bug really has to do only with the latter. The problem here was that we were unreffing the picker while holding the channel's LB mutex. Destroying the picker was triggering a `SubchannelWrapper` to be orphaned, which triggered a hop into the `WorkSerializer`. Once there, we were also running a queued subchannel connectivity state notification, which triggered an update to the LB policy, which triggered returning a new picker to the channel, which tried to acquire the channel's LB mutex again. Note that the `work_serializer_dispatch` experiment would have avoided this problem.	1 year ago
Richard Belleville	24420100bb	[PSM Interop] Collect metadata in appnet ssa tests (#34406 ) Follow-up fix to https://github.com/grpc/grpc/pull/34387	1 year ago
alto-ruby	dfa040f49f	[Ruby] replace strdup with gpr_strdup (#34177 ) grpc 1.57.0 crashes win ruby and alpine due to no `strdup` in musl libc. This diff replace `strdup` with `grp_strdup` ``` Thread 1 "ruby" received signal SIGSEGV, Segmentation fault. 0x00000000000a4596 in ?? () (gdb) bt #0 0x00000000000a4596 in ?? () #1 0x00007ffff14e298c in grpc_rb_channel_create_in_process_add_args_hash_cb (key=<optimized out>, val=<optimized out>, args_obj=<optimized out>) at rb_channel_args.c:84 #2 0x00007ffff7c2b9ea in hash_ar_foreach_iter (error=0, argp=140737488344784, value=<optimized out>, key=<optimized out>) at hash.c:1341 ``` fixes #34044 closes #27995	1 year ago
Craig Tiller	52369882d8	[promises] Add tracing to seq, join variants (#34401 ) Co-authored-by: ctiller <ctiller@users.noreply.github.com>	1 year ago
Mark D. Roth	0bffb766ff	[grpclb e2e test] increase RPC deadlines to fix flakes (#34403 )	1 year ago
Jan Tattermusch	113dbf5183	[bazelify tests] make grpc_distribtests_standalone and grpc_bazel_distribtest presubmit jobs a noop (#34391 ) Since many tests now run reliably as bazelified tests on RBE, we can remove them from presubmit runs to speedup testing of PRs. (for now, these jobs will still run on master, they can be removed from master as a followup). - linux/grpc_distribtests_standalone is now fully covered by bazel test suite `a3b4c797a7/tools/bazelify_tests/test/BUILD (L202)`, setting them to `presubmit=False` will stop tests from running on PRs. - stop running tests from grpc_bazel_distribtest on PR, instead rely on bazel distribtests running as bazelified tests.	1 year ago
Jan Tattermusch	b62edde1e3	[bazelified tests] Add more bazelified portability tests (and make them SOT for presubmit) (#34388 ) - bring parity between legacy portability_build_only linux job and bazelified RBE portability tests. - make grpc_portability_build_only job a no-op, since it's fully replaced by that bazelified tests (but keep grpc_portability on master job for now). For comparison: [legacy portability linux build only tests](https://source.cloud.google.com/results/invocations/f26f8b32-6878-43c7-9613-05f3111fa07c/targets) [bazel RBE bazelified tests](https://source.cloud.google.com/results/invocations/b656625f-b1e8-4b5c-9be0-b34ac9fcb31e/targets) (before this PR).	1 year ago
Craig Tiller	accc1688a8	[build] Exclude some e2e suites from experiments tests (#34404 ) We have a bunch of experiments testing against core e2e - and this is good for robustness, bad for CI times. We also have a bunch of marginal but overall necessary fixtures in the e2e suites - again good for robustness, bad for CI times. We can eliminate some of the cross product though, and I think safely: run experiments on a broad range of suites, but not ALL the suites, and get a bunch of our CI time back. Here I introduce an environment variable: `GRPC_CI_EXPERIMENTS` that's set when running bazel @experiment= configs, cleared otherwise (so we can still execute those tests directly when necessary). When that env var is set we filter out a bunch of suites from the test configurations.	1 year ago
Richard Belleville	62521a889f	[Interop] Tests for SSA and GAMMA (#34387 ) This is just an initial scope of tests. Much of this code was written by @ginayeh . I just did the final polish/integration step. There are 3 main tests included: 1. The GAMMA baseline test, including the [actual GAMMA API](https://gateway-api.sigs.k8s.io/geps/gep-1426/) rather than vendor extensions. 2. Kubernetes-based stateful session affinity tests, where the mesh (including SSA configuration) is configured using CRDs 3. GCP-based stateful session affinity tests, where the mesh is configured using the networkservices APIs directly Tests 1 and 2 will run in both prod and GKE staging, i.e. `container.googleapis.com` and `staging-container.sandbox.googleapis.com`. The latter of these will act as an early detection mechanism for regressions in the controller that translates Gateway resources into networkservices resources. Test 3 will run against `staging-networkservices.sandbox.googleapis.com` to act as an early detection mechanism for regressions in the control plane SSA implementation. The scope of the SSA tests is still fairly minimal. Session drain testing is in-progress but not included in this PR, though several elements required for it are (grace period, pre-stop hook, and the ability to kill a single pod in a deployment). --------- Co-authored-by: Jung-Yu (Gina) Yeh <ginayeh@google.com> Co-authored-by: Sergii Tkachenko <sergiitk@google.com>	1 year ago
Zach Reyes	e57b32588b	[python][interop] Add bootstrap generator test to nightly cron job against python master (#33933 )	1 year ago
Craig Tiller	b964cd50c9	[experiments] Remove unique_metadata_strings experiment (#34303 ) Submit after: 2023/09/18 Remove this experiment since it's sufficiently rolled out.	1 year ago
Craig Tiller	e6359c34a4	[fuzzing] Extend deadline to fix fuzzer failure (#34389 )	1 year ago
Craig Tiller	c0155b4188	[experiments] Make codegen more merge friendly (#34393 ) Remove the explicit numbering that's hostile to source code merge tools.	1 year ago
Craig Tiller	47306d78f4	[work-serializer] Add some basic process-wide monitoring (#34369 ) Add some basic metrics to work serializer, keep them process wide for now (though it may be interesting to get these into channelz in the future). Collected are: - time spent running a work serializer when it starts - time spent actually executing work when the work serializer runs - number of items executed each run A high disparity between the first two indicates our dispatching mechanism is adding large amounts of latency (perhaps due to thread starvation like effects). A high value for any of these indicate contention on the serializer. It's likely a future iteration on these will select different metrics - I'm not entirely sure which will be useful in production analysis yet. I'm using `std::chrono::steady_clock` here for precision (nanoseconds) with a compact representation (better than timespec) and a robust & portable api - I think it's appropriate for metrics, but wouldn't use it much beyond that at this point.	1 year ago
Mark D. Roth	214776e6aa	[LB policies] hop into WorkSerializer in subchannel wrappers' Orphan() (#34394 ) The one in xds_override_host was the one that was actually triggering test failures, but I audited all of the other policies and fixed a couple of other places that could also be problematic.	1 year ago
apolcyn	87eed73a47	[dns] unskip c-ares tests on arm (#34232 ) Now that we have https://github.com/grpc/grpc/pull/33942 (and another follow-up [fix](https://github.com/grpc/grpc/pull/34185)), I think the issue from https://github.com/grpc/grpc/issues/25289 is likely fixed	1 year ago
Craig Tiller	e1d78a2394	[experiments] Extend expiry of memory_pressure_controller (#34390 )	1 year ago
Craig Tiller	0375a585e2	[work-serializer] Fix synchronous test assumption (#34392 ) This test assumed synchronous work serializer execution (or at least faster async than we always get)... make a trivial change to keep the test semantics but allow for the implementation to be more async.	1 year ago
Mark D. Roth	25cb8e6ed2	[WRR] delegate to pick_first as per dualstack design (#34245 ) Rolls forward the changes from #33087, which were rolled back in #33718. This change is now guarded by a disablable experiment.	1 year ago
Jan Tattermusch	a3b4c797a7	[bazelified tests] Reenable runtests_php_linux_dbg after #34257 (#34266 ) The valgrind based test (removed in https://github.com/grpc/grpc/pull/34257) was the reason why the dbg test was disabled in the past.	1 year ago
Paulo Castello da Costa	0218f7d3a6	[benchmark] Add golang tests back to main CI job. (#34370 )	1 year ago
Craig Tiller	4cfa676045	[combiner] Add a force-offload mechanism (#34377 ) Add a mechanism to allow the transport to force an offload when it knows that's appropriate.	1 year ago
Yash Tibrewal	d670ffa92c	[CSM] Create an experimental target (#34381 )	1 year ago
Yash Tibrewal	b038da5072	[CSM] Second attempt: Add a server selector based on channel args (#34376 ) This reverts commit `2db446aa9a`. <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	1 year ago
Craig Tiller	17662c66c3	[build] Fix merge problem (#34378 )	1 year ago
Michael Lumish	2f05ddc278	[PSM Interop] Enable xDS affinity test for Node (#34288 ) Similar to #34146, this will only run on master for now. This will work after grpc/grpc-node#2568 is merged.	1 year ago
Craig Tiller	86b931c354	[work-serializer] Dispatch on run experiment (relanding) (#34372 ) Reverts grpc/grpc#34371	1 year ago
Mark D. Roth	b6f01c68aa	[pick_first] ignore duplicate calls to ExitIdleLocked() (#34374 )	1 year ago
Mark D. Roth	5a4e8f3dbd	[client_channel] second attempt: SubchannelWrapper hops into WorkSerializer before destruction (#34321 ) Original PR was #34307, reverted in #34318 due to internal test failures. The first commit is a revert of the revert. The second commit contains the fix. The original idea here was that `SubchannelWrapper::Orphan()`, which is called when the strong refcount reaches 0, would take a new weak ref and then hop into the `WorkSerializer` before dropping that weak ref, thus ensuring that the `SubchannelWrapper` is destroyed inside the `WorkSerializer` (which is needed because the `SubchannelWrapper` dtor cleans up some state in the channel related to the subchannel). The problem is that `DualRefCounted<>::Unref()` itself actually increments the weak ref count before calling `Orphan()` and then decrements it afterwards. So in the case where the `SubchannelWrapper` is unreffed outside of the `WorkSerializer` and no other thread happens to be holding the `WorkSerializer`, the weak ref that we were taking in `Orphan()` was unreffed inline, which meant that it wasn't actually the last weak ref -- the last weak ref was the one taken by `DualRefCounted<>::Unref()`, and it wasn't released until after the `WorkSerializer` was released. To this this problem, we move the code from the `SubchannelWrapper` dtor that cleans up the channel's state into the `WorkSerializer` callback that is scheduled in `Orphan()`. Thus, regardless of whether or not the last weak ref is released inside of the `WorkSerializer`, we are definitely doing that cleanup inside the `WorkSerializer`, which is what we actually care about. Also adds an experiment to guard this behavior.	1 year ago
nanahpang	2db446aa9a	Revert "[CSM] Add a server selector based on channel args" (#34375 ) Reverts grpc/grpc#34312	1 year ago
Paulo Castello da Costa	02ee4d9173	[benchmark] Add golang tests back to experimental CI job. (#34368 )	1 year ago

1 2 3 4 5 ...

53792 Commits (7f555bd9a1e72d85c716086f0241b50c0e544e62) All Branches Search

53792 Commits (7f555bd9a1e72d85c716086f0241b50c0e544e62)

All Branches