Chiebot-Mirror/grpc - grpc - Gitea: Git with a cup of tea

Commit Graph

Author	SHA1	Message	Date
Eugene Ostroukhov	98ac00d7d2	[interop test] Fix the test (#34424 )	1 year ago
Roman-Byshliaha-Bose	49bb52d7f4	[build] Add detection of QNX platform (#34418 )	1 year ago
Mark D. Roth	ddd4d6e318	[client_channel] don't hop into WorkSerializer to unref ConfigSelector per-call (#34399 ) Also fold the `client_channel_subchannel_wrapper_work_serializer_orphan` experiment into the `work_serializer_dispatch` experiment.	1 year ago
Eugene Ostroukhov	490f6a3ee9	[test interop] Add HookService to the maintenence server (#34413 ) This pull request adds another hook service on the maintenance server. This will enable clients to gradually migrate from the standalone hook server. Changes: 1. Hook service can now be used separately. 2. Copied latest protos and updated the hook service to new API. 3. Added the hook service to the maintenance server.	1 year ago
Stanley Cheung	fc159a6901	[Observability Testing] register prometheus exporter (#34380 ) Working towards testing against CSM Observability. Added ability to register a prometheus exporter with our Opentelemetry plugin. This will allow our metrics to be available at the standard prometheus port `:9464`.	1 year ago
AJ Heller	3707b42bec	[reland][EventEngine] Move combiner executor usage to EventEngine (#34396 ) Relands #31713	1 year ago
Mark D. Roth	83c35169e5	[client_channel] don't unref picker while holding the LB mutex (#34407 ) This fixes a deadlock seen when both the `round_robin_delegate_to_pick_first` and `client_channel_subchannel_wrapper_work_serializer_orphan` experiments are enabled -- although I think the bug really has to do only with the latter. The problem here was that we were unreffing the picker while holding the channel's LB mutex. Destroying the picker was triggering a `SubchannelWrapper` to be orphaned, which triggered a hop into the `WorkSerializer`. Once there, we were also running a queued subchannel connectivity state notification, which triggered an update to the LB policy, which triggered returning a new picker to the channel, which tried to acquire the channel's LB mutex again. Note that the `work_serializer_dispatch` experiment would have avoided this problem.	1 year ago
Richard Belleville	24420100bb	[PSM Interop] Collect metadata in appnet ssa tests (#34406 ) Follow-up fix to https://github.com/grpc/grpc/pull/34387	1 year ago
alto-ruby	dfa040f49f	[Ruby] replace strdup with gpr_strdup (#34177 ) grpc 1.57.0 crashes win ruby and alpine due to no `strdup` in musl libc. This diff replace `strdup` with `grp_strdup` ``` Thread 1 "ruby" received signal SIGSEGV, Segmentation fault. 0x00000000000a4596 in ?? () (gdb) bt #0 0x00000000000a4596 in ?? () #1 0x00007ffff14e298c in grpc_rb_channel_create_in_process_add_args_hash_cb (key=<optimized out>, val=<optimized out>, args_obj=<optimized out>) at rb_channel_args.c:84 #2 0x00007ffff7c2b9ea in hash_ar_foreach_iter (error=0, argp=140737488344784, value=<optimized out>, key=<optimized out>) at hash.c:1341 ``` fixes #34044 closes #27995	1 year ago
Craig Tiller	52369882d8	[promises] Add tracing to seq, join variants (#34401 ) Co-authored-by: ctiller <ctiller@users.noreply.github.com>	1 year ago
Mark D. Roth	0bffb766ff	[grpclb e2e test] increase RPC deadlines to fix flakes (#34403 )	1 year ago
Jan Tattermusch	113dbf5183	[bazelify tests] make grpc_distribtests_standalone and grpc_bazel_distribtest presubmit jobs a noop (#34391 ) Since many tests now run reliably as bazelified tests on RBE, we can remove them from presubmit runs to speedup testing of PRs. (for now, these jobs will still run on master, they can be removed from master as a followup). - linux/grpc_distribtests_standalone is now fully covered by bazel test suite `a3b4c797a7/tools/bazelify_tests/test/BUILD (L202)`, setting them to `presubmit=False` will stop tests from running on PRs. - stop running tests from grpc_bazel_distribtest on PR, instead rely on bazel distribtests running as bazelified tests.	1 year ago
Jan Tattermusch	b62edde1e3	[bazelified tests] Add more bazelified portability tests (and make them SOT for presubmit) (#34388 ) - bring parity between legacy portability_build_only linux job and bazelified RBE portability tests. - make grpc_portability_build_only job a no-op, since it's fully replaced by that bazelified tests (but keep grpc_portability on master job for now). For comparison: [legacy portability linux build only tests](https://source.cloud.google.com/results/invocations/f26f8b32-6878-43c7-9613-05f3111fa07c/targets) [bazel RBE bazelified tests](https://source.cloud.google.com/results/invocations/b656625f-b1e8-4b5c-9be0-b34ac9fcb31e/targets) (before this PR).	1 year ago
Craig Tiller	accc1688a8	[build] Exclude some e2e suites from experiments tests (#34404 ) We have a bunch of experiments testing against core e2e - and this is good for robustness, bad for CI times. We also have a bunch of marginal but overall necessary fixtures in the e2e suites - again good for robustness, bad for CI times. We can eliminate some of the cross product though, and I think safely: run experiments on a broad range of suites, but not ALL the suites, and get a bunch of our CI time back. Here I introduce an environment variable: `GRPC_CI_EXPERIMENTS` that's set when running bazel @experiment= configs, cleared otherwise (so we can still execute those tests directly when necessary). When that env var is set we filter out a bunch of suites from the test configurations.	1 year ago
Richard Belleville	62521a889f	[Interop] Tests for SSA and GAMMA (#34387 ) This is just an initial scope of tests. Much of this code was written by @ginayeh . I just did the final polish/integration step. There are 3 main tests included: 1. The GAMMA baseline test, including the [actual GAMMA API](https://gateway-api.sigs.k8s.io/geps/gep-1426/) rather than vendor extensions. 2. Kubernetes-based stateful session affinity tests, where the mesh (including SSA configuration) is configured using CRDs 3. GCP-based stateful session affinity tests, where the mesh is configured using the networkservices APIs directly Tests 1 and 2 will run in both prod and GKE staging, i.e. `container.googleapis.com` and `staging-container.sandbox.googleapis.com`. The latter of these will act as an early detection mechanism for regressions in the controller that translates Gateway resources into networkservices resources. Test 3 will run against `staging-networkservices.sandbox.googleapis.com` to act as an early detection mechanism for regressions in the control plane SSA implementation. The scope of the SSA tests is still fairly minimal. Session drain testing is in-progress but not included in this PR, though several elements required for it are (grace period, pre-stop hook, and the ability to kill a single pod in a deployment). --------- Co-authored-by: Jung-Yu (Gina) Yeh <ginayeh@google.com> Co-authored-by: Sergii Tkachenko <sergiitk@google.com>	1 year ago
Zach Reyes	e57b32588b	[python][interop] Add bootstrap generator test to nightly cron job against python master (#33933 )	1 year ago
Craig Tiller	b964cd50c9	[experiments] Remove unique_metadata_strings experiment (#34303 ) Submit after: 2023/09/18 Remove this experiment since it's sufficiently rolled out.	1 year ago
Craig Tiller	e6359c34a4	[fuzzing] Extend deadline to fix fuzzer failure (#34389 )	1 year ago
Craig Tiller	c0155b4188	[experiments] Make codegen more merge friendly (#34393 ) Remove the explicit numbering that's hostile to source code merge tools.	1 year ago
Craig Tiller	47306d78f4	[work-serializer] Add some basic process-wide monitoring (#34369 ) Add some basic metrics to work serializer, keep them process wide for now (though it may be interesting to get these into channelz in the future). Collected are: - time spent running a work serializer when it starts - time spent actually executing work when the work serializer runs - number of items executed each run A high disparity between the first two indicates our dispatching mechanism is adding large amounts of latency (perhaps due to thread starvation like effects). A high value for any of these indicate contention on the serializer. It's likely a future iteration on these will select different metrics - I'm not entirely sure which will be useful in production analysis yet. I'm using `std::chrono::steady_clock` here for precision (nanoseconds) with a compact representation (better than timespec) and a robust & portable api - I think it's appropriate for metrics, but wouldn't use it much beyond that at this point.	1 year ago
Mark D. Roth	214776e6aa	[LB policies] hop into WorkSerializer in subchannel wrappers' Orphan() (#34394 ) The one in xds_override_host was the one that was actually triggering test failures, but I audited all of the other policies and fixed a couple of other places that could also be problematic.	1 year ago
apolcyn	87eed73a47	[dns] unskip c-ares tests on arm (#34232 ) Now that we have https://github.com/grpc/grpc/pull/33942 (and another follow-up [fix](https://github.com/grpc/grpc/pull/34185)), I think the issue from https://github.com/grpc/grpc/issues/25289 is likely fixed	1 year ago
Craig Tiller	e1d78a2394	[experiments] Extend expiry of memory_pressure_controller (#34390 )	1 year ago
Craig Tiller	0375a585e2	[work-serializer] Fix synchronous test assumption (#34392 ) This test assumed synchronous work serializer execution (or at least faster async than we always get)... make a trivial change to keep the test semantics but allow for the implementation to be more async.	1 year ago
Mark D. Roth	25cb8e6ed2	[WRR] delegate to pick_first as per dualstack design (#34245 ) Rolls forward the changes from #33087, which were rolled back in #33718. This change is now guarded by a disablable experiment.	1 year ago
Jan Tattermusch	a3b4c797a7	[bazelified tests] Reenable runtests_php_linux_dbg after #34257 (#34266 ) The valgrind based test (removed in https://github.com/grpc/grpc/pull/34257) was the reason why the dbg test was disabled in the past.	1 year ago
Paulo Castello da Costa	0218f7d3a6	[benchmark] Add golang tests back to main CI job. (#34370 )	1 year ago
Craig Tiller	4cfa676045	[combiner] Add a force-offload mechanism (#34377 ) Add a mechanism to allow the transport to force an offload when it knows that's appropriate.	1 year ago
Yash Tibrewal	d670ffa92c	[CSM] Create an experimental target (#34381 )	1 year ago
Yash Tibrewal	b038da5072	[CSM] Second attempt: Add a server selector based on channel args (#34376 ) This reverts commit `2db446aa9a`. <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	1 year ago
Craig Tiller	17662c66c3	[build] Fix merge problem (#34378 )	1 year ago
Michael Lumish	2f05ddc278	[PSM Interop] Enable xDS affinity test for Node (#34288 ) Similar to #34146, this will only run on master for now. This will work after grpc/grpc-node#2568 is merged.	1 year ago
Craig Tiller	86b931c354	[work-serializer] Dispatch on run experiment (relanding) (#34372 ) Reverts grpc/grpc#34371	1 year ago
Mark D. Roth	b6f01c68aa	[pick_first] ignore duplicate calls to ExitIdleLocked() (#34374 )	1 year ago
Mark D. Roth	5a4e8f3dbd	[client_channel] second attempt: SubchannelWrapper hops into WorkSerializer before destruction (#34321 ) Original PR was #34307, reverted in #34318 due to internal test failures. The first commit is a revert of the revert. The second commit contains the fix. The original idea here was that `SubchannelWrapper::Orphan()`, which is called when the strong refcount reaches 0, would take a new weak ref and then hop into the `WorkSerializer` before dropping that weak ref, thus ensuring that the `SubchannelWrapper` is destroyed inside the `WorkSerializer` (which is needed because the `SubchannelWrapper` dtor cleans up some state in the channel related to the subchannel). The problem is that `DualRefCounted<>::Unref()` itself actually increments the weak ref count before calling `Orphan()` and then decrements it afterwards. So in the case where the `SubchannelWrapper` is unreffed outside of the `WorkSerializer` and no other thread happens to be holding the `WorkSerializer`, the weak ref that we were taking in `Orphan()` was unreffed inline, which meant that it wasn't actually the last weak ref -- the last weak ref was the one taken by `DualRefCounted<>::Unref()`, and it wasn't released until after the `WorkSerializer` was released. To this this problem, we move the code from the `SubchannelWrapper` dtor that cleans up the channel's state into the `WorkSerializer` callback that is scheduled in `Orphan()`. Thus, regardless of whether or not the last weak ref is released inside of the `WorkSerializer`, we are definitely doing that cleanup inside the `WorkSerializer`, which is what we actually care about. Also adds an experiment to guard this behavior.	1 year ago
nanahpang	2db446aa9a	Revert "[CSM] Add a server selector based on channel args" (#34375 ) Reverts grpc/grpc#34312	1 year ago
Paulo Castello da Costa	02ee4d9173	[benchmark] Add golang tests back to experimental CI job. (#34368 )	1 year ago
AJ Heller	2467562e4b	[EventEngine] Delete OriginalThreadPool, remove work_stealing experiment (#34315 ) This has been stable for a bit, everywhere that the EventEngine is enabled. Going forward, I think the event_engine_{client\|listener} experiments can probably be used to regulate thread-pool-specific issues. --------- Co-authored-by: drfloob <drfloob@users.noreply.github.com>	1 year ago
Yash Tibrewal	c145b7910e	[CSM] Add a server selector based on channel args (#34312 ) I've added channel args to `CreateNewServerCallTracer` on the `ServerCallTracerFactory`. The motivation is for CSM Observability where the OTel plugin will be configured to only do stats on servers which are xDS enabled, so I plan to check this via channel args. In the future, with the new scopes for metrics, I think I'll be able to change this to only check once per server or server connection instead of per call. <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	1 year ago
Craig Tiller	d589caa679	Revert "[work-serializer] Dispatch on run experiment" (#34371 ) Reverts grpc/grpc#34274 (needs some changes internally)	1 year ago
Craig Tiller	1705470950	[work-serializer] Dispatch on run experiment (#34274 ) Co-authored-by: ctiller <ctiller@users.noreply.github.com> Co-authored-by: Mark D. Roth <roth@google.com>	1 year ago
Esun Kim	9a7ecfad00	[Fix] Added missing #include (#34359 ) One more missing #include	1 year ago
Yash Tibrewal	037979c0d8	[CSM] Remaining cleanup from GSM to CSM renaming (#34352 ) <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	1 year ago
Craig Tiller	f002863cef	[code-review] Fix paths for code generated stuff (#34357 ) missed a `/lib/` on a few paths...	1 year ago
Yash Tibrewal	16593fdb3c	[SpellCheck] s/heding/hedging (#34354 ) <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	1 year ago
nanahpang	a4ac80c394	Revert "[Security] Move ownership of tsi_ssl_client_handshaker_factory to grpc_ssl_credentials." (#34355 ) Reverts grpc/grpc#34180	1 year ago
Gregory Cooke	36dc5e7391	[Security] Move ownership of tsi_ssl_client_handshaker_factory to grpc_ssl_credentials. (#34180 ) Move the SSL_CTX to the level of the credentials rather than the subchannel. The SSL_CTX should only get created once per credential rather than once per subchannel. We should observe no behavior change with this PR, only efficiency gains.	1 year ago
Eugene Ostroukhov	58f1c74383	[test] Update NDK image with newer CMake (#34341 ) 1. Switch to CMake 1.18. 2. Make ergonomic change to push_testing_images.sh to allow building just a single image. 3. Update packages to reduce a number of vulnerabilities reported.	1 year ago
Gregory Cooke	8d62fc2b0b	[Test] Add concurrent test for session reuse (#34293 ) Add a test that runs concurrent requests using session caching.	1 year ago
Mark D. Roth	1986007e1e	[round_robin] 4th attempt: delegate to pick_first as per dualstack design (#34337 ) Most recent attempt was #34320, reverted in #34335. The first commit here is a pure revert. The second commit fixes the outlier_detection unit test to pass both with and without the experiment.	1 year ago

1 2 3 4 5 ...

53829 Commits (153824a21c2f9cc2f9d16d86af5e539cc49fae84) All Branches Search

53829 Commits (153824a21c2f9cc2f9d16d86af5e539cc49fae84)

All Branches