Chiebot-Mirror/grpc - grpc - Gitea: Git with a cup of tea

Commit Graph

Author	SHA1	Message	Date
Yash Tibrewal	5029af9578	OpenCensus: Use new CallTracer interfaces (#32618 ) This change mostly aims to get OpenCensus to use the new ServerCallTracer interface. Note that the interfaces nor the code are in their final states. There are a bunch of moving pieces, but I thought this might be a nice mid-step to check-in and make sure that our internal traces can also work with these changes. Overall changes - 1) call_tracer.h shows what the hierarchy of new call tracer interfaces looks like. Open to renaming suggestions. 2) Moved most of the common interface between `CallAttemptTracer` and `ServerCallTracer` into a common `CallTracerInterface`. We should be able to eventually move `RecordReceivedTrailingMetadata` and `RecordEnd` as well to these common interfaces, but it requires some additional work. 3) The compression filter is now responsible for recording the recv and send messages for both the subchannel call and the server, and adds in ability to record compressed and decompressed messages as well. 4) The OpenCensus server filter now uses the new `ServerCallTracer` interface, and so doesn't need to be a filter anymore. 5) A new ServerCallTracerFilter was added. Ideally, we should be able to move it to the current connected filter, but it is in a bit of an interesting state right now, so I would prefer making those changes in a separate PR with Craig's eyes on it. 6) A new context element `GRPC_CONTEXT_CALL_TRACER_ANNOTATION_INTERFACE` was created that replaces the old `GRPC_CONTEXT_CALL_TRACER`, and the new `GRPC_CONTEXT_CALL_TRACER` is mainly to pass the `CallAttemptTracer` down the stack. This should go away in the new promise-based world. <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. --> <!-- Reviewable:start --> - - - This change is [<img src="https://reviewable.io/review_button.svg" height="34" align="absmiddle" alt="Reviewable"/>](https://reviewable.io/reviews/grpc/grpc/32618) <!-- Reviewable:end -->	2 years ago
Craig Tiller	d025d50b54	[resource_quota] Fix setpoint for memory pressure controller (#32625 ) <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
Mark D. Roth	a67a46e7a3	xds_cluster_e2e_test: fix test flake from #32571 (#32623 )	2 years ago
Eugene Ostroukhov	acec3a6975	[testing]: Add "orca_oob" test case (#32599 )	2 years ago
Eugene Ostroukhov	555f3e26a4	PSM Interop: add orca proto to the legacy test driver dependencies (#32620 ) ref b/273575071	2 years ago
Sergii Tkachenko	c16338581d	PSM Interop: add orca proto to the new test driver dependencies (#32619 ) Now `messages.proto` requires `xds/v3/orca_load_report.proto`. The dependency introduced in https://github.com/grpc/grpc/pull/32524. ref b/273575071	2 years ago
Sergii Tkachenko	dce2d8729c	PSM Interop: Retry on recoverable kubernetes errors (#32596 ) - Increase kubernetes library default for urlib3 retries to 10 - Add custom retry logic to all API calls made by framework.k8s Custom retry logic handles various errors we're experienced over two years, and based on ~140 failure reports: 1. Errors returned by the k8s API server itself: - 401 Unauthorized - 409 Conflict - 429 Too Many Requests - 500 Internal Server Error 2. Connection errors that might indicate k8s API server is temporarily unavailable (such as a restart, upgrade, etc): - All `NewConnectionError`s, f.e. "Connection timed out", "Connection refused" - All "connection aborted" `ProtocolError`s, f.e. "Remote end closed connection without response", "Connection reset by peer" ref b/178378578, b/258546394	2 years ago
Mike Kruskal	b7e430174b	Fix unused variable warning (#32616 )	2 years ago
Eugene Ostroukhov	c62ecd5cb4	[testing]: Add "orca_per_rpc" test case (#32524 )	2 years ago
AJ Heller	ae55fb04ab	[EventEngine] Windows Endpoint: optimize reads by chaining synchronous WSARecv operations (#32563 ) Built on https://github.com/grpc/grpc/pull/32560 When calling EventEngine::Read, if a synchronous WSARecv call completes successfully and 1) the read buffer is not full, and 2) the stream remains open, then the endpoint will now chain execution of more synchronous WSARecvs. The chain is broken and the on_read callback is called when either there are errors, the next call would block, the buffer is full, or the stream is closed. Something like this is helpful to prevent excessive read callback execution under a flood of tiny payloads, presuming messages are not being combined as one would usually expect (see `//test/core/iomgr:endpoint_pair_test`, and Nagle's algorithm).	2 years ago
Yash Tibrewal	1088046a57	HttpFilters: Disable disabling of compression filters (#32613 ) Note that there is no behavior change associated with this PR. In other words, folks that use `GRPC_ARG_ENABLE_PER_MESSAGE_DECOMPRESSION` and `GRPC_ARG_ENABLE_PER_MESSAGE_COMPRESSION` will still see the same behavior as before. The actual change - The compression filter will always be added to the filter stack for HTTP transports even if it is a no-op due to the above channel args. <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
Esun Kim	0b2609a61d	Update minimum MSVC version to 2019 (#32614 ) To be aligned with By https://github.com/google/oss-policies-info/pull/8	2 years ago
Yash Tibrewal	6f960be41b	Gcp Observability: Make GcpObservabilityInit blocking (#32612 ) <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
apolcyn	e97f632886	[testing] make error check in log_too_many_open_files_test portable (#32595 ) The error message for EMFILE is different on Musl, so this test is currently broken portability tests on Musl: https://source.cloud.google.com/results/invocations/d742bd83-bd28-49a3-b64a-442d0667231e/targets/github%2Fgrpc%2Frun_tests%2Fcpp_linux_dbg_native_x64_gcc_musl/tests	2 years ago
Eugene Ostroukhov	fd7c85f310	Update utilization/util to be in 1..0 range	2 years ago
Jan Tattermusch	d812dc6757	Simplify tools/buildgen/extract_metadata_from_bazel_xml.py to prepare for protobuf upgrade. (#32590 ) `bazel query deps(//src/proto/...)` seems unnecessary (regenerated projects are identical) and causes trouble with protobuf 22.x (since it basically breaks `tools/buildgen/generate_projects.sh` run and that makes upgrade experiments painful).	2 years ago
Yijie Ma	ac7faf75ba	Fix a race on vptr for UnimplementedAsyncRequest (#32547 ) It is reported in https://github.com/grpc/grpc/issues/32356 that there is a race on vptr for `UnimplementedAsyncRequest` which would cause crashes for multi-threaded server if clients send unimplemented RPC request to the server. The cause is that the server requests a call for `UnimplementedAsyncRequest` in its base class `GenericAsyncRequest` when the `vptr` still points to the base class's `vtable`. If the call went in and another server thread picks up the tag before the `vptr` points back to the derived class's `vtable`, it would call the wrong virtual function and also this is a data race. This fix makes the request of the call inside the derived class's constructor. <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
Stanley Cheung	40ccf97217	Observability Testing: Pass interop parameters to each lang's run.sh script as-is (#32586 ) Each `run.sh` should just pass those parameters through to the interop client/server binaries as-is. Corresponding framework PR: https://github.com/GoogleCloudPlatform/grpc-gcp-tools/pull/28	2 years ago
Sergii Tkachenko	198a9f6fe9	PSM Interop: Local dev various improvements (#32575 ) PSM Interop: Local dev various improvements - Cleanup resources on ctrl+c - Add startup probes to address the issue with port forwarding starting before the workload listens on a port - Remove misleading restartPolicy: it's silently ignored by k8s - Extra debug message with port-forwarding command	2 years ago
Craig Tiller	62bb99d163	[e2e] c++-ify core e2e test fixtures (#32550 ) A step toward #14016. --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	2 years ago
Yousuk Seung	0003c320f4	xDS e2e tests: use ServerMetricRecorder (#32546 ) <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
Yash Tibrewal	9551e3ef5a	Add ServerCallTracer interfaces (#32555 ) <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
Mark D. Roth	e81002cfda	xDS: fix crash when removing the last endpoint from the last locality in weighted_target (#32571 ) Fixes #32486.	2 years ago
Jan Tattermusch	0cde9a7550	make_grpcio_tools.py improvements: more readable diffs (#32583 ) - sort source files to ensure stable ordering - generate one source file per line together this should produce diffs that are much more readable by humans when sources get added/removed to/from protobuf (and make_grpcio_tools.py is used to regenerate).	2 years ago
Jan Tattermusch	a27b86fb95	Switch linux RBE to ubuntu18.04, get rid of rbe_autoconfig (#32559 ) First step in the modernization of our RBE stack (see go/rbe-tech-debt-notes). - Get rid of the deprecated rbe_autoconfig and start using [rbe_configs_gen](https://github.com/bazelbuild/bazel-toolchains#rbe_configs_gen---cli-tool-to-generate-configs) + check in the generated toolchain configs. - Switch from marketplace.gcr.io/google/rbe-ubuntu16-04 to marketplace.gcr.io/google/rbe-ubuntu18-04 (this image is still not owned by us, but at least it's newer and demonstrates how a switch to a newer docker image is done). - provide script for generating the linux RBE toolchain configs. - cleanup RBE configuration in the bazelrc files used for remote build	2 years ago
apolcyn	d47b569330	[testing]: remove server-side check on number of concurrent RPCs in alts_concurrent_connectivity_test (#32585 ) This check only works if all handshake RPCs have an OK status, and it's racey e.g. if the client is cancelling handshake RPCs (being when an RPC is cancelled, termination of the RPC at the client is asynchronous from termination at the server, so the client can resume the queue before the server RPC completes).	2 years ago
Yash Tibrewal	97ba987132	GCP Observability: Docs on Init (#32573 ) <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
Craig Tiller	2df6ca26dc	[chttp2] Fix tsan race (#32576 ) Noticed here: https://source.cloud.google.com/results/invocations/779f3614-42bd-44bb-a00d-ab56f9749095/targets/%2F%2Ftest%2Fcore%2Fend2end:h2_full_test@retry_cancel4@poller%3Depoll1@experiment%3Devent_engine_client/log A race between starting reading and transport close. Fix: move the closure setting inside the combiner, and check if the transport is closed at that time. <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
AJ Heller	febed5121a	[EventEngine] Skip pathological iomgr test on Windows experiments (#32569 ) With iomgr, this test is effectively rate limited by ExecCtx and the single thread running pollset_work, which results in thousands of tiny writes happening before every read. A small set of _synchronous_ 8k reads then dominate the read-side of the test. This is an efficient balance. With the Windows EventEngine, the fully asynchronous, multi-threaded reads and writes end up alternating roughly 1:1, meaning that a read callback is executed for every tiny handful of bytes, tens of thousands of times. Compared to the Posix EventEngine, without things like TCP_INQ and/or recvmsg's timeout, I don't know of any great signal for how much data can safely be received in a batch (e.g., we don't want to wait for data that will never come, and we don't want to run callbacks for 2 bytes over and over again if we have KB in the pipe). I believe the Windows EventEngine is WAI. I can significantly improve this test performance by artificially slowing the reader down (adding a >= 1ms sleep), but I believe that improves this use case to the detriment of all others.	2 years ago
AJ Heller	e77548d662	[EventEngine] Fix PosixEventEngine IPv4 support (#32574 ) This fixes a bug where connections cannot be made in IPv4-only environments. To test, hard-code `IsIpv6LoopbackAvailable` to return false. Example Error: ` D0309 00:29:49.514359445 235 tcp_client.cc:67] (event_engine) EventEngine::Connect Status: INTERNAL: socket: Address family not supported by protocol ` This can also be reproduced in gRPC's benchmark environment, which does not have IPv6 enabled.	2 years ago
Craig Tiller	822dab21d9	[promises] Support marking calls as traced (#32355 ) <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
Yash Tibrewal	657c1da1b0	HttpProxyMapperTest: cleanup (#32572 ) <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
Jin	89980d01f6	feat: Auth lib: Remove 3PI config url validation (#32450 ) To support TPC feature for BYOID (3PI), we need to remove the validation the pattern of impersonation endpoints, sts endpoints and token info endpoints since they are different in TPC regions. A security review is already passed at b/261634871 <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
Kim Vandry	0e38b075ff	ALTS: Do not pass a host to grpc_channel_create_pollset_set_call(); let the channel supply it. (#31867 ) When the handshaker_service_url is in "host:port" format such as it normally is when using ALTS in GCE (in which case it comes from then this makes no difference as the authority and the URL are the same. But when different URLs are used, the correct authority to use is not always the same as the URL. For example if the URL is unix:///some/path then the correct authority is "localhost". This is correctly computed by grpc_core::UnixResolverFactory and stored as the channel's default authority, but we throw that away when we override the authority for individual RPCs. Note indeed that the majority of other callers of grpc_channel_create_* pass nullptr for the host/authority argument.	2 years ago
AJ Heller	2cdc98a614	[EventEngine] Enable the client experiment for all h2_full end2end tests (#32568 )	2 years ago
Kim Vandry	84c084527a	Fix str vs. bytes error in Python bindings for ALTS. (#31980 ) It looks like nobody ever created ALTS redentials from Python with a list of accepted service accounts before. Simple reproduction: ``` import grpc grpc.alts_channel_credentials(None) # works grpc.alts_channel_credentials(['foo']) # fails ``` Without this change, generates this error: ``` [...] File "src/python/grpcio/grpc/_cython/_cygrpc/credentials.pyx.pxi", line 414, in _cython.cygrpc.channel_credentials_alts File "src/python/grpcio/grpc/_cython/_cygrpc/credentials.pyx.pxi", line 403, in _cython.cygrpc.ALTSChannelCredentials.__cinit__ TypeError: expected bytes, str found ``` (And the error cannot be worked around by the caller by passing a bytes object from the Python side: you still get the same error.)	2 years ago
Gregory Cooke	ca9e365002	Added verified_root_cert_subject pass up through cpp api (#32335 ) PR #32215 added the verified root cert subject to the lower level `tsi_peer`. This PR is a companion to that and completes the feature by bubbling the information up to the `TsiCustomVerificationCheckRequest` which is part of the user facing API for implementing custom verification callbacks.	2 years ago
Richard Belleville	86bd0721ea	[fork] Generate GDB backtraces in fork tests on Kokoro (#32535 ) <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
AJ Heller	1548038a09	[EventEngine] Windows endpoints keep their EventEngines alive (#32560 ) Discovered via `bazel test --test_env=GRPC_EXPERIMENTS=event_engine_client //test/core/iomgr:endpoint_pair_test`. CI experiments can be enabled generally on Windows once a few fixes and improvements are completed.	2 years ago
edmondj	cd4154f21d	Fixed memory leak in Alarm's move assignment	2 years ago
Frank de Jonge	7b3977e9f1	Use correct namespace for checking if the isDefaultRootsPemSet method exists. (#31580 ) The `method_exists` function requires a fully qualified class name to be sent to check if a method exists. The current class was missing the namespace, which means the function always returns `false`. In our application this caused the credentials to be loaded many times over, which ate up some CPU. This bug fix ensures that this is only run once per request.	2 years ago
Rokya	4dd0873ff7	[Binder Tansport] Move trans stream receiver callbacks out of `mu_` (#32438 ) This prevents deadlock against wire writer issues. Currently, there are some `transport_stream_receiver_` callbacks triggered by NDK binder may acquire `WireReaderImpl::mu_` first then `WireWriterImpl::write_mu_`. We don't like see this. We have this problem since some client and server are in the same process. The behavior of NDK binder seems more aggressive when the Tx and Rx are in the same process.	2 years ago
Xuan Wang	95d6325768	Fix DeprecationWarning when calling asyncio.get_event_loop() (#32533 ) Fix: https://github.com/grpc/grpc/issues/32526#event-8652983465 ### Context: Calling `asyncio.get_event_loop()` [without an running loop will emitted a DeprecationWarning](https://docs.python.org/3.10/library/asyncio-eventloop.html#asyncio.get_event_loop), in this case, we should call `asyncio.get_event_loop_policy().get_event_loop()` to get the loop.	2 years ago
Richard Belleville	c5213ceff9	[fork] Entirely opt EE threads out of ExecCtx counting (#32536 ) Follow-up to https://github.com/grpc/grpc/pull/32229. https://github.com/grpc/grpc/pull/32229 incremented the `ExecCtx` count unconditionally. It was previously impossible for a thread to exit `IncExecCtxCount` while `fork_complete_` was `false`. These same threads then went on to _decrement_ `count_` while the fork was still in progress, putting `count_` well below its expected range ([0, 1] while blocking and [2, inf) while not blocking). This resulted in cases where `count_` would be stuck at a negative number with a thread infinitely looping through `IncExecCtxCount`. This PR instead opts EE threads out of ExecCtx counting entirely. They handle clean-up of their threads separately through a separate set of handlers registered by an entirely separate invocation of `pthread_atfork`. This resolves the issue pointed out in [this comment](https://github.com/grpc/grpc/issues/31885#issuecomment-1426445192).	2 years ago
apolcyn	27ee3913d1	[iomgr and EE logging] Log ERROR if socket() returns EMFILE on posix (#32204 ) There are potentially surprising deployment bugs that can cause `EMFILE` to be hit. For example, file descriptor limits can be easily reached if - the round robin LB policy is used - the load balancer hands out an assignment with a lot of backends - using debian's default 1024 file descriptor limit. To make such problems more apparent, we can pay special attention to this error and log ERROR when it happens. Related: b/265199104	2 years ago
Cheng-Yu Chung	141592df55	Fix typo in `src/core/lib/channel/call_finalization.h` (#32543 )	2 years ago
AJ Heller	b5fc93aa18	[codehealth] Prevent the labeler workflow from removing labels (#32542 ) It seems as if we ran into this bug https://github.com/actions/labeler/issues/442 on at least one PR https://github.com/grpc/grpc/pull/32515. This PR utilizes the suggested workaround until we can upgrade to a new major version release of actions/labeler.	2 years ago
Mark D. Roth	e7f86da5ac	client channel: fix use-after-free bug (#32539 ) b/268292646	2 years ago
Mark D. Roth	5c1351883e	WRR: add trace message when updating scheduler (#32534 ) This will help detect when we fall back to RR.	2 years ago
Stanley Cheung	db62c06171	GCP Observability testing for C++ (#32531 ) Third try for #32466. This adds an interop client / server for GCP Observability integration testing. Everything is new here with no refactor. Plan is to get this in first before trying to refactor out the flags.	2 years ago

1 2 3 4 5 ...

52737 Commits (5029af9578bc3a7b67290e14c6d80703b8f3232f) All Branches Search

52737 Commits (5029af9578bc3a7b67290e14c6d80703b8f3232f)

All Branches