Chiebot-Mirror/grpc - grpc - Gitea: Git with a cup of tea

Commit Graph

Author	SHA1	Message	Date
Craig Tiller	cd44a2433e	[call] Dont take grpclb_client_stats from the app (#33118 ) This metadata doesn't actually encode so passing it through from an app will force a crash. <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
Vignesh Babu	915d7c4a70	[Fuzzing] Bound RunAfter duration in fuzzing event engine (#33128 ) Bounds duration to 1 year. Fixes b/258949216	2 years ago
Yijie Ma	3526defc19	[JsonWriter] Do not break in EscapeString when encountering a null byte (#33127 ) Instead just Utf-16 encode the null byte when dumping the value to a string form. <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
Craig Tiller	997af8d073	[api_fuzzer] Attempt to clean up fuzzer memory leak (#33120 ) <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
Mark D. Roth	59dbdfeea2	[xds_client_fuzzer] fail bootstrap parsing if xds_servers is empty (#33119 ) b/269022924	2 years ago
Mark D. Roth	5ae1cfcce3	[xds_client_fuzzer] fix null pointer dereference in `FakeXdsTransport::TriggerConnectionFailure()` (#33117 ) b/259358608	2 years ago
Mark D. Roth	13133ae703	[xds_client_fuzzer] fix bug in fake transport (#33115 ) Fixes `FakeXdsTransport` to remove itself from the map in `FakeXdsTransportFactory` when it gets orphaned by the `XdsClient`, so that a subsequent creation of a new transport for the same server does not trigger an assertion due to the transport already existing in the map. Fixes internal b/259362837.	2 years ago
Craig Tiller	66d9f52fbd	[api-fuzzer] Fix memory leak (#33109 ) ApiFuzzer::CreateChannel() called twice creates two channels but doesn't delete the first. Choose some reasonable behavior.	2 years ago
Craig Tiller	123811399b	[promises] Remove bad log statement (#33113 ) Was leading to a nullptr deref, and we just don't need this one anymore. <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
Craig Tiller	18d369a6f4	[fuzzing] Avoid initialization order fiasco in core_end2end_test_fuzzer (#33108 )	2 years ago
Craig Tiller	ee0cf2fada	[filter-fuzzer] Delete this fuzzer until I can spend time on it (#33096 ) It's not finished and won't be for a bit...	2 years ago
Craig Tiller	9760ce9d0a	[end2end] Shorten corpora filenames (#33095 ) Avoids long path name problems on Windows <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
Mark D. Roth	8fdfb22848	[JSON] generalize handling of RefCountedPtr<> (#33048 ) Also remove a check in the weighted_target LB policy that I somehow missed in #32932.	2 years ago
Craig Tiller	4674f2ccf7	[fuzz] Turn core end2end tests into fuzzers (#33013 ) Add a new binary that runs all core end2end tests in fuzzing mode. In this mode FuzzingEventEngine is substituted for the default event engine. This means that time is simulated, as is IO. The FEE gets control of callback delays also. In our tests the `Step()` function becomes, instead of a single call to `completion_queue_next`, a series of calls to that function and `FuzzingEventEngine::Tick`, driving forward the event loop until progress can be made. PR guide: --- New binaries `core_end2end_test_fuzzer` - the new fuzzer itself `seed_end2end_corpus` - a tool that produces an interesting seed corpus Config changes for safe fuzzing The implementation tries to use the config fuzzing work we've previously deployed in api_fuzzer to fuzz across experiments. Since some experiments are far too experimental to be safe in such fuzzing (and this will always be the case): - a new flag is added to experiments to opt-out of this fuzzing - a new hook is added to the config system to allow variables to re-write their inputs before setting them during the fuzz Event manager/IO changes Changes are made to the event engine shims so that tcp_server_posix can run with a non-FD carrying EventEngine. These are in my mind a bit clunky, but they work and they're in code that we expect to delete in the medium term, so I think overall the approach is good. Changes to time A small tweak is made to fix a bug initializing time for fuzzers in time.cc - we were previously failing to initialize `g_process_epoch_cycles` Changes to `Crash` A version that prints to stdio is added so that we can reliably print a crash from the fuzzer. Changes to CqVerifier Hooks are added to allow the top level loop to hook the verification functions with a function that steps time between CQ polls. Changes to end2end fixtures State machinery moves from the fixture to the test infra, to keep the customizations for fuzzing or not in one place. This means that fixtures are now just client/server factories, which is overall nice. It did necessitate moving some bespoke machinery into h2_ssl_cert_test.cc - this file is beginning to be problematic in borrowing parts but not all of the e2e test machinery. Some future PR needs to solve this. A cq arg is added to the Make functions since the cq is now owned by the test and not the fixture. Changes to test registration `TEST_P` is replaced by `CORE_END2END_TEST` and our own test registry is used as a first depot for test information. The gtest version of these tests: queries that registry to manually register tests with gtest. This ultimately changes the name of our tests again (I think for the last time) - the new names are shorter and more readable, so I don't count this as a regression. The fuzzer version of these tests: constructs a database of fuzzable tests that it can consult to look up a particular suite/test/config combination specified by the fuzzer to fuzz against. This gives us a single fuzzer that can test all 3k-ish fuzzing ready tests and cross polinate configuration between them. Changes to test config The zero size registry stuff was causing some problems with the event engine feature macros, so instead I've removed those and used GTEST_SKIP in the problematic tests. I think that's the approach we move towards in the future. Which tests are included Configs that are compatible - those that do not do fd manipulation directly (these are incompatible with FuzzingEventEngine), and those that do not join threads on their shutdown path (as these are incompatible with our cq wait methodology). Each we can talk about in the future - fd manipulation would be a significant expansion of FuzzingEventEngine, and is probably not worth it, however many uses of background threads now should probably evolve to be EventEngine::Run calls in the future, and then would be trivially enabled in the fuzzers. Some tests currently fail in the fuzzing environment, a `SKIP_IF_FUZZING` macro is used for these few to disable them if in the fuzzing environment. We'll burn these down in the future. Changes to fuzzing_event_engine Changes are made to time: an exponential sweep forward is used now - this catches small time precision things early, but makes decade long timers (we have them) able to be used right now. In the future we'll just skip time forward to the next scheduled timer, but that approach doesn't yet work due to legacy timer system interactions. Changes to port assignment: we ensure that ports are legal numbers before assigning them via `grpc_pick_port_or_die`. A race condition between time checking and io is fixed. --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	2 years ago
Hannah Shi	ad2a5dd355	[ObjC] Cf event engine client (#33034 ) Added `//:gpr_platform` to cf_engine_test to fix build_cleaner check in the previous merge. More details in https://github.com/grpc/grpc/pull/33027	2 years ago
Mark D. Roth	2c423d277c	[outlier detection] fix crash with pick_first and add tests (#33069 ) Fixes #32967. Also fix incorrect defaults for `enforcementPercentage` fields.	2 years ago
AJ Heller	0ed3bb7955	[EventEngine] Disable more thread pool tests for legacy implementation (#33068 )	2 years ago
AJ Heller	63ec566f3e	[EventEngine] Reduce the size of some thread pool tests (#33055 ) The `CanStartLotsOfClosures` test was sometimes taking over 60s to run ([example](https://source.cloud.google.com/results/invocations/d96c89f9-03f9-43fd-a729-d744d7499532/targets;query=thread_pool_test/%2F%2Ftest%2Fcore%2Fevent_engine:thread_pool_test@poller%3Depoll1/log)). More often than not, though, the test would take < 5s ([example](https://source.cloud.google.com/results/invocations/95d32b32-5df7-4dd4-a82c-1024869b09c8/targets;query=thread_pool_test/%2F%2Ftest%2Fcore%2Fevent_engine:thread_pool_test/log)). Both examples are from before the tests changed with the introduction of the work-stealing thread pool (`3fb738b9b1`). This PR reduces the closure count to 500k for the `CanStartLotsOfClosures` test, and changes the blocking-closure scale-test to exercise the work stealing implementation alone.	2 years ago
Mark D. Roth	17315823c2	[client channel] assume LB policies start in CONNECTING state (#33009 ) Currently, we are not very consistent in what we assume the initial state of an LB policy will be and whether or not we assume that it will immediately report a new picker when it gets its initial address update; different parts of our code make different assumptions. This PR establishes the convention that LB policies will be assumed to start in state CONNECTING and will not be assumed to report a new picker immediately upon getting their initial address update, and we now assume that convention everywhere consistently. This is a preparatory step for changing policies like round_robin to delegate to pick_first, which I'm working on in #32692. As part of that change, we need pick_first to not report a connectivity state until it actually sees the connectivity state of the underlying subchannels, so that round_robin knows when to swap over to a new child list without reintroducing the problem fixed in #31939.	2 years ago
Esun Kim	37e9903ecb	[Build] Fix json error (#33051 ) To fix this error ``` test/core/security/grpc_authorization_engine_test.cc:88:32: error: unknown type name 'Json'; did you mean 'experimental::Json'? ParseAuditLoggerConfig(const Json&) override { ^~~~ experimental::Json ```	2 years ago
Mark D. Roth	1432fe4e4c	[JSON] make API public but experimental (#32987 ) This makes the JSON API visible as part of the C-core API, but in the `experimental` namespace. It will be used as part of various experimental APIs that we will be introducing in the near future, such as the audit logging API.	2 years ago
Ming-Chuan	6c2f4371bb	[Binder Transport] Flush ExecCtx in e2e test (#32971 ) WireWriter implementation schedules actions to be run by `ExecCtx`. We should flush pending actions before destructing `end2end_testing::g_transaction_processor`, which need to be alive to handle the scheduled actions. Otherwise, we get heap-use-after-free error because the testing fixture (`end2end_testing::g_transaction_processor`) is destructed before all the scheduled actions are run. This lowers end2end binder transport test failure rate from 0.23% to 0.15%, according to internal tool that runs the test for 15000 times under various configuration.	2 years ago
Mark D. Roth	e872fb91d9	[WRR] fix some edge cases in scheduler logic (#33045 ) This corresponds to two recent changes made to our internal implementation. See b/276292666 for details.	2 years ago
AJ Heller	3fb738b9b1	[EventEngine] Implement work-stealing in the EventEngine ThreadPool (#32869 ) This PR implements a work-stealing thread pool for use inside EventEngine implementations. Because of historical risks here, I've guarded the new implementation behind an experiment flag: `GRPC_EXPERIMENTS=work_stealing`. Current default behavior is the original thread pool implementation. Benchmarks look very promising: ``` bazel test \ --test_timeout=300 \ --config=opt -c opt \ --test_output=streamed \ --test_arg='--benchmark_format=csv' \ --test_arg='--benchmark_min_time=0.15' \ --test_arg='--benchmark_filter=_FanOut' \ --test_arg='--benchmark_repetitions=15' \ --test_arg='--benchmark_report_aggregates_only=true' \ test/cpp/microbenchmarks:bm_thread_pool ``` 2023-05-04: `bm_thread_pool` benchmark results on my local machine (64 core ThreadRipper PRO 3995WX, 256GB memory), comparing this PR to master: ![image](https://user-images.githubusercontent.com/295906/236315252-35ed237e-7626-486c-acfa-71a36f783d22.png) 2023-05-04: `bm_thread_pool` benchmark results in the Linux RBE environment (unsure of machine configuration, likely small), comparing this PR to master. ![image](https://user-images.githubusercontent.com/295906/236317164-2c5acbeb-fdac-4737-9b2d-4df9c41cb825.png) --------- Co-authored-by: drfloob <drfloob@users.noreply.github.com>	2 years ago
Yijie Ma	7df0e11755	[EventEngine] Change TXT lookup result type to std::vector<std::string> (#33030 ) One TXT lookup query can return multiple TXT records (see the following example). `EventEngine::DNSResolver` should return all of them to let the caller (e.g. `event_engine_client_channel_resolver`) decide which one they would use. ``` $ dig TXT wikipedia.org ; <<>> DiG 9.18.12-1+build1-Debian <<>> TXT wikipedia.org ;; global options: +cmd ;; Got answer: ;; ->>HEADER<<- opcode: QUERY, status: NOERROR, id: 49626 ;; flags: qr rd ra; QUERY: 1, ANSWER: 3, AUTHORITY: 0, ADDITIONAL: 1 ;; OPT PSEUDOSECTION: ; EDNS: version: 0, flags:; udp: 512 ;; QUESTION SECTION: ;wikipedia.org. IN TXT ;; ANSWER SECTION: wikipedia.org. 600 IN TXT "google-site-verification=AMHkgs-4ViEvIJf5znZle-BSE2EPNFqM1nDJGRyn2qk" wikipedia.org. 600 IN TXT "yandex-verification: 35c08d23099dc863" wikipedia.org. 600 IN TXT "v=spf1 include:wikimedia.org ~all" ``` Note that this change also deviates us from the iomgr's DNSResolver API which uses std::string as the result type. <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
AJ Heller	ee0aaacbde	Revert "[ObjC] CF Stream Event Engine Client" (#33027 ) Reverts grpc/grpc#32924. This breaks the build again, unfortunately. From `test/core/event_engine/cf:cf_engine_test`: ``` error: module .../grpc/test/core/event_engine/cf:cf_engine_test does not depend on a module exporting 'grpc/support/port_platform.h' ``` @sampajano I recommend looking into CI tests to catch iOS problems before merging. We can enable EventEngine experiments in the CI generally once this PR lands, but this broken test is not one of those experiments. A normal build should have caught this. cc @HannahShiSFB	2 years ago
Hannah Shi	d0c1809840	[ObjC] CF Stream Event Engine Client (#32924 ) bazel build --config=macos --genrule_strategy=local --copt="-DGRPC_CFSTREAM=1" //test/cpp/end2end:cfstream_test succeeds Fixing failure described here: https://github.com/grpc/grpc/pull/32882#issuecomment-1512210309	2 years ago
Mark D. Roth	1fcaccdf5f	[client channel] Second attempt: use ChunkedVector for call attributes (#33015 ) Original was #33002, reverted in #33014. The second commit here adds a build visibility tag necessary to fix the internal build problems.	2 years ago
AJ Heller	18aab6ffb5	Revert "[client channel] use ChunkedVector for call attributes" (#33014 ) Reverts grpc/grpc#33002. Breaks internal builds: `.../privacy_context:filters does not depend on a module exporting '.../src/core/lib/channel/context.h'`	2 years ago
Mark D. Roth	2f89fd5528	[client channel] use ChunkedVector for call attributes (#33002 ) Change call attributes to be stored in a `ChunkedVector` instead of `std::map<>`, so that the storage can be allocated on the arena. This means that we're now doing a linear search instead of a map lookup, but the total number of attributes is expected to be low enough that that should be okay. Also, we now hide the actual data structure inside of the `ServiceConfigCallData` object, which required some changes to the `ConfigSelector` API. Previously, the `ConfigSelector` would return a `CallConfig` struct, and the client channel would then use the data in that struct to populate the `ServiceConfigCallData`. This PR changes that such that the client channel creates the `ServiceConfigCallData` before invoking the `ConfigSelector`, and it passes the `ServiceConfigCallData` into the `ConfigSelector` so that the `ConfigSelector` can populate it directly.	2 years ago
Luwei Ge	4c7da485c5	[xDS] Protect RBAC audit logging options field with environment variable. (#33004 ) The protection is added at `xds_http_rbac_filter.cc` where we read the new field. With this disabling the feature, nothing from things like `xds_audit_logger_registry.cc` shall be invoked.	2 years ago
Craig Tiller	ad41fe96b6	[promises] Re-enable C++ end2end tests (with fixes) (#32837 ) Makes some awkward fixes to compression filter, call, connected channel to hold the semantics we have upheld now in tests. Once the fixes described here https://github.com/grpc/grpc/blob/master/src/core/lib/channel/connected_channel.cc#L636 are in this gets a lot less ad-hoc, but that's likely going to be post-landing promises client & server side. We specifically need special handling for server side cancellation in response to reads wrt the inproc transport - which doesn't track cancellation thoroughly enough itself. <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. --> --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	2 years ago
Craig Tiller	65a2a895af	[chttp2] Fix some fuzzer found bugs. (#33005 ) <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
Luwei Ge	abc82b9e19	[Audit Logging] Audit logging support in authorization engines. (#32995 ) 1. `GrpcAuthorizationEngine` creates the logger from the given config in its ctor. 2. `Evaluate()` invokes audit logging when needed. --------- Co-authored-by: rockspore <rockspore@users.noreply.github.com>	2 years ago
Craig Tiller	79e46a6022	[channelz] Save some memory per channel (#32996 ) Whilst the per cpu counters probably help single channel contention, we think it's likely that they're a pessimization when taken fleetwide. <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
Luwei Ge	3541ef5d69	[Audit Logging] Authz policy support for audit logging (#32944 ) Add audit condition and audit logger config into `grpc_core::Rbac`. Support translation of audit logging options from authz policy to it. Audit logging options in authz policy looks like: ```json { "audit_logging_options": { "audit_condition": "ON_DENY", "audit_loggers": [ { "name": "logger", "config": {}, "is_optional": false } ] } } ``` which is consistent with what's in the xDS RBAC proto but a little flattened. --------- Co-authored-by: rockspore <rockspore@users.noreply.github.com>	2 years ago
Mark D. Roth	844e740183	[JSON] Replace ctors with factory methods (#32834 )	2 years ago
Eugene Ostroukhov	ac228814a0	[core] Expand core attributes to hold values of any type (#32835 )	2 years ago
Luwei Ge	f02ce240d7	[xDS] pass HTTP filter name to `GenerateServiceConfig()` method. (#32976 ) We need the RBAC filter name as the `policy_name` field in audit logging context.	2 years ago
Craig Tiller	9f00eda536	[examine-stack] Try to unblock Ubuntu 20.04 (#32975 ) Try a different approach to this test and check some non-leaf functions in the returned text - looks like we're running into problems getting the leaf function out of the stack trace on that platform (which is probably fine): https://source.cloud.google.com/results/invocations/09e8e1ea-df48-4fdb-96dd-916bd5014f90/targets/%2F%2Ftest%2Fcore%2Fgprpp:examine_stack_test/tests Needed to unblock https://github.com/grpc/grpc/pull/32748 <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
AJ Heller	a9afd1cde8	[test] Re-land: Enable EventEngine experiments for Posix end2end tests (#32948 ) Relands #32844. End2end tests will now wait for the default EventEngine to shut down between tests. This should avoid some use-after-frees and leaks.	2 years ago
nanahpang	d1dda5c8a2	[Fix fuzzer error] Memory address points to zero page. (#32894 ) Found memory access error in frame_fuzzer_test. Located the root cause in ExecCtx::Get(), where ExecCtx needs to be initialized before using HPackParser:ParseInput(). Error logs: MemorySanitizer:DEADLYSIGNAL ==2812845==ERROR: MemorySanitizer: SEGV on unknown address 0x000000000030 (pc 0x55869275574e bp 0x7fffd7d9fb50 sp 0x7fffd7d9fb20 T2812845) ==2812845==The signal is caused by a READ memory access. ==2812845==Hint: address points to the zero page. #0 0x55869275574e in starting_cpu [third_party/grpc/src/core/lib/iomgr/exec_ctx.h:129](https://cs.corp.google.com/piper///depot/google3/third_party/grpc/src/core/lib/iomgr/exec_ctx.h?l=129&ws=ladynana/2900&snapshot=42):9 #1 0x55869275574e in grpc_core::PerCpu<grpc_core::GlobalStatsCollector::Data>::this_cpu() [third_party/grpc/src/core/lib/gprpp/per_cpu.h:38](https://cs.corp.google.com/piper///depot/google3/third_party/grpc/src/core/lib/gprpp/per_cpu.h?l=38&ws=ladynana/2900&snapshot=42):48 #2 0x558692753cda in IncrementHttp2MetadataSize [third_party/grpc/src/core/lib/debug/stats_data.h:265](https://cs.corp.google.com/piper///depot/google3/third_party/grpc/src/core/lib/debug/stats_data.h?l=265&ws=ladynana/2900&snapshot=42):11 #3 0x558692753cda in grpc_core::HPackParser::ParseInput(grpc_core::HPackParser::Input, bool) [third_party/grpc/src/core/ext/transport/chttp2/transport/hpack_parser.cc:933](https://cs.corp.google.com/piper///depot/google3/third_party/grpc/src/core/ext/transport/chttp2/transport/hpack_parser.cc?l=933&ws=ladynana/2900&snapshot=42):20 <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
Mark D. Roth	020e9b4dd6	[WRR] Remove env var guard for WRR policy (#32936 ) - remove the `_experimental` suffix from the gRPC policy name - remove the env var guard for the xDS policy config	2 years ago
Luwei Ge	dcfc5d6904	[Audit Logging] Logger and factory APIs in C-Core and C++. (#32750 ) Audit logging APIs for both built-in loggers and third-party logger implementations. C++ uses using decls referring to C-Core APIs. --------- Co-authored-by: rockspore <rockspore@users.noreply.github.com>	2 years ago
Craig Tiller	706352a86e	[filter-fuzzer] Disable this fuzzer until its ready. (#32929 ) <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. --> --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	2 years ago
Luwei Ge	2917804b9a	[Audit Logging] Xds Audit Logger Registry (#32828 ) Third-party loggers will be added in subsequent PRs once the logger factory APIs are available to validate the configs here. This registry is used in `xds_http_rbac_filter.cc` to generate service config json.	2 years ago
Vignesh Babu	c515eba30b	[Transport] Update Chttp2 context list to include relative offset of traced RPCs within outgoing buffer (#32825 ) The PR also creates a separate BUILD target for: - chttp2 context list - iomgr buffer_list - iomgr internal errqueue This would allow the context list to be included as standalone dependencies for EventEngine implementations.	2 years ago
apolcyn	017d9943ef	[XDS] Revert "Revert "XDS: enable XDS federation by default (#32711 )" (#32814 ) (#32902 ) Previous lack-of-load-reporting issue has been fixed (b/276944116)	2 years ago
Craig Tiller	efa939ac1f	[cleanup] Remove public_headers_must_be_c89 test (#32898 ) We're starting to introduce C++ APIs to C-core, so this test is no longer relevant.	2 years ago
Craig Tiller	5da7cbb2c8	[gprpp] Better test for examine_stack (#32897 ) In order to help https://github.com/grpc/grpc/pull/32748, change the test so that it tells us what the problem is in the logs. <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago

1 2 3 4 5 ...

7871 Commits (100231973605df0c29620364f15215f460fe3329)