Chiebot-Mirror/grpc - grpc - Gitea: Git with a cup of tea

Commit Graph

Author	SHA1	Message	Date
Craig Tiller	cd96210215	[fuzzer] Fix event ordering in retry_max_concurrent_streams (#33454 ) Just like #33405	1 year ago
Craig Tiller	133640507b	[end2end] Better logging, and a fix (by increasing timeout) on disappearing_server_test (#33444 ) I've had local runs with a 10 second gap between creating the call and issuing the first batch client side. --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	1 year ago
Craig Tiller	2055cce132	[fuzzing] Fix bug on endpoint shutdown whereby we leave read requests dangling (#33406 ) Fix fuzzer found bug b/286716972 Follows up on https://github.com/grpc/grpc/pull/33266 but gets the edge case right of when there's a read queued before the peer closes - in that case we weren't waking up the read.	1 year ago
Craig Tiller	51a6857fa1	[end2end] Shard some longer running tests a little more to reduce timeout risk (#33439 ) <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	1 year ago
Craig Tiller	7e6606f5a6	[windows] Add a check for too long path names (#33418 ) We should probably cap this so that our customers have a chance of cloning the repository. --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	2 years ago
Craig Tiller	8e137e524a	[end2end] Reduce likelihood of two tests opening the same UDS socket (#33410 )	2 years ago
Craig Tiller	14c63c70af	[fuzzing] Fix recursive mutex acquisition in fuzzing event engine (#33404 ) Fixes internal fuzzing bug b/286717107	2 years ago
Craig Tiller	495bcbcb74	[fuzzing] Increase timeout to accomodate slow internal callbacks (#33407 ) Fixes b/286780969	2 years ago
Craig Tiller	eba6cecdcb	[fuzzing] Remove ambiguity in test case, make fuzzer pass (#33405 ) Here the recv message batch 103 was returning end of stream. Per the reasoning in https://github.com/grpc/proposal/blob/master/L104-core-ban-recv-with-send-status.md Sending status is the final thing for a call on the server, so requiring a recv message to complete when we've sent status is getting into at best a gray area in out spec. Add a strict ordering between that recv and the sending of status to make a more deterministic test. fixes b/286708835, b/286727273	2 years ago
Sergii Tkachenko	de6ed9ba9f	[Python] Migrate from yapf to black (#33138 ) - Switched from yapf to black - Reconfigure isort for black - Resolve black/pylint idiosyncrasies Note: I used `--experimental-string-processing` because black was producing "implicit string concatenation", similar to what described here: https://github.com/psf/black/issues/1837. While currently this feature is experimental, it will be enabled by default: https://github.com/psf/black/issues/2188. After running black with the new string processing so that the generated code merges these `"hello" " world"` strings concatenations, then I removed `--experimental-string-processing` for stability, and regenerated the code again. To the reviewer: don't even try to open "Files Changed" tab 😄 It's better to review commit-by-commit, and ignore `run black and isort`.	2 years ago
Craig Tiller	82534bab24	[end2end] Shard some longer running tests a little more to reduce timeout risk (#33387 ) Also drop a few deadlines so that tests can run faster (where that's safe) <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
Craig Tiller	d8f3dab96c	[end2end] Binary & fuzzer per test .cc (#33374 ) <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
Craig Tiller	72da46fa5c	[fuzzing] Handle closing after the final write in fuzzing-event-engine (#33266 ) If an endpoint closes it should still report any pending writes.	2 years ago
Craig Tiller	f964961ba8	[fuzzing] Tune down max delay (#33345 ) This had been intended to be 500ms for the first round, but inadvertently got bumped up during some last minute investigations. Tune it back down, let things settle out, and then see whether we want to increase it or not. <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
Craig Tiller	21996c3784	[end2end] Force crash on failure to receive an event (#33260 ) When I converted these tests there was an intent to use gtest failures to see past an unexpected event/no event received error - however that doesn't work out because our tests rely on the prior events happening. What we got instead was misattributed failures, folks chasing wild theories on uninitialized data, dogs and cats living together, mass hysteria. Let's just crash and see if it makes diagnosis actually easier. <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
Craig Tiller	bc70a67e94	[fuzzer] Increase timeouts to accommodate delayed callbacks (#33271 ) <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
Craig Tiller	ea58add8bf	[end2end] Fix fuzzer found memory leak (#33264 ) (this is a bug in the core e2e fuzzer infrastructure)	2 years ago
Craig Tiller	e0ba7b720a	[fuzzing] Increase timeout to accommodate delayed callbacks (#33267 ) Put enough internal delays into this test and it hits deadline exceeded... extend the deadline to cover that. (this is likely to become a common edit over the next few weeks...)	2 years ago
Craig Tiller	7a3e2e45da	[fuzzer] Change core_end2end_test_fuzzer test selection scheme (#33197 ) Use an index instead of a string to select tests (and use that index module total test count to ensure whatever the fuzzer selects we always run a test). This will make the fuzzer corpus unstable when the test count changes, which I think is fine - it'll regenerate. <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
AJ Heller	4fde5dabf6	[fuzzing] Extract and modernize ChannelArgs fuzzer configuration (#33161 ) ChannelArgs fuzz configuration is expected to be used in other fuzzing targets as well. This PR extracts the common code from the API fuzzer and converts to use the C++ types.	2 years ago
Vignesh Babu	9c59671936	[testing] Skip more flaky event engine tests (#33160 ) Expand the set with more new flaky tests.	2 years ago
Craig Tiller	5fac4ad47b	[fuzzing] Improve OSA distance performance (#33149 ) Early out evaluating this function where we can, and use macros to eliminate function calls in debug builds. Takes per-example time from 5400ms to 1200ms in debug asan builds. <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. --> --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	2 years ago
Mark D. Roth	a78001a087	[resolver] remove unused ctor for ServerAddress (#33148 ) Co-authored-by: markdroth <markdroth@users.noreply.github.com>	2 years ago
Vignesh Babu	63ecc4ba3e	[testing] Temporarily skip flaky event engine tests. (#33136 ) Based on flaky tests reported by dashboard: https://dashboards.corp.google.com/stubby_team.grpc_flaky_dashboard#1318s66d4f	2 years ago
Craig Tiller	239d3e6857	[fuzzing] Allow core_end2end_test_fuzzer, api_fuzzer to change experiments (#33147 ) They were intended to be able to, but since these are currently frozen across the process it wasn't possible. Fix that for these fuzzers.	2 years ago
Craig Tiller	cd44a2433e	[call] Dont take grpclb_client_stats from the app (#33118 ) This metadata doesn't actually encode so passing it through from an app will force a crash. <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
Vignesh Babu	915d7c4a70	[Fuzzing] Bound RunAfter duration in fuzzing event engine (#33128 ) Bounds duration to 1 year. Fixes b/258949216	2 years ago
Craig Tiller	997af8d073	[api_fuzzer] Attempt to clean up fuzzer memory leak (#33120 ) <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
Craig Tiller	66d9f52fbd	[api-fuzzer] Fix memory leak (#33109 ) ApiFuzzer::CreateChannel() called twice creates two channels but doesn't delete the first. Choose some reasonable behavior.	2 years ago
Craig Tiller	123811399b	[promises] Remove bad log statement (#33113 ) Was leading to a nullptr deref, and we just don't need this one anymore. <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
Craig Tiller	18d369a6f4	[fuzzing] Avoid initialization order fiasco in core_end2end_test_fuzzer (#33108 )	2 years ago
Craig Tiller	9760ce9d0a	[end2end] Shorten corpora filenames (#33095 ) Avoids long path name problems on Windows <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
Craig Tiller	4674f2ccf7	[fuzz] Turn core end2end tests into fuzzers (#33013 ) Add a new binary that runs all core end2end tests in fuzzing mode. In this mode FuzzingEventEngine is substituted for the default event engine. This means that time is simulated, as is IO. The FEE gets control of callback delays also. In our tests the `Step()` function becomes, instead of a single call to `completion_queue_next`, a series of calls to that function and `FuzzingEventEngine::Tick`, driving forward the event loop until progress can be made. PR guide: --- New binaries `core_end2end_test_fuzzer` - the new fuzzer itself `seed_end2end_corpus` - a tool that produces an interesting seed corpus Config changes for safe fuzzing The implementation tries to use the config fuzzing work we've previously deployed in api_fuzzer to fuzz across experiments. Since some experiments are far too experimental to be safe in such fuzzing (and this will always be the case): - a new flag is added to experiments to opt-out of this fuzzing - a new hook is added to the config system to allow variables to re-write their inputs before setting them during the fuzz Event manager/IO changes Changes are made to the event engine shims so that tcp_server_posix can run with a non-FD carrying EventEngine. These are in my mind a bit clunky, but they work and they're in code that we expect to delete in the medium term, so I think overall the approach is good. Changes to time A small tweak is made to fix a bug initializing time for fuzzers in time.cc - we were previously failing to initialize `g_process_epoch_cycles` Changes to `Crash` A version that prints to stdio is added so that we can reliably print a crash from the fuzzer. Changes to CqVerifier Hooks are added to allow the top level loop to hook the verification functions with a function that steps time between CQ polls. Changes to end2end fixtures State machinery moves from the fixture to the test infra, to keep the customizations for fuzzing or not in one place. This means that fixtures are now just client/server factories, which is overall nice. It did necessitate moving some bespoke machinery into h2_ssl_cert_test.cc - this file is beginning to be problematic in borrowing parts but not all of the e2e test machinery. Some future PR needs to solve this. A cq arg is added to the Make functions since the cq is now owned by the test and not the fixture. Changes to test registration `TEST_P` is replaced by `CORE_END2END_TEST` and our own test registry is used as a first depot for test information. The gtest version of these tests: queries that registry to manually register tests with gtest. This ultimately changes the name of our tests again (I think for the last time) - the new names are shorter and more readable, so I don't count this as a regression. The fuzzer version of these tests: constructs a database of fuzzable tests that it can consult to look up a particular suite/test/config combination specified by the fuzzer to fuzz against. This gives us a single fuzzer that can test all 3k-ish fuzzing ready tests and cross polinate configuration between them. Changes to test config The zero size registry stuff was causing some problems with the event engine feature macros, so instead I've removed those and used GTEST_SKIP in the problematic tests. I think that's the approach we move towards in the future. Which tests are included Configs that are compatible - those that do not do fd manipulation directly (these are incompatible with FuzzingEventEngine), and those that do not join threads on their shutdown path (as these are incompatible with our cq wait methodology). Each we can talk about in the future - fd manipulation would be a significant expansion of FuzzingEventEngine, and is probably not worth it, however many uses of background threads now should probably evolve to be EventEngine::Run calls in the future, and then would be trivially enabled in the fuzzers. Some tests currently fail in the fuzzing environment, a `SKIP_IF_FUZZING` macro is used for these few to disable them if in the fuzzing environment. We'll burn these down in the future. Changes to fuzzing_event_engine Changes are made to time: an exponential sweep forward is used now - this catches small time precision things early, but makes decade long timers (we have them) able to be used right now. In the future we'll just skip time forward to the next scheduled timer, but that approach doesn't yet work due to legacy timer system interactions. Changes to port assignment: we ensure that ports are legal numbers before assigning them via `grpc_pick_port_or_die`. A race condition between time checking and io is fixed. --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	2 years ago
Mark D. Roth	1fcaccdf5f	[client channel] Second attempt: use ChunkedVector for call attributes (#33015 ) Original was #33002, reverted in #33014. The second commit here adds a build visibility tag necessary to fix the internal build problems.	2 years ago
AJ Heller	18aab6ffb5	Revert "[client channel] use ChunkedVector for call attributes" (#33014 ) Reverts grpc/grpc#33002. Breaks internal builds: `.../privacy_context:filters does not depend on a module exporting '.../src/core/lib/channel/context.h'`	2 years ago
Mark D. Roth	2f89fd5528	[client channel] use ChunkedVector for call attributes (#33002 ) Change call attributes to be stored in a `ChunkedVector` instead of `std::map<>`, so that the storage can be allocated on the arena. This means that we're now doing a linear search instead of a map lookup, but the total number of attributes is expected to be low enough that that should be okay. Also, we now hide the actual data structure inside of the `ServiceConfigCallData` object, which required some changes to the `ConfigSelector` API. Previously, the `ConfigSelector` would return a `CallConfig` struct, and the client channel would then use the data in that struct to populate the `ServiceConfigCallData`. This PR changes that such that the client channel creates the `ServiceConfigCallData` before invoking the `ConfigSelector`, and it passes the `ServiceConfigCallData` into the `ConfigSelector` so that the `ConfigSelector` can populate it directly.	2 years ago
AJ Heller	a9afd1cde8	[test] Re-land: Enable EventEngine experiments for Posix end2end tests (#32948 ) Relands #32844. End2end tests will now wait for the default EventEngine to shut down between tests. This should avoid some use-after-frees and leaks.	2 years ago
Yijie Ma	1d10ca77ce	[Fuzzing] Migrate client and server_fuzzer to structured fuzzing (#32878 ) - Added `fuzzer_input.proto` and `NetworkInput` proto message - Migrated client_fuzzer and server_fuzzer to proto fuzzer - Migrated the existing corpus and verified that the code coverage (e.g. chttp2) stays the same Probably need to cherrypick due to amount of files changed. <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
Vignesh Babu	a2c89d0b24	[fuzzing] Define a common fuzzing interface and move API fuzzer to it (#32853 ) Requires cherrypick for grpc_fuzzer.bzl file.	2 years ago
AJ Heller	ca92648aa3	Revert "[test] Enable EventEngine experiments for Posix end2end tests." (#32855 ) Reverts grpc/grpc#32844. CI revealed multiple EventEngine issues overnight.	2 years ago
AJ Heller	b16bf18bc3	[test] Enable EventEngine experiments for Posix end2end tests. (#32844 ) This enables the EventEngine experiments in end2end tests, excluding the ResourceQuota tests which have known failures. Some Windows tests are hanging, so they will be enabled later. --------- Co-authored-by: drfloob <drfloob@users.noreply.github.com>	2 years ago
Craig Tiller	63c094cf5b	[promises] Run C++ end to end tests with server promises (#32537 ) Expand server promises to run with C++ end2end tests. Across connected_channel/call/batch_builder/pipe/transport: - fix a bug where read errors weren't propagated from transport to call so that we can populate failed_before_recv_message for the c++ bindings - ensure those errors are not, however, used to populate the returned call status Add a new latch call arg to lazily propagate the bound CQ for a server call (and client call, but here it's used degenerately - it's always populated). This allows server calls to be properly bound to pollsets.(1)/(2) In call.cc: - move some profiling code from FilterStackCall to Call, and then use it in PromiseBasedCall (this should be cleaned up with tracing work) - implement GetServerAuthority In server.cc: - use an RAII pattern on `MatchResult` to avoid a bug whereby a tag could be dropped if we cancel a request after it's been matched but before it's published - fix deadline export to ServerContext In resource_quota_server.cc: - fix some long standing flakes (that were finally obvious with the new test code) - it's legal here to have client calls not arrive at the server due to resource starvation, work through that (includes adding expectations during a `Step` call, which required some small tweaks to cq_verifier) In the C++ end2end_test.cc: - strengthen a flaky test so it passes consistently (it's likely we'll revisit this with the fuzzing efforts to strengthen it into an actually robust test) (1) It's time to remove this concept (2) Surprisingly the only test that reliably demonstrates this not being done is time_change_test --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	2 years ago
Craig Tiller	afddf1a70c	[chttp2] Better error message on metadata size exceeded message (#32809 ) This error can trigger for either initial or trailing metadata (and we've had outages where the latter was the cause). I don't think we know at this layer if we're parsing initial or trailing - though it'd be a good exercise to plumb that through. For now remove the word initial because it's better to give less information than wrong information. <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
Alisha Nanda	4e2f92bf9c	[metadata] Fix fuzzer bug with metadata arg. (#32787 ) Bug: b/276525236.	2 years ago
Craig Tiller	1f0630fd91	[core-test] Ensure grpc is fully shutdown between e2e tests (#32797 ) <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. -->	2 years ago
Craig Tiller	724441d85b	[tests] Convert core e2e tests to gtest (#32603 ) Notes: - `+trace` fixtures haven't run since 2016, so they're disabled for now (`7ad2d0b463 (diff-780fce7267c34170c1d0ea15cc9f65a7f4b79fefe955d185c44e8b3251cf9e38R76)`) - all current fixtures define `FEATURE_MASK_SUPPORTS_AUTHORITY_HEADER` and hence `authority_not_supported` has not been run in years - deleted - bad_hostname similarly hasn't been triggered in a long while, so deleted - load_reporting_hook has never been enabled, so deleted (`f23fb4cf31/test/core/end2end/generate_tests.bzl (L145-L148)`) - filter_latency & filter_status_code rely on global variables and so don't convert particularly cleanly - and their value seems marginal, so deleted --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	2 years ago
Craig Tiller	db3daf567b	[api-fuzzer] Enable fuzzing over config vars (#32736 ) Add the capability for api-fuzzer to fuzz over different config variables, to enable us to spot incompatible configurations there sooner. --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	2 years ago
Craig Tiller	175ccc3a90	Reland global config changes (#32661 ) <!-- If you know who should review your pull request, please assign it to that person, otherwise the pull request would get assigned randomly. If your pull request is for a specific language, please add the appropriate lang label. --> --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	2 years ago
Craig Tiller	a363b6c001	[fuzzing] Implement endpoints for FuzzingEventEngine (#32689 ) Implement listeners, connection, endpoints for `FuzzingEventEngine`. Allows the fuzzer to select write sizes and delays, connection delays, and port assignments. I made a few modifications to the test suite to admit this event engine to pass the client & server tests: 1. the test factories return shared_ptr<> to admit us to return the same event engine for both the oracle and the implementation - necessary because FuzzingEventEngine forms a closed world of addresses & ports. 2. removed the WaitForSingleOwner calls - these seem unnecessary, and we don't ask our users to do this - tested existing linux tests 1000x across debug, asan, tsan with this change Additionally, the event engine overrides the global port picker logic so that port assignments are made by the fuzzer too. This PR is a step along a longer journey, and has some outstanding brethren PR's, and some follow-up work: * #32603 will convert all the core e2e tests into a more malleable form * we'll then use #32667 to turn all of these into fuzzers * finally we'll integrate this into that work and turn all core e2e tests into fuzzers over timer & callback reorderings and io size/spacings --------- Co-authored-by: ctiller <ctiller@users.noreply.github.com>	2 years ago
Alisha Nanda	19d06a78ec	Add random early rejection for metadata (#32600 ) (hopefully last try) Add new channel arg GRPC_ARG_ABSOLUTE_MAX_METADATA_SIZE as hard limit for metadata. Change GRPC_ARG_MAX_METADATA_SIZE to be a soft limit. Behavior is as follows: Hard limit (1) if hard limit is explicitly set, this will be used. (2) if hard limit is not explicitly set, maximum of default and soft limit * 1.25 (if soft limit is set) will be used. Soft limit (1) if soft limit is explicitly set, this will be used. (2) if soft limit is not explicitly set, maximum of default and hard limit * 0.8 (if hard limit is set) will be used. Requests between soft and hard limit will be rejected randomly, requests above hard limit will be rejected.	2 years ago

... 3 4 5 6 7 ...

3003 Commits (406fbf07a4429d2ca23ee93a5c0ee06ceca9fc4f)