c-ares

Commit Graph

Author	SHA1	Message	Date
Brad House	f68992a159	Propagate record duplication error code (#820 ) In c-ares 1.30.0 we started validating strings parsed are printable. This caused a regression in a pycares test case due to a wrong response code being returned as the error was being propagated from a different section of code that was assuming the only possible failure condition was out-of-memory. This PR adds a fix for this and also a test case to validate it. Ref: https://github.com/saghul/pycares/issues/200 Fix By: Brad House (@bradh352)	4 months ago
Brad House	ef6a3dfe76	CI: Add solaris (#814 )	4 months ago
Brad House	b19c186ce7	Rework WinAFD event code (#811 ) We've had reports of user-after-free type crashes in Windows cleanup code for the Event Thread. In evaluating the code, it appeared there were some memory leaks on per-connection handles that may have remained open during shutdown, while trying to resolve that it became apparent the methodology chosen may not have been the right one for interfacing with the Windows AFD system as stability issues were seen during this debugging process. Since this system is completely undocumented, there was no clear resolution path other than to switch to the other methodology which involves directly opening `\Device\Afd`, rather than spawning a "peer socket" to use to queue AFD operations. The original methodology chosen more closely resembled what is employed by [libuv](https://github.com/libuv/libuv) and given its widespread use was the reason it was used. The new methodology more closely resembles [wepoll](https://github.com/piscisaureus/wepoll). Its not clear if there are any scalability or performance advantages or disadvantages for either method. They both seem like different ways to do the same thing, but this current way does seem more stable. Fixes #798 Fix By: Brad House (@bradh352)	4 months ago
Brad House	f90a81ed81	tests: use std::chrono instead of pulling in ares__tvnow and ares__timeval_remaining (#809 ) This will allow more tests to run even when internal symbols aren't accessible. Fix By: Brad House (@bradh352)	5 months ago
Brad House	b649b85917	tests: fix compile warning	5 months ago
Brad House	614bdd88b9	Tests: fix test cleanup race condition (#803 ) There was a thread passed data for processing that was cleaned up before thread exit, and it could cause a use-after-free in the test suite. This doesn't affect c-ares. This was found during trying to reproduce #798, but appears unrelated, don't use a helper thread as it isn't necessary. Fix By: Brad House (@bradh352)	5 months ago
Brad House	378d26144d	DNS RR TXT strings should not be automatically concatenated (#801 ) As per #738, there are usecases where the DNS TXT record strings should not be concatenated like RFC 7208 indicates. We cannot break ABI with those using the new API, so we need to support retrieving the concatenated version as well as a new API to retrieve the individual strings which will be used by `ares_parse_text_reply_ext()` to restore the old behavior prior to c-ares 1.20. Fixes Issue: #738 Fix By: Brad House (@bradh352)	5 months ago
Brad House	70f10a85f3	DNS 0x20 implementation (#800 ) This PR enables DNS 0x20 as per https://datatracker.ietf.org/doc/html/draft-vixie-dnsext-dns0x20-00 . DNS 0x20 adds additional entropy to the request by randomly altering the case of the DNS question to help prevent cache poisoning attacks. Google DNS has implemented this support as of 2023, even though this is a proposed and expired standard from 2008: https://groups.google.com/g/public-dns-discuss/c/KxIDPOydA5M There have been documented cases of name server and caching server non-conformance, though it is expected to become more rare, especially since Google has started using this. This can be enabled via the `ARES_FLAG_DNS0x20` flag, which is currently disabled by default. The test cases do however enable this flag to validate this feature. Implementors using this flag will notice that responses will retain the mixed case, but since DNS names are case-insensitive, any proper implementation should not be impacted. There is currently no fallback mechanism implemented as it isn't immediately clear how this may affect a stub resolver like c-ares where we aren't querying the authoritative name server, but instead an intermediate recursive resolver where some domains may return invalid results while others return valid results, all while querying the same nameserver. Likely using DNS cookies as suggested by #620 is a better mechanism to fight cache poisoning attacks for stub resolvers. TCP queries do not use this feature even if the `ARES_FLAG_DNS0x20` flag is specified since they are not subject to cache poisoning attacks. Fixes Issue: #795 Fix By: Brad House (@bradh352)	5 months ago
Brad House	c96200353d	valgrind: fix warning in test case	5 months ago
Brad House	51ca744459	Clean up header inclusion, simplification (#797 ) The header inclusion logic in c-ares is hard to follow. Lets try to simplify the way it works to make it easier to understand and less likely to break on new code changes. There's still more work to be done, but this is a good start at simplifying things. Fix By: Brad House (@bradh352)	5 months ago
Brad House	9b6f197fec	warning in test	5 months ago
Brad House	1b8cfdedc9	build fix	5 months ago
Brad House	853244bc58	build fix	5 months ago
Brad House	54808a5190	fix comments	5 months ago
Brad House	8293a05f63	cleanup more warnings due to new compiler flags	5 months ago
Brad House	bbcb1a2bdf	clang-format	5 months ago
Brad House	827a1d523c	ares_queryloop: output server list	5 months ago
Brad House	f8d1e63840	ares_queryloop: properly capture CTRL-C and cleanup	5 months ago
Brad House	7ea18a83b3	test: clean up some minor warnings	5 months ago
Brad House	f9faa3f05c	try to work around windows ASAN issue by not using std::string::npos	5 months ago
Brad House	268092a390	MSVC: enable strict warnings (#792 ) MSVC has been building with /W3 which isn't considered a safe level for modern code. /W4 is recommended, but it too is lacking some recommended options, so we enable /W4 and also the recommended options. We do, however, have to disable a couple of options due to Windows headers not being fully compliant sometimes as well as some things we do in c-ares that it doesn't like, but aren't actually bad. Fix By: Brad House (@bradh352)	5 months ago
Brad House	4248c642d2	Enable QueryCache by default (#786 ) The query cache should be enabled by default. This will help with determining proper timeouts for #736. It can still be disabled by setting the ttl to 0. There should be no negative consequences of this in real-world scenarios since DNS is based on the TTL concept and upstream servers will cache results and not recurse based on this information anyhow. DNS queries and responses are very small, this should have negligible impact on memory consumption. Fix By: Brad House (@bradh352)	5 months ago
Brad House	f05465e59b	tests: set ndots:1 as default, don't honor system config as it may skew results	5 months ago
Brad House	c0d41d08ab	Coverage code annotations for identification of desirable paths that need testing (#775 ) Add code annotations for ignoring specific code paths for coverage calculations. The primary purpose of this is to make it easy to see the code paths that we could (and probably should) write test cases for, as these would have the most impact on delivery of a stable product. The annotations used are: `LCOV_EXCL_LINE: <designation>`, `LCOV_EXCL_START: <designation>`, `LCOV_EXCL_STOP` Unfortunately `LCOV_EXCL_BR_LINE` does not appear to be supported by coveralls as it would have been a more elegant solution over START/STOP. We specifically include the `<designation>` not just for future reference but because it makes it easy to identify in case we want to address these conditions in a different way in the future. The main areas designated for exclusion are: 1. `OutOfMemory` - these are hard to test cases, and on modern systems, are likely to never occur due to optimistic memory allocations, which can then later cause the kernel to terminate your application due to memory not actually being available. c-ares does have some testing framework for this, if we wish to expand in the future, we can easily use sed to get rid of of these annotations. 2. `DefensiveCoding` - these are impossible to reach paths at the point in time the code was written. They are there for defensive coding in case code is refactored in the future to prevent unexpected behavior. 3. `UntestablePath` - these are code paths that aren't possible to test, such as failure of a system call. 4. `FallbackCode` - This is an entire set of code that is untestable because its not able to simulate a failure of the primary path. This PR also does add some actual coverage in the test cases where it is easy to do. Fix By: Brad House (@bradh352)	5 months ago
Gregor Jasny	9d36fd2030	fix some obvious errors reported by the CLion Project Analyzer (#779 ) Fix By: Gregor Jasny (@gjasny)	6 months ago
Brad House	6129d9b79f	Basic support for SIG RR record (RFC 2931 / RFC 2535) (#773 ) With the current c-ares parser, as per PR #765 parsing was broken due to validation that didn't understand the `SIG` record class. This PR adds basic, non validating, and incomplete support for the `SIG` record type. The additional `KEY` and `NXT` which would be required for additional verification of the records is not implemented. It also does not store the raw unprocessed RR data that would be required for the validation. The primary purpose of this PR is to be able to recognize the record and handle some periphery aspects such as validation of the class associated with the RR and to not honor the TTL in the RR in the c-ares query cache since it will always be 0. Fixes #765 Fix By: Brad House (@bradh352)	6 months ago
Brad House	f70f09f01c	Fix windows y2k38 issue by creating our own timeval datatype (#772 ) As per Issue #760, the use of `struct timeval` is meant for only time differentials, however it could be used to denote an exact timeout. This could lead to y2k38 issues on some platforms. Fixes Issue #760 Fix By: Brad House (@bradh352)	6 months ago
Brad House	7497991ae5	clang-format	6 months ago
Brad House	8d80486e04	Auto reload config on changes (requires EventThread) (#759 ) Automatically detect configuration changes and reload. On systems which provide notification mechanisms, use those, otherwise fallback to polling. When a system configuration change is detected, it asynchronously applies the configuration in order to ensure it is a non-blocking operation for any queries which may still be being processed. On Windows, however, changes aren't detected if a user manually sets/changes the DNS servers on an interface, it doesn't appear there is any mechanism capable of this. We are relying on `NotifyIpInterfaceChange()` for notifications. Fixes Issue: #613 Fix By: Brad House (@bradh352)	6 months ago
Oliver Welsh	89a8856cca	Add observability into DNS server health via a server state callback, invoked whenever a query finishes (#744 ) Summary This PR adds a server state callback that is invoked whenever a query to a DNS server finishes. The callback is invoked with the server details (as a string), a boolean indicating whether the query succeeded or failed, flags describing the query (currently just indicating whether TCP or UDP was used), and custom userdata. This can be used by user applications to gain observability into DNS server health and usage. For example, alerts when a DNS server fails/recovers or metrics to track how often a DNS server is used and responds successfully. Testing Three new regression tests `MockChannelTest.ServStateCallback*` have been added to test the new callback in different success/failure scenarios. Fix By: Oliver Welsh (@oliverwelsh)	7 months ago
Oliver Welsh	09e82e05a3	Improve reliability in the server retry delay regression tests (#747 ) Improve reliability in the server retry delay regression tests by increasing the retry delay and sleeping for a little more than the retry delay when attempting to force retries. This helps to account for unreliable timing (e.g. NTP slew) intermittently breaking pipelines. Fix By: Oliver Welsh (@oliverwelsh)	7 months ago
Oliver Welsh	fd81f36d3e	Add server failover retry behavior, where failed servers are retried with small probability after a minimum delay (#731 ) Summary By default c-ares will select the server with the least number of consecutive failures when sending a query. However, this means that if a server temporarily goes down and hits failures (e.g. a transient network issue), then that server will never be retried until all other servers hit the same number of failures. This is an issue if the failed server is preferred to other servers in the list. For example if a primary server and a backup server are configured. This PR adds new server failover retry behavior, where failed servers are retried with small probability after a minimum delay has passed. The probability and minimum delay are configurable via the `ARES_OPT_SERVER_FAILOVER` option. By default c-ares will use a probability of 10% and a minimum delay of 5 seconds. In addition, this PR includes a small change to always close out connections to servers which have hit failures, even with `ARES_FLAG_STAYOPEN`. It's possible that resetting the connection can resolve some server issues (e.g. by resetting the source port). Testing A new set of regression tests have been added to test the new server failover retry behavior. Fixes Issue: #717 Fix By: Oliver Welsh (@oliverwelsh)	7 months ago
Brad House	458c937213	Allow configuration value for NDots to be zero (#735 ) As per Issue #734 some people use `ndots:0` in their configuration which is allowed by the system resolver but not by c-ares. Add support for `ndots:0` and add a test case to validate this behavior. Fixes Issue: #734 Fix By: Brad House (@bradh352)	8 months ago
Brad House	7d455baa27	remove tests that have been disabled forever	8 months ago
Brad House	5fd3fc3ab3	mark deprecated functions as such (#732 ) Multiple functions have been deprecated over the years, annotate them with attribute deprecated. When possible show a message about their replacements. This is a continuation/completion of PR #706 Fix By: Cristian Rodríguez (@crrodriguez)	8 months ago
Brad House	a516bbbbaf	tests: mockserver is local, shorten timeouts to make test cases run faster to use less CI resources	8 months ago
Oliver Welsh	fab65acae9	Add function ares_search_dnrec() to search for records using the new DNS record parser (#719 ) This PR adds a new function `ares_search_dnsrec()` to search for records using the new DNS record parser. The function takes an arbitrary DNS record object to search (that must represent a query for a single name). The function takes a new callback type, `ares_callback_dnsrec`, that is invoked with a parsed DNS record object rather than the raw buffer(+length). The original motivation for this change is to provide support for [draft-kaplan-enum-sip-routing-04](https://datatracker.ietf.org/doc/html/draft-kaplan-enum-sip-routing-04); when routing phone calls using an ENUM server, it can be useful to include identifying source information in an OPT RR options value, to help select the appropriate route for the call. The new function allows for more customisable searches like this. Summary of code changes A new function `ares_search_dnsrec()` has been added and exposed. Moreover, the entire `ares_search_int()` internal code flow has been refactored to use parsed DNS record objects and the new DNS record parser. The DNS record object is passed through the `search_query` structure by encoding/decoding to/from a buffer (if multiple search domains are used). A helper function `ares_dns_write_query_altname()` is used to re-write the DNS record object with a new query name (used to append search domains). `ares_search()` is now a wrapper around the new internal code, where the DNS record object is created based on the name, class and type parameters. The new function uses a new callback type, `ares_callback_dnsrec`. This is invoked with a parsed DNS record object. For now, we convert from `ares_callback` to this new type using `ares__dnsrec_convert_cb()`. Some functions that are common to both `ares_query()` and `ares_search()` have been refactored using the new DNS record parser. See `ares_dns_record_create_query()` and `ares_dns_query_reply_tostatus()`. Testing A new FV has been added to test the new function, which searches for a DNS record containing an OPT RR with custom options value. As part of this, I needed to enhance the mock DNS server to expect request text (and assert that it matches actual request text). This is because the FV needs to check that the request contains the correct OPT RR. Documentation The man page docs have been updated to describe the new feature. Futures In the future, a new variant of `ares_send()` could be introduced in the same vein (`ares_send_dnsrec()`). This could be used by `ares_search_dnsrec()`. Moreover, we could migrate internal code to use `ares_callback_dnsrec` as the default callback. This will help to make the new DNS record parser the norm in C-Ares. --------- Co-authored-by: Oliver Welsh (@oliverwelsh)	8 months ago
Brad House	a2a8578ee0	Replace configuration file parsers with memory-safe parser (#725 ) Rewrite configuration parsers using new memory safe parsing functions. After CVE-2024-25629 its obvious that we need to prioritize again on getting all the hand written parsers with direct pointer manipulation replaced. They're just not safe and hard to audit. It was yet another example of 20+yr old code having a memory safety issue just now coming to light. Though these parsers are definitely less efficient, they're written with memory safety in mind, and any performance difference is going to be meaningless for something that only happens once a while. Fix By: Brad House (@bradh352)	8 months ago
Oliver Welsh	035c4c3776	Add flag to not use a default local named server on channel initialization (#713 ) Hello, I work on an application for Microsoft which uses c-ares to perform DNS lookups. We have made some minor changes to the library over time, and would like to contribute these back to the project in case they are useful more widely. This PR adds a new channel init flag, described below. Please let me know if I can include any more information to make this PR better/easier for you to review. Thanks! Summary When initializing a channel with `ares_init_options()`, if there are no nameservers available (because `ARES_OPT_SERVERS` is not used and `/etc/resolv.conf` is either empty or not available) then a default local named server will be added to the channel. However in some applications a local named server will never be available. In this case, all subsequent queries on the channel will fail. If we know this ahead of time, then it may be preferred to fail channel initialization directly rather than wait for the queries to fail. This gives better visibility, since we know that the failure is due to missing servers rather than something going wrong with the queries. This PR adds a new flag `ARES_FLAG_NO_DFLT_SVR`, to indicate that a default local named server should not be added to a channel in this scenario. Instead, a new error `ARES_EINITNOSERVER` is returned and initialization fails. Testing I have added 2 new FV tests: - `ContainerNoDfltSvrEmptyInit` to test that initialization fails when no nameservers are available and the flag is set. - `ContainerNoDfltSvrFullInit` to test that initialization still succeeds when the flag is set but other nameservers are available. Existing FVs are all passing. Documentation I have had a go at manually updating the docs to describe the new flag/error, but couldn't see any contributing guidance about testing this. Please let me know if you'd like anything more here. --------- Fix By: Oliver Welsh (@oliverwelsh)	9 months ago
Brad House	fe04c6cadd	clang-format	9 months ago
Brad House	fed3559cfc	Add ares_queue_wait_empty() for use with EventThreads (#710 ) It may be useful to wait for the queue to be empty under certain conditions (mainly test cases), expose a function to efficiently do this and rework test cases to use it. Fix By: Brad House (@bradh352)	10 months ago
Brad House	906d2c1041	sanity check GTest includes GMock component	10 months ago
Brad House	0e4c0f2600	build-time disabled threads breaks c-ares (#700 ) Regression introduced in 1.26.0, building c-ares with threading disabled results in ares_init{_options}() failing. Also adds a new CI test case to prevent this regression in the future. Fixes Bug: #699 Fix By: Brad House (@bradh352)	10 months ago
Brad House	7963c519fc	Event Subsystem: No longer require integrators to have their own (#696 ) This PR implements an event thread to process all events on file descriptors registered by c-ares. Prior to this feature, integrators were required to understand the internals of c-ares and how to monitor file descriptors and timeouts and process events. Implements OS-specific efficient polling such as epoll(), kqueue(), or IOCP, and falls back to poll() or select() if otherwise unsupported. At this point, it depends on basic threading primitives such as pthreads or windows threads. If enabled via the ARES_OPT_EVENT_THREAD option passed to ares_init_options(), then socket callbacks cannot be used. Fixes Bug: #611 Fix By: Brad House (@bradh352)	10 months ago
Erik Lax	26642c1014	Added flags to are_dns_parse to force RAW packet parsing (#693 ) This pull request adds six flags to instruct the parser under various circumstances to skip parsing of the returned RR records so the raw data can be retrieved. Fixes Bug: #686 Fix By: Erik Lax (@eriklax)	10 months ago
Brad House	d6850eb4ad	Autotools allow make to override CFLAGS/CPPFLAGS/CXXFLAGS (#695 ) The previous build system allowed overwriting of CFLAGS/CPPFLAGS/CXXFLAGS on the make command line. Switch to using AM_CFLAGS/AM_CPPFLAGS/AM_CXXFLAGS when we set our own flags for building which ensures they are kept even when a user tries to override. Fixes Bug: #694 Fix By: Brad House (@bradh352)	10 months ago
Brad House	626dcb155b	Do not sanity check RR Name vs Question (#685 ) It appears as though we should never sanity check the RR name vs the question name as some DNS servers may return results for alias records. Fixes Bug: #683 Fix By: Brad House (@bradh352)	11 months ago
Gregor Jasny	2a6a420cb6	cmake: improve some include related code (#680 ) * cmake: avoid warning about non-existing include dir In the Debian build logs I noticed the following warning: cc1: warning: /build/c-ares-1.25.0/test/include: No such file or directory [-Wmissing-include-dirs] This happened because ${CMAKE_INSTALL_INCLUDEDIR} had been added to caresinternal. I believe it has been copied from the "real" lib where it's used in the INSTALL_INTERFACE context. But because caresinternal is never installed we don't need that include here. * cmake: drop CARES_TOPLEVEL_DIR variable The CARES_TOPLEVEL_DIR variable is the same as the automatically created PROJECT_SOURCE_DIR variable. Let's stick to the official one. Also because it is already used at places where CARES_TOPLEVEL_DIR is used as well. Fix By: Gregor Jasny (@gjasny)	11 months ago
Brad House	4f5767ed69	test: fix outdated license headers	11 months ago
Brad House	1231aa739f	tests: replace google DNS with CloudFlare for reverse lookups as google's servers stopped responding properly	11 months ago

1 2 3 4

162 Commits (2bd73c9a7dbb79cc56667ced13fceb70a68a5e26)