c-ares

Commit Graph

Author	SHA1	Message	Date
Brad House	aed78f1665	Windows UBSAN tests (#649 ) Fix UBSAN error, and enable UBSAN testing in AppVeyor. Fixes Bug #648 Fix By: Gisle Vanem (@gvanem)	12 months ago
Brad House	d974c556bb	Support ipv6 link-local servers and %iface syntax (#646 ) Some environments may send router advertisements on a link setting their link-local (fe80::/10) address as a valid DNS server to the remote system. This will cause a DNS entry to be created like `fe80::1%iface`, since all link-local network interfaces are technically part of the same /10 subnet, it must be told what interface to send packets through explicitly if there are multiple physical interfaces. This PR adds support for the %iface modifier when setting DNS servers via `/etc/resolv.conf` as well as via `ares_set_servers_csv()`. For MacOS and iOS it is assumed that libresolve will set the `sin6_scope_id` and should be supported, but my test systems don't seem to read the Router Advertisement for RDNSS link-local. Specifying the link-local dns server on MacOS via adig has been tested and confirmed working. For Windows, this is similar to MacOS in that the system doesn't seem to honor the RDNSS RA, but specifying manually has been tested to work. At this point, Android support does not exist. Fixes Bug #462 Supersedes PR #463 Fix By: Brad House (@bradh352) and Serhii Purik (@sergvpurik)	12 months ago
Brad House	86ac418970	Nameserver parsing: replace another hand-written parser (#643 ) This replaces the nameserver parsing code with code that use ares__buf_*() in the name of memory safety. Fix By: Brad House (@bradh352)	1 year ago
Brad House	6e83c28ab4	clang-format	1 year ago
Brad House	fb52c3f9d1	ares_init_options with ARES_OPT_UDP_PORT wrong byte order Regression from c-ares 1.19.1, ARES_OPT_UDP_PORT and ARES_OPT_TCP_PORT are specified from the user in host-byte order, but there was a regression that caused it to be read as if it was network byte order. Fixes Bug: #640 Reported By: @Flow86 Fix By: Brad House (@bradh352)	1 year ago
Brad House	a9442bd828	Basic Thread Safety (#636 ) c-ares does not have any concept of thread-safety. It has always been 100% up to the implementor to ensure they never call c-ares from more than one thread at a time. This patch adds basic thread-safety support, which can be disabled at compile time if not desired. It uses a single recursive mutex per channel, which should be extremely quick when uncontested so overhead should be minimal. Fixes Bug: #610 Also sets the stage to implement #611 Fix By: Brad House (@bradh352)	1 year ago
Christian Clauss	054f474a29	Fix typos discovered by codespell (#634 ) % `codespell --ignore-words-list="aas,aci,acter,atleast,contentss,firey,fo,sais,seh,statics"` * https://pypi.org/project/codespell Fix By: Christian Clauss (@cclauss)	1 year ago
Ignat	2985ce35d0	Support attempts and timeout options from resolv.conf (#632 ) c-ares parses only antique version of options for timeout and number of retries from resolv.conf (`retrans` and `retry` are missing in modern documentation https://man7.org/linux/man-pages/man5/resolv.conf.5.html). I add support of `attempts` and `timeout` options Fix By: Ignat (@Kontakter)	1 year ago
Brad House	00f7cf6d25	fix test case regression due to missing parens	1 year ago
Brad House	5262da7e88	now that warnings are enabled on test cases, clear a bunch of warnings	1 year ago
Brad House	33ee6c0573	CMake: CXXFLAGS environment wasn't being read because C++ compiler was enabled after settings warnings.	1 year ago
Brad House	ecf7a7f166	fix additional windows test warnings	1 year ago
Brad House	5c1a5d8050	cleanup some Windows warnings in test	1 year ago
Brad House	3db69bbfeb	Fix Windows UWP (Store) building and add to CI/CD (#627 ) When building for UWP (WindowsStore), additional headers are needed and some functions are not available. This also adds AppVeyor CI/CD support to catch these issues in the future. Fix By: Deal (@halx99) and Brad House (@bradh352)	1 year ago
Brad House	320cefe1c7	ares_set_servers_() should allow an empty server list For historic reasons, we have users depending on ares_set_servers_() to return ARES_SUCCESS when passing no servers and actually clear the server list. It appears they do this for test cases to simulate DNS unavailable or similar. Presumably they could achieve the same effect in other ways (point to localhost on a port that isn't in use). But it seems like this might be wide-spread enough to cause headaches so we just will document and test for this behavior, clearly it hasn't caused "issues" for anyone with the old behavior. See: https://github.com/nodejs/node/pull/50800 Fix By: Brad House (@bradh352)	1 year ago
Brad House	4982f76a2f	Query Cache support (#625 ) This PR implements a query cache at the lowest possible level, the actual dns request and response messages. Only successful and `NXDOMAIN` responses are cached. The lowest TTL in the response message determines the cache validity period for the response, and is capped at the configuration value for `qcache_max_ttl`. For `NXDOMAIN` responses, the SOA record is evaluated. For a query to match the cache, the opcode, flags, and each question's class, type, and name are all evaluated. This is to prevent matching a cached entry for a subtly different query (such as if the RD flag is set on one request and not another). For things like ares_getaddrinfo() or ares_search() that may spawn multiple queries, each individual message received is cached rather than the overarching response. This makes it possible for one query in the sequence to be purged from the cache while others still return cached results which means there is no chance of ever returning stale data. We have had a lot of user requests to return TTLs on all the various parsers like `ares_parse_caa_reply()`, and likely this is because they want to implement caching mechanisms of their own, thus this PR should solve those issues as well. Due to the internal data structures we have these days, this PR is less than 500 lines of new code. Fixes #608 Fix By: Brad House (@bradh352)	1 year ago
Brad House	818904643e	additional test coverage	1 year ago
Brad House	b2ea0cd6b2	Extended RCODE in OPT RR PsuedoRecord should not be exposed directly, it should be presented as part of the normal rcode	1 year ago
Ignat	7a140cb478	Randomize retry penalties to prevent thundering herd type issues (#606 ) The retry timeout values were using a fixed calculation which could cause multiple simultaneous queries to timeout and retry at the exact same time. If a DNS server is throttling requests, this could cause the issue to never self-resolve due to all requests recurring at the same instance again. This PR also creates a maximum timeout option to make sure the random value selected does not exceed this value. Fix By: Ignat (@Kontakter)	1 year ago
Brad House	0cc570eabe	Implement ares_reinit() to reload system configuration into existing channel (#614 ) This PR implements ares_reinit() to safely reload a channel's configuration even if there are existing queries. This function can be called when system configuration is detected to be changed, however since c-ares isn't thread aware, care must be taken to ensure no other c-ares calls are in progress at the time this function is called. Also, this function may update the open file descriptor list so care must also be taken to wake any event loops and reprocess the list of file descriptors. Fixes Bug #301 Fix By: Brad House (@bradh352)	1 year ago
Brad House	956b7ebf68	rewrite adig using new helpers (#607 ) adig previously performed manual parsing of the DNS records. Now it can focus strictly on formatting of output data for printing. It simply iterates across the parsed DNS packet and queries for the RRs, parameters for each RR, and the datatypes for each parameter. adig will now automatically pick up new RRs from the c-ares library due to the dynamic nature. The adig format also now more closely resembles that of BIND's `dig` output. A few more helpers needed to be added to the c-ares library that were missing. There ware a couple of minor bugs and enhancements also needed. Example: ``` ./adig -t ANY www.google.com ; <<>> c-ares DiG 1.21.0 <<>> www.google.com ;; Got answer: ;; ->>HEADER<<- opcode: QUERY, status: RCODE, id: 23913 ;; flags: qr rd ra; QUERY: 1, ANSWER: 11, AUTHORITY: 0, ADDITIONAL: 1 ;; OPT PSEUDOSECTION: ; EDNS: version: 0, flags: 0; udp: 512 ;; QUESTION SECTION: ;www.google.com. IN ANY ;; ANSWER SECTION: www.google.com. 162 IN A 142.251.107.99 www.google.com. 162 IN A 142.251.107.105 www.google.com. 162 IN A 142.251.107.103 www.google.com. 162 IN A 142.251.107.147 www.google.com. 162 IN A 142.251.107.104 www.google.com. 162 IN A 142.251.107.106 www.google.com. 162 IN AAAA 2607:f8b0:400c:c32::93 www.google.com. 162 IN AAAA 2607:f8b0:400c:c32::69 www.google.com. 162 IN AAAA 2607:f8b0:400c:c32::68 www.google.com. 162 IN AAAA 2607:f8b0:400c:c32::6a www.google.com. 21462 IN HTTPS 1 . alpn="h2,h3" ;; MSG SIZE rcvd: 276 ``` Fix By: Brad House (@bradh352)	1 year ago
Brad House	c433477753	Add SVCB and HTTPS RR (RFC 9460) (#603 ) This PR adds parsing and writing support for SVCB and HTTPS RRs as defined in RFC 9460. This should fix #566 Fix By: Brad House (@bradh352)	1 year ago
Brad House	df1cbdccf7	OPT RR should support parsing key/value options (#602 ) The OPT RR record has some seldom used options with a 16bit key and a binary value. The current parser and writer was not supporting this. This PR adds support. The same format is also used for SVCB/HTTPS records, so getting this in there is necessary to support that RR type. Also, we split the Binary record format into BIN and BINP, where BINP is an indicator that the binary data is _likely_ printable and will guarantee a NULL terminator. This is helpful for those attempting to print RRs. Fix By: Brad House (@bradh352)	1 year ago
Brad House	80fdc96f9a	clang-format	1 year ago
Brad House	fb43c04bae	Add TLSA record support (#600 ) As per #470, c-ares is missing a parser for the TLSA record format (RFC 6698). This PR introduces that parser. Once the new parser interface becomes public and this PR is merged, then #470 can be closed. Fix By: Brad House (@bradh352)	1 year ago
Brad House	eaa172a157	memory leak in test case	1 year ago
Brad House	5bbf3f3962	DNS Write: fix name compression	1 year ago
Brad House	35c1300078	Coverage: add tests for writing and parsing various record formats	1 year ago
Brad House	784ee5a754	DNS Record Write (#598 ) The `ares_dns_record_t` data structure created in the prior release is capable of holding a complete parsed DNS message and also provides all helpers in order to fill in the data structure. This PR adds write capabilities for this data structure to form a complete message and supports features such as DNS name compression as defined in RFC1035. Though this message writing capability goes further than c-ares internally needs, external users may find it useful ... and we may find it useful for test validation as well. This also replaces the existing message writing code in `ares_create_query()`, as well rewriting the request message without EDNS in ares_process.c's `process_answer()`. Fix By: Brad House (@bradh352)	1 year ago
Brad House	17136d8f29	build fix	1 year ago
Brad House	a116fede19	remove tests that depend on randomness	1 year ago
Brad House	d2389cd3b7	`ares_channel` -> `ares_channel_t `: don't bury the pointer (#595 ) `ares_channel` is defined as `typedef struct ares_channeldata ares_channel;`. The problem with this, is it embeds the pointer into the typedef, which means an `ares_channel` can never be declared as `const` as if you write `const ares_channel channel`, that expands to `struct ares_channeldata * const ares_channel` and not `const struct ares_channeldata channel`. We will now typedef `ares_channel_t` as `typedef struct ares_channeldata ares_channel_t;`, so if you write `const ares_channel_t channel`, it properly expands to `const struct ares_channeldata channel`. We are maintaining the old typedef for API compatibility with existing integrations, and due to typedef expansion this should not even cause any compiler warnings for existing code. There are no ABI implications with this change. I could be convinced to keep existing public functions as `ares_channel` if a sufficient argument exists, but internally we really need make this change for modern best practices. This change will allow us to internally use `const ares_channel_t ` where appropriate. Whether or not we decide to change any public interfaces to use `const` may require further discussion on if there might be ABI implications (I don't think so, but I'm also not 100% sure what a compiler internally does with `const` when emitting machine code ... I think more likely ABI implications would occur going the opposite direction). FYI, This PR was done via a combination of sed and clang-format, the only manual code change was the addition of the new typedef, and a couple doc fixes :) Fix By: Brad House (@bradh352)	1 year ago
Brad House	c8bd83a4ca	Dynamic Server List (#594 ) This PR makes the server list a dynamic sorted list of servers. The sort order is [ consecutive failures, system config index ]. The server list can be updated via ares_set_servers_*(). Any queries currently directed to servers that are no longer in the list will be automatically re-queued to a different server. Also, any time a failure occurs on the server, the sort order of the servers will be updated so that the one with the fewest consecutive failures is chosen for the next query that goes on the wire, this way bad or non-responsive servers are automatically isolated. Since the server list is now dynamic, the tracking of query failures per server has been removed and instead is relying on the server sort order as previously described. This simplifies the logic while also reducing the amount of memory required per query. However, because of this dynamic nature, it may not be easy to determine the server attempt order for enqueued queries if there have been any failures. If using the ARES_OPT_ROTATE, this is now implemented to be a random selection of the configured servers. Since the server list is dynamic, its not possible to go to the next server as configuration could have changed between queries or attempts for the same query. Finally, this PR moved some existing functions into new files to logically separate them. This should address issues #550 and #440, while also setting the framework to implement #301. #301 needs a little more effort since it configures things other than the servers themselves (domains, search, sortlist, lookups), which need to make sure they can be safely updated. Fix By: Brad House (@bradh352)	1 year ago
Brad House	fb361064eb	AppVeyor: update compiler versions, use Msys2, and Windows fixes (#593 ) AppVeyor was using Visual Studio 2015 along with old versions of MinGW. Update to the latest AppVeyor provides and also add an MSYS2 build test using MinGW which will use the bleeding edge version. When researching #590 this also uncovered a bug in cmake not properly detecting if_indextoname() on windows. This has been corrected as well as the underlying issue reported in #590. Fix By: Brad House (@bradh352) and Jonas Kvinge (@jonaski)	1 year ago
Brad House	bcd80b5313	Coverage: add some code misuse test cases	1 year ago
Brad House	8a3664b8cb	Replace hosts parser, add caching capabilities (#591 ) HOSTS FILE PROCESSING OVERVIEW ============================== The hosts file on the system contains static entries to be processed locally rather than querying the nameserver. Each row is an IP address followed by a list of space delimited hostnames that match the ip address. This is used for both forward and reverse lookups. We are caching the entire parsed hosts file for performance reasons. Some files may be quite sizable and as per Issue #458 can approach 1/2MB in size, and the parse overhead on a rapid succession of queries can be quite large. The entries are stored in forwards and backwards hashtables so we can get O(1) performance on lookup. The file is cached until the file modification timestamp changes (or 60s if there is no implemented stat() capability). The hosts file processing is quite unique. It has to merge all related hosts and ips into a single entry due to file formatting requirements. For instance take the below: ``` 127.0.0.1 localhost.localdomain localhost ::1 localhost.localdomain localhost 192.168.1.1 host.example.com host 192.168.1.5 host.example.com host 2620🔢:1 host.example.com host6.example.com host6 host ``` This will yield 2 entries. 1) ips: `127.0.0.1,::1` hosts: `localhost.localdomain,localhost` 2) ips: `192.168.1.1,192.168.1.5,2620🔢:1` hosts: `host.example.com,host,host6.example.com,host6` It could be argued that if searching for `192.168.1.1` that the `host6` hostnames should not be returned, but this implementation will return them since they are related (both ips have the fqdn of host.example.com). It is unlikely this will matter in the real world. Fix By: Brad House (@bradh352)	1 year ago
Brad House	424243dc49	add test vector said to cause a memory leak	1 year ago
Brad House	7dc3d98fba	Modernization: replace multiple hand-parsers with new memory-safe parser (#581 ) New DNS record parsing code. The old code was basically just some helper macros and functions for parsing an entire DNS message. The caller had to know the RFCs to use the parsers, except for some pre-made exceptions. The new parsing code parses the entire DNS message into an opaque data structure in a memory safe manner with various accessors for reading and manipulating the data. The existing parser helpers for the various record types were reimplemented as wrappers around the new parser. The accessors allow easy iteration across the DNS record datastructure, and can be used to easily create dig-like output without needing to know anything about the various record types and formats as dynamic helpers are provided for enumeration of values and data types of those values. At some point in the future, this new DNS record structure, accessors, and parser will be exposed publicly. This is not done at this point as we don't want to do that until the API is completely stable. Likely a write() function to output the DNS record back into an actual message buffer will be introduced with the stable API as well. Some subtle bugs in the existing code were uncovered, some which had test cases which turned out to be bogus. Validation with third-party implementations (e.g. BIND9) were performed to validate such cases were indeed bugs. Adding additional RR parsers such as for TLSA (#470) or SVCB/HTTPS (#566) are trivial now since focus can be put on only parsing the data within the RR, not the entire message. That said, as the new parser is not yet public, it isn't clear the best way to expose any new RRs (probably best to wait for the new parser to be public rather than hacking in another legacy function). Some additional RRs that are part of DNS RFC1035 or EDNS RFC6891 that didn't have previously implemented parsers are now also implemented (e.g. HINFO, OPT). Any unrecognized RRs are encapsulated into a "RAW_RR" as binary data which can be inserted or extracted, but are otherwise not interpreted in any way. Fix By: Brad House (@bradh352)	1 year ago
Brad House	c3e050ac72	remove ares_nowarn helpers #580 Now that the code internally is using proper datatypes, there is no longer a need for ares_nowarn helpers. Remove them. Fix By: Brad House (@bradh352)	1 year ago
Brad House	1549415228	Reformat code using clang-format (#579 ) c-ares uses multiple code styles, standardize on one. Talking with @bagder he feels strongly about maintaining an 80 column limit, but feels less strongly about things I feel strongly about (like alignment). Can re-run the formatter on the codebase via: ``` clang-format -i /.c /.h //.c //.h ``` Fix By: Brad House (@bradh352)	1 year ago
Brad House	3b10e571da	Fix internal datatype usage and warnings (#573 ) PR #568 increased the warning levels and c-ares code emitted a bunch of warnings. This PR fixes those warnings and starts transitioning internal data types into more proper forms (e.g. data lengths should be size_t not int). It does, however, have to manually cast back to what the public API needs due to API and ABI compliance (we aren't looking to break integrations, just clean up internals). Fix By: Brad House (@bradh352)	1 year ago
Brad House	432c42572d	Enhance test of ares_getsock() In an attempt to see if ares_getsock() was broken as per #571, do further sanity checks of the results of ares_getsock(). It seems as though ares_getsock() is fine. Fix By: Brad House (@bradh352)	1 year ago
Brad House	973023b4b9	ares_status_t enum for status codes (#567 ) The list of possible error codes in c-ares was a #define list. This not only doesn't provide for any sort of type safety but it also lacks clarification on what a function may return or what it takes, as an int could be an ares status, a boolean, or possibly even a length in the current code. We are not changing any public APIs as though the C standard states the underlying size and type of an enum is int, there are compiler attributes to override this as well as compiler flags like -fshort-enums. GCC in particular is known to expand an enum's width based on the data values (e.g., it can emit a 64bit integer enum). All internal usages should be changed by this PR, but of course, there may be some I missed. Fix By: Brad House (@bradh352)	1 year ago
Daniel Stenberg	e8fe27eaa8	ares-test: silence warning (#564 ) warning: comparison of integer expressions of different signedness Fix By: Daniel Stenberg (@bagder)	1 year ago
Brad House	17931888ec	fix reference to freed memory (#562 ) Issue #561 shows free'd memory could be accessed in some error conditions. Fixes Issue #561 Fix By: Brad House (@bradh352)	1 year ago
Brad House	9e542a8839	reported build/test systems may timeout on intensive tests. reduce test case to still be relevant but to reduce false positive errors	1 year ago
Gregor Jasny	ca85a4bd6f	Regression: Fix typo in fuzzcheck target name (#559 ) This seems to be a vim'esque typo introduced with `c1b00c41`. Fix By: Gregor Jasny (@gjasny)	1 year ago
Brad House	4d4fb34075	Test Harness: use ares_timeout() to calculate the value to pass to select() these days. (#555 ) The test framework was using 100ms timeout passed to select(), and not using ares_timeout() to calculate the actual recommended value based on the queries in queue. Using ares_timeout() tests the functionality of ares_timeout() itself and will provide more responsive results. Fix By: Brad House (@bradh352)	1 year ago
Brad House	fab4039b9b	Fix for TCP back to back queries (#552 ) As per #266, TCP queries are basically broken. If we get a partial reply, things just don't work, but unlike UDP, TCP may get fragmented and we need to properly handle that. I've started creating a basic parser/buffer framework for c-ares for memory safety reasons, but it also helps for things like this where we shouldn't be manually tracking positions and fetching only a couple of bytes at a time from a socket. This parser/buffer will be expanded and used more in the future. This also resolves #206 by allowing NULL to be specified for some socket callbacks so they will auto-route to the built-in c-ares functions. Fixes: #206, #266 Fix By: Brad House (@bradh352)	1 year ago
Brad House	21f3b77440	ares_getaddrinfo(): Fail faster on AF_UNSPEC if we've already received one address class (#551 ) As per #541, when using AF_UNSPEC with ares_getaddrinfo() (and in turn with ares_gethostbynam()) if we receive a successful response for one address class, we should not allow the other address class to continue on with retries, just return the address class we have. This will limit the overall query time to whatever timeout remains for the pending query for the other address class, it will not, however, terminate the other query as it may still prove to be successful (possibly coming in less than a millisecond later) and we'd want that result still. It just turns off additional error processing to get the result back quicker. Fixes Bug: #541 Fix By: Brad House (@bradh352)	1 year ago

1 2

99 Commits (aed78f166553d3c935793879987839c52618f7d8)