tailscale

mirror of https://github.com/tailscale/tailscale.git synced 2025-12-25 20:23:43 +00:00

Author	SHA1	Message	Date
Brad Fitzpatrick	4e0fc037e6	all: use iterators over slice views more This gets close to all of the remaining ones. Updates #12912 Change-Id: I9c672bbed2654a6c5cab31e0cbece6c107d8c6fa Signed-off-by: Brad Fitzpatrick <bradfitz@tailscale.com>	2024-11-11 13:22:34 -08:00
Brad Fitzpatrick	01185e436f	types/result, util/lineiter: add package for a result type, use it This adds a new generic result type (motivated by golang/go#70084) to try it out, and uses it in the new lineutil package (replacing the old lineread package), changing that package to return iterators: sometimes over []byte (when the input is all in memory), but sometimes iterators over results of []byte, if errors might happen at runtime. Updates #12912 Updates golang/go#70084 Change-Id: Iacdc1070e661b5fb163907b1e8b07ac7d51d3f83 Signed-off-by: Brad Fitzpatrick <bradfitz@tailscale.com>	2024-11-05 10:27:52 -08:00
VimT	43138c7a5c	net/socks5: optimize UDP relay Key changes: - No mutex for every udp package: replace syncs.Map with regular map for udpTargetConns - Use socksAddr as map key for better type safety - Add test for multi udp target Updates #7581 Change-Id: Ic3d384a9eab62dcbf267d7d6d268bf242cc8ed3c Signed-off-by: VimT <me@vimt.me>	2024-11-01 15:47:52 -07:00
VimT	b0626ff84c	net/socks5: fix UDP relay in userspace-networking mode This commit addresses an issue with the SOCKS5 UDP relay functionality when using the --tun=userspace-networking option. Previously, UDP packets were not being correctly routed into the Tailscale network in this mode. Key changes: - Replace single UDP connection with a map of connections per target - Use c.srv.dial for creating connections to ensure proper routing Updates #7581 Change-Id: Iaaa66f9de6a3713218014cf3f498003a7cac9832 Signed-off-by: VimT <me@vimt.me>	2024-11-01 15:47:52 -07:00
Jordan Whited	49de23cf1b	net/netcheck: add addReportHistoryAndSetPreferredDERP() test case (#13989 ) Add an explicit case for exercising preferred DERP hysteresis around the branch that compares latencies on a percentage basis. Updates #cleanup Signed-off-by: Jordan Whited <jordan@tailscale.com>	2024-10-31 19:25:00 -07:00
Andrea Gottardo	6985369479	net/sockstats: prevent crash in setNetMon (#13985 )	2024-10-31 12:00:34 -07:00
Anton Tolchanov	b4f46c31bb	wgengine/magicsock: export packet drop metric for outbound errors Some checks failed CI / windows (push) Has been cancelled CI / vm (push) Has been cancelled CI / cross (386, linux) (push) Successful in 16m47s CI / cross (amd64, darwin) (push) Successful in 16m50s CI / cross (amd64, freebsd) (push) Successful in 16m45s CI / cross (amd64, openbsd) (push) Successful in 16m44s CI / cross (amd64, windows) (push) Successful in 16m18s CI / cross (arm, 5, linux) (push) Successful in 16m34s CI / cross (arm, 7, linux) (push) Successful in 16m27s CI / cross (arm64, darwin) (push) Successful in 17m30s CI / cross (arm64, linux) (push) Successful in 16m25s CI / cross (arm64, windows) (push) Successful in 15m45s CI / ios (push) Successful in 1m36s CI / cross (loong64, linux) (push) Successful in 16m30s CI / crossmin (amd64, plan9) (push) Successful in 10m38s CI / android (push) Successful in 1m26s CI / crossmin (ppc64, aix) (push) Successful in 10m43s CI / tailscale_go (push) Successful in 45s CI / fuzz (push) Has been skipped CI / notify_slack (push) Has been cancelled CI / check_mergeability (push) Has been cancelled CI / depaware (push) Successful in 1m1s CI / go_generate (push) Successful in 2m11s CI / go_mod_tidy (push) Successful in 59s CI / licenses (push) Successful in 9s CI / staticcheck (386, windows) (push) Failing after 1m15s CI / staticcheck (amd64, darwin) (push) Failing after 1m19s CI / staticcheck (amd64, linux) (push) Failing after 1m19s CI / staticcheck (amd64, windows) (push) Failing after 1m13s CI / wasm (push) Successful in 27m59s This required sharing the dropped packet metric between two packages (tstun and magicsock), so I've moved its definition to util/usermetric. Updates tailscale/corp#22075 Signed-off-by: Anton Tolchanov <anton@tailscale.com>	2024-10-31 08:33:24 +00:00
James Tucker	e1e22785b4	net/netcheck: ensure prior preferred DERP is always in netchecks Some checks are pending CI / windows (push) Waiting to run CI / privileged (push) Waiting to run CI / vm (push) Waiting to run CI / race-root-integration (1/4) (push) Waiting to run CI / race-root-integration (2/4) (push) Waiting to run CI / test (-race, amd64, 1/3) (push) Waiting to run CI / test (-race, amd64, 2/3) (push) Waiting to run CI / test (-race, amd64, 3/3) (push) Waiting to run CI / test (386) (push) Waiting to run CI / cross (386, linux) (push) Waiting to run CI / cross (amd64, darwin) (push) Waiting to run CI / cross (amd64, freebsd) (push) Waiting to run CI / cross (amd64, openbsd) (push) Waiting to run CI / cross (amd64, windows) (push) Waiting to run CI / cross (arm, 5, linux) (push) Waiting to run CI / cross (arm, 7, linux) (push) Waiting to run CI / cross (arm64, darwin) (push) Waiting to run CI / cross (arm64, linux) (push) Waiting to run CI / cross (arm64, windows) (push) Waiting to run CI / cross (loong64, linux) (push) Waiting to run CI / ios (push) Waiting to run CI / crossmin (amd64, plan9) (push) Waiting to run CI / crossmin (ppc64, aix) (push) Waiting to run CI / android (push) Waiting to run CI / wasm (push) Waiting to run CI / tailscale_go (push) Waiting to run CI / fuzz (push) Waiting to run CI / depaware (push) Waiting to run CI / notify_slack (push) Blocked by required conditions CI / check_mergeability (push) Blocked by required conditions In an environment with unstable latency, such as upstream bufferbloat, there are cases where a full netcheck could drop the prior preferred DERP (likely home DERP) from future netcheck probe plans. This will then likely result in a home DERP having a missing sample on the next incremental netcheck, ultimately resulting in a home DERP move. This change does not fix our overall response to highly unstable latency, but it is an incremental improvement to prevent single spurious samples during a full netcheck from alone triggering a flapping condition, as now the prior changes to include historical latency will still provide the desired resistance, and the home DERP should not move unless latency is consistently worse over a 5 minute period. Note that there is a nomenclature and semantics issue remaining in the difference between a report preferred DERP and a home DERP. A report preferred DERP is aspirational, it is what will be picked as a home DERP if a home DERP connection needs to be established. A nodes home DERP may be different than a recent preferred DERP, in which case a lot of netcheck logic is fallible. In future enhancements much of the DERP move logic should move to consider the home DERP, rather than recent report preferred DERP. Updates #8603 Updates #13969 Signed-off-by: James Tucker <james@tailscale.com>	2024-10-30 17:19:26 -07:00
Renato Aguiar	5d07c17b93	net/dns: fix blank lines being added to resolv.conf on OpenBSD (#13928 ) During resolv.conf update, old 'search' lines are cleared but '\n' is not deleted, leaving behind a new blank line on every update. This adds 's' flag to regexp, so '\n' is included in the match and deleted when old lines are cleared. Also, insert missing `\n` when updated 'search' line is appended to resolv.conf. Signed-off-by: Renato Aguiar <renato@renatoaguiar.net>	2024-10-28 08:00:48 -07:00
Andrew Dunham	7fe6e50858	net/dns/resolver: fix test flake Some checks failed CI / vm (push) Waiting to run CI / race-build (push) Waiting to run CI / go_mod_tidy (push) Waiting to run CI / licenses (push) Waiting to run CI / cross (386, linux) (push) Waiting to run CI / cross (amd64, darwin) (push) Waiting to run CI / cross (amd64, freebsd) (push) Waiting to run CI / cross (amd64, openbsd) (push) Waiting to run CI / cross (amd64, windows) (push) Waiting to run CI / cross (arm, 5, linux) (push) Waiting to run CI / cross (arm, 7, linux) (push) Waiting to run CI / cross (arm64, darwin) (push) Waiting to run CI / cross (arm64, linux) (push) Waiting to run CI / cross (arm64, windows) (push) Waiting to run CI / cross (loong64, linux) (push) Waiting to run CI / ios (push) Waiting to run CI / crossmin (amd64, plan9) (push) Waiting to run CI / staticcheck (386, windows) (push) Waiting to run CI / crossmin (ppc64, aix) (push) Waiting to run CI / android (push) Waiting to run CI / wasm (push) Waiting to run CI / tailscale_go (push) Waiting to run CI / fuzz (push) Waiting to run CI / depaware (push) Waiting to run CI / staticcheck (amd64, darwin) (push) Waiting to run CI / staticcheck (amd64, linux) (push) Waiting to run CI / staticcheck (amd64, windows) (push) Waiting to run CI / notify_slack (push) Blocked by required conditions CI / check_mergeability (push) Blocked by required conditions CodeQL / Analyze (go) (push) Has been cancelled Updates #13902 Signed-off-by: Andrew Dunham <andrew@du.nham.ca> Change-Id: Ib2def19caad17367e9a31786ac969278e65f51c6	2024-10-24 13:36:57 -05:00
Andrew Dunham	b2665d9b89	net/netcheck: add a Now field to the netcheck Report Some checks failed CI / test (-race, amd64, 1/3) (push) Has been cancelled CI / test (-race, amd64, 2/3) (push) Has been cancelled CI / test (-race, amd64, 3/3) (push) Has been cancelled CI / test (386) (push) Has been cancelled CI / cross (386, linux) (push) Has been cancelled CI / cross (amd64, windows) (push) Has been cancelled CI / cross (arm, 5, linux) (push) Has been cancelled CI / cross (arm, 7, linux) (push) Has been cancelled CI / cross (arm64, darwin) (push) Has been cancelled CI / cross (arm64, linux) (push) Has been cancelled CI / cross (arm64, windows) (push) Has been cancelled CI / cross (loong64, linux) (push) Has been cancelled CI / ios (push) Has been cancelled CI / crossmin (amd64, plan9) (push) Has been cancelled CI / licenses (push) Has been cancelled CI / crossmin (ppc64, aix) (push) Has been cancelled CI / android (push) Has been cancelled CI / wasm (push) Has been cancelled CI / tailscale_go (push) Has been cancelled CI / fuzz (push) Has been cancelled CI / depaware (push) Has been cancelled CI / staticcheck (386, windows) (push) Has been cancelled CI / staticcheck (amd64, darwin) (push) Has been cancelled CI / staticcheck (amd64, linux) (push) Has been cancelled CI / staticcheck (amd64, windows) (push) Has been cancelled CI / notify_slack (push) Has been cancelled CI / check_mergeability (push) Has been cancelled CI / cross (amd64, darwin) (push) Has been cancelled CI / cross (amd64, freebsd) (push) Has been cancelled CI / cross (amd64, openbsd) (push) Has been cancelled This allows us to print the time that a netcheck was run, which is useful in debugging. Updates #10972 Signed-off-by: Andrew Dunham <andrew@du.nham.ca> Change-Id: Id48d30d4eb6d5208efb2b1526a71d83fe7f9320b	2024-10-22 15:52:42 -04:00
Maisem Ali	85241f8408	net/tstun: use /10 as subnet for TAP mode; read IP from netmap Some checks failed test installer.sh / test (curl, alpine:3.14) (push) Has been cancelled test installer.sh / test (curl, alpine:edge) (push) Has been cancelled test installer.sh / test (curl, alpine:latest) (push) Has been cancelled test installer.sh / test (curl, amazonlinux:latest) (push) Has been cancelled test installer.sh / test (curl, archlinux:latest) (push) Has been cancelled test installer.sh / test (curl, debian:oldstable-slim) (push) Has been cancelled test installer.sh / test (curl, debian:sid-slim) (push) Has been cancelled test installer.sh / test (curl, debian:stable-slim) (push) Has been cancelled test installer.sh / test (curl, debian:testing-slim) (push) Has been cancelled test installer.sh / test (curl, elementary/docker:stable) (push) Has been cancelled test installer.sh / test (curl, elementary/docker:unstable) (push) Has been cancelled test installer.sh / test (curl, fedora:latest) (push) Has been cancelled test installer.sh / test (curl, kalilinux/kali-dev) (push) Has been cancelled test installer.sh / test (curl, kalilinux/kali-rolling) (push) Has been cancelled test installer.sh / test (curl, opensuse/leap:latest) (push) Has been cancelled test installer.sh / test (curl, opensuse/tumbleweed:latest) (push) Has been cancelled test installer.sh / test (curl, oraclelinux:8) (push) Has been cancelled test installer.sh / test (curl, oraclelinux:9) (push) Has been cancelled test installer.sh / test (curl, parrotsec/core:latest) (push) Has been cancelled test installer.sh / test (curl, parrotsec/core:lts-amd64) (push) Has been cancelled test installer.sh / test (curl, rockylinux:8.7) (push) Has been cancelled test installer.sh / test (curl, rockylinux:9) (push) Has been cancelled test installer.sh / test (curl, ubuntu:18.04) (push) Has been cancelled test installer.sh / test (curl, ubuntu:20.04) (push) Has been cancelled test installer.sh / test (curl, ubuntu:22.04) (push) Has been cancelled test installer.sh / test (curl, ubuntu:23.04) (push) Has been cancelled test installer.sh / test (wget apt-transport-https, ubuntu:16.04) (push) Has been cancelled test installer.sh / test (wget, debian:oldstable-slim) (push) Has been cancelled test installer.sh / test (wget, debian:sid-slim) (push) Has been cancelled test installer.sh / test (wget, ubuntu:23.04) (push) Has been cancelled Few changes to resolve TODOs in the code: - Instead of using a hardcoded IP, get it from the netmap. - Use 100.100.100.100 as the gateway IP - Use the /10 CGNAT range instead of a random /24 Updates #2589 Signed-off-by: Maisem Ali <maisem@tailscale.com>	2024-10-21 17:24:29 -07:00
Maisem Ali	d4d21a0bbf	net/tstun: restore tap mode functionality It had bit-rotted likely during the transition to vector io in `76389d8baf`. Tested on Ubuntu 24.04 by creating a netns and doing the DHCP dance to get an IP. Updates #2589 Signed-off-by: Maisem Ali <maisem@tailscale.com>	2024-10-21 17:02:53 -07:00
Andrea Gottardo	f8f53bb6d4	health: remove SysDNSOS, add two Warnables for read+set system DNS config (#13874 )	2024-10-21 13:40:43 -07:00
Andrea Gottardo	fd77965f23	net/tlsdial: call out firewalls blocking Tailscale in health warnings (#13840 ) Updates tailscale/tailscale#13839 Adds a new blockblame package which can detect common MITM SSL certificates used by network appliances. We use this in `tlsdial` to display a dedicated health warning when we cannot connect to control, and a network appliance MITM attack is detected. Signed-off-by: Andrea Gottardo <andrea@gottardo.me>	2024-10-19 00:35:46 +00:00
Jordan Whited	877fa504b4	net/netcheck: remove arbitrary deadlines from GetReport() tests (#13832 ) GetReport() may have side effects when the caller enforces a deadline that is shorter than ReportTimeout. Updates #13783 Updates #13394 Signed-off-by: Jordan Whited <jordan@tailscale.com>	2024-10-18 13:12:07 -07:00
Kristoffer Dalby	e0d711c478	{net/connstats,wgengine/magicsock}: fix packet counting in connstats connstats currently increments the packet counter whenever it is called to store a length of data, however when udp batch sending was introduced we pass the length for a series of packages, and it is only incremented ones, making it count wrongly if we are on a platform supporting udp batches. Updates tailscale/corp#22075 Signed-off-by: Kristoffer Dalby <kristoffer@tailscale.com>	2024-10-14 14:17:56 +02:00
Nick Khyl	f07ff47922	net/dns/resolver: add tests for using a forwarder with multiple upstream resolvers If multiple upstream DNS servers are available, quad-100 sends requests to all of them and forwards the first successful response, if any. If no successful responses are received, it propagates the first failure from any of them. This PR adds some test coverage for these scenarios. Updates #13571 Signed-off-by: Nick Khyl <nickk@tailscale.com>	2024-10-11 12:02:27 -05:00
Nick Hill	c2144c44a3	net/dns/resolver: update (forwarder).forwardWithDestChan to always return an error unless it sends a response to responseChan We currently have two executions paths where (forwarder).forwardWithDestChan returns nil, rather than an error, without sending a DNS response to responseChan. These paths are accompanied by a comment that reads: // Returning an error will cause an internal retry, there is // nothing we can do if parsing failed. Just drop the packet. But it is not (or no longer longer) accurate: returning an error from forwardWithDestChan does not currently cause a retry. Moreover, although these paths are currently unreachable due to implementation details, if (forwarder).forwardWithDestChan were to return nil without sending a response to responseChan, it would cause a deadlock at one call site and a panic at another. Therefore, we update (forwarder).forwardWithDestChan to return errors in those two paths and remove comments that were no longer accurate and misleading. Updates #cleanup Updates #13571 Signed-off-by: Nick Hill <mykola.khyl@gmail.com>	2024-10-11 12:02:27 -05:00
Nick Hill	e7545f2eac	net/dns/resolver: translate 5xx DoH server errors into SERVFAIL DNS responses If a DoH server returns an HTTP server error, rather than a SERVFAIL within a successful HTTP response, we should handle it in the same way as SERVFAIL. Updates #13571 Signed-off-by: Nick Hill <mykola.khyl@gmail.com>	2024-10-11 12:02:27 -05:00
Nick Hill	17335d2104	net/dns/resolver: forward SERVFAIL responses over PeerDNS As per the docstring, (forwarder).forwardWithDestChan should either send to responseChan and returns nil, or returns a non-nil error (without sending to the channel). However, this does not hold when all upstream DNS servers replied with an error. We've been handling this special error path in (Resolver).Query but not in (Resolver).HandlePeerDNSQuery. As a result, SERVFAIL responses from upstream servers were being converted into HTTP 503 responses, instead of being properly forwarded as SERVFAIL within a successful HTTP response, as per RFC 8484, section 4.2.1: A successful HTTP response with a 2xx status code (see Section 6.3 of [RFC7231]) is used for any valid DNS response, regardless of the DNS response code. For example, a successful 2xx HTTP status code is used even with a DNS message whose DNS response code indicates failure, such as SERVFAIL or NXDOMAIN. In this PR we fix (forwarder).forwardWithDestChan to no longer return an error when it sends a response to responseChan, and remove the special handling in (*Resolver).Query, as it is no longer necessary. Updates #13571 Signed-off-by: Nick Hill <mykola.khyl@gmail.com>	2024-10-11 12:02:27 -05:00
Jordan Whited	33029d4486	net/netcheck: fix netcheck cli-triggered nil pointer deref (#13782 ) Updates #13780 Signed-off-by: Jordan Whited <jordan@tailscale.com>	2024-10-10 15:52:47 -07:00
Brad Fitzpatrick	841eaacb07	net/sockstats: quiet some log spam in release builds Updates #13731 Change-Id: Ibee85426827ebb9e43a1c42a9c07c847daa50117 Signed-off-by: Brad Fitzpatrick <bradfitz@tailscale.com>	2024-10-08 11:02:46 -07:00
Andrew Dunham	8ee7f82bf4	net/netcheck: don't panic if a region has no Nodes Updates #13728 Signed-off-by: Andrew Dunham <andrew@du.nham.ca> Change-Id: I1e8319d6b2da013ae48f15113b30c9333e69cc0b	2024-10-08 12:52:27 -04:00
Andrea Gottardo	6de6ab015f	net/dns: tweak DoH timeout, limit MaxConnsPerHost, require TLS 1.3 (#13564 ) Updates tailscale/tailscale#6148 This is the result of some observations we made today with @raggi. The DNS over HTTPS client currently doesn't cap the number of connections it uses, either in-use or idle. A burst of DNS queries will open multiple connections. Idle connections remain open for 30 seconds (this interval is defined in the dohTransportTimeout constant). For DoH providers like NextDNS which send keep-alives, this means the cellular modem will remain up more than expected to send ACKs if any keep-alives are received while a connection remains idle during those 30 seconds. We can set the IdleConnTimeout to 10 seconds to ensure an idle connection is terminated if no other DNS queries come in after 10 seconds. Additionally, we can cap the number of connections to 1. This ensures that at all times there is only one open DoH connection, either active or idle. If idle, it will be terminated within 10 seconds from the last query. We also observed all the DoH providers we support are capable of TLS 1.3. We can force this TLS version to reduce the number of packets sent/received each time a TLS connection is established. Signed-off-by: Andrea Gottardo <andrea@gottardo.me>	2024-10-02 09:26:11 -07:00
Brad Fitzpatrick	f49d218cfe	net/dnscache: don't fall back to an IPv6 dial if we don't have IPv6 I noticed while debugging a test failure elsewhere that our failure logs (when verbosity is cranked up) were uselessly attributing dial failures to failure to dial an invalid IP address (this IPv6 address we didn't have), rather than showing me the actual IPv4 connection failure. Updates #13597 (tangentially) Change-Id: I45ffbefbc7e25ebfb15768006413a705b941dae5 Signed-off-by: Brad Fitzpatrick <bradfitz@tailscale.com>	2024-10-02 10:41:08 -05:00
Andrea Gottardo	ed1ac799c8	net/captivedetection: set Timeout on net.Dialer (#13613 ) Updates tailscale/tailscale#1634 Updates tailscale/tailscale#13265 Captive portal detection uses a custom `net.Dialer` in its `http.Client`. This custom Dialer ensures that the socket is bound specifically to the Wi-Fi interface. This is crucial because without it, if any default routes are set, the outgoing requests for detecting a captive portal would bypass Wi-Fi and go through the default route instead. The Dialer did not have a Timeout property configured, so the default system timeout was applied. This caused issues in #13265, where we attempted to make captive portal detection requests over an IPsec interface used for Wi-Fi Calling. The call to `connect()` would fail and remain blocked until the system timeout (approximately 1 minute) was reached. In #13598, I simply excluded the IPsec interface from captive portal detection. This was a quick and safe mitigation for the issue. This PR is a follow-up to make the process more robust, by setting a 3 seconds timeout on any connection establishment on any interface (this is the same timeout interval we were already setting on the HTTP client). Signed-off-by: Andrea Gottardo <andrea@gottardo.me>	2024-10-02 15:29:46 +00:00
Brad Fitzpatrick	262c526c4e	net/portmapper: don't treat 0.0.0.0 as a valid IP Updates tailscale/corp#23538 Change-Id: I58b8c30abe43f1d1829f01eb9fb2c1e6e8db9476 Signed-off-by: Brad Fitzpatrick <bradfitz@tailscale.com>	2024-10-01 16:11:47 -05:00
Andrew Dunham	16ef88754d	net/portmapper: don't return unspecified/local external IPs We were previously not checking that the external IP that we got back from a UPnP portmap was a valid endpoint; add minimal validation that this endpoint is something that is routeable by another host. Updates tailscale/corp#23538 Signed-off-by: Andrew Dunham <andrew@du.nham.ca> Change-Id: Id9649e7683394aced326d5348f4caa24d0efd532	2024-10-01 14:13:40 -04:00
Andrea Gottardo	69be54c7b6	net/captivedetection: exclude ipsec interfaces from captive portal detection (#13598 ) Updates tailscale/tailscale#1634 Logs from some iOS users indicate that we're pointlessly performing captive portal detection on certain interfaces named ipsec*. These are tunnels with the cellular carrier that do not offer Internet access, and are only used to provide internet calling functionality (VoLTE / VoWiFi). ``` attempting to do captive portal detection on interface ipsec1 attempting to do captive portal detection on interface ipsec6 ``` This PR excludes interfaces with the `ipsec` prefix from captive portal detection. Signed-off-by: Andrea Gottardo <andrea@gottardo.me>	2024-09-26 17:28:10 +00:00
Kristoffer Dalby	7d1160ddaa	{ipn,net,tsnet}: use tsaddr helpers Signed-off-by: Kristoffer Dalby <kristoffer@tailscale.com>	2024-09-26 12:17:31 +02:00
Kristoffer Dalby	3dc33a0a5b	net/tsaddr: add WithoutExitRoutes and IsExitRoute Updates #cleanup Signed-off-by: Kristoffer Dalby <kristoffer@tailscale.com>	2024-09-26 12:17:31 +02:00
Kristoffer Dalby	0e0e53d3b3	util/usermetrics: make usermetrics non-global this commit changes usermetrics to be non-global, this is a building block for correct metrics if a go process runs multiple tsnets or in tests. Updates #13420 Updates tailscale/corp#22075 Signed-off-by: Kristoffer Dalby <kristoffer@tailscale.com>	2024-09-25 15:57:00 +02:00
Andrea Gottardo	8a6f48b455	cli: add `tailscale dns query` (#13368 ) Updates tailscale/tailscale#13326 Adds a CLI subcommand to perform DNS queries using the internal DNS forwarder and observe its internals (namely, which upstream resolvers are being used). Signed-off-by: Andrea Gottardo <andrea@gottardo.me>	2024-09-24 20:18:45 +00:00
James Tucker	af5a845a87	net/dns/resolver: fix dns-sd NXDOMAIN responses from quad-100 mdnsResponder at least as of macOS Sequoia does not find NXDOMAIN responses to these dns-sd PTR queries acceptable unless they include the question section in the response. This was found debugging #13511, once we turned on additional diagnostic reporting from mdnsResponder we witnessed: ``` Received unacceptable 12-byte response from 100.100.100.100 over UDP via utun6/27 -- id: 0x7F41 (32577), flags: 0x8183 (R/Query, RD, RA, NXDomain), counts: 0/0/0/0, ``` If the response includes a question section, the resposnes are acceptable, e.g.: ``` Received acceptable 59-byte response from 8.8.8.8 over UDP via en0/17 -- id: 0x2E55 (11861), flags: 0x8183 (R/Query, RD, RA, NXDomain), counts: 1/0/0/0, ``` This may be contributing to an issue under diagnosis in #13511 wherein some combination of conditions results in mdnsResponder no longer answering DNS queries correctly to applications on the system for extended periods of time (multiple minutes), while dig against quad-100 provides correct responses for those same domains. If additional debug logging is enabled in mdnsResponder we see it reporting: ``` Penalizing server 100.100.100.100 for 60 seconds ``` It is also possible that the reason that macOS & iOS never "stopped spamming" these queries is that they have never been replied to with acceptable responses. It is not clear if this special case handling of dns-sd PTR queries was ever beneficial, and given this evidence may have always been harmful. If we subsequently observe that the queries settle down now that they have acceptable responses, we should remove these special cases - making upstream queries very occasionally isn't a lot of battery, so we should be better off having to maintain less special cases and avoid bugs of this class. Updates #2442 Updates #3025 Updates #3363 Updates #3594 Updates #13511 Signed-off-by: James Tucker <james@tailscale.com>	2024-09-18 18:43:03 -07:00
Jordan Whited	951884b077	net/netcheck,wgengine/magicsock: plumb OnlyTCP443 controlknob through netcheck (#13491 ) Updates tailscale/corp#17879 Signed-off-by: Jordan Whited <jordan@tailscale.com>	2024-09-17 12:24:42 -07:00
Jordan Whited	afec2d41b4	wgengine/magicsock: remove redundant deadline from netcheck report call (#13395 ) netcheck.Client.GetReport() applies its own deadlines. This 2s deadline was causing GetReport() to never fall back to HTTPS/ICMP measurements as it was shorter than netcheck.stunProbeTimeout, leaving no time for fallbacks. Updates #13394 Updates #6187 Signed-off-by: Jordan Whited <jordan@tailscale.com>	2024-09-13 10:51:30 -07:00
Nick Khyl	4dfde7bffc	net/dns: disable DNS registration for Tailscale interface on Windows We already disable dynamic updates by setting DisableDynamicUpdate to 1 for the Tailscale interface. However, this does not prevent non-dynamic DNS registration from happening when `ipconfig /registerdns` runs and in similar scenarios. Notably, dns/windowsManager.SetDNS runs `ipconfig /registerdns`, triggering DNS registration for all interfaces that do not explicitly disable it. In this PR, we update dns/windowsManager.disableDynamicUpdates to also set RegistrationEnabled to 0. Fixes #13411 Signed-off-by: Nick Khyl <nickk@tailscale.com>	2024-09-07 19:00:38 +01:00
Jordan Whited	7aa766ee65	net/tstun: probe TCP GRO (#13376 ) Disable TCP & UDP GRO if the probe fails. torvalds/linux@e269d79c7d broke virtio_net TCP & UDP GRO causing GRO writes to return EINVAL. The bug was then resolved later in torvalds/linux@89add40066. The offending commit was pulled into various LTS releases. Updates #13041 Signed-off-by: Jordan Whited <jordan@tailscale.com>	2024-09-05 09:59:31 -07:00
Andrew Dunham	7dcf65a10a	net/dns: fix IsZero and Equal methods on OSConfig Discovered this while investigating the following issue; I think it's unrelated, but might as well fix it. Also, add a test helper for checking things that have an IsZero method using the reflect package. Updates tailscale/support-escalations#55 Signed-off-by: Andrew Dunham <andrew@du.nham.ca> Change-Id: I57b7adde43bcef9483763b561da173b4c35f49e2	2024-09-05 00:05:36 -04:00
Brad Fitzpatrick	3d401c11fa	all: use new Go 1.23 slices.Sorted more Updates #12912 Change-Id: If1294e5bc7b5d3cf0067535ae10db75e8b988d8b Signed-off-by: Brad Fitzpatrick <bradfitz@tailscale.com>	2024-09-04 14:52:21 -07:00
Andrea Gottardo	d060b3fa02	cli: implement `tailscale dns status` (#13353 ) Updates tailscale/tailscale#13326 This PR begins implementing a `tailscale dns` command group in the Tailscale CLI. It provides an initial implementation of `tailscale dns status` which dumps the state of the internal DNS forwarder. Two new endpoints were added in LocalAPI to support the CLI functionality: - `/netmap`: dumps a copy of the last received network map (because the CLI shouldn't have to listen to the ipn bus for a copy) - `/dns-osconfig`: dumps the OS DNS configuration (this will be very handy for the UI clients as well, as they currently do not display this information) My plan is to implement other subcommands mentioned in tailscale/tailscale#13326, such as `query`, in later PRs. Signed-off-by: Andrea Gottardo <andrea@gottardo.me>	2024-09-04 19:43:55 +00:00
Andrea Gottardo	0112da6070	net/dns: support GetBaseConfig on Darwin OSS tailscaled (#13351 ) Updates tailscale/tailscale#177 It appears that the OSS distribution of `tailscaled` is currently unable to get the current system base DNS configuration, as GetBaseConfig() in manager_darwin.go is unimplemented. This PR adds a basic implementation that reads the current values in `/etc/resolv.conf`, to at least unblock DNS resolution via Quad100 if `--accept-dns` is enabled. Signed-off-by: Andrea Gottardo <andrea@gottardo.me>	2024-09-04 10:31:58 -07:00
Andrew Dunham	1c972bc7cb	wgengine/magicsock: actually use AF_PACKET socket for raw disco Previously, despite what the commit said, we were using a raw IP socket that was not an AF_PACKET socket, and thus was subject to the host firewall rules. Switch to using a real AF_PACKET socket to actually get the functionality we want. Updates #13140 Signed-off-by: Andrew Dunham <andrew@du.nham.ca> Change-Id: If657daeeda9ab8d967e75a4f049c66e2bca54b78	2024-09-03 12:50:09 -04:00
Jordan Whited	45c97751fb	net/tstun: clarify GROFilterFunc *gro.GRO usage (#13318 ) Updates #cleanup Signed-off-by: Jordan Whited <jordan@tailscale.com>	2024-08-29 13:04:46 -07:00
Andrea Gottardo	a584d04f8a	dns: increase TimeToVisible before DNS unavailable warning (#13317 ) Updates tailscale/tailscale#13314 Some users are reporting 'DNS unavailable' spurious (?) warnings, especially on Android: https://old.reddit.com/r/Tailscale/comments/1f2ow3w/health_warning_dns_unavailable_on_tailscale/ https://old.reddit.com/r/Tailscale/comments/1f3l2il/health_warnings_dns_unavailable_what_does_it_mean/ I suspect this is caused by having a too low TimeToVisible setting on the Warnable, which triggers the unhealthy state during slow network transitions. Signed-off-by: Andrea Gottardo <andrea@gottardo.me>	2024-08-29 11:43:38 -07:00
Jordan Whited	0926954cf5	net/tstun,wgengine/netstack: implement TCP GRO for local services (#13315 ) Throughput improves substantially when measured via netstack loopback (TS_DEBUG_NETSTACK_LOOPBACK_PORT). Before (`d21ebc2`): jwhited@i5-12400-2:~$ iperf3 -V -c 100.100.100.100 Starting Test: protocol: TCP, 1 streams, 131072 byte blocks Test Complete. Summary Results: [ ID] Interval Transfer Bitrate Retr [ 5] 0.00-10.00 sec 5.77 GBytes 4.95 Gbits/sec 0 sender [ 5] 0.00-10.01 sec 5.77 GBytes 4.95 Gbits/sec receiver After: jwhited@i5-12400-2:~$ iperf3 -V -c 100.100.100.100 Starting Test: protocol: TCP, 1 streams, 131072 byte blocks Test Complete. Summary Results: [ ID] Interval Transfer Bitrate Retr [ 5] 0.00-10.00 sec 12.7 GBytes 10.9 Gbits/sec 0 sender [ 5] 0.00-10.00 sec 12.7 GBytes 10.9 Gbits/sec receiver Updates tailscale/corp#22754 Signed-off-by: Jordan Whited <jordan@tailscale.com>	2024-08-29 11:37:48 -07:00
Jordan Whited	31cdbd68b1	net/tstun: fix gvisor inbound GSO packet injection (#13283 ) buffs[0] was not sized to hold pkt with GSO, resulting in a panic. Updates tailscale/corp#22511 Signed-off-by: Jordan Whited <jordan@tailscale.com>	2024-08-27 14:59:43 -07:00
Kristoffer Dalby	a2c42d3cd4	usermetric: add initial user-facing metrics This commit adds a new usermetric package and wires up metrics across the tailscale client. Updates tailscale/corp#22075 Co-authored-by: Anton Tolchanov <anton@tailscale.com> Signed-off-by: Kristoffer Dalby <kristoffer@tailscale.com>	2024-08-27 11:21:35 +02:00
Jordan Whited	d097096ddc	net/tstun,wgengine/netstack: make inbound synthetic packet injection GSO-aware (#13266 ) Updates tailscale/corp#22511 Signed-off-by: Jordan Whited <jordan@tailscale.com>	2024-08-26 19:26:39 -07:00

1 2 3 4 5 ...

1125 Commits