From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from bifrost.lang.hm (mail.lang.hm [64.81.33.126]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (Client did not present a certificate) by huchra.bufferbloat.net (Postfix) with ESMTPS id B099B21F0E7 for ; Mon, 28 Jan 2013 15:30:46 -0800 (PST) Received: from asgard.lang.hm (asgard.lang.hm [10.0.0.100]) by bifrost.lang.hm (8.13.4/8.13.4/Debian-3) with ESMTP id r0SNUgbV013057; Mon, 28 Jan 2013 15:30:42 -0800 Date: Mon, 28 Jan 2013 15:29:17 -0800 (PST) From: David Lang X-X-Sender: dlang@asgard.lang.hm To: Robert Bradley In-Reply-To: Message-ID: References: User-Agent: Alpine 2.02 (DEB 1266 2009-07-14) MIME-Version: 1.0 Content-Type: MULTIPART/Mixed; BOUNDARY="===============5079268765366972012==" Cc: "" , Felix Fietkau Subject: Re: [Cerowrt-devel] deployed some cero this weekend, chasing checksums X-BeenThere: cerowrt-devel@lists.bufferbloat.net X-Mailman-Version: 2.1.13 Precedence: list List-Id: Development issues regarding the cerowrt test router project List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 28 Jan 2013 23:30:47 -0000 This message is in MIME format. The first part should be readable text, while the remaining parts are likely unreadable without MIME-aware tools. --===============5079268765366972012== Content-Type: TEXT/Plain; format=flowed; charset=US-ASCII depending on your hardware, you may be seeing bogus checksums in outbound packets, but the packets on the wire have valid checksums because the hardware computes the checksums and sets them. I know I've run into this one before when doing load testing. David Lang On Mon, 28 Jan 2013, Robert Bradley wrote: > It looks more like data corruption of various forms as opposed to a fault > in checksumming: > > - Truncation of some layer-4 data including headers to 75 octets > - Some bad TCP packets have stored header lengths of 0 octets > - I often see lines of incrementing bytes (30 31 32 etc.). For example, > packet 962 has a train of values from 0x10 to 0x2f, starting at position > 0x003a (the TCP timestamps). I think these are meant to be fragments from > the ping packets (which contain 8 octets then values 0x10 to 0x37), but > these are straying into non-ICMP packets. > - There are pieces of HTTP in non-HTTP protocols. For example, packet 1394 > is supposed to be UDP, but looks like it is really TCP traffic with the > wrong protocol number. The checksum is still invalid in either case. > - It is possible to corrupt layer-4 checksums only, leaving the IP layer > untouched. > > > On 28 January 2013 07:52, Dave Taht wrote: > >> Put up a pic http://snapon.lab.bufferbloat.net/~d/yurt >> >> they aren't bad all the time, but when they go bad, bad things happen. >> >> >> On Sun, Jan 27, 2013 at 11:41 PM, Dave Taht wrote: >> >>> >>> I have been debugging some weirdness for a while. You might want to do >>> some captures on the latest cero and look at checksums. >>> >>> An unreasonably high number of checksum issues seem to be happening, but >>> there doesn't appear to be a whole lot of pattern to it, as yet. >>> >>> I will simplify. I pinged locally and 8.8.8.8 and surfed the web, and a >>> symptom is that some other routers can't ping sometimes nor access much of >>> the internet beyond the gateway. They can always reach the gateway. >>> >>> in the interim, the topology on this capture are >>> >>> 172.30.102.17 - laptop via ethernet to >>> 172.20.102.1 - cerowrt 3.7.4-4 via ethernet to >>> 172.20.6.1 - ubnt 3.3.8-26 via mesh to >>> 172.20.142.11 - ubnt 3.7.4-4 via ethernet to >>> * 192.168.100.1 - cerowrt 3.7.2 capture point (yes, updating that) >>> 10.0.10.1 - comcast box (yes, double nat, fixing that) >>> >>> I took a capture on the se00 interface >>> >>> tcpdump -i se00 -w/tmp/yurt.cap host 172.20.102.17 >>> >>> and stuck that capture there: >>> >>> http://snapon.lab.bufferbloat.net/~d/yurt/yurt.cap >>> >>> and then looked at it with wireshark with this filter >>> >>> ip.checksum_bad == 1 >>> >>> and scratched my head at the error rate (about 1%) and the pattern (lack >>> thereof) >>> >>> I will simplify in the mroning >>> >>> -- >>> Dave T?ht >>> >>> Fixing bufferbloat with cerowrt: >>> http://www.teklibre.com/cerowrt/subscribe.html >> >> >> >> >> -- >> Dave T?ht >> >> Fixing bufferbloat with cerowrt: >> http://www.teklibre.com/cerowrt/subscribe.html >> > > > > --===============5079268765366972012== Content-Type: TEXT/PLAIN; CHARSET=us-ascii Content-ID: Content-Description: Content-Disposition: INLINE _______________________________________________ Cerowrt-devel mailing list Cerowrt-devel@lists.bufferbloat.net https://lists.bufferbloat.net/listinfo/cerowrt-devel --===============5079268765366972012==--