From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mail-qa0-f52.google.com (mail-qa0-f52.google.com [209.85.216.52]) (using TLSv1 with cipher RC4-MD5 (128/128 bits)) (Client CN "smtp.gmail.com", Issuer "Google Internet Authority" (verified OK)) by huchra.bufferbloat.net (Postfix) with ESMTPS id 5ADDA201044 for ; Thu, 24 May 2012 11:15:10 -0700 (PDT) Received: by qabj34 with SMTP id j34so591079qab.11 for ; Thu, 24 May 2012 11:15:09 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=sender:message-id:date:from:organization:user-agent:mime-version:to :cc:subject:references:in-reply-to:content-type :content-transfer-encoding; bh=+y4XjPrgj2kChd+a18PKZ4iQ+3lCHP30AUuhkbW0swU=; b=S65XJJpsu5H4iIg4orS4p2mncZLG6nuWQyVVNBxKegJS9cs8ID1kZtWIO7jUmjy8H1 byAxrlYOEXfJprdern81vZ0X3zo78GAtyXdJc43CRvW/Y4Dw92PXR/vSig2QNXlTq7AZ YMtxkv/CxcOJVhgHkO7mS8Bb+S50QKQJR6QZWgVbhuTKFg7X+ERTFwVwkW8BesLDv9Ci XUvJmLgFFLaZaNxe39tiLPUU4hqrwVViDtNoZxJpPK9t4xHPGZt88v0tvdhcx3d+92lS KD+nn0vz9ufpos765jxyE358Vw70wt1TQV6TVzJHxRdEOVFUrXek5z+iObvr/f8ydkCZ k5Cw== Received: by 10.229.135.9 with SMTP id l9mr123314qct.91.1337883309085; Thu, 24 May 2012 11:15:09 -0700 (PDT) Received: from [192.168.1.27] (c-24-218-176-94.hsd1.ma.comcast.net. [24.218.176.94]) by mx.google.com with ESMTPS id gy5sm9209703qab.3.2012.05.24.11.15.07 (version=SSLv3 cipher=OTHER); Thu, 24 May 2012 11:15:08 -0700 (PDT) Sender: Jim Gettys Message-ID: <4FBE7AAB.5080307@freedesktop.org> Date: Thu, 24 May 2012 14:15:07 -0400 From: Jim Gettys Organization: Bell Labs User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:12.0) Gecko/20120430 Thunderbird/12.0.1 MIME-Version: 1.0 To: Sebastian Moeller References: <00404BC8-3761-409D-A1C8-9213D7D9A3DF@gmx.de> <1E435715-5C95-49AF-99D0-E8AD6EAD5B44@gmx.de> <4FBE5767.6080704@gmail.com> <4D0F5C65-2401-470F-A6D8-BE18E8BA25C7@gmx.de> <4FBE6290.9000701@freedesktop.org> <0E4C11DB-2B8A-411B-A61F-34B2A6BF57B9@gmx.de> In-Reply-To: <0E4C11DB-2B8A-411B-A61F-34B2A6BF57B9@gmx.de> Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: 8bit Cc: cerowrt-devel@lists.bufferbloat.net Subject: Re: [Cerowrt-devel] Fwd: 3.3.6-2 X-BeenThere: cerowrt-devel@lists.bufferbloat.net X-Mailman-Version: 2.1.13 Precedence: list List-Id: Development issues regarding the cerowrt test router project List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 24 May 2012 18:15:10 -0000 On 05/24/2012 02:12 PM, Sebastian Moeller wrote: > Hi Jim, > > good point, I will go and see whether that is the cause for my crashes… Will return to this post if/when I have new data in either direction… If you do, see if you can grab the babeld.conf file and add it to: https://www.bufferbloat.net/issues/392 > best > Sebastian > > > > On May 24, 2012, at 9:32 AM, Jim Gettys wrote: > >> On 05/24/2012 12:18 PM, Sebastian Moeller wrote: >>> Hi Robert, >>> >>> On May 24, 2012, at 8:44 AM, Robert Bradley wrote: >>> >>>> On 24/05/12 04:48, Sebastian Moeller wrote: >>>>> A) under moderate wireless stress I get a lot of allocation failures from slub, like: >>>>> [ 1221.664062] ath: skbuff alloc of size 1926 failed >>>>> In the routers dmesg. And every now and then the router crashes and reboots (I have not yet found a way to make this happen reliably, it seems to require some uptime) >>>> This looks to me like a possible memory leak somewhere, but I'm no expert. >>> Not being an expert I concur. >> My router's /tmp/log/babeld.log had grown to almost 256k. (and my router >> had been flaky). >> >> So I suspect that's making grim trouble as /tmp is a tmpfs: e.g. coming >> out of ram. >> -rw-r--r-- 1 root root 247936 May 24 12:27 babeld.log >> >> Tail on the babeld file had: >> >> Couldn't determine channel of interface gw00: Invalid argument. >> Couldn't determine channel of interface gw10: Invalid argument. >> Couldn't determine channel of interface gw00: Invalid argument. >> Couldn't determine channel of interface gw10: Invalid argument. >> Couldn't determine channel of interface gw00: Invalid argument. >> Couldn't determine channel of interface gw10: Invalid argument. >> Couldn't determine channel of interface gw00: Invalid argument. >> Couldn't determine channel of interface gw10: Invalid argument. >> Couldn't determine channel of interface gw00: Invalid argument. >> Couldn't determine channel of interface gw10: Invalid argument. >> >> I should probably have grabbed a copy before nuking the file. /me bad.... >> >> Will put into redmine... >> >> - Jim >> >>>> (Unless cerowrt is using tmpfs and filling up memory with logs, of course.) >>> I tried to check that, but since I can nor reproduce the crashes easily yet I have not been able to test that hypothesis (when I checked "df -h" on the router there always was some room left, but heck for all I know it might be the log entries for the allocation failures that quickly eat up all the remaining memory) I will try to test this hypothesis. Currently I tried to check dmesg and free in rapid succession during the test runs that are prone to cause the crash free memory fluctuates some but I never saw it reach 0 just before crashing. >>> >>>> Is UDP from the wired side to the Internet also OK? I'm assuming it is, but it would be nice to prove that it is actually a leak in ath9k and/or the wireless stack first! >>> Actually I have not tested that yet (again with the crash somewhat hard to reproduce I will have to take the wireless out of use for 24 to 48 hours to be reasonably sure that the issue does not occur under wired connections). That said, I will go and work on that. So I have my testing work charted out and will post again once I have more data. >>> >>> Best >>> Sebastian >>> >>> >>> >>> >>>> _______________________________________________ >>>> Cerowrt-devel mailing list >>>> Cerowrt-devel@lists.bufferbloat.net >>>> https://lists.bufferbloat.net/listinfo/cerowrt-devel >>> _______________________________________________ >>> Cerowrt-devel mailing list >>> Cerowrt-devel@lists.bufferbloat.net >>> https://lists.bufferbloat.net/listinfo/cerowrt-devel