From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mail-qc0-f171.google.com (mail-qc0-f171.google.com [209.85.216.171]) (using TLSv1 with cipher RC4-MD5 (128/128 bits)) (Client CN "smtp.gmail.com", Issuer "Google Internet Authority" (verified OK)) by huchra.bufferbloat.net (Postfix) with ESMTPS id BD84B200B1B for ; Thu, 24 May 2012 09:32:19 -0700 (PDT) Received: by qcsp15 with SMTP id p15so10319487qcs.16 for ; Thu, 24 May 2012 09:32:18 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=sender:message-id:date:from:organization:user-agent:mime-version:to :cc:subject:references:in-reply-to:content-type :content-transfer-encoding; bh=7v1OxlVyqXs1jXAqJCvg9KFpbiXdHbaYVOluBRNpNFk=; b=uiQTeOri5Za2g1u9lvxBTHxsKbdYQs6px5QV5v6tnaq3OT1jKDt8xU+7ZKQ5SvzAG2 B3jBMbfX3LHU8OhLGIhf0tpZe5EVHn8XW50ICLbzY8+oynYo0xejqd7gutkY05vhadL4 /0eCHVQQrKtjiRrUPajZTDCzS1RAim8G0UCKpQvz7RznyQlX8zkIHasOsRfO1s1HA4cr sj9rVGofKr9YL35v4QU85PNgdaYBMbPwN/NESDS+NDah6Y4z1A9zRVzOCWhkh3X8GLQc C9OcQSZ8mcDsKPP6gDyUGmDkAHJZsmynpBsb+XG5/gkvSMKlunfAzaohCh8J48SWAPb6 BR0w== Received: by 10.224.97.132 with SMTP id l4mr11103090qan.15.1337877138505; Thu, 24 May 2012 09:32:18 -0700 (PDT) Received: from [192.168.1.27] (c-24-218-176-94.hsd1.ma.comcast.net. [24.218.176.94]) by mx.google.com with ESMTPS id dk2sm3397314qab.14.2012.05.24.09.32.17 (version=SSLv3 cipher=OTHER); Thu, 24 May 2012 09:32:17 -0700 (PDT) Sender: Jim Gettys Message-ID: <4FBE6290.9000701@freedesktop.org> Date: Thu, 24 May 2012 12:32:16 -0400 From: Jim Gettys Organization: Bell Labs User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:12.0) Gecko/20120430 Thunderbird/12.0.1 MIME-Version: 1.0 To: Sebastian Moeller References: <00404BC8-3761-409D-A1C8-9213D7D9A3DF@gmx.de> <1E435715-5C95-49AF-99D0-E8AD6EAD5B44@gmx.de> <4FBE5767.6080704@gmail.com> <4D0F5C65-2401-470F-A6D8-BE18E8BA25C7@gmx.de> In-Reply-To: <4D0F5C65-2401-470F-A6D8-BE18E8BA25C7@gmx.de> Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Cc: cerowrt-devel@lists.bufferbloat.net Subject: Re: [Cerowrt-devel] Fwd: 3.3.6-2 X-BeenThere: cerowrt-devel@lists.bufferbloat.net X-Mailman-Version: 2.1.13 Precedence: list List-Id: Development issues regarding the cerowrt test router project List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 24 May 2012 16:32:20 -0000 On 05/24/2012 12:18 PM, Sebastian Moeller wrote: > Hi Robert, > > On May 24, 2012, at 8:44 AM, Robert Bradley wrote: > >> On 24/05/12 04:48, Sebastian Moeller wrote: >>> A) under moderate wireless stress I get a lot of allocation failures from slub, like: >>> [ 1221.664062] ath: skbuff alloc of size 1926 failed >>> In the routers dmesg. And every now and then the router crashes and reboots (I have not yet found a way to make this happen reliably, it seems to require some uptime) >> This looks to me like a possible memory leak somewhere, but I'm no expert. > Not being an expert I concur. My router's /tmp/log/babeld.log had grown to almost 256k. (and my router had been flaky). So I suspect that's making grim trouble as /tmp is a tmpfs: e.g. coming out of ram. -rw-r--r-- 1 root root 247936 May 24 12:27 babeld.log Tail on the babeld file had: Couldn't determine channel of interface gw00: Invalid argument. Couldn't determine channel of interface gw10: Invalid argument. Couldn't determine channel of interface gw00: Invalid argument. Couldn't determine channel of interface gw10: Invalid argument. Couldn't determine channel of interface gw00: Invalid argument. Couldn't determine channel of interface gw10: Invalid argument. Couldn't determine channel of interface gw00: Invalid argument. Couldn't determine channel of interface gw10: Invalid argument. Couldn't determine channel of interface gw00: Invalid argument. Couldn't determine channel of interface gw10: Invalid argument. I should probably have grabbed a copy before nuking the file. /me bad.... Will put into redmine... - Jim >> (Unless cerowrt is using tmpfs and filling up memory with logs, of course.) > I tried to check that, but since I can nor reproduce the crashes easily yet I have not been able to test that hypothesis (when I checked "df -h" on the router there always was some room left, but heck for all I know it might be the log entries for the allocation failures that quickly eat up all the remaining memory) I will try to test this hypothesis. Currently I tried to check dmesg and free in rapid succession during the test runs that are prone to cause the crash free memory fluctuates some but I never saw it reach 0 just before crashing. > >> Is UDP from the wired side to the Internet also OK? I'm assuming it is, but it would be nice to prove that it is actually a leak in ath9k and/or the wireless stack first! > Actually I have not tested that yet (again with the crash somewhat hard to reproduce I will have to take the wireless out of use for 24 to 48 hours to be reasonably sure that the issue does not occur under wired connections). That said, I will go and work on that. So I have my testing work charted out and will post again once I have more data. > > Best > Sebastian > > > > >> _______________________________________________ >> Cerowrt-devel mailing list >> Cerowrt-devel@lists.bufferbloat.net >> https://lists.bufferbloat.net/listinfo/cerowrt-devel > _______________________________________________ > Cerowrt-devel mailing list > Cerowrt-devel@lists.bufferbloat.net > https://lists.bufferbloat.net/listinfo/cerowrt-devel