From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mail-we0-f171.google.com (mail-we0-f171.google.com [74.125.82.171]) (using TLSv1 with cipher RC4-SHA (128/128 bits)) (Client CN "smtp.gmail.com", Issuer "Google Internet Authority" (verified OK)) by huchra.bufferbloat.net (Postfix) with ESMTPS id 4174821F0C8 for ; Fri, 25 May 2012 00:02:07 -0700 (PDT) Received: by wejx9 with SMTP id x9so796599wej.16 for ; Fri, 25 May 2012 00:02:04 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type:content-transfer-encoding; bh=u2a2sapB9BGOCXbIM1f29XFhrPxSVQlk751ynvWwOmM=; b=1JsCLboT9r+C1c5BskALX0SrarAKfaPtDFByr87wS/JEakkSv9vriarm8CE9xSJOxD X/HFeDaD3j/eAYnDD7wq9p0Y5pXDv2F356Chym9jpcGYDgy/sj8nOrONjFdWUeE2ADX9 PAJJt4ca4pXH0KoIZBfk9L9SMggorwwQkXyHv6W26fvsVSqe1Vpq1jez4xtQZ4Q7fJHf qcClBAlBJLod50ZLzp1Cq0VPie+bHd9dR0kBvF9Efxwzqp8ZR0IayP9l4nqTE2y3/htA H5KRSf+lpUuhjbPnnnccNT1Xy7DGZ4ycg2Us8H1hJZPDMFRumEa6xHIdHntJ5HzhIX1F vFVQ== MIME-Version: 1.0 Received: by 10.180.82.198 with SMTP id k6mr46428737wiy.20.1337929324591; Fri, 25 May 2012 00:02:04 -0700 (PDT) Received: by 10.223.115.9 with HTTP; Fri, 25 May 2012 00:02:04 -0700 (PDT) In-Reply-To: <61BEA217-79A6-47C8-888D-101BC0EAFB45@gmx.de> References: <00404BC8-3761-409D-A1C8-9213D7D9A3DF@gmx.de> <1E435715-5C95-49AF-99D0-E8AD6EAD5B44@gmx.de> <4FBE5767.6080704@gmail.com> <4D0F5C65-2401-470F-A6D8-BE18E8BA25C7@gmx.de> <4FBE6290.9000701@freedesktop.org> <0E4C11DB-2B8A-411B-A61F-34B2A6BF57B9@gmx.de> <4FBE7AAB.5080307@freedesktop.org> <4FBE84C4.80607@gmail.com> <61BEA217-79A6-47C8-888D-101BC0EAFB45@gmx.de> Date: Fri, 25 May 2012 08:02:04 +0100 Message-ID: From: Dave Taht To: Sebastian Moeller Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: quoted-printable Cc: babel-users@lists.alioth.debian.org, cerowrt-devel@lists.bufferbloat.net Subject: Re: [Cerowrt-devel] 3.3.6-2 X-BeenThere: cerowrt-devel@lists.bufferbloat.net X-Mailman-Version: 2.1.13 Precedence: list List-Id: Development issues regarding the cerowrt test router project List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 25 May 2012 07:02:08 -0000 On Fri, May 25, 2012 at 7:41 AM, Sebastian Moeller wrote: > Hi Robert, > > since I see the same log file on my router as Jim, I just want to report = my observations below. > > On May 24, 2012, at 11:58 AM, Robert Bradley wrote: > >> On 24/05/12 19:15, Jim Gettys wrote: >>> On 05/24/2012 02:12 PM, Sebastian Moeller wrote: >>>> Hi Jim, >>>> >>>> good point, I will go and see whether that is the cause for my crashes= =85 Will return to this post if/when I have new data in either direction=85 >>> If you do, see if you can grab the babeld.conf file and add it to: >>> https://www.bufferbloat.net/issues/392 >>> >> >> I don't know if it helps at all, but it looks like Babel's failing to ob= tain channel information for the guest interfaces (gw00 and gw10). I am cc'ing the babel-users list. So it appears that the second interface on a wireless radio, does not report channel information reliably, OR babeld is not getting it on the second interface for some reason. ...sensing the channel is important so that diversity routing works. Going back to vacation now. > > =A0 =A0 =A0 =A0True, in my case I had set the 2.4GHz radio to auto channe= l select, which does not seem to work well with either babeld or its specif= ic configuration. > >> Are these disabled on your routers at the moment? =A0I suppose in the wo= rst case you could try setting an explicit channel for both of the non-mesh= guest interfaces and see if the logs clear up (or somehow pass "-L /dev/nu= ll" to babeld). > > =A0 =A0 =A0 =A0After setting the 2.4GHz channel to 1 instead of auto /tmp= /babeld.log still grows with the same entries. And on a WNDR3700v2 there ar= e 30840 KB of tmpfs on /tmp so the babeld.log size of 256KB should not by i= tself cause the router to crash. That said, while testing this hypothesis b= y filling most of /tmp (dd if=3D/dev/zero of=3D/tmp/delete_me bs=3D1024 cou= nt=3D30000, so that around 340KB stayed free) the router reliably went firs= t into OOM and the rebooted itself. Might it be that the size of the /tmp f= ilesystem is too large if actually used? If I naively add the VSZs of most = processes I end up at around 90% of available memory, so worst case there a= ctually only seems to be room for a much smaller /tmp than 30MB. . Maybe re= stricting /tmp to 6000 KB might make this problem go away (or hooking up a = swap device). Does this reasoning sound sane? Once I figure out how to redu= ce the size of /tmp I will test this. > > >> >> I'm assuming the ad-hoc mesh links are working fine, since gw01/gw11 are= n't present in the log fragment. > > =A0 =A0 =A0 =A0In my case I do not know as I never tried to test with a m= esh client. kill babel if you aren't using it, see what happens. /etc/init.d/babeld disable /etc/init.d/babeld stop > > best > =A0 =A0 =A0 =A0Sebastian > >> -- >> Robert Bradley >> _______________________________________________ >> Cerowrt-devel mailing list >> Cerowrt-devel@lists.bufferbloat.net >> https://lists.bufferbloat.net/listinfo/cerowrt-devel > > _______________________________________________ > Cerowrt-devel mailing list > Cerowrt-devel@lists.bufferbloat.net > https://lists.bufferbloat.net/listinfo/cerowrt-devel --=20 Dave T=E4ht SKYPE: davetaht US Tel: 1-239-829-5608 http://www.bufferbloat.net