From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mail-qt0-x241.google.com (mail-qt0-x241.google.com [IPv6:2607:f8b0:400d:c0d::241]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by lists.bufferbloat.net (Postfix) with ESMTPS id 7A4BE3B29E for ; Thu, 16 Feb 2017 22:51:00 -0500 (EST) Received: by mail-qt0-x241.google.com with SMTP id h53so4667294qth.3 for ; Thu, 16 Feb 2017 19:51:00 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :cc:content-transfer-encoding; bh=QUjoFYEErTAsSgcx6+ZW2uY9IfWmv8436+/pKRB0mmk=; b=UQoa7y2LxIvnM4GtRbrEWBCShPRdE/cm1Rt3Vvt6dYZrW3RmvDrTvUQ67RKnLlOiZx 1Kbbe8evtpAlmY0S4vu2EGmXkCZ4rBecKL9BeAXqNJb33yGPjaNKpXx4WwcGd+yBkTeA dOspcyJKev4GSXvTc/ZyfppmTX1xJaHkAgkOMJYEw0M3v8SGD15HEPncDvA0LQj/qJUW cmhdQzbhTAoFC217fR5lMZuidcX/sGVho2jw3WVF4eehNFIPgV3+mr2FJtP1En3fIR6r WCcX/2OKIh0ZDID9SM7ttXgrYbH9fgA81Yjdg6u9JvW5C2noo5+uI3s47TAMGrjtzMs8 AfzQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to:cc:content-transfer-encoding; bh=QUjoFYEErTAsSgcx6+ZW2uY9IfWmv8436+/pKRB0mmk=; b=XFn0Gv3kXXEkdWuxdGZpSQ81QNnylrgI6sBP1otykBbwQxPQ31W+I88GJfdhRH14U6 SLwwmSjDORfZ+e94TChB4FFRwC0Wdns8UlKFEiaotoQOGGkpBWZiPLVVEwiRBoGBeYU6 JlPc+8Kp+9q1nZyDY3FAhJvykg27rHIOPQwwjZ/1RziCbIa3EnFSMAwUa1dGnTx2e/uI DLyI0C1KxyrNHknsthbkE2as0826lBOGZczWUXOX+5RfQSybqhH14nQVJofr3YvMXMDP t4bSQxlQ5MamDPqdQGYOea6sUai4yb1xgtsUj6k4lJsuVT02jTAtlUJPHuydk863fY6b fn+A== X-Gm-Message-State: AMke39ndlWQqGpIDk2kHI/5j9EK3fZEqxvdaC3C+eSZTRDywN7x4YCbGdreRnVzZh8kAV7CecYaBgenc+4AGOg== X-Received: by 10.200.41.73 with SMTP id z9mr5443998qtz.137.1487303459980; Thu, 16 Feb 2017 19:50:59 -0800 (PST) MIME-Version: 1.0 Received: by 10.12.142.132 with HTTP; Thu, 16 Feb 2017 19:50:59 -0800 (PST) In-Reply-To: <47B84782-858A-4CAF-BD22-3CC835483DC9@horow.net> References: <47B84782-858A-4CAF-BD22-3CC835483DC9@horow.net> From: Dave Taht Date: Thu, 16 Feb 2017 19:50:59 -0800 Message-ID: To: Frank Horowitz Cc: make-wifi-fast@lists.bufferbloat.net, =?UTF-8?B?VG9rZSBIw7hpbGFuZC1Kw7hyZ2Vuc2Vu?= Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Subject: Re: [Make-wifi-fast] Instrumented ATH9K for Crashes? X-BeenThere: make-wifi-fast@lists.bufferbloat.net X-Mailman-Version: 2.1.20 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 17 Feb 2017 03:51:00 -0000 It is my hope felix squished it last week. https://patchwork.kernel.org/patch/9568369/ On Thu, Feb 16, 2017 at 7:18 PM, Frank Horowitz wrote: > Hi All, > > TL/DR: I=E2=80=99ve been seeing reliable crashes from ATH9K drivers in ne= t-next kernels for weeks, but have been unable to capture a crash log. > > In an attempt at having a reliably/regularly updatable router running the= ATF and BBR codes, I=E2=80=99ve assembled an Atom based Zotac mini-itx boa= rd with two different ATH9K based radios. I=E2=80=99ve installed Ubuntu 16.= 10, with a kernel compiled from Dave Miller=E2=80=99s net-next tree (curren= tly running 4.10-rc7). The radios are set up using 2 different hostapd.conf= files (one for the 2.4GHz radio, and one for the 5GHz radio). The motherbo= ard has an RTL8169 ethernet onboard, and I=E2=80=99ve got a 4 port Intel et= hernet card also in the mix. The RTL8169 is my WAN port, fed by a DSL modem= (running LEDE), and all but one of the other network ports are part of a L= AN bridge =E2=80=94 the last port is ultimately meant to feed a DMZ, but th= ere=E2=80=99s nothing on it at the moment. > > When the radios are not connected to the bridge, everything has run stabl= y for days. When the radios are connected to the bridge, but have no client= s, the result has run stably for about 24 hours before I stopped the test. > > When a radio is connected to the bridge and has a client, the system reli= ably crashes within an hour or two. > > I=E2=80=99ve tried to get netconsole logs from another linux box on my br= idged LAN. but thats a Heisenbug because I can=E2=80=99t get the ATH9K=E2= =80=99s to play well with netconsole over the bridge. I think this is due t= o the lack of polling in the ATH9K driver, but would be delighted to find o= ut that it=E2=80=99s something configurable for those radios. Bottom line, = I=E2=80=99ve had no luck in snagging a log from the crashes via netconsole.= I=E2=80=99ve also tried looking at the systemd logs, but nothing made it t= o the log database before the crash. > > I could reconfigure my network such that the unbridged DMZ is feeding my = external linux box. > > Before I try that, I thought I=E2=80=99d ask Toke and the list for advice= about any configs for the ATH9K driver that might help with A) capturing a= crash log, and/or B) debugging the drivers. > > Hopefully, by the time this bites someone else in 4.11 kernels, we=E2=80= =99ll have been able to squish this bug. (Just to be explicit, I=E2=80=99m = volunteering to be a testbed. Don=E2=80=99t tell my wife! ;-) ) > > TIA for any hints on how best to proceed. > > Frank Horowitz > frank@horow.net > > > > > _______________________________________________ > Make-wifi-fast mailing list > Make-wifi-fast@lists.bufferbloat.net > https://lists.bufferbloat.net/listinfo/make-wifi-fast --=20 Dave T=C3=A4ht Let's go make home routers and wifi faster! With better software! http://blog.cerowrt.org