* [Cerowrt-devel] cerowrt issues (3.10.24-8)
@ 2014-01-24 23:08 Steve Jenson
2014-01-24 23:23 ` Dave Taht
0 siblings, 1 reply; 16+ messages in thread
From: Steve Jenson @ 2014-01-24 23:08 UTC (permalink / raw)
To: cerowrt-devel
[-- Attachment #1: Type: text/plain, Size: 13280 bytes --]
Hi everybody,
I've been using cerowrt as a secondary wifi network (just a single AP for
now) for a few weeks now. Recently, my wndr3800 got stuck in a bad state
and eventually rebooted. I've had this happen a few times now and am
looking for ways to debug the issue. I'm new to cerowrt and openwrt so any
advice is appreciated.
Since I use it as a secondary network, this is no way critical. I'm not
looking for free tech support but I couldn't find anything on the wiki
about troubleshooting. I'd love to start a page and write some shell
scripts to diagnose and report issues. I know that a cerowrt router is
meant to be a research project rather a consumer device but these things
seem helpful regardless.
Please let me know if you'd prefer I not email the list with these issues
or if you'd rather I used trac or a different forum.
--
In this state, I can connect to the cerowrt base station via wifi but am
unable to route packets to the internet. I can connect to :81 and see the
login page but logging in results in a lua error at `/cgi-bin/luci`
/usr/lib/lua/luci/dispatcher.lua:448: Failed to execute function
dispatcher target for entry '/'.
The called action terminated with an exception:
/usr/lib/lua/luci/sauth.lua:87: Session data invalid!
stack traceback:
[C]: in function 'assert'
/usr/lib/lua/luci/dispatcher.lua:448: in function 'dispatch'
/usr/lib/lua/luci/dispatcher.lua:195: in function
</usr/lib/lua/luci/dispatcher.lua:194>
I can ssh into the device and cat various log files until the router hangs
and reboots. here's a few relevant lines from my terminal history before
the device rebooted (I'm assuming a watchdog kicked in and rebooted it).
root@buffy2-1:~# ping google.com
ping: bad address 'google.com'
root@buffy2-1:~# free
total used free shared buffers
Mem: 126336 110332 16004 0 5616
-/+ buffers: 104716 21620
Swap: 0 0 0
root@buffy2-1:~# uptime
02:08:54 up 2 days, 1:26, load average: 0.10, 0.21, 0.17
root@buffy2-1:~# dmesg
[ 0.000000] Linux version 3.10.24 (cero2@snapon) (gcc version 4.6.4
(OpenWrt/Linaro GCC 4.6-2013.05 r38226) ) #1 Tue Dec 24
10:50:15 PST 2013
[skipping some lines]
[ 13.156250] Error: Driver 'gpio-keys-polled' is already registered,
aborting...
[ 19.414062] IPv6: ADDRCONF(NETDEV_UP): ge00: link is not ready
[ 19.421875] ar71xx: pll_reg 0xb8050010: 0x11110000
[ 19.429687] se00: link up (1000Mbps/Full duplex)
[ 22.140625] IPv6: ADDRCONF(NETDEV_UP): sw00: link is not ready
[ 23.351562] IPv6: ADDRCONF(NETDEV_CHANGE): sw00: link becomes ready
[ 23.757812] ar71xx: pll_reg 0xb8050014: 0x11110000
[ 23.757812] ge00: link up (1000Mbps/Full duplex)
[ 23.773437] IPv6: ADDRCONF(NETDEV_CHANGE): ge00: link becomes ready
root@buffy2-1:~# ifconfig
ge00 Link encap:Ethernet HWaddr 2C:B0:5D:A0:C5:B1
inet addr:192.168.1.138 Bcast:192.168.1.255 Mask:255.255.255.0
inet6 addr: fe80::2eb0:5dff:fea0:c5b1/64 Scope:Link
inet6 addr: 2602:30a:2cdb:330:2eb0:5dff:fea0:c5b1/64 Scope:Global
UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
RX packets:1469670 errors:0 dropped:8 overruns:0 frame:0
TX packets:547733 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:1000
RX bytes:229243410 (218.6 MiB) TX bytes:57304808 (54.6 MiB)
Interrupt:5
lo Link encap:Local Loopback
inet addr:127.0.0.1 Mask:255.0.0.0
inet6 addr: ::1/128 Scope:Host
UP LOOPBACK RUNNING MTU:65536 Metric:1
RX packets:23689 errors:0 dropped:0 overruns:0 frame:0
TX packets:23689 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:0
RX bytes:2612713 (2.4 MiB) TX bytes:2612713 (2.4 MiB)
pimreg Link encap:UNSPEC HWaddr
00-00-00-00-00-00-00-00-00-00-00-00-00-00-00-00
UP RUNNING NOARP MTU:1472 Metric:1
RX packets:0 errors:0 dropped:0 overruns:0 frame:0
TX packets:0 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:0
RX bytes:0 (0.0 B) TX bytes:0 (0.0 B)
se00 Link encap:Ethernet HWaddr 2E:B0:5D:A0:C5:B0
inet addr:172.30.42.1 Bcast:172.30.42.31 Mask:255.255.255.224
inet6 addr: 2602:30a:2cdb:330:2eb0:5dff:fea0:c5b1/64 Scope:Global
inet6 addr: fe80::2cb0:5dff:fea0:c5b0/64 Scope:Link
UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
RX packets:0 errors:0 dropped:0 overruns:0 frame:0
TX packets:191740 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:1000
RX bytes:0 (0.0 B) TX bytes:42184988 (40.2 MiB)
Interrupt:4
sw00 Link encap:Ethernet HWaddr 2C:B0:5D:A0:C5:B0
inet addr:172.30.42.65 Bcast:172.30.42.95 Mask:255.255.255.224
inet6 addr: 2602:30a:2cdb:330:2eb0:5dff:fea0:c5b1/64 Scope:Global
inet6 addr: fe80::2eb0:5dff:fea0:c5b0/64 Scope:Link
UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
RX packets:70239 errors:0 dropped:0 overruns:0 frame:0
TX packets:286967 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:1000
RX bytes:15590189 (14.8 MiB) TX bytes:127357293 (121.4 MiB)
root@buffy2-1:~# less /var/log/babeld.log
Couldn't determine channel of interface sw00: Invalid argument.
Couldn't determine channel of interface sw00: Invalid argument.
Couldn't determine channel of interface sw00: Invalid argument.
Couldn't determine channel of interface sw00: Invalid argument.
send: Cannot assign requested address
send: Cannot assign requested address
send: Cannot assign requested address
Couldn't determine channel of interface sw00: Invalid argument.
Couldn't determine channel of interface sw00: Invalid argument.
Couldn't determine channel of interface sw00: Invalid argument.
Couldn't determine channel of interface sw00: Invalid argument.
netlink_read: recvmsg(): No buffer space available
Couldn't determine channel of interface sw00: Invalid argument.
Couldn't determine channel of interface sw00: Invalid argument.
Couldn't determine channel of interface sw00: Invalid argument.
Couldn't determine channel of interface sw00: Invalid argument.
Couldn't determine channel of interface sw00: Invalid argument.
Couldn't determine channel of interface sw00: Invalid argument.
Couldn't determine channel of interface sw00: Invalid argument.
Couldn't determine channel of interface sw00: Invalid argument.
Couldn't determine channel of interface sw00: Invalid argument.
Couldn't determine channel of interface sw00: Invalid argument.
Couldn't determine channel of interface sw00: Invalid argument.
Couldn't determine channel of interface sw00: Invalid argument.
Couldn't determine channel of interface sw00: Invalid argument.
Couldn't determine channel of interface sw00: Invalid argument.
Couldn't determine channel of interface sw00: Invalid argument.
Couldn't determine channel of interface sw00: Invalid argument.
Couldn't determine channel of interface sw00: Invalid argument.
Couldn't determine channel of interface sw00: Invalid argument.
Couldn't determine channel of interface sw00: Invalid argument.
Couldn't determine channel of interface sw00: Invalid argument.
Couldn't determine channel of interface sw00: Invalid argument.
Couldn't determine channel of interface sw00: Invalid argument.
Couldn't determine channel of interface sw00: Invalid argument.
Couldn't determine channel of interface sw00: Invalid argument.
Couldn't determine channel of interface sw00: Invalid argument.
</quote>
and then the system hang. Note: the first time I ran 'less
/var/log/babeld.log', it was Killed. I would assume that's an OOM
killer?
I'm running cerowrt 3.10.24-8 on a wndr3800. I have a spare wndr3800
that I'm getting ready to flash with the same version and I can see if I get
the same issue. What's the best way to clone a configuration?
---
Here are some log lines from a separate instance of the same problem, I
recovered these log lines from the web server (I was able to login to
the web interface for another hour until I got the same lua exception
as in the beginning of this email)
This time I was completely unable to connect to wifi from my iPhone
and you can see where hostapd disassociates the iPhone soon after
it connects.
the kernel log:
[158995.550781] icmp6_send: no reply to icmp error
[160930.527343] ath: phy0: Failed to stop TX DMA, queues=0x004!
[160930.542968] ath: phy0: DMA failed to stop in 10 ms AR_CR=0x00000024
AR_DIAG_SW=0x42000020 DMADBG_7=0x000084c0
[160930.554687] ath: phy0: Could not stop RX, we could be confusing the DMA
engine when we start RX up
and in the system log:
Fri Jan 24 15:51:15 2014 daemon.err minissdpd[4933]: recvmsg(s, &hdr, 0):
No buffer space available
Fri Jan 24 15:51:15 2014 daemon.err avahi-daemon[1273]: netlink.c:
recvmsg() failed: No buffer space available
Fri Jan 24 15:52:18 2014 daemon.err minissdpd[4933]: recvmsg(s, &hdr, 0):
No buffer space available
root@buffy2-1:/tmp/log# free
total used free shared buffers
Mem: 126336 110128 16208 0 5616
-/+ buffers: 104512 21824
Swap: 0 0 0
Fri Jan 24 17:14:53 2014 daemon.err minissdpd[4933]: recvmsg(s, &hdr, 0):
No buffer space available
Fri Jan 24 17:15:53 2014 daemon.err minissdpd[4933]: recvmsg(s, &hdr, 0):
No buffer space available
Fri Jan 24 17:16:18 2014 daemon.info hostapd: sw00: STA 90:72:40:e9:9c:b6
IEEE 802.11: authenticated
Fri Jan 24 17:16:18 2014 daemon.info hostapd: sw00: STA 90:72:40:e9:9c:b6
IEEE 802.11: associated (aid 1)
Fri Jan 24 17:16:18 2014 daemon.info hostapd: sw00: STA 90:72:40:e9:9c:b6
RADIUS: starting accounting session 52DC70A7-00000010
Fri Jan 24 17:16:18 2014 daemon.info hostapd: sw00: STA 90:72:40:e9:9c:b6
WPA: pairwise key handshake completed (RSN)
Fri Jan 24 17:16:18 2014 daemon.info dnsmasq-dhcp[4999]: DHCPREQUEST(sw00)
192.168.1.114 90:72:40:e9:9c:b6
Fri Jan 24 17:16:18 2014 daemon.info dnsmasq-dhcp[4999]: DHCPNAK(sw00)
192.168.1.114 90:72:40:e9:9c:b6 wrong network
Fri Jan 24 17:16:21 2014 daemon.info dnsmasq-dhcp[4999]: DHCPDISCOVER(sw00)
90:72:40:e9:9c:b6
Fri Jan 24 17:16:21 2014 daemon.info dnsmasq-dhcp[4999]: DHCPOFFER(sw00)
172.30.42.82 90:72:40:e9:9c:b6
Fri Jan 24 17:16:21 2014 daemon.info dnsmasq-dhcp[4999]: DHCPDISCOVER(sw00)
90:72:40:e9:9c:b6
Fri Jan 24 17:16:21 2014 daemon.info dnsmasq-dhcp[4999]: DHCPOFFER(sw00)
172.30.42.82 90:72:40:e9:9c:b6
Fri Jan 24 17:16:22 2014 daemon.info dnsmasq-dhcp[4999]: DHCPREQUEST(sw00)
172.30.42.82 90:72:40:e9:9c:b6
Fri Jan 24 17:16:22 2014 daemon.info dnsmasq-dhcp[4999]: DHCPACK(sw00)
172.30.42.82 90:72:40:e9:9c:b6 stevejs-iPhone
Fri Jan 24 17:16:35 2014 daemon.info hostapd: sw00: STA 90:72:40:e9:9c:b6
IEEE 802.11: disassociated
Fri Jan 24 17:16:36 2014 daemon.info hostapd: sw00: STA 90:72:40:e9:9c:b6
IEEE 802.11: deauthenticated due to inactivity (timer DEAUTH/REMOVE)
Fri Jan 24 17:16:57 2014 daemon.err minissdpd[4933]: recvmsg(s, &hdr, 0):
No buffer space available
Fri Jan 24 17:17:59 2014 daemon.err minissdpd[4933]: recvmsg(s, &hdr, 0):
No buffer space available
Fri Jan 24 17:19:04 2014 daemon.err minissdpd[4933]: recvmsg(s, &hdr, 0):
No buffer space available
Fri Jan 24 17:19:04 2014 daemon.err avahi-daemon[1273]: netlink.c:
recvmsg() failed: No buffer space available
Fri Jan 24 17:19:45 2014 daemon.info hostapd: sw00: STA 90:72:40:e9:9c:b6
IEEE 802.11: authenticated
Fri Jan 24 17:19:45 2014 daemon.info hostapd: sw00: STA 90:72:40:e9:9c:b6
IEEE 802.11: associated (aid 1)
Fri Jan 24 17:19:45 2014 daemon.info hostapd: sw00: STA 90:72:40:e9:9c:b6
RADIUS: starting accounting session 52DC70A7-00000011
Fri Jan 24 17:19:45 2014 daemon.info hostapd: sw00: STA 90:72:40:e9:9c:b6
WPA: pairwise key handshake completed (RSN)
Fri Jan 24 17:19:45 2014 daemon.info dnsmasq-dhcp[4999]: DHCPDISCOVER(sw00)
90:72:40:e9:9c:b6
Fri Jan 24 17:19:45 2014 daemon.info dnsmasq-dhcp[4999]: DHCPOFFER(sw00)
172.30.42.82 90:72:40:e9:9c:b6
Fri Jan 24 17:19:47 2014 daemon.info dnsmasq-dhcp[4999]: DHCPREQUEST(sw00)
172.30.42.82 90:72:40:e9:9c:b6
Fri Jan 24 17:19:47 2014 daemon.info dnsmasq-dhcp[4999]: DHCPACK(sw00)
172.30.42.82 90:72:40:e9:9c:b6 stevejs-iPhone
Fri Jan 24 17:20:01 2014 daemon.info hostapd: sw00: STA 90:72:40:e9:9c:b6
IEEE 802.11: disassociated
Fri Jan 24 17:20:02 2014 daemon.info hostapd: sw00: STA 90:72:40:e9:9c:b6
IEEE 802.11: deauthenticated due to inactivity (timer DEAUTH/REMOVE)
Fri Jan 24 17:20:09 2014 daemon.err minissdpd[4933]: recvmsg(s, &hdr, 0):
No buffer space available
Fri Jan 24 17:21:10 2014 daemon.err minissdpd[4933]: recvmsg(s, &hdr, 0):
No buffer space available
and then it hung and rebooted.
I bought this particular wndr3800 used so it's possible I have a bad piece
of hardware. Are there any hardware testing scripts or anything like
memtest86 for these devices? Are there entries in /sys that I should be
looking at? I've noticed about 9k unaligned_instructions on boot but that
never grows.
Thanks!
Steve Jenson
@stevej
[-- Attachment #2: Type: text/html, Size: 16203 bytes --]
^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: [Cerowrt-devel] cerowrt issues (3.10.24-8)
2014-01-24 23:08 [Cerowrt-devel] cerowrt issues (3.10.24-8) Steve Jenson
@ 2014-01-24 23:23 ` Dave Taht
2014-01-27 21:06 ` Steve Jenson
0 siblings, 1 reply; 16+ messages in thread
From: Dave Taht @ 2014-01-24 23:23 UTC (permalink / raw)
To: Steve Jenson; +Cc: cerowrt-devel
On Fri, Jan 24, 2014 at 6:08 PM, Steve Jenson <stevej@fruitless.org> wrote:
> Hi everybody,
>
> I've been using cerowrt as a secondary wifi network (just a single AP for
> now) for a few weeks now. Recently, my wndr3800 got stuck in a bad state and
> eventually rebooted. I've had this happen a few times now and am looking for
> ways to debug the issue. I'm new to cerowrt and openwrt so any advice is
> appreciated.
>
> Since I use it as a secondary network, this is no way critical.
Yea! I appreciate caution before putting alpha software on your gw.
> I'm not
> looking for free tech support but I couldn't find anything on the wiki about
> troubleshooting. I'd love to start a page and write some shell scripts to
> diagnose and report issues. I know that a cerowrt router is meant to be a
> research project rather a consumer device but these things seem helpful
> regardless.
Sure, let me know your wiki account. I have been lax about granting
access of late as the signup process is overrun by spammers.
> Please let me know if you'd prefer I not email the list with these issues or
> if you'd rather I used trac or a different forum.
>
The list is where most stuff happens. Also in the irc channel.
If it gets to where it needs to be tracked we have a bugtracker at
http://www.bufferbloat.net/projects/cerowrt/issues
The first question I have is: Are you on comcast? Cerowrt
had a dhcpv6-pd implementation that "just worked" from feburary through
december. Regrettably they changed the RA announcement interval
to a really low number around then... and this triggers a firewall reload
every minute on everything prior to the release I point to below.
If there is a memory leak somewhere that would have triggered it.
> --
>
> In this state, I can connect to the cerowrt base station via wifi but am
> unable to route packets to the internet. I can connect to :81 and see the
> login page but logging in results in a lua error at `/cgi-bin/luci`
>
>
> /usr/lib/lua/luci/dispatcher.lua:448: Failed to execute function
> dispatcher target for entry '/'.
> The called action terminated with an exception:
> /usr/lib/lua/luci/sauth.lua:87: Session data invalid!
> stack traceback:
> [C]: in function 'assert'
> /usr/lib/lua/luci/dispatcher.lua:448: in function 'dispatch'
> /usr/lib/lua/luci/dispatcher.lua:195: in function
> </usr/lib/lua/luci/dispatcher.lua:194>
>
> I can ssh into the device and cat various log files until the router hangs
> and reboots. here's a few relevant lines from my terminal history before the
> device rebooted (I'm assuming a watchdog kicked in and rebooted it).
>
> root@buffy2-1:~# ping google.com
> ping: bad address 'google.com'
> root@buffy2-1:~# free
> total used free shared buffers
> Mem: 126336 110332 16004 0 5616
> -/+ buffers: 104716 21620
> Swap: 0 0 0
> root@buffy2-1:~# uptime
> 02:08:54 up 2 days, 1:26, load average: 0.10, 0.21, 0.17
> root@buffy2-1:~# dmesg
> [ 0.000000] Linux version 3.10.24 (cero2@snapon) (gcc version 4.6.4
> (OpenWrt/Linaro GCC 4.6-2013.05 r38226) ) #1 Tue Dec 24
> 10:50:15 PST 2013
> [skipping some lines]
>
> [ 13.156250] Error: Driver 'gpio-keys-polled' is already registered,
> aborting...
> [ 19.414062] IPv6: ADDRCONF(NETDEV_UP): ge00: link is not ready
> [ 19.421875] ar71xx: pll_reg 0xb8050010: 0x11110000
> [ 19.429687] se00: link up (1000Mbps/Full duplex)
> [ 22.140625] IPv6: ADDRCONF(NETDEV_UP): sw00: link is not ready
> [ 23.351562] IPv6: ADDRCONF(NETDEV_CHANGE): sw00: link becomes ready
> [ 23.757812] ar71xx: pll_reg 0xb8050014: 0x11110000
> [ 23.757812] ge00: link up (1000Mbps/Full duplex)
> [ 23.773437] IPv6: ADDRCONF(NETDEV_CHANGE): ge00: link becomes ready
>
> root@buffy2-1:~# ifconfig
> ge00 Link encap:Ethernet HWaddr 2C:B0:5D:A0:C5:B1
> inet addr:192.168.1.138 Bcast:192.168.1.255 Mask:255.255.255.0
> inet6 addr: fe80::2eb0:5dff:fea0:c5b1/64 Scope:Link
> inet6 addr: 2602:30a:2cdb:330:2eb0:5dff:fea0:c5b1/64 Scope:Global
> UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
> RX packets:1469670 errors:0 dropped:8 overruns:0 frame:0
> TX packets:547733 errors:0 dropped:0 overruns:0 carrier:0
> collisions:0 txqueuelen:1000
> RX bytes:229243410 (218.6 MiB) TX bytes:57304808 (54.6 MiB)
> Interrupt:5
>
> lo Link encap:Local Loopback
> inet addr:127.0.0.1 Mask:255.0.0.0
> inet6 addr: ::1/128 Scope:Host
> UP LOOPBACK RUNNING MTU:65536 Metric:1
> RX packets:23689 errors:0 dropped:0 overruns:0 frame:0
> TX packets:23689 errors:0 dropped:0 overruns:0 carrier:0
> collisions:0 txqueuelen:0
> RX bytes:2612713 (2.4 MiB) TX bytes:2612713 (2.4 MiB)
>
> pimreg Link encap:UNSPEC HWaddr
> 00-00-00-00-00-00-00-00-00-00-00-00-00-00-00-00
> UP RUNNING NOARP MTU:1472 Metric:1
> RX packets:0 errors:0 dropped:0 overruns:0 frame:0
> TX packets:0 errors:0 dropped:0 overruns:0 carrier:0
> collisions:0 txqueuelen:0
> RX bytes:0 (0.0 B) TX bytes:0 (0.0 B)
>
> se00 Link encap:Ethernet HWaddr 2E:B0:5D:A0:C5:B0
> inet addr:172.30.42.1 Bcast:172.30.42.31 Mask:255.255.255.224
> inet6 addr: 2602:30a:2cdb:330:2eb0:5dff:fea0:c5b1/64 Scope:Global
How are you assigning your ipv6 addresses?
> inet6 addr: fe80::2cb0:5dff:fea0:c5b0/64 Scope:Link
> UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
> RX packets:0 errors:0 dropped:0 overruns:0 frame:0
> TX packets:191740 errors:0 dropped:0 overruns:0 carrier:0
> collisions:0 txqueuelen:1000
> RX bytes:0 (0.0 B) TX bytes:42184988 (40.2 MiB)
> Interrupt:4
>
> sw00 Link encap:Ethernet HWaddr 2C:B0:5D:A0:C5:B0
> inet addr:172.30.42.65 Bcast:172.30.42.95 Mask:255.255.255.224
> inet6 addr: 2602:30a:2cdb:330:2eb0:5dff:fea0:c5b1/64 Scope:Global
> inet6 addr: fe80::2eb0:5dff:fea0:c5b0/64 Scope:Link
> UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
> RX packets:70239 errors:0 dropped:0 overruns:0 frame:0
> TX packets:286967 errors:0 dropped:0 overruns:0 carrier:0
> collisions:0 txqueuelen:1000
> RX bytes:15590189 (14.8 MiB) TX bytes:127357293 (121.4 MiB)
>
> root@buffy2-1:~# less /var/log/babeld.log
> Couldn't determine channel of interface sw00: Invalid argument.
> Couldn't determine channel of interface sw00: Invalid argument.
> Couldn't determine channel of interface sw00: Invalid argument.
> Couldn't determine channel of interface sw00: Invalid argument.
> send: Cannot assign requested address
> send: Cannot assign requested address
> send: Cannot assign requested address
> Couldn't determine channel of interface sw00: Invalid argument.
> Couldn't determine channel of interface sw00: Invalid argument.
> Couldn't determine channel of interface sw00: Invalid argument.
> Couldn't determine channel of interface sw00: Invalid argument.
> netlink_read: recvmsg(): No buffer space available
> Couldn't determine channel of interface sw00: Invalid argument.
> Couldn't determine channel of interface sw00: Invalid argument.
> Couldn't determine channel of interface sw00: Invalid argument.
> Couldn't determine channel of interface sw00: Invalid argument.
> Couldn't determine channel of interface sw00: Invalid argument.
> Couldn't determine channel of interface sw00: Invalid argument.
> Couldn't determine channel of interface sw00: Invalid argument.
> Couldn't determine channel of interface sw00: Invalid argument.
> Couldn't determine channel of interface sw00: Invalid argument.
> Couldn't determine channel of interface sw00: Invalid argument.
> Couldn't determine channel of interface sw00: Invalid argument.
> Couldn't determine channel of interface sw00: Invalid argument.
> Couldn't determine channel of interface sw00: Invalid argument.
> Couldn't determine channel of interface sw00: Invalid argument.
> Couldn't determine channel of interface sw00: Invalid argument.
> Couldn't determine channel of interface sw00: Invalid argument.
> Couldn't determine channel of interface sw00: Invalid argument.
> Couldn't determine channel of interface sw00: Invalid argument.
> Couldn't determine channel of interface sw00: Invalid argument.
> Couldn't determine channel of interface sw00: Invalid argument.
> Couldn't determine channel of interface sw00: Invalid argument.
> Couldn't determine channel of interface sw00: Invalid argument.
> Couldn't determine channel of interface sw00: Invalid argument.
> Couldn't determine channel of interface sw00: Invalid argument.
> Couldn't determine channel of interface sw00: Invalid argument.
This is a problem in babel detecting the channel on a "normal"
rather than a mesh interface. It's bugged me a long while, but
haven't got around to finding what triggers it. Might "fix" it by
acquiring the channel at babel start time from /etc/config/wireless.
It messes up the diversity routing calculation, grump.
There is a possibility a logfile got really big, but this one
generally doesn't, but I should turn off logging in some
future release...
> </quote>
>
> and then the system hang. Note: the first time I ran 'less
> /var/log/babeld.log', it was Killed. I would assume that's an OOM
> killer?
Yes it really sounds like you were out of memory. Your free does not
indicate that however.
> I'm running cerowrt 3.10.24-8 on a wndr3800. I have a spare wndr3800
> that I'm getting ready to flash with the same version and I can see if I get
> the same issue. What's the best way to clone a configuration?
scp -r therouter:/overlay somewheresafe
then reflash it clean
>
> ---
>
> Here are some log lines from a separate instance of the same problem, I
> recovered these log lines from the web server (I was able to login to
> the web interface for another hour until I got the same lua exception
> as in the beginning of this email)
You can setup remote syslog logging if you lke.
> This time I was completely unable to connect to wifi from my iPhone
> and you can see where hostapd disassociates the iPhone soon after
> it connects.
>
> the kernel log:
>
> [158995.550781] icmp6_send: no reply to icmp error
>
> [160930.527343] ath: phy0: Failed to stop TX DMA, queues=0x004!
We see this all the time for 3 years now.
> [160930.542968] ath: phy0: DMA failed to stop in 10 ms AR_CR=0x00000024
> AR_DIAG_SW=0x42000020 DMADBG_7=0x000084c0
that is kind of new.
>
> [160930.554687] ath: phy0: Could not stop RX, we could be confusing the DMA
> engine when we start RX up
So is that.
>
> and in the system log:
>
> Fri Jan 24 15:51:15 2014 daemon.err minissdpd[4933]: recvmsg(s, &hdr, 0): No
> buffer space available
>
> Fri Jan 24 15:51:15 2014 daemon.err avahi-daemon[1273]: netlink.c: recvmsg()
> failed: No buffer space available
>
> Fri Jan 24 15:52:18 2014 daemon.err minissdpd[4933]: recvmsg(s, &hdr, 0): No
> buffer space available
>
>
> root@buffy2-1:/tmp/log# free
>
> total used free shared buffers
>
> Mem: 126336 110128 16208 0 5616
>
> -/+ buffers: 104512 21824
>
> Swap: 0 0 0
>
>
> Fri Jan 24 17:14:53 2014 daemon.err minissdpd[4933]: recvmsg(s, &hdr, 0): No
> buffer space available
>
> Fri Jan 24 17:15:53 2014 daemon.err minissdpd[4933]: recvmsg(s, &hdr, 0): No
> buffer space available
>
> Fri Jan 24 17:16:18 2014 daemon.info hostapd: sw00: STA 90:72:40:e9:9c:b6
> IEEE 802.11: authenticated
>
> Fri Jan 24 17:16:18 2014 daemon.info hostapd: sw00: STA 90:72:40:e9:9c:b6
> IEEE 802.11: associated (aid 1)
>
> Fri Jan 24 17:16:18 2014 daemon.info hostapd: sw00: STA 90:72:40:e9:9c:b6
> RADIUS: starting accounting session 52DC70A7-00000010
>
> Fri Jan 24 17:16:18 2014 daemon.info hostapd: sw00: STA 90:72:40:e9:9c:b6
> WPA: pairwise key handshake completed (RSN)
>
> Fri Jan 24 17:16:18 2014 daemon.info dnsmasq-dhcp[4999]: DHCPREQUEST(sw00)
> 192.168.1.114 90:72:40:e9:9c:b6
>
> Fri Jan 24 17:16:18 2014 daemon.info dnsmasq-dhcp[4999]: DHCPNAK(sw00)
> 192.168.1.114 90:72:40:e9:9c:b6 wrong network
>
> Fri Jan 24 17:16:21 2014 daemon.info dnsmasq-dhcp[4999]: DHCPDISCOVER(sw00)
> 90:72:40:e9:9c:b6
>
> Fri Jan 24 17:16:21 2014 daemon.info dnsmasq-dhcp[4999]: DHCPOFFER(sw00)
> 172.30.42.82 90:72:40:e9:9c:b6
>
> Fri Jan 24 17:16:21 2014 daemon.info dnsmasq-dhcp[4999]: DHCPDISCOVER(sw00)
> 90:72:40:e9:9c:b6
>
> Fri Jan 24 17:16:21 2014 daemon.info dnsmasq-dhcp[4999]: DHCPOFFER(sw00)
> 172.30.42.82 90:72:40:e9:9c:b6
>
> Fri Jan 24 17:16:22 2014 daemon.info dnsmasq-dhcp[4999]: DHCPREQUEST(sw00)
> 172.30.42.82 90:72:40:e9:9c:b6
>
> Fri Jan 24 17:16:22 2014 daemon.info dnsmasq-dhcp[4999]: DHCPACK(sw00)
> 172.30.42.82 90:72:40:e9:9c:b6 stevejs-iPhone
>
> Fri Jan 24 17:16:35 2014 daemon.info hostapd: sw00: STA 90:72:40:e9:9c:b6
> IEEE 802.11: disassociated
>
> Fri Jan 24 17:16:36 2014 daemon.info hostapd: sw00: STA 90:72:40:e9:9c:b6
> IEEE 802.11: deauthenticated due to inactivity (timer DEAUTH/REMOVE)
>
> Fri Jan 24 17:16:57 2014 daemon.err minissdpd[4933]: recvmsg(s, &hdr, 0): No
> buffer space available
>
> Fri Jan 24 17:17:59 2014 daemon.err minissdpd[4933]: recvmsg(s, &hdr, 0): No
> buffer space available
>
> Fri Jan 24 17:19:04 2014 daemon.err minissdpd[4933]: recvmsg(s, &hdr, 0): No
> buffer space available
>
> Fri Jan 24 17:19:04 2014 daemon.err avahi-daemon[1273]: netlink.c: recvmsg()
> failed: No buffer space available
>
> Fri Jan 24 17:19:45 2014 daemon.info hostapd: sw00: STA 90:72:40:e9:9c:b6
> IEEE 802.11: authenticated
>
> Fri Jan 24 17:19:45 2014 daemon.info hostapd: sw00: STA 90:72:40:e9:9c:b6
> IEEE 802.11: associated (aid 1)
>
> Fri Jan 24 17:19:45 2014 daemon.info hostapd: sw00: STA 90:72:40:e9:9c:b6
> RADIUS: starting accounting session 52DC70A7-00000011
>
> Fri Jan 24 17:19:45 2014 daemon.info hostapd: sw00: STA 90:72:40:e9:9c:b6
> WPA: pairwise key handshake completed (RSN)
>
> Fri Jan 24 17:19:45 2014 daemon.info dnsmasq-dhcp[4999]: DHCPDISCOVER(sw00)
> 90:72:40:e9:9c:b6
>
> Fri Jan 24 17:19:45 2014 daemon.info dnsmasq-dhcp[4999]: DHCPOFFER(sw00)
> 172.30.42.82 90:72:40:e9:9c:b6
>
> Fri Jan 24 17:19:47 2014 daemon.info dnsmasq-dhcp[4999]: DHCPREQUEST(sw00)
> 172.30.42.82 90:72:40:e9:9c:b6
>
> Fri Jan 24 17:19:47 2014 daemon.info dnsmasq-dhcp[4999]: DHCPACK(sw00)
> 172.30.42.82 90:72:40:e9:9c:b6 stevejs-iPhone
>
> Fri Jan 24 17:20:01 2014 daemon.info hostapd: sw00: STA 90:72:40:e9:9c:b6
> IEEE 802.11: disassociated
>
> Fri Jan 24 17:20:02 2014 daemon.info hostapd: sw00: STA 90:72:40:e9:9c:b6
> IEEE 802.11: deauthenticated due to inactivity (timer DEAUTH/REMOVE)
>
> Fri Jan 24 17:20:09 2014 daemon.err minissdpd[4933]: recvmsg(s, &hdr, 0): No
> buffer space available
>
> Fri Jan 24 17:21:10 2014 daemon.err minissdpd[4933]: recvmsg(s, &hdr, 0): No
> buffer space available
>
> and then it hung and rebooted.
You are running out of ram somewhere unobservable, and yes, I'd be
a little nervous that it isn't actually a memory or hw problem.
>
> I bought this particular wndr3800 used so it's possible I have a bad piece
> of hardware. Are there any hardware testing scripts or anything like
> memtest86 for these devices? Are there entries in /sys that I should be
> looking at? I've noticed about 9k unaligned_instructions on boot but that
> never grows.
I have only observed 4 in the last 24 hours on this build:
https://lists.bufferbloat.net/pipermail/cerowrt-devel/2014-January/002093.html
>
> Thanks!
> Steve Jenson
> @stevej
>
>
>
>
> _______________________________________________
> Cerowrt-devel mailing list
> Cerowrt-devel@lists.bufferbloat.net
> https://lists.bufferbloat.net/listinfo/cerowrt-devel
>
--
Dave Täht
Fixing bufferbloat with cerowrt: http://www.teklibre.com/cerowrt/subscribe.html
^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: [Cerowrt-devel] cerowrt issues (3.10.24-8)
2014-01-24 23:23 ` Dave Taht
@ 2014-01-27 21:06 ` Steve Jenson
2014-01-27 21:10 ` Steve Jenson
0 siblings, 1 reply; 16+ messages in thread
From: Steve Jenson @ 2014-01-27 21:06 UTC (permalink / raw)
To: Dave Taht; +Cc: cerowrt-devel
[-- Attachment #1: Type: text/plain, Size: 11366 bytes --]
On Fri, Jan 24, 2014 at 3:23 PM, Dave Taht <dave.taht@gmail.com> wrote:
> On Fri, Jan 24, 2014 at 6:08 PM, Steve Jenson <stevej@fruitless.org>
> wrote:
> > Hi everybody,
> >
> > I've been using cerowrt as a secondary wifi network (just a single AP for
> > now) for a few weeks now. Recently, my wndr3800 got stuck in a bad state
> and
> > eventually rebooted. I've had this happen a few times now and am looking
> for
> > ways to debug the issue. I'm new to cerowrt and openwrt so any advice is
> > appreciated.
> >
> > Since I use it as a secondary network, this is no way critical.
>
> Yea! I appreciate caution before putting alpha software on your gw.
>
> > I'm not
> > looking for free tech support but I couldn't find anything on the wiki
> about
> > troubleshooting. I'd love to start a page and write some shell scripts to
> > diagnose and report issues. I know that a cerowrt router is meant to be a
> > research project rather a consumer device but these things seem helpful
> > regardless.
>
> Sure, let me know your wiki account. I have been lax about granting
> access of late as the signup process is overrun by spammers.
My username is stevej on the wiki. Thanks!
> > Please let me know if you'd prefer I not email the list with these
> issues or
> > if you'd rather I used trac or a different forum.
> >
>
> The list is where most stuff happens. Also in the irc channel.
>
> If it gets to where it needs to be tracked we have a bugtracker at
>
> http://www.bufferbloat.net/projects/cerowrt/issues
>
> The first question I have is: Are you on comcast? Cerowrt
> had a dhcpv6-pd implementation that "just worked" from feburary through
> december. Regrettably they changed the RA announcement interval
> to a really low number around then... and this triggers a firewall reload
> every minute on everything prior to the release I point to below.
>
> If there is a memory leak somewhere that would have triggered it.
I am on AT&T ADSL2+ with a Motorola NVG510 modem.
> > In this state, I can connect to the cerowrt base station via wifi but am
> > unable to route packets to the internet. I can connect to :81 and see the
> > login page but logging in results in a lua error at `/cgi-bin/luci`
> >
> >
> > /usr/lib/lua/luci/dispatcher.lua:448: Failed to execute function
> > dispatcher target for entry '/'.
> > The called action terminated with an exception:
> > /usr/lib/lua/luci/sauth.lua:87: Session data invalid!
> > stack traceback:
> > [C]: in function 'assert'
> > /usr/lib/lua/luci/dispatcher.lua:448: in function 'dispatch'
> > /usr/lib/lua/luci/dispatcher.lua:195: in function
> > </usr/lib/lua/luci/dispatcher.lua:194>
> >
> > I can ssh into the device and cat various log files until the router
> hangs
> > and reboots. here's a few relevant lines from my terminal history before
> the
> > device rebooted (I'm assuming a watchdog kicked in and rebooted it).
> >
> > root@buffy2-1:~# ping google.com
> > ping: bad address 'google.com'
> > root@buffy2-1:~# free
> > total used free shared buffers
> > Mem: 126336 110332 16004 0 5616
> > -/+ buffers: 104716 21620
> > Swap: 0 0 0
> > root@buffy2-1:~# uptime
> > 02:08:54 up 2 days, 1:26, load average: 0.10, 0.21, 0.17
> > root@buffy2-1:~# dmesg
> > [ 0.000000] Linux version 3.10.24 (cero2@snapon) (gcc version 4.6.4
> > (OpenWrt/Linaro GCC 4.6-2013.05 r38226) ) #1 Tue Dec 24
> > 10:50:15 PST 2013
> > [skipping some lines]
> >
> > [ 13.156250] Error: Driver 'gpio-keys-polled' is already registered,
> > aborting...
> > [ 19.414062] IPv6: ADDRCONF(NETDEV_UP): ge00: link is not ready
> > [ 19.421875] ar71xx: pll_reg 0xb8050010: 0x11110000
> > [ 19.429687] se00: link up (1000Mbps/Full duplex)
> > [ 22.140625] IPv6: ADDRCONF(NETDEV_UP): sw00: link is not ready
> > [ 23.351562] IPv6: ADDRCONF(NETDEV_CHANGE): sw00: link becomes ready
> > [ 23.757812] ar71xx: pll_reg 0xb8050014: 0x11110000
> > [ 23.757812] ge00: link up (1000Mbps/Full duplex)
> > [ 23.773437] IPv6: ADDRCONF(NETDEV_CHANGE): ge00: link becomes ready
> >
> > root@buffy2-1:~# ifconfig
> > ge00 Link encap:Ethernet HWaddr 2C:B0:5D:A0:C5:B1
> > inet addr:192.168.1.138 Bcast:192.168.1.255
> Mask:255.255.255.0
> > inet6 addr: fe80::2eb0:5dff:fea0:c5b1/64 Scope:Link
> > inet6 addr: 2602:30a:2cdb:330:2eb0:5dff:fea0:c5b1/64
> Scope:Global
> > UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
> > RX packets:1469670 errors:0 dropped:8 overruns:0 frame:0
> > TX packets:547733 errors:0 dropped:0 overruns:0 carrier:0
> > collisions:0 txqueuelen:1000
> > RX bytes:229243410 (218.6 MiB) TX bytes:57304808 (54.6 MiB)
> > Interrupt:5
> >
> > lo Link encap:Local Loopback
> > inet addr:127.0.0.1 Mask:255.0.0.0
> > inet6 addr: ::1/128 Scope:Host
> > UP LOOPBACK RUNNING MTU:65536 Metric:1
> > RX packets:23689 errors:0 dropped:0 overruns:0 frame:0
> > TX packets:23689 errors:0 dropped:0 overruns:0 carrier:0
> > collisions:0 txqueuelen:0
> > RX bytes:2612713 (2.4 MiB) TX bytes:2612713 (2.4 MiB)
> >
> > pimreg Link encap:UNSPEC HWaddr
> > 00-00-00-00-00-00-00-00-00-00-00-00-00-00-00-00
> > UP RUNNING NOARP MTU:1472 Metric:1
> > RX packets:0 errors:0 dropped:0 overruns:0 frame:0
> > TX packets:0 errors:0 dropped:0 overruns:0 carrier:0
> > collisions:0 txqueuelen:0
> > RX bytes:0 (0.0 B) TX bytes:0 (0.0 B)
> >
> > se00 Link encap:Ethernet HWaddr 2E:B0:5D:A0:C5:B0
> > inet addr:172.30.42.1 Bcast:172.30.42.31 Mask:255.255.255.224
> > inet6 addr: 2602:30a:2cdb:330:2eb0:5dff:fea0:c5b1/64
> Scope:Global
>
> How are you assigning your ipv6 addresses?
>
It's been a while since I messed with this but I think IPv6 is assigned
thanks to 6relayd? My modem has IPv6 enabled but no DHCPv6 options that I
can find. Here's how cerowrt is configured.
root@buffy2-1:/overlay/etc/config# cat 6relayd
config server 'default'
option fallback_relay 'rd dhcpv6 ndp'
list network 'ge00'
list network 'ge01'
list network 'gw00'
list network 'gw01'
list network 'gw10'
list network 'gw11'
list network 'se00'
list network 'sw00'
list network 'sw10'
option rd 'relay'
option dhcpv6 'relay'
option ndp 'relay'
option master 'ge00'
> > inet6 addr: fe80::2cb0:5dff:fea0:c5b0/64 Scope:Link
> > UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
> > RX packets:0 errors:0 dropped:0 overruns:0 frame:0
> > TX packets:191740 errors:0 dropped:0 overruns:0 carrier:0
> > collisions:0 txqueuelen:1000
> > RX bytes:0 (0.0 B) TX bytes:42184988 (40.2 MiB)
> > Interrupt:4
> >
> > sw00 Link encap:Ethernet HWaddr 2C:B0:5D:A0:C5:B0
> > inet addr:172.30.42.65 Bcast:172.30.42.95
> Mask:255.255.255.224
> > inet6 addr: 2602:30a:2cdb:330:2eb0:5dff:fea0:c5b1/64
> Scope:Global
> > inet6 addr: fe80::2eb0:5dff:fea0:c5b0/64 Scope:Link
> > UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
> > RX packets:70239 errors:0 dropped:0 overruns:0 frame:0
> > TX packets:286967 errors:0 dropped:0 overruns:0 carrier:0
> > collisions:0 txqueuelen:1000
> > RX bytes:15590189 (14.8 MiB) TX bytes:127357293 (121.4 MiB)
> >
> > root@buffy2-1:~# less /var/log/babeld.log
> > Couldn't determine channel of interface sw00: Invalid argument.
> > Couldn't determine channel of interface sw00: Invalid argument.
> > Couldn't determine channel of interface sw00: Invalid argument.
> > Couldn't determine channel of interface sw00: Invalid argument.
> > send: Cannot assign requested address
> > send: Cannot assign requested address
> > send: Cannot assign requested address
> > Couldn't determine channel of interface sw00: Invalid argument.
> > Couldn't determine channel of interface sw00: Invalid argument.
> > Couldn't determine channel of interface sw00: Invalid argument.
> > Couldn't determine channel of interface sw00: Invalid argument.
> > netlink_read: recvmsg(): No buffer space available
> > Couldn't determine channel of interface sw00: Invalid argument.
> > Couldn't determine channel of interface sw00: Invalid argument.
> > Couldn't determine channel of interface sw00: Invalid argument.
> > Couldn't determine channel of interface sw00: Invalid argument.
> > Couldn't determine channel of interface sw00: Invalid argument.
> > Couldn't determine channel of interface sw00: Invalid argument.
> > Couldn't determine channel of interface sw00: Invalid argument.
> > Couldn't determine channel of interface sw00: Invalid argument.
> > Couldn't determine channel of interface sw00: Invalid argument.
> > Couldn't determine channel of interface sw00: Invalid argument.
> > Couldn't determine channel of interface sw00: Invalid argument.
> > Couldn't determine channel of interface sw00: Invalid argument.
> > Couldn't determine channel of interface sw00: Invalid argument.
> > Couldn't determine channel of interface sw00: Invalid argument.
> > Couldn't determine channel of interface sw00: Invalid argument.
> > Couldn't determine channel of interface sw00: Invalid argument.
> > Couldn't determine channel of interface sw00: Invalid argument.
> > Couldn't determine channel of interface sw00: Invalid argument.
> > Couldn't determine channel of interface sw00: Invalid argument.
> > Couldn't determine channel of interface sw00: Invalid argument.
> > Couldn't determine channel of interface sw00: Invalid argument.
> > Couldn't determine channel of interface sw00: Invalid argument.
> > Couldn't determine channel of interface sw00: Invalid argument.
> > Couldn't determine channel of interface sw00: Invalid argument.
> > Couldn't determine channel of interface sw00: Invalid argument.
>
> This is a problem in babel detecting the channel on a "normal"
> rather than a mesh interface. It's bugged me a long while, but
> haven't got around to finding what triggers it. Might "fix" it by
> acquiring the channel at babel start time from /etc/config/wireless.
>
> It messes up the diversity routing calculation, grump.
>
> There is a possibility a logfile got really big, but this one
> generally doesn't, but I should turn off logging in some
> future release...
I believe I've tracked down part of what's going on. It looks like my tmpfs
is filling up 100% and then the device enters a bad state:
After 24 hours, with tmpfs at 50%, babeld.log is the largest file by far in
tmpfs and the only file that appears to be growing (based on `du`). It
takes about 48 hours from reboot to fill up tmpfs on my device.
# sort babeld.log | uniq -c |sort -rn |head
503236 Couldn't determine channel of interface sw00: Invalid argument.
1376 netlink_read: recvmsg(): No buffer space available
3 send: Cannot assign requested address
# wc -l babeld.log
504617 babeld.log
I sped up system failure by using `dd` to fill up tmpfs and the system
became immediately unusable.
This also explains the luci session store errors as sessions are stored in
tmpfs.
The other buffer issues may or may not be related to this.
Best,
Steve
[-- Attachment #2: Type: text/html, Size: 14513 bytes --]
^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: [Cerowrt-devel] cerowrt issues (3.10.24-8)
2014-01-27 21:06 ` Steve Jenson
@ 2014-01-27 21:10 ` Steve Jenson
2014-01-27 21:14 ` Dave Taht
0 siblings, 1 reply; 16+ messages in thread
From: Steve Jenson @ 2014-01-27 21:10 UTC (permalink / raw)
To: Dave Taht; +Cc: cerowrt-devel
[-- Attachment #1: Type: text/plain, Size: 11966 bytes --]
Looking more, the buffer errors are showing up in syslog well before tmpfs
fills up. Is the memtester openwrt package available for cerowrt? I don't
see it under `Available packages`.
Thanks,
Steve
On Mon, Jan 27, 2014 at 1:06 PM, Steve Jenson <stevej@fruitless.org> wrote:
> On Fri, Jan 24, 2014 at 3:23 PM, Dave Taht <dave.taht@gmail.com> wrote:
>
>> On Fri, Jan 24, 2014 at 6:08 PM, Steve Jenson <stevej@fruitless.org>
>> wrote:
>> > Hi everybody,
>> >
>> > I've been using cerowrt as a secondary wifi network (just a single AP
>> for
>> > now) for a few weeks now. Recently, my wndr3800 got stuck in a bad
>> state and
>> > eventually rebooted. I've had this happen a few times now and am
>> looking for
>> > ways to debug the issue. I'm new to cerowrt and openwrt so any advice is
>> > appreciated.
>> >
>> > Since I use it as a secondary network, this is no way critical.
>>
>> Yea! I appreciate caution before putting alpha software on your gw.
>>
>> > I'm not
>> > looking for free tech support but I couldn't find anything on the wiki
>> about
>> > troubleshooting. I'd love to start a page and write some shell scripts
>> to
>> > diagnose and report issues. I know that a cerowrt router is meant to be
>> a
>> > research project rather a consumer device but these things seem helpful
>> > regardless.
>>
>> Sure, let me know your wiki account. I have been lax about granting
>> access of late as the signup process is overrun by spammers.
>
>
> My username is stevej on the wiki. Thanks!
>
>
>
>> > Please let me know if you'd prefer I not email the list with these
>> issues or
>> > if you'd rather I used trac or a different forum.
>> >
>>
>> The list is where most stuff happens. Also in the irc channel.
>>
>> If it gets to where it needs to be tracked we have a bugtracker at
>>
>> http://www.bufferbloat.net/projects/cerowrt/issues
>>
>> The first question I have is: Are you on comcast? Cerowrt
>> had a dhcpv6-pd implementation that "just worked" from feburary through
>> december. Regrettably they changed the RA announcement interval
>> to a really low number around then... and this triggers a firewall reload
>> every minute on everything prior to the release I point to below.
>>
>> If there is a memory leak somewhere that would have triggered it.
>
>
> I am on AT&T ADSL2+ with a Motorola NVG510 modem.
>
>
>
>> > In this state, I can connect to the cerowrt base station via wifi but am
>> > unable to route packets to the internet. I can connect to :81 and see
>> the
>> > login page but logging in results in a lua error at `/cgi-bin/luci`
>> >
>> >
>> > /usr/lib/lua/luci/dispatcher.lua:448: Failed to execute function
>> > dispatcher target for entry '/'.
>> > The called action terminated with an exception:
>> > /usr/lib/lua/luci/sauth.lua:87: Session data invalid!
>> > stack traceback:
>> > [C]: in function 'assert'
>> > /usr/lib/lua/luci/dispatcher.lua:448: in function 'dispatch'
>> > /usr/lib/lua/luci/dispatcher.lua:195: in function
>> > </usr/lib/lua/luci/dispatcher.lua:194>
>> >
>> > I can ssh into the device and cat various log files until the router
>> hangs
>> > and reboots. here's a few relevant lines from my terminal history
>> before the
>> > device rebooted (I'm assuming a watchdog kicked in and rebooted it).
>> >
>> > root@buffy2-1:~# ping google.com
>> > ping: bad address 'google.com'
>> > root@buffy2-1:~# free
>> > total used free shared buffers
>> > Mem: 126336 110332 16004 0 5616
>> > -/+ buffers: 104716 21620
>> > Swap: 0 0 0
>> > root@buffy2-1:~# uptime
>> > 02:08:54 up 2 days, 1:26, load average: 0.10, 0.21, 0.17
>> > root@buffy2-1:~# dmesg
>> > [ 0.000000] Linux version 3.10.24 (cero2@snapon) (gcc version 4.6.4
>> > (OpenWrt/Linaro GCC 4.6-2013.05 r38226) ) #1 Tue Dec 24
>> > 10:50:15 PST 2013
>> > [skipping some lines]
>> >
>> > [ 13.156250] Error: Driver 'gpio-keys-polled' is already registered,
>> > aborting...
>> > [ 19.414062] IPv6: ADDRCONF(NETDEV_UP): ge00: link is not ready
>> > [ 19.421875] ar71xx: pll_reg 0xb8050010: 0x11110000
>> > [ 19.429687] se00: link up (1000Mbps/Full duplex)
>> > [ 22.140625] IPv6: ADDRCONF(NETDEV_UP): sw00: link is not ready
>> > [ 23.351562] IPv6: ADDRCONF(NETDEV_CHANGE): sw00: link becomes ready
>> > [ 23.757812] ar71xx: pll_reg 0xb8050014: 0x11110000
>> > [ 23.757812] ge00: link up (1000Mbps/Full duplex)
>> > [ 23.773437] IPv6: ADDRCONF(NETDEV_CHANGE): ge00: link becomes ready
>> >
>> > root@buffy2-1:~# ifconfig
>> > ge00 Link encap:Ethernet HWaddr 2C:B0:5D:A0:C5:B1
>> > inet addr:192.168.1.138 Bcast:192.168.1.255
>> Mask:255.255.255.0
>> > inet6 addr: fe80::2eb0:5dff:fea0:c5b1/64 Scope:Link
>> > inet6 addr: 2602:30a:2cdb:330:2eb0:5dff:fea0:c5b1/64
>> Scope:Global
>> > UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
>> > RX packets:1469670 errors:0 dropped:8 overruns:0 frame:0
>> > TX packets:547733 errors:0 dropped:0 overruns:0 carrier:0
>> > collisions:0 txqueuelen:1000
>> > RX bytes:229243410 (218.6 MiB) TX bytes:57304808 (54.6 MiB)
>> > Interrupt:5
>> >
>> > lo Link encap:Local Loopback
>> > inet addr:127.0.0.1 Mask:255.0.0.0
>> > inet6 addr: ::1/128 Scope:Host
>> > UP LOOPBACK RUNNING MTU:65536 Metric:1
>> > RX packets:23689 errors:0 dropped:0 overruns:0 frame:0
>> > TX packets:23689 errors:0 dropped:0 overruns:0 carrier:0
>> > collisions:0 txqueuelen:0
>> > RX bytes:2612713 (2.4 MiB) TX bytes:2612713 (2.4 MiB)
>> >
>> > pimreg Link encap:UNSPEC HWaddr
>> > 00-00-00-00-00-00-00-00-00-00-00-00-00-00-00-00
>> > UP RUNNING NOARP MTU:1472 Metric:1
>> > RX packets:0 errors:0 dropped:0 overruns:0 frame:0
>> > TX packets:0 errors:0 dropped:0 overruns:0 carrier:0
>> > collisions:0 txqueuelen:0
>> > RX bytes:0 (0.0 B) TX bytes:0 (0.0 B)
>> >
>> > se00 Link encap:Ethernet HWaddr 2E:B0:5D:A0:C5:B0
>> > inet addr:172.30.42.1 Bcast:172.30.42.31
>> Mask:255.255.255.224
>> > inet6 addr: 2602:30a:2cdb:330:2eb0:5dff:fea0:c5b1/64
>> Scope:Global
>>
>> How are you assigning your ipv6 addresses?
>>
>
> It's been a while since I messed with this but I think IPv6 is assigned
> thanks to 6relayd? My modem has IPv6 enabled but no DHCPv6 options that I
> can find. Here's how cerowrt is configured.
>
> root@buffy2-1:/overlay/etc/config# cat 6relayd
> config server 'default'
> option fallback_relay 'rd dhcpv6 ndp'
> list network 'ge00'
> list network 'ge01'
> list network 'gw00'
> list network 'gw01'
> list network 'gw10'
> list network 'gw11'
> list network 'se00'
> list network 'sw00'
> list network 'sw10'
> option rd 'relay'
> option dhcpv6 'relay'
> option ndp 'relay'
> option master 'ge00'
>
>
>> > inet6 addr: fe80::2cb0:5dff:fea0:c5b0/64 Scope:Link
>> > UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
>> > RX packets:0 errors:0 dropped:0 overruns:0 frame:0
>> > TX packets:191740 errors:0 dropped:0 overruns:0 carrier:0
>> > collisions:0 txqueuelen:1000
>> > RX bytes:0 (0.0 B) TX bytes:42184988 (40.2 MiB)
>> > Interrupt:4
>> >
>> > sw00 Link encap:Ethernet HWaddr 2C:B0:5D:A0:C5:B0
>> > inet addr:172.30.42.65 Bcast:172.30.42.95
>> Mask:255.255.255.224
>> > inet6 addr: 2602:30a:2cdb:330:2eb0:5dff:fea0:c5b1/64
>> Scope:Global
>> > inet6 addr: fe80::2eb0:5dff:fea0:c5b0/64 Scope:Link
>> > UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
>> > RX packets:70239 errors:0 dropped:0 overruns:0 frame:0
>> > TX packets:286967 errors:0 dropped:0 overruns:0 carrier:0
>> > collisions:0 txqueuelen:1000
>> > RX bytes:15590189 (14.8 MiB) TX bytes:127357293 (121.4 MiB)
>> >
>> > root@buffy2-1:~# less /var/log/babeld.log
>> > Couldn't determine channel of interface sw00: Invalid argument.
>> > Couldn't determine channel of interface sw00: Invalid argument.
>> > Couldn't determine channel of interface sw00: Invalid argument.
>> > Couldn't determine channel of interface sw00: Invalid argument.
>> > send: Cannot assign requested address
>> > send: Cannot assign requested address
>> > send: Cannot assign requested address
>> > Couldn't determine channel of interface sw00: Invalid argument.
>> > Couldn't determine channel of interface sw00: Invalid argument.
>> > Couldn't determine channel of interface sw00: Invalid argument.
>> > Couldn't determine channel of interface sw00: Invalid argument.
>> > netlink_read: recvmsg(): No buffer space available
>> > Couldn't determine channel of interface sw00: Invalid argument.
>> > Couldn't determine channel of interface sw00: Invalid argument.
>> > Couldn't determine channel of interface sw00: Invalid argument.
>> > Couldn't determine channel of interface sw00: Invalid argument.
>> > Couldn't determine channel of interface sw00: Invalid argument.
>> > Couldn't determine channel of interface sw00: Invalid argument.
>> > Couldn't determine channel of interface sw00: Invalid argument.
>> > Couldn't determine channel of interface sw00: Invalid argument.
>> > Couldn't determine channel of interface sw00: Invalid argument.
>> > Couldn't determine channel of interface sw00: Invalid argument.
>> > Couldn't determine channel of interface sw00: Invalid argument.
>> > Couldn't determine channel of interface sw00: Invalid argument.
>> > Couldn't determine channel of interface sw00: Invalid argument.
>> > Couldn't determine channel of interface sw00: Invalid argument.
>> > Couldn't determine channel of interface sw00: Invalid argument.
>> > Couldn't determine channel of interface sw00: Invalid argument.
>> > Couldn't determine channel of interface sw00: Invalid argument.
>> > Couldn't determine channel of interface sw00: Invalid argument.
>> > Couldn't determine channel of interface sw00: Invalid argument.
>> > Couldn't determine channel of interface sw00: Invalid argument.
>> > Couldn't determine channel of interface sw00: Invalid argument.
>> > Couldn't determine channel of interface sw00: Invalid argument.
>> > Couldn't determine channel of interface sw00: Invalid argument.
>> > Couldn't determine channel of interface sw00: Invalid argument.
>> > Couldn't determine channel of interface sw00: Invalid argument.
>>
>> This is a problem in babel detecting the channel on a "normal"
>> rather than a mesh interface. It's bugged me a long while, but
>> haven't got around to finding what triggers it. Might "fix" it by
>> acquiring the channel at babel start time from /etc/config/wireless.
>>
>> It messes up the diversity routing calculation, grump.
>>
>> There is a possibility a logfile got really big, but this one
>> generally doesn't, but I should turn off logging in some
>> future release...
>
>
> I believe I've tracked down part of what's going on. It looks like my
> tmpfs is filling up 100% and then the device enters a bad state:
>
> After 24 hours, with tmpfs at 50%, babeld.log is the largest file by far
> in tmpfs and the only file that appears to be growing (based on `du`). It
> takes about 48 hours from reboot to fill up tmpfs on my device.
>
> # sort babeld.log | uniq -c |sort -rn |head
>
> 503236 Couldn't determine channel of interface sw00: Invalid argument.
>
> 1376 netlink_read: recvmsg(): No buffer space available
>
> 3 send: Cannot assign requested address
>
> # wc -l babeld.log
>
> 504617 babeld.log
> I sped up system failure by using `dd` to fill up tmpfs and the system
> became immediately unusable.
>
> This also explains the luci session store errors as sessions are stored in
> tmpfs.
>
> The other buffer issues may or may not be related to this.
>
> Best,
> Steve
>
[-- Attachment #2: Type: text/html, Size: 15217 bytes --]
^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: [Cerowrt-devel] cerowrt issues (3.10.24-8)
2014-01-27 21:10 ` Steve Jenson
@ 2014-01-27 21:14 ` Dave Taht
2014-01-29 12:45 ` Sebastian Moeller
0 siblings, 1 reply; 16+ messages in thread
From: Dave Taht @ 2014-01-27 21:14 UTC (permalink / raw)
To: Steve Jenson; +Cc: cerowrt-devel
certainly turn off the babeld log! I will leave it off in the next release.
On Mon, Jan 27, 2014 at 4:10 PM, Steve Jenson <stevej@fruitless.org> wrote:
> Looking more, the buffer errors are showing up in syslog well before tmpfs
> fills up. Is the memtester openwrt package available for cerowrt? I don't
> see it under `Available packages`.
>
> Thanks,
> Steve
>
>
> On Mon, Jan 27, 2014 at 1:06 PM, Steve Jenson <stevej@fruitless.org> wrote:
>>
>> On Fri, Jan 24, 2014 at 3:23 PM, Dave Taht <dave.taht@gmail.com> wrote:
>>>
>>> On Fri, Jan 24, 2014 at 6:08 PM, Steve Jenson <stevej@fruitless.org>
>>> wrote:
>>> > Hi everybody,
>>> >
>>> > I've been using cerowrt as a secondary wifi network (just a single AP
>>> > for
>>> > now) for a few weeks now. Recently, my wndr3800 got stuck in a bad
>>> > state and
>>> > eventually rebooted. I've had this happen a few times now and am
>>> > looking for
>>> > ways to debug the issue. I'm new to cerowrt and openwrt so any advice
>>> > is
>>> > appreciated.
>>> >
>>> > Since I use it as a secondary network, this is no way critical.
>>>
>>> Yea! I appreciate caution before putting alpha software on your gw.
>>>
>>> > I'm not
>>> > looking for free tech support but I couldn't find anything on the wiki
>>> > about
>>> > troubleshooting. I'd love to start a page and write some shell scripts
>>> > to
>>> > diagnose and report issues. I know that a cerowrt router is meant to be
>>> > a
>>> > research project rather a consumer device but these things seem helpful
>>> > regardless.
>>>
>>> Sure, let me know your wiki account. I have been lax about granting
>>> access of late as the signup process is overrun by spammers.
>>
>>
>> My username is stevej on the wiki. Thanks!
>>
>>
>>>
>>> > Please let me know if you'd prefer I not email the list with these
>>> > issues or
>>> > if you'd rather I used trac or a different forum.
>>> >
>>>
>>> The list is where most stuff happens. Also in the irc channel.
>>>
>>> If it gets to where it needs to be tracked we have a bugtracker at
>>>
>>> http://www.bufferbloat.net/projects/cerowrt/issues
>>>
>>> The first question I have is: Are you on comcast? Cerowrt
>>> had a dhcpv6-pd implementation that "just worked" from feburary through
>>> december. Regrettably they changed the RA announcement interval
>>> to a really low number around then... and this triggers a firewall reload
>>> every minute on everything prior to the release I point to below.
>>>
>>> If there is a memory leak somewhere that would have triggered it.
>>
>>
>> I am on AT&T ADSL2+ with a Motorola NVG510 modem.
>>
>>
>>>
>>> > In this state, I can connect to the cerowrt base station via wifi but
>>> > am
>>> > unable to route packets to the internet. I can connect to :81 and see
>>> > the
>>> > login page but logging in results in a lua error at `/cgi-bin/luci`
>>> >
>>> >
>>> > /usr/lib/lua/luci/dispatcher.lua:448: Failed to execute function
>>> > dispatcher target for entry '/'.
>>> > The called action terminated with an exception:
>>> > /usr/lib/lua/luci/sauth.lua:87: Session data invalid!
>>> > stack traceback:
>>> > [C]: in function 'assert'
>>> > /usr/lib/lua/luci/dispatcher.lua:448: in function 'dispatch'
>>> > /usr/lib/lua/luci/dispatcher.lua:195: in function
>>> > </usr/lib/lua/luci/dispatcher.lua:194>
>>> >
>>> > I can ssh into the device and cat various log files until the router
>>> > hangs
>>> > and reboots. here's a few relevant lines from my terminal history
>>> > before the
>>> > device rebooted (I'm assuming a watchdog kicked in and rebooted it).
>>> >
>>> > root@buffy2-1:~# ping google.com
>>> > ping: bad address 'google.com'
>>> > root@buffy2-1:~# free
>>> > total used free shared buffers
>>> > Mem: 126336 110332 16004 0 5616
>>> > -/+ buffers: 104716 21620
>>> > Swap: 0 0 0
>>> > root@buffy2-1:~# uptime
>>> > 02:08:54 up 2 days, 1:26, load average: 0.10, 0.21, 0.17
>>> > root@buffy2-1:~# dmesg
>>> > [ 0.000000] Linux version 3.10.24 (cero2@snapon) (gcc version 4.6.4
>>> > (OpenWrt/Linaro GCC 4.6-2013.05 r38226) ) #1 Tue Dec 24
>>> > 10:50:15 PST 2013
>>> > [skipping some lines]
>>> >
>>> > [ 13.156250] Error: Driver 'gpio-keys-polled' is already registered,
>>> > aborting...
>>> > [ 19.414062] IPv6: ADDRCONF(NETDEV_UP): ge00: link is not ready
>>> > [ 19.421875] ar71xx: pll_reg 0xb8050010: 0x11110000
>>> > [ 19.429687] se00: link up (1000Mbps/Full duplex)
>>> > [ 22.140625] IPv6: ADDRCONF(NETDEV_UP): sw00: link is not ready
>>> > [ 23.351562] IPv6: ADDRCONF(NETDEV_CHANGE): sw00: link becomes ready
>>> > [ 23.757812] ar71xx: pll_reg 0xb8050014: 0x11110000
>>> > [ 23.757812] ge00: link up (1000Mbps/Full duplex)
>>> > [ 23.773437] IPv6: ADDRCONF(NETDEV_CHANGE): ge00: link becomes ready
>>> >
>>> > root@buffy2-1:~# ifconfig
>>> > ge00 Link encap:Ethernet HWaddr 2C:B0:5D:A0:C5:B1
>>> > inet addr:192.168.1.138 Bcast:192.168.1.255
>>> > Mask:255.255.255.0
>>> > inet6 addr: fe80::2eb0:5dff:fea0:c5b1/64 Scope:Link
>>> > inet6 addr: 2602:30a:2cdb:330:2eb0:5dff:fea0:c5b1/64
>>> > Scope:Global
>>> > UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
>>> > RX packets:1469670 errors:0 dropped:8 overruns:0 frame:0
>>> > TX packets:547733 errors:0 dropped:0 overruns:0 carrier:0
>>> > collisions:0 txqueuelen:1000
>>> > RX bytes:229243410 (218.6 MiB) TX bytes:57304808 (54.6 MiB)
>>> > Interrupt:5
>>> >
>>> > lo Link encap:Local Loopback
>>> > inet addr:127.0.0.1 Mask:255.0.0.0
>>> > inet6 addr: ::1/128 Scope:Host
>>> > UP LOOPBACK RUNNING MTU:65536 Metric:1
>>> > RX packets:23689 errors:0 dropped:0 overruns:0 frame:0
>>> > TX packets:23689 errors:0 dropped:0 overruns:0 carrier:0
>>> > collisions:0 txqueuelen:0
>>> > RX bytes:2612713 (2.4 MiB) TX bytes:2612713 (2.4 MiB)
>>> >
>>> > pimreg Link encap:UNSPEC HWaddr
>>> > 00-00-00-00-00-00-00-00-00-00-00-00-00-00-00-00
>>> > UP RUNNING NOARP MTU:1472 Metric:1
>>> > RX packets:0 errors:0 dropped:0 overruns:0 frame:0
>>> > TX packets:0 errors:0 dropped:0 overruns:0 carrier:0
>>> > collisions:0 txqueuelen:0
>>> > RX bytes:0 (0.0 B) TX bytes:0 (0.0 B)
>>> >
>>> > se00 Link encap:Ethernet HWaddr 2E:B0:5D:A0:C5:B0
>>> > inet addr:172.30.42.1 Bcast:172.30.42.31
>>> > Mask:255.255.255.224
>>> > inet6 addr: 2602:30a:2cdb:330:2eb0:5dff:fea0:c5b1/64
>>> > Scope:Global
>>>
>>> How are you assigning your ipv6 addresses?
>>
>>
>> It's been a while since I messed with this but I think IPv6 is assigned
>> thanks to 6relayd? My modem has IPv6 enabled but no DHCPv6 options that I
>> can find. Here's how cerowrt is configured.
>>
>> root@buffy2-1:/overlay/etc/config# cat 6relayd
>> config server 'default'
>> option fallback_relay 'rd dhcpv6 ndp'
>> list network 'ge00'
>> list network 'ge01'
>> list network 'gw00'
>> list network 'gw01'
>> list network 'gw10'
>> list network 'gw11'
>> list network 'se00'
>> list network 'sw00'
>> list network 'sw10'
>> option rd 'relay'
>> option dhcpv6 'relay'
>> option ndp 'relay'
>> option master 'ge00'
>>
>>>
>>> > inet6 addr: fe80::2cb0:5dff:fea0:c5b0/64 Scope:Link
>>> > UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
>>> > RX packets:0 errors:0 dropped:0 overruns:0 frame:0
>>> > TX packets:191740 errors:0 dropped:0 overruns:0 carrier:0
>>> > collisions:0 txqueuelen:1000
>>> > RX bytes:0 (0.0 B) TX bytes:42184988 (40.2 MiB)
>>> > Interrupt:4
>>> >
>>> > sw00 Link encap:Ethernet HWaddr 2C:B0:5D:A0:C5:B0
>>> > inet addr:172.30.42.65 Bcast:172.30.42.95
>>> > Mask:255.255.255.224
>>> > inet6 addr: 2602:30a:2cdb:330:2eb0:5dff:fea0:c5b1/64
>>> > Scope:Global
>>> > inet6 addr: fe80::2eb0:5dff:fea0:c5b0/64 Scope:Link
>>> > UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
>>> > RX packets:70239 errors:0 dropped:0 overruns:0 frame:0
>>> > TX packets:286967 errors:0 dropped:0 overruns:0 carrier:0
>>> > collisions:0 txqueuelen:1000
>>> > RX bytes:15590189 (14.8 MiB) TX bytes:127357293 (121.4 MiB)
>>> >
>>> > root@buffy2-1:~# less /var/log/babeld.log
>>> > Couldn't determine channel of interface sw00: Invalid argument.
>>> > Couldn't determine channel of interface sw00: Invalid argument.
>>> > Couldn't determine channel of interface sw00: Invalid argument.
>>> > Couldn't determine channel of interface sw00: Invalid argument.
>>> > send: Cannot assign requested address
>>> > send: Cannot assign requested address
>>> > send: Cannot assign requested address
>>> > Couldn't determine channel of interface sw00: Invalid argument.
>>> > Couldn't determine channel of interface sw00: Invalid argument.
>>> > Couldn't determine channel of interface sw00: Invalid argument.
>>> > Couldn't determine channel of interface sw00: Invalid argument.
>>> > netlink_read: recvmsg(): No buffer space available
>>> > Couldn't determine channel of interface sw00: Invalid argument.
>>> > Couldn't determine channel of interface sw00: Invalid argument.
>>> > Couldn't determine channel of interface sw00: Invalid argument.
>>> > Couldn't determine channel of interface sw00: Invalid argument.
>>> > Couldn't determine channel of interface sw00: Invalid argument.
>>> > Couldn't determine channel of interface sw00: Invalid argument.
>>> > Couldn't determine channel of interface sw00: Invalid argument.
>>> > Couldn't determine channel of interface sw00: Invalid argument.
>>> > Couldn't determine channel of interface sw00: Invalid argument.
>>> > Couldn't determine channel of interface sw00: Invalid argument.
>>> > Couldn't determine channel of interface sw00: Invalid argument.
>>> > Couldn't determine channel of interface sw00: Invalid argument.
>>> > Couldn't determine channel of interface sw00: Invalid argument.
>>> > Couldn't determine channel of interface sw00: Invalid argument.
>>> > Couldn't determine channel of interface sw00: Invalid argument.
>>> > Couldn't determine channel of interface sw00: Invalid argument.
>>> > Couldn't determine channel of interface sw00: Invalid argument.
>>> > Couldn't determine channel of interface sw00: Invalid argument.
>>> > Couldn't determine channel of interface sw00: Invalid argument.
>>> > Couldn't determine channel of interface sw00: Invalid argument.
>>> > Couldn't determine channel of interface sw00: Invalid argument.
>>> > Couldn't determine channel of interface sw00: Invalid argument.
>>> > Couldn't determine channel of interface sw00: Invalid argument.
>>> > Couldn't determine channel of interface sw00: Invalid argument.
>>> > Couldn't determine channel of interface sw00: Invalid argument.
>>>
>>> This is a problem in babel detecting the channel on a "normal"
>>> rather than a mesh interface. It's bugged me a long while, but
>>> haven't got around to finding what triggers it. Might "fix" it by
>>> acquiring the channel at babel start time from /etc/config/wireless.
>>>
>>> It messes up the diversity routing calculation, grump.
>>>
>>> There is a possibility a logfile got really big, but this one
>>> generally doesn't, but I should turn off logging in some
>>> future release...
>>
>>
>> I believe I've tracked down part of what's going on. It looks like my
>> tmpfs is filling up 100% and then the device enters a bad state:
>>
>> After 24 hours, with tmpfs at 50%, babeld.log is the largest file by far
>> in tmpfs and the only file that appears to be growing (based on `du`). It
>> takes about 48 hours from reboot to fill up tmpfs on my device.
>>
>> # sort babeld.log | uniq -c |sort -rn |head
>>
>> 503236 Couldn't determine channel of interface sw00: Invalid argument.
>>
>> 1376 netlink_read: recvmsg(): No buffer space available
>>
>> 3 send: Cannot assign requested address
>>
>> # wc -l babeld.log
>>
>> 504617 babeld.log
>>
>> I sped up system failure by using `dd` to fill up tmpfs and the system
>> became immediately unusable.
>>
>> This also explains the luci session store errors as sessions are stored in
>> tmpfs.
>>
>> The other buffer issues may or may not be related to this.
>>
>> Best,
>> Steve
>
>
--
Dave Täht
Fixing bufferbloat with cerowrt: http://www.teklibre.com/cerowrt/subscribe.html
^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: [Cerowrt-devel] cerowrt issues (3.10.24-8)
2014-01-27 21:14 ` Dave Taht
@ 2014-01-29 12:45 ` Sebastian Moeller
2014-01-29 16:10 ` Dave Taht
0 siblings, 1 reply; 16+ messages in thread
From: Sebastian Moeller @ 2014-01-29 12:45 UTC (permalink / raw)
To: Dave Taht; +Cc: cerowrt-devel
Hi Dave,
quick question, how does one turn of logging for babeld? It seems that if daemonized it defaults to logging to /var/log/babeld.log (or similar). Is setting the log file to /dev/null really the answer?
(Since I have no the IPv6 issue not yet resolved, I assume babeld is unhappy) I resorted to stopping babeld completely, but that feels like a crutch…
Best
Sebastian
On Jan 27, 2014, at 22:14 , Dave Taht <dave.taht@gmail.com> wrote:
> certainly turn off the babeld log! I will leave it off in the next release.
>
> On Mon, Jan 27, 2014 at 4:10 PM, Steve Jenson <stevej@fruitless.org> wrote:
>> Looking more, the buffer errors are showing up in syslog well before tmpfs
>> fills up. Is the memtester openwrt package available for cerowrt? I don't
>> see it under `Available packages`.
>>
>> Thanks,
>> Steve
>>
>>
>> On Mon, Jan 27, 2014 at 1:06 PM, Steve Jenson <stevej@fruitless.org> wrote:
>>>
>>> On Fri, Jan 24, 2014 at 3:23 PM, Dave Taht <dave.taht@gmail.com> wrote:
>>>>
>>>> On Fri, Jan 24, 2014 at 6:08 PM, Steve Jenson <stevej@fruitless.org>
>>>> wrote:
>>>>> Hi everybody,
>>>>>
>>>>> I've been using cerowrt as a secondary wifi network (just a single AP
>>>>> for
>>>>> now) for a few weeks now. Recently, my wndr3800 got stuck in a bad
>>>>> state and
>>>>> eventually rebooted. I've had this happen a few times now and am
>>>>> looking for
>>>>> ways to debug the issue. I'm new to cerowrt and openwrt so any advice
>>>>> is
>>>>> appreciated.
>>>>>
>>>>> Since I use it as a secondary network, this is no way critical.
>>>>
>>>> Yea! I appreciate caution before putting alpha software on your gw.
>>>>
>>>>> I'm not
>>>>> looking for free tech support but I couldn't find anything on the wiki
>>>>> about
>>>>> troubleshooting. I'd love to start a page and write some shell scripts
>>>>> to
>>>>> diagnose and report issues. I know that a cerowrt router is meant to be
>>>>> a
>>>>> research project rather a consumer device but these things seem helpful
>>>>> regardless.
>>>>
>>>> Sure, let me know your wiki account. I have been lax about granting
>>>> access of late as the signup process is overrun by spammers.
>>>
>>>
>>> My username is stevej on the wiki. Thanks!
>>>
>>>
>>>>
>>>>> Please let me know if you'd prefer I not email the list with these
>>>>> issues or
>>>>> if you'd rather I used trac or a different forum.
>>>>>
>>>>
>>>> The list is where most stuff happens. Also in the irc channel.
>>>>
>>>> If it gets to where it needs to be tracked we have a bugtracker at
>>>>
>>>> http://www.bufferbloat.net/projects/cerowrt/issues
>>>>
>>>> The first question I have is: Are you on comcast? Cerowrt
>>>> had a dhcpv6-pd implementation that "just worked" from feburary through
>>>> december. Regrettably they changed the RA announcement interval
>>>> to a really low number around then... and this triggers a firewall reload
>>>> every minute on everything prior to the release I point to below.
>>>>
>>>> If there is a memory leak somewhere that would have triggered it.
>>>
>>>
>>> I am on AT&T ADSL2+ with a Motorola NVG510 modem.
>>>
>>>
>>>>
>>>>> In this state, I can connect to the cerowrt base station via wifi but
>>>>> am
>>>>> unable to route packets to the internet. I can connect to :81 and see
>>>>> the
>>>>> login page but logging in results in a lua error at `/cgi-bin/luci`
>>>>>
>>>>>
>>>>> /usr/lib/lua/luci/dispatcher.lua:448: Failed to execute function
>>>>> dispatcher target for entry '/'.
>>>>> The called action terminated with an exception:
>>>>> /usr/lib/lua/luci/sauth.lua:87: Session data invalid!
>>>>> stack traceback:
>>>>> [C]: in function 'assert'
>>>>> /usr/lib/lua/luci/dispatcher.lua:448: in function 'dispatch'
>>>>> /usr/lib/lua/luci/dispatcher.lua:195: in function
>>>>> </usr/lib/lua/luci/dispatcher.lua:194>
>>>>>
>>>>> I can ssh into the device and cat various log files until the router
>>>>> hangs
>>>>> and reboots. here's a few relevant lines from my terminal history
>>>>> before the
>>>>> device rebooted (I'm assuming a watchdog kicked in and rebooted it).
>>>>>
>>>>> root@buffy2-1:~# ping google.com
>>>>> ping: bad address 'google.com'
>>>>> root@buffy2-1:~# free
>>>>> total used free shared buffers
>>>>> Mem: 126336 110332 16004 0 5616
>>>>> -/+ buffers: 104716 21620
>>>>> Swap: 0 0 0
>>>>> root@buffy2-1:~# uptime
>>>>> 02:08:54 up 2 days, 1:26, load average: 0.10, 0.21, 0.17
>>>>> root@buffy2-1:~# dmesg
>>>>> [ 0.000000] Linux version 3.10.24 (cero2@snapon) (gcc version 4.6.4
>>>>> (OpenWrt/Linaro GCC 4.6-2013.05 r38226) ) #1 Tue Dec 24
>>>>> 10:50:15 PST 2013
>>>>> [skipping some lines]
>>>>>
>>>>> [ 13.156250] Error: Driver 'gpio-keys-polled' is already registered,
>>>>> aborting...
>>>>> [ 19.414062] IPv6: ADDRCONF(NETDEV_UP): ge00: link is not ready
>>>>> [ 19.421875] ar71xx: pll_reg 0xb8050010: 0x11110000
>>>>> [ 19.429687] se00: link up (1000Mbps/Full duplex)
>>>>> [ 22.140625] IPv6: ADDRCONF(NETDEV_UP): sw00: link is not ready
>>>>> [ 23.351562] IPv6: ADDRCONF(NETDEV_CHANGE): sw00: link becomes ready
>>>>> [ 23.757812] ar71xx: pll_reg 0xb8050014: 0x11110000
>>>>> [ 23.757812] ge00: link up (1000Mbps/Full duplex)
>>>>> [ 23.773437] IPv6: ADDRCONF(NETDEV_CHANGE): ge00: link becomes ready
>>>>>
>>>>> root@buffy2-1:~# ifconfig
>>>>> ge00 Link encap:Ethernet HWaddr 2C:B0:5D:A0:C5:B1
>>>>> inet addr:192.168.1.138 Bcast:192.168.1.255
>>>>> Mask:255.255.255.0
>>>>> inet6 addr: fe80::2eb0:5dff:fea0:c5b1/64 Scope:Link
>>>>> inet6 addr: 2602:30a:2cdb:330:2eb0:5dff:fea0:c5b1/64
>>>>> Scope:Global
>>>>> UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
>>>>> RX packets:1469670 errors:0 dropped:8 overruns:0 frame:0
>>>>> TX packets:547733 errors:0 dropped:0 overruns:0 carrier:0
>>>>> collisions:0 txqueuelen:1000
>>>>> RX bytes:229243410 (218.6 MiB) TX bytes:57304808 (54.6 MiB)
>>>>> Interrupt:5
>>>>>
>>>>> lo Link encap:Local Loopback
>>>>> inet addr:127.0.0.1 Mask:255.0.0.0
>>>>> inet6 addr: ::1/128 Scope:Host
>>>>> UP LOOPBACK RUNNING MTU:65536 Metric:1
>>>>> RX packets:23689 errors:0 dropped:0 overruns:0 frame:0
>>>>> TX packets:23689 errors:0 dropped:0 overruns:0 carrier:0
>>>>> collisions:0 txqueuelen:0
>>>>> RX bytes:2612713 (2.4 MiB) TX bytes:2612713 (2.4 MiB)
>>>>>
>>>>> pimreg Link encap:UNSPEC HWaddr
>>>>> 00-00-00-00-00-00-00-00-00-00-00-00-00-00-00-00
>>>>> UP RUNNING NOARP MTU:1472 Metric:1
>>>>> RX packets:0 errors:0 dropped:0 overruns:0 frame:0
>>>>> TX packets:0 errors:0 dropped:0 overruns:0 carrier:0
>>>>> collisions:0 txqueuelen:0
>>>>> RX bytes:0 (0.0 B) TX bytes:0 (0.0 B)
>>>>>
>>>>> se00 Link encap:Ethernet HWaddr 2E:B0:5D:A0:C5:B0
>>>>> inet addr:172.30.42.1 Bcast:172.30.42.31
>>>>> Mask:255.255.255.224
>>>>> inet6 addr: 2602:30a:2cdb:330:2eb0:5dff:fea0:c5b1/64
>>>>> Scope:Global
>>>>
>>>> How are you assigning your ipv6 addresses?
>>>
>>>
>>> It's been a while since I messed with this but I think IPv6 is assigned
>>> thanks to 6relayd? My modem has IPv6 enabled but no DHCPv6 options that I
>>> can find. Here's how cerowrt is configured.
>>>
>>> root@buffy2-1:/overlay/etc/config# cat 6relayd
>>> config server 'default'
>>> option fallback_relay 'rd dhcpv6 ndp'
>>> list network 'ge00'
>>> list network 'ge01'
>>> list network 'gw00'
>>> list network 'gw01'
>>> list network 'gw10'
>>> list network 'gw11'
>>> list network 'se00'
>>> list network 'sw00'
>>> list network 'sw10'
>>> option rd 'relay'
>>> option dhcpv6 'relay'
>>> option ndp 'relay'
>>> option master 'ge00'
>>>
>>>>
>>>>> inet6 addr: fe80::2cb0:5dff:fea0:c5b0/64 Scope:Link
>>>>> UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
>>>>> RX packets:0 errors:0 dropped:0 overruns:0 frame:0
>>>>> TX packets:191740 errors:0 dropped:0 overruns:0 carrier:0
>>>>> collisions:0 txqueuelen:1000
>>>>> RX bytes:0 (0.0 B) TX bytes:42184988 (40.2 MiB)
>>>>> Interrupt:4
>>>>>
>>>>> sw00 Link encap:Ethernet HWaddr 2C:B0:5D:A0:C5:B0
>>>>> inet addr:172.30.42.65 Bcast:172.30.42.95
>>>>> Mask:255.255.255.224
>>>>> inet6 addr: 2602:30a:2cdb:330:2eb0:5dff:fea0:c5b1/64
>>>>> Scope:Global
>>>>> inet6 addr: fe80::2eb0:5dff:fea0:c5b0/64 Scope:Link
>>>>> UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
>>>>> RX packets:70239 errors:0 dropped:0 overruns:0 frame:0
>>>>> TX packets:286967 errors:0 dropped:0 overruns:0 carrier:0
>>>>> collisions:0 txqueuelen:1000
>>>>> RX bytes:15590189 (14.8 MiB) TX bytes:127357293 (121.4 MiB)
>>>>>
>>>>> root@buffy2-1:~# less /var/log/babeld.log
>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>> send: Cannot assign requested address
>>>>> send: Cannot assign requested address
>>>>> send: Cannot assign requested address
>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>> netlink_read: recvmsg(): No buffer space available
>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>
>>>> This is a problem in babel detecting the channel on a "normal"
>>>> rather than a mesh interface. It's bugged me a long while, but
>>>> haven't got around to finding what triggers it. Might "fix" it by
>>>> acquiring the channel at babel start time from /etc/config/wireless.
>>>>
>>>> It messes up the diversity routing calculation, grump.
>>>>
>>>> There is a possibility a logfile got really big, but this one
>>>> generally doesn't, but I should turn off logging in some
>>>> future release...
>>>
>>>
>>> I believe I've tracked down part of what's going on. It looks like my
>>> tmpfs is filling up 100% and then the device enters a bad state:
>>>
>>> After 24 hours, with tmpfs at 50%, babeld.log is the largest file by far
>>> in tmpfs and the only file that appears to be growing (based on `du`). It
>>> takes about 48 hours from reboot to fill up tmpfs on my device.
>>>
>>> # sort babeld.log | uniq -c |sort -rn |head
>>>
>>> 503236 Couldn't determine channel of interface sw00: Invalid argument.
>>>
>>> 1376 netlink_read: recvmsg(): No buffer space available
>>>
>>> 3 send: Cannot assign requested address
>>>
>>> # wc -l babeld.log
>>>
>>> 504617 babeld.log
>>>
>>> I sped up system failure by using `dd` to fill up tmpfs and the system
>>> became immediately unusable.
>>>
>>> This also explains the luci session store errors as sessions are stored in
>>> tmpfs.
>>>
>>> The other buffer issues may or may not be related to this.
>>>
>>> Best,
>>> Steve
>>
>>
>
>
>
> --
> Dave Täht
>
> Fixing bufferbloat with cerowrt: http://www.teklibre.com/cerowrt/subscribe.html
> _______________________________________________
> Cerowrt-devel mailing list
> Cerowrt-devel@lists.bufferbloat.net
> https://lists.bufferbloat.net/listinfo/cerowrt-devel
^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: [Cerowrt-devel] cerowrt issues (3.10.24-8)
2014-01-29 12:45 ` Sebastian Moeller
@ 2014-01-29 16:10 ` Dave Taht
2014-01-29 17:44 ` Sebastian Moeller
0 siblings, 1 reply; 16+ messages in thread
From: Dave Taht @ 2014-01-29 16:10 UTC (permalink / raw)
To: Sebastian Moeller; +Cc: cerowrt-devel
On Wed, Jan 29, 2014 at 4:45 AM, Sebastian Moeller <moeller0@gmx.de> wrote:
> Hi Dave,
>
> quick question, how does one turn of logging for babeld? It seems that if daemonized it defaults to logging to /var/log/babeld.log (or similar). Is setting the log file to /dev/null really the answer?
seems so.
> (Since I have no the IPv6 issue not yet resolved, I assume babeld is unhappy) I resorted to stopping babeld completely, but that feels like a crutch...
no daemon in an embedded system should ever write to flash in an
uncontrollable manner.
I will also argue that not being able to find the channel is a bug
that messes with diversity routing
in particular.
>
> Best
> Sebastian
>
>
>
> On Jan 27, 2014, at 22:14 , Dave Taht <dave.taht@gmail.com> wrote:
>
>> certainly turn off the babeld log! I will leave it off in the next release.
>>
>> On Mon, Jan 27, 2014 at 4:10 PM, Steve Jenson <stevej@fruitless.org> wrote:
>>> Looking more, the buffer errors are showing up in syslog well before tmpfs
>>> fills up. Is the memtester openwrt package available for cerowrt? I don't
>>> see it under `Available packages`.
>>>
>>> Thanks,
>>> Steve
>>>
>>>
>>> On Mon, Jan 27, 2014 at 1:06 PM, Steve Jenson <stevej@fruitless.org> wrote:
>>>>
>>>> On Fri, Jan 24, 2014 at 3:23 PM, Dave Taht <dave.taht@gmail.com> wrote:
>>>>>
>>>>> On Fri, Jan 24, 2014 at 6:08 PM, Steve Jenson <stevej@fruitless.org>
>>>>> wrote:
>>>>>> Hi everybody,
>>>>>>
>>>>>> I've been using cerowrt as a secondary wifi network (just a single AP
>>>>>> for
>>>>>> now) for a few weeks now. Recently, my wndr3800 got stuck in a bad
>>>>>> state and
>>>>>> eventually rebooted. I've had this happen a few times now and am
>>>>>> looking for
>>>>>> ways to debug the issue. I'm new to cerowrt and openwrt so any advice
>>>>>> is
>>>>>> appreciated.
>>>>>>
>>>>>> Since I use it as a secondary network, this is no way critical.
>>>>>
>>>>> Yea! I appreciate caution before putting alpha software on your gw.
>>>>>
>>>>>> I'm not
>>>>>> looking for free tech support but I couldn't find anything on the wiki
>>>>>> about
>>>>>> troubleshooting. I'd love to start a page and write some shell scripts
>>>>>> to
>>>>>> diagnose and report issues. I know that a cerowrt router is meant to be
>>>>>> a
>>>>>> research project rather a consumer device but these things seem helpful
>>>>>> regardless.
>>>>>
>>>>> Sure, let me know your wiki account. I have been lax about granting
>>>>> access of late as the signup process is overrun by spammers.
>>>>
>>>>
>>>> My username is stevej on the wiki. Thanks!
>>>>
>>>>
>>>>>
>>>>>> Please let me know if you'd prefer I not email the list with these
>>>>>> issues or
>>>>>> if you'd rather I used trac or a different forum.
>>>>>>
>>>>>
>>>>> The list is where most stuff happens. Also in the irc channel.
>>>>>
>>>>> If it gets to where it needs to be tracked we have a bugtracker at
>>>>>
>>>>> http://www.bufferbloat.net/projects/cerowrt/issues
>>>>>
>>>>> The first question I have is: Are you on comcast? Cerowrt
>>>>> had a dhcpv6-pd implementation that "just worked" from feburary through
>>>>> december. Regrettably they changed the RA announcement interval
>>>>> to a really low number around then... and this triggers a firewall reload
>>>>> every minute on everything prior to the release I point to below.
>>>>>
>>>>> If there is a memory leak somewhere that would have triggered it.
>>>>
>>>>
>>>> I am on AT&T ADSL2+ with a Motorola NVG510 modem.
>>>>
>>>>
>>>>>
>>>>>> In this state, I can connect to the cerowrt base station via wifi but
>>>>>> am
>>>>>> unable to route packets to the internet. I can connect to :81 and see
>>>>>> the
>>>>>> login page but logging in results in a lua error at `/cgi-bin/luci`
>>>>>>
>>>>>>
>>>>>> /usr/lib/lua/luci/dispatcher.lua:448: Failed to execute function
>>>>>> dispatcher target for entry '/'.
>>>>>> The called action terminated with an exception:
>>>>>> /usr/lib/lua/luci/sauth.lua:87: Session data invalid!
>>>>>> stack traceback:
>>>>>> [C]: in function 'assert'
>>>>>> /usr/lib/lua/luci/dispatcher.lua:448: in function 'dispatch'
>>>>>> /usr/lib/lua/luci/dispatcher.lua:195: in function
>>>>>> </usr/lib/lua/luci/dispatcher.lua:194>
>>>>>>
>>>>>> I can ssh into the device and cat various log files until the router
>>>>>> hangs
>>>>>> and reboots. here's a few relevant lines from my terminal history
>>>>>> before the
>>>>>> device rebooted (I'm assuming a watchdog kicked in and rebooted it).
>>>>>>
>>>>>> root@buffy2-1:~# ping google.com
>>>>>> ping: bad address 'google.com'
>>>>>> root@buffy2-1:~# free
>>>>>> total used free shared buffers
>>>>>> Mem: 126336 110332 16004 0 5616
>>>>>> -/+ buffers: 104716 21620
>>>>>> Swap: 0 0 0
>>>>>> root@buffy2-1:~# uptime
>>>>>> 02:08:54 up 2 days, 1:26, load average: 0.10, 0.21, 0.17
>>>>>> root@buffy2-1:~# dmesg
>>>>>> [ 0.000000] Linux version 3.10.24 (cero2@snapon) (gcc version 4.6.4
>>>>>> (OpenWrt/Linaro GCC 4.6-2013.05 r38226) ) #1 Tue Dec 24
>>>>>> 10:50:15 PST 2013
>>>>>> [skipping some lines]
>>>>>>
>>>>>> [ 13.156250] Error: Driver 'gpio-keys-polled' is already registered,
>>>>>> aborting...
>>>>>> [ 19.414062] IPv6: ADDRCONF(NETDEV_UP): ge00: link is not ready
>>>>>> [ 19.421875] ar71xx: pll_reg 0xb8050010: 0x11110000
>>>>>> [ 19.429687] se00: link up (1000Mbps/Full duplex)
>>>>>> [ 22.140625] IPv6: ADDRCONF(NETDEV_UP): sw00: link is not ready
>>>>>> [ 23.351562] IPv6: ADDRCONF(NETDEV_CHANGE): sw00: link becomes ready
>>>>>> [ 23.757812] ar71xx: pll_reg 0xb8050014: 0x11110000
>>>>>> [ 23.757812] ge00: link up (1000Mbps/Full duplex)
>>>>>> [ 23.773437] IPv6: ADDRCONF(NETDEV_CHANGE): ge00: link becomes ready
>>>>>>
>>>>>> root@buffy2-1:~# ifconfig
>>>>>> ge00 Link encap:Ethernet HWaddr 2C:B0:5D:A0:C5:B1
>>>>>> inet addr:192.168.1.138 Bcast:192.168.1.255
>>>>>> Mask:255.255.255.0
>>>>>> inet6 addr: fe80::2eb0:5dff:fea0:c5b1/64 Scope:Link
>>>>>> inet6 addr: 2602:30a:2cdb:330:2eb0:5dff:fea0:c5b1/64
>>>>>> Scope:Global
>>>>>> UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
>>>>>> RX packets:1469670 errors:0 dropped:8 overruns:0 frame:0
>>>>>> TX packets:547733 errors:0 dropped:0 overruns:0 carrier:0
>>>>>> collisions:0 txqueuelen:1000
>>>>>> RX bytes:229243410 (218.6 MiB) TX bytes:57304808 (54.6 MiB)
>>>>>> Interrupt:5
>>>>>>
>>>>>> lo Link encap:Local Loopback
>>>>>> inet addr:127.0.0.1 Mask:255.0.0.0
>>>>>> inet6 addr: ::1/128 Scope:Host
>>>>>> UP LOOPBACK RUNNING MTU:65536 Metric:1
>>>>>> RX packets:23689 errors:0 dropped:0 overruns:0 frame:0
>>>>>> TX packets:23689 errors:0 dropped:0 overruns:0 carrier:0
>>>>>> collisions:0 txqueuelen:0
>>>>>> RX bytes:2612713 (2.4 MiB) TX bytes:2612713 (2.4 MiB)
>>>>>>
>>>>>> pimreg Link encap:UNSPEC HWaddr
>>>>>> 00-00-00-00-00-00-00-00-00-00-00-00-00-00-00-00
>>>>>> UP RUNNING NOARP MTU:1472 Metric:1
>>>>>> RX packets:0 errors:0 dropped:0 overruns:0 frame:0
>>>>>> TX packets:0 errors:0 dropped:0 overruns:0 carrier:0
>>>>>> collisions:0 txqueuelen:0
>>>>>> RX bytes:0 (0.0 B) TX bytes:0 (0.0 B)
>>>>>>
>>>>>> se00 Link encap:Ethernet HWaddr 2E:B0:5D:A0:C5:B0
>>>>>> inet addr:172.30.42.1 Bcast:172.30.42.31
>>>>>> Mask:255.255.255.224
>>>>>> inet6 addr: 2602:30a:2cdb:330:2eb0:5dff:fea0:c5b1/64
>>>>>> Scope:Global
>>>>>
>>>>> How are you assigning your ipv6 addresses?
>>>>
>>>>
>>>> It's been a while since I messed with this but I think IPv6 is assigned
>>>> thanks to 6relayd? My modem has IPv6 enabled but no DHCPv6 options that I
>>>> can find. Here's how cerowrt is configured.
>>>>
>>>> root@buffy2-1:/overlay/etc/config# cat 6relayd
>>>> config server 'default'
>>>> option fallback_relay 'rd dhcpv6 ndp'
>>>> list network 'ge00'
>>>> list network 'ge01'
>>>> list network 'gw00'
>>>> list network 'gw01'
>>>> list network 'gw10'
>>>> list network 'gw11'
>>>> list network 'se00'
>>>> list network 'sw00'
>>>> list network 'sw10'
>>>> option rd 'relay'
>>>> option dhcpv6 'relay'
>>>> option ndp 'relay'
>>>> option master 'ge00'
>>>>
>>>>>
>>>>>> inet6 addr: fe80::2cb0:5dff:fea0:c5b0/64 Scope:Link
>>>>>> UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
>>>>>> RX packets:0 errors:0 dropped:0 overruns:0 frame:0
>>>>>> TX packets:191740 errors:0 dropped:0 overruns:0 carrier:0
>>>>>> collisions:0 txqueuelen:1000
>>>>>> RX bytes:0 (0.0 B) TX bytes:42184988 (40.2 MiB)
>>>>>> Interrupt:4
>>>>>>
>>>>>> sw00 Link encap:Ethernet HWaddr 2C:B0:5D:A0:C5:B0
>>>>>> inet addr:172.30.42.65 Bcast:172.30.42.95
>>>>>> Mask:255.255.255.224
>>>>>> inet6 addr: 2602:30a:2cdb:330:2eb0:5dff:fea0:c5b1/64
>>>>>> Scope:Global
>>>>>> inet6 addr: fe80::2eb0:5dff:fea0:c5b0/64 Scope:Link
>>>>>> UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
>>>>>> RX packets:70239 errors:0 dropped:0 overruns:0 frame:0
>>>>>> TX packets:286967 errors:0 dropped:0 overruns:0 carrier:0
>>>>>> collisions:0 txqueuelen:1000
>>>>>> RX bytes:15590189 (14.8 MiB) TX bytes:127357293 (121.4 MiB)
>>>>>>
>>>>>> root@buffy2-1:~# less /var/log/babeld.log
>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>> send: Cannot assign requested address
>>>>>> send: Cannot assign requested address
>>>>>> send: Cannot assign requested address
>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>> netlink_read: recvmsg(): No buffer space available
>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>
>>>>> This is a problem in babel detecting the channel on a "normal"
>>>>> rather than a mesh interface. It's bugged me a long while, but
>>>>> haven't got around to finding what triggers it. Might "fix" it by
>>>>> acquiring the channel at babel start time from /etc/config/wireless.
>>>>>
>>>>> It messes up the diversity routing calculation, grump.
>>>>>
>>>>> There is a possibility a logfile got really big, but this one
>>>>> generally doesn't, but I should turn off logging in some
>>>>> future release...
>>>>
>>>>
>>>> I believe I've tracked down part of what's going on. It looks like my
>>>> tmpfs is filling up 100% and then the device enters a bad state:
>>>>
>>>> After 24 hours, with tmpfs at 50%, babeld.log is the largest file by far
>>>> in tmpfs and the only file that appears to be growing (based on `du`). It
>>>> takes about 48 hours from reboot to fill up tmpfs on my device.
>>>>
>>>> # sort babeld.log | uniq -c |sort -rn |head
>>>>
>>>> 503236 Couldn't determine channel of interface sw00: Invalid argument.
>>>>
>>>> 1376 netlink_read: recvmsg(): No buffer space available
>>>>
>>>> 3 send: Cannot assign requested address
>>>>
>>>> # wc -l babeld.log
>>>>
>>>> 504617 babeld.log
>>>>
>>>> I sped up system failure by using `dd` to fill up tmpfs and the system
>>>> became immediately unusable.
>>>>
>>>> This also explains the luci session store errors as sessions are stored in
>>>> tmpfs.
>>>>
>>>> The other buffer issues may or may not be related to this.
>>>>
>>>> Best,
>>>> Steve
>>>
>>>
>>
>>
>>
>> --
>> Dave Täht
>>
>> Fixing bufferbloat with cerowrt: http://www.teklibre.com/cerowrt/subscribe.html
>> _______________________________________________
>> Cerowrt-devel mailing list
>> Cerowrt-devel@lists.bufferbloat.net
>> https://lists.bufferbloat.net/listinfo/cerowrt-devel
>
--
Dave Täht
Fixing bufferbloat with cerowrt: http://www.teklibre.com/cerowrt/subscribe.html
^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: [Cerowrt-devel] cerowrt issues (3.10.24-8)
2014-01-29 16:10 ` Dave Taht
@ 2014-01-29 17:44 ` Sebastian Moeller
2014-01-29 18:21 ` Dave Taht
2014-01-29 18:24 ` Steve Jenson
0 siblings, 2 replies; 16+ messages in thread
From: Sebastian Moeller @ 2014-01-29 17:44 UTC (permalink / raw)
To: Dave Taht; +Cc: cerowrt-devel
On January 29, 2014 5:10:18 PM CET, Dave Taht <dave.taht@gmail.com> wrote:
>On Wed, Jan 29, 2014 at 4:45 AM, Sebastian Moeller <moeller0@gmx.de>
>wrote:
>> Hi Dave,
>>
>> quick question, how does one turn of logging for babeld? It seems
>that if daemonized it defaults to logging to /var/log/babeld.log (or
>similar). Is setting the log file to /dev/null really the answer?
>
>seems so.
Okay, I guess I will try that then...
>
>> (Since I have no the IPv6 issue not yet resolved, I assume
>babeld is unhappy) I resorted to stopping babeld completely, but that
>feels like a crutch...
>
>no daemon in an embedded system should ever write to flash in an
>uncontrollable manner.
What bugs me is that it basically keeps repeating the same error over and over again. If it would rate limit and:or push messages to the system log it would be nicer.
>
>I will also argue that not being able to find the channel is a bug
>that messes with diversity routing
>in particular.
I have actually not yet understood what it wants to tell me ;), since I got your attention, is there an easy way to run a babel client under macosx?
Best Regards
Sebastian
>
>>
>> Best
>> Sebastian
>>
>>
>>
>> On Jan 27, 2014, at 22:14 , Dave Taht <dave.taht@gmail.com> wrote:
>>
>>> certainly turn off the babeld log! I will leave it off in the next
>release.
>>>
>>> On Mon, Jan 27, 2014 at 4:10 PM, Steve Jenson <stevej@fruitless.org>
>wrote:
>>>> Looking more, the buffer errors are showing up in syslog well
>before tmpfs
>>>> fills up. Is the memtester openwrt package available for cerowrt? I
>don't
>>>> see it under `Available packages`.
>>>>
>>>> Thanks,
>>>> Steve
>>>>
>>>>
>>>> On Mon, Jan 27, 2014 at 1:06 PM, Steve Jenson
><stevej@fruitless.org> wrote:
>>>>>
>>>>> On Fri, Jan 24, 2014 at 3:23 PM, Dave Taht <dave.taht@gmail.com>
>wrote:
>>>>>>
>>>>>> On Fri, Jan 24, 2014 at 6:08 PM, Steve Jenson
><stevej@fruitless.org>
>>>>>> wrote:
>>>>>>> Hi everybody,
>>>>>>>
>>>>>>> I've been using cerowrt as a secondary wifi network (just a
>single AP
>>>>>>> for
>>>>>>> now) for a few weeks now. Recently, my wndr3800 got stuck in a
>bad
>>>>>>> state and
>>>>>>> eventually rebooted. I've had this happen a few times now and am
>>>>>>> looking for
>>>>>>> ways to debug the issue. I'm new to cerowrt and openwrt so any
>advice
>>>>>>> is
>>>>>>> appreciated.
>>>>>>>
>>>>>>> Since I use it as a secondary network, this is no way critical.
>>>>>>
>>>>>> Yea! I appreciate caution before putting alpha software on your
>gw.
>>>>>>
>>>>>>> I'm not
>>>>>>> looking for free tech support but I couldn't find anything on
>the wiki
>>>>>>> about
>>>>>>> troubleshooting. I'd love to start a page and write some shell
>scripts
>>>>>>> to
>>>>>>> diagnose and report issues. I know that a cerowrt router is
>meant to be
>>>>>>> a
>>>>>>> research project rather a consumer device but these things seem
>helpful
>>>>>>> regardless.
>>>>>>
>>>>>> Sure, let me know your wiki account. I have been lax about
>granting
>>>>>> access of late as the signup process is overrun by spammers.
>>>>>
>>>>>
>>>>> My username is stevej on the wiki. Thanks!
>>>>>
>>>>>
>>>>>>
>>>>>>> Please let me know if you'd prefer I not email the list with
>these
>>>>>>> issues or
>>>>>>> if you'd rather I used trac or a different forum.
>>>>>>>
>>>>>>
>>>>>> The list is where most stuff happens. Also in the irc channel.
>>>>>>
>>>>>> If it gets to where it needs to be tracked we have a bugtracker
>at
>>>>>>
>>>>>> http://www.bufferbloat.net/projects/cerowrt/issues
>>>>>>
>>>>>> The first question I have is: Are you on comcast? Cerowrt
>>>>>> had a dhcpv6-pd implementation that "just worked" from feburary
>through
>>>>>> december. Regrettably they changed the RA announcement interval
>>>>>> to a really low number around then... and this triggers a
>firewall reload
>>>>>> every minute on everything prior to the release I point to below.
>>>>>>
>>>>>> If there is a memory leak somewhere that would have triggered it.
>>>>>
>>>>>
>>>>> I am on AT&T ADSL2+ with a Motorola NVG510 modem.
>>>>>
>>>>>
>>>>>>
>>>>>>> In this state, I can connect to the cerowrt base station via
>wifi but
>>>>>>> am
>>>>>>> unable to route packets to the internet. I can connect to :81
>and see
>>>>>>> the
>>>>>>> login page but logging in results in a lua error at
>`/cgi-bin/luci`
>>>>>>>
>>>>>>>
>>>>>>> /usr/lib/lua/luci/dispatcher.lua:448: Failed to execute
>function
>>>>>>> dispatcher target for entry '/'.
>>>>>>> The called action terminated with an exception:
>>>>>>> /usr/lib/lua/luci/sauth.lua:87: Session data invalid!
>>>>>>> stack traceback:
>>>>>>> [C]: in function 'assert'
>>>>>>> /usr/lib/lua/luci/dispatcher.lua:448: in function 'dispatch'
>>>>>>> /usr/lib/lua/luci/dispatcher.lua:195: in function
>>>>>>> </usr/lib/lua/luci/dispatcher.lua:194>
>>>>>>>
>>>>>>> I can ssh into the device and cat various log files until the
>router
>>>>>>> hangs
>>>>>>> and reboots. here's a few relevant lines from my terminal
>history
>>>>>>> before the
>>>>>>> device rebooted (I'm assuming a watchdog kicked in and rebooted
>it).
>>>>>>>
>>>>>>> root@buffy2-1:~# ping google.com
>>>>>>> ping: bad address 'google.com'
>>>>>>> root@buffy2-1:~# free
>>>>>>> total used free shared
>buffers
>>>>>>> Mem: 126336 110332 16004 0
> 5616
>>>>>>> -/+ buffers: 104716 21620
>>>>>>> Swap: 0 0 0
>>>>>>> root@buffy2-1:~# uptime
>>>>>>> 02:08:54 up 2 days, 1:26, load average: 0.10, 0.21, 0.17
>>>>>>> root@buffy2-1:~# dmesg
>>>>>>> [ 0.000000] Linux version 3.10.24 (cero2@snapon) (gcc version
>4.6.4
>>>>>>> (OpenWrt/Linaro GCC 4.6-2013.05 r38226) ) #1 Tue Dec 24
>>>>>>> 10:50:15 PST 2013
>>>>>>> [skipping some lines]
>>>>>>>
>>>>>>> [ 13.156250] Error: Driver 'gpio-keys-polled' is already
>registered,
>>>>>>> aborting...
>>>>>>> [ 19.414062] IPv6: ADDRCONF(NETDEV_UP): ge00: link is not
>ready
>>>>>>> [ 19.421875] ar71xx: pll_reg 0xb8050010: 0x11110000
>>>>>>> [ 19.429687] se00: link up (1000Mbps/Full duplex)
>>>>>>> [ 22.140625] IPv6: ADDRCONF(NETDEV_UP): sw00: link is not
>ready
>>>>>>> [ 23.351562] IPv6: ADDRCONF(NETDEV_CHANGE): sw00: link becomes
>ready
>>>>>>> [ 23.757812] ar71xx: pll_reg 0xb8050014: 0x11110000
>>>>>>> [ 23.757812] ge00: link up (1000Mbps/Full duplex)
>>>>>>> [ 23.773437] IPv6: ADDRCONF(NETDEV_CHANGE): ge00: link becomes
>ready
>>>>>>>
>>>>>>> root@buffy2-1:~# ifconfig
>>>>>>> ge00 Link encap:Ethernet HWaddr 2C:B0:5D:A0:C5:B1
>>>>>>> inet addr:192.168.1.138 Bcast:192.168.1.255
>>>>>>> Mask:255.255.255.0
>>>>>>> inet6 addr: fe80::2eb0:5dff:fea0:c5b1/64 Scope:Link
>>>>>>> inet6 addr: 2602:30a:2cdb:330:2eb0:5dff:fea0:c5b1/64
>>>>>>> Scope:Global
>>>>>>> UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
>>>>>>> RX packets:1469670 errors:0 dropped:8 overruns:0
>frame:0
>>>>>>> TX packets:547733 errors:0 dropped:0 overruns:0
>carrier:0
>>>>>>> collisions:0 txqueuelen:1000
>>>>>>> RX bytes:229243410 (218.6 MiB) TX bytes:57304808 (54.6
>MiB)
>>>>>>> Interrupt:5
>>>>>>>
>>>>>>> lo Link encap:Local Loopback
>>>>>>> inet addr:127.0.0.1 Mask:255.0.0.0
>>>>>>> inet6 addr: ::1/128 Scope:Host
>>>>>>> UP LOOPBACK RUNNING MTU:65536 Metric:1
>>>>>>> RX packets:23689 errors:0 dropped:0 overruns:0 frame:0
>>>>>>> TX packets:23689 errors:0 dropped:0 overruns:0
>carrier:0
>>>>>>> collisions:0 txqueuelen:0
>>>>>>> RX bytes:2612713 (2.4 MiB) TX bytes:2612713 (2.4 MiB)
>>>>>>>
>>>>>>> pimreg Link encap:UNSPEC HWaddr
>>>>>>> 00-00-00-00-00-00-00-00-00-00-00-00-00-00-00-00
>>>>>>> UP RUNNING NOARP MTU:1472 Metric:1
>>>>>>> RX packets:0 errors:0 dropped:0 overruns:0 frame:0
>>>>>>> TX packets:0 errors:0 dropped:0 overruns:0 carrier:0
>>>>>>> collisions:0 txqueuelen:0
>>>>>>> RX bytes:0 (0.0 B) TX bytes:0 (0.0 B)
>>>>>>>
>>>>>>> se00 Link encap:Ethernet HWaddr 2E:B0:5D:A0:C5:B0
>>>>>>> inet addr:172.30.42.1 Bcast:172.30.42.31
>>>>>>> Mask:255.255.255.224
>>>>>>> inet6 addr: 2602:30a:2cdb:330:2eb0:5dff:fea0:c5b1/64
>>>>>>> Scope:Global
>>>>>>
>>>>>> How are you assigning your ipv6 addresses?
>>>>>
>>>>>
>>>>> It's been a while since I messed with this but I think IPv6 is
>assigned
>>>>> thanks to 6relayd? My modem has IPv6 enabled but no DHCPv6 options
>that I
>>>>> can find. Here's how cerowrt is configured.
>>>>>
>>>>> root@buffy2-1:/overlay/etc/config# cat 6relayd
>>>>> config server 'default'
>>>>> option fallback_relay 'rd dhcpv6 ndp'
>>>>> list network 'ge00'
>>>>> list network 'ge01'
>>>>> list network 'gw00'
>>>>> list network 'gw01'
>>>>> list network 'gw10'
>>>>> list network 'gw11'
>>>>> list network 'se00'
>>>>> list network 'sw00'
>>>>> list network 'sw10'
>>>>> option rd 'relay'
>>>>> option dhcpv6 'relay'
>>>>> option ndp 'relay'
>>>>> option master 'ge00'
>>>>>
>>>>>>
>>>>>>> inet6 addr: fe80::2cb0:5dff:fea0:c5b0/64 Scope:Link
>>>>>>> UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
>>>>>>> RX packets:0 errors:0 dropped:0 overruns:0 frame:0
>>>>>>> TX packets:191740 errors:0 dropped:0 overruns:0
>carrier:0
>>>>>>> collisions:0 txqueuelen:1000
>>>>>>> RX bytes:0 (0.0 B) TX bytes:42184988 (40.2 MiB)
>>>>>>> Interrupt:4
>>>>>>>
>>>>>>> sw00 Link encap:Ethernet HWaddr 2C:B0:5D:A0:C5:B0
>>>>>>> inet addr:172.30.42.65 Bcast:172.30.42.95
>>>>>>> Mask:255.255.255.224
>>>>>>> inet6 addr: 2602:30a:2cdb:330:2eb0:5dff:fea0:c5b1/64
>>>>>>> Scope:Global
>>>>>>> inet6 addr: fe80::2eb0:5dff:fea0:c5b0/64 Scope:Link
>>>>>>> UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
>>>>>>> RX packets:70239 errors:0 dropped:0 overruns:0 frame:0
>>>>>>> TX packets:286967 errors:0 dropped:0 overruns:0
>carrier:0
>>>>>>> collisions:0 txqueuelen:1000
>>>>>>> RX bytes:15590189 (14.8 MiB) TX bytes:127357293 (121.4
>MiB)
>>>>>>>
>>>>>>> root@buffy2-1:~# less /var/log/babeld.log
>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>> send: Cannot assign requested address
>>>>>>> send: Cannot assign requested address
>>>>>>> send: Cannot assign requested address
>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>> netlink_read: recvmsg(): No buffer space available
>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>
>>>>>> This is a problem in babel detecting the channel on a "normal"
>>>>>> rather than a mesh interface. It's bugged me a long while, but
>>>>>> haven't got around to finding what triggers it. Might "fix" it by
>>>>>> acquiring the channel at babel start time from
>/etc/config/wireless.
>>>>>>
>>>>>> It messes up the diversity routing calculation, grump.
>>>>>>
>>>>>> There is a possibility a logfile got really big, but this one
>>>>>> generally doesn't, but I should turn off logging in some
>>>>>> future release...
>>>>>
>>>>>
>>>>> I believe I've tracked down part of what's going on. It looks like
>my
>>>>> tmpfs is filling up 100% and then the device enters a bad state:
>>>>>
>>>>> After 24 hours, with tmpfs at 50%, babeld.log is the largest file
>by far
>>>>> in tmpfs and the only file that appears to be growing (based on
>`du`). It
>>>>> takes about 48 hours from reboot to fill up tmpfs on my device.
>>>>>
>>>>> # sort babeld.log | uniq -c |sort -rn |head
>>>>>
>>>>> 503236 Couldn't determine channel of interface sw00: Invalid
>argument.
>>>>>
>>>>> 1376 netlink_read: recvmsg(): No buffer space available
>>>>>
>>>>> 3 send: Cannot assign requested address
>>>>>
>>>>> # wc -l babeld.log
>>>>>
>>>>> 504617 babeld.log
>>>>>
>>>>> I sped up system failure by using `dd` to fill up tmpfs and the
>system
>>>>> became immediately unusable.
>>>>>
>>>>> This also explains the luci session store errors as sessions are
>stored in
>>>>> tmpfs.
>>>>>
>>>>> The other buffer issues may or may not be related to this.
>>>>>
>>>>> Best,
>>>>> Steve
>>>>
>>>>
>>>
>>>
>>>
>>> --
>>> Dave Täht
>>>
>>> Fixing bufferbloat with cerowrt:
>http://www.teklibre.com/cerowrt/subscribe.html
>>> _______________________________________________
>>> Cerowrt-devel mailing list
>>> Cerowrt-devel@lists.bufferbloat.net
>>> https://lists.bufferbloat.net/listinfo/cerowrt-devel
>>
Hi Dave,
--
Sent from my Android phone with K-9 Mail. Please excuse my brevity.
^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: [Cerowrt-devel] cerowrt issues (3.10.24-8)
2014-01-29 17:44 ` Sebastian Moeller
@ 2014-01-29 18:21 ` Dave Taht
2014-01-29 19:51 ` Sebastian Moeller
2014-01-29 18:24 ` Steve Jenson
1 sibling, 1 reply; 16+ messages in thread
From: Dave Taht @ 2014-01-29 18:21 UTC (permalink / raw)
To: Sebastian Moeller; +Cc: cerowrt-devel
On Wed, Jan 29, 2014 at 9:44 AM, Sebastian Moeller <moeller0@gmx.de> wrote:
> On January 29, 2014 5:10:18 PM CET, Dave Taht <dave.taht@gmail.com> wrote:
>>On Wed, Jan 29, 2014 at 4:45 AM, Sebastian Moeller <moeller0@gmx.de>
>>wrote:
>>> Hi Dave,
>>>
>>> quick question, how does one turn of logging for babeld? It seems
>>that if daemonized it defaults to logging to /var/log/babeld.log (or
>>similar). Is setting the log file to /dev/null really the answer?
>>
>>seems so.
>
> Okay, I guess I will try that then...
>
>
>>
>>> (Since I have no the IPv6 issue not yet resolved, I assume
>>babeld is unhappy) I resorted to stopping babeld completely, but that
>>feels like a crutch...
>>
>>no daemon in an embedded system should ever write to flash in an
>>uncontrollable manner.
>
> What bugs me is that it basically keeps repeating the same error over and over again. If it would rate limit and:or push messages to the system log it would be nicer.
>
>
>>
>>I will also argue that not being able to find the channel is a bug
>>that messes with diversity routing
>>in particular.
>
> I have actually not yet understood what it wants to tell me ;), since I got your attention, is there an easy way to run a babel client under macosx?
for coping with the mac I use "macports" to get a compiler and support
for open source software.
I haven't ever tried to run babeld on the mac I have, I will put it on
my list...
>
>
> Best Regards
> Sebastian
>
>
>>
>>>
>>> Best
>>> Sebastian
>>>
>>>
>>>
>>> On Jan 27, 2014, at 22:14 , Dave Taht <dave.taht@gmail.com> wrote:
>>>
>>>> certainly turn off the babeld log! I will leave it off in the next
>>release.
>>>>
>>>> On Mon, Jan 27, 2014 at 4:10 PM, Steve Jenson <stevej@fruitless.org>
>>wrote:
>>>>> Looking more, the buffer errors are showing up in syslog well
>>before tmpfs
>>>>> fills up. Is the memtester openwrt package available for cerowrt? I
>>don't
>>>>> see it under `Available packages`.
>>>>>
>>>>> Thanks,
>>>>> Steve
>>>>>
>>>>>
>>>>> On Mon, Jan 27, 2014 at 1:06 PM, Steve Jenson
>><stevej@fruitless.org> wrote:
>>>>>>
>>>>>> On Fri, Jan 24, 2014 at 3:23 PM, Dave Taht <dave.taht@gmail.com>
>>wrote:
>>>>>>>
>>>>>>> On Fri, Jan 24, 2014 at 6:08 PM, Steve Jenson
>><stevej@fruitless.org>
>>>>>>> wrote:
>>>>>>>> Hi everybody,
>>>>>>>>
>>>>>>>> I've been using cerowrt as a secondary wifi network (just a
>>single AP
>>>>>>>> for
>>>>>>>> now) for a few weeks now. Recently, my wndr3800 got stuck in a
>>bad
>>>>>>>> state and
>>>>>>>> eventually rebooted. I've had this happen a few times now and am
>>>>>>>> looking for
>>>>>>>> ways to debug the issue. I'm new to cerowrt and openwrt so any
>>advice
>>>>>>>> is
>>>>>>>> appreciated.
>>>>>>>>
>>>>>>>> Since I use it as a secondary network, this is no way critical.
>>>>>>>
>>>>>>> Yea! I appreciate caution before putting alpha software on your
>>gw.
>>>>>>>
>>>>>>>> I'm not
>>>>>>>> looking for free tech support but I couldn't find anything on
>>the wiki
>>>>>>>> about
>>>>>>>> troubleshooting. I'd love to start a page and write some shell
>>scripts
>>>>>>>> to
>>>>>>>> diagnose and report issues. I know that a cerowrt router is
>>meant to be
>>>>>>>> a
>>>>>>>> research project rather a consumer device but these things seem
>>helpful
>>>>>>>> regardless.
>>>>>>>
>>>>>>> Sure, let me know your wiki account. I have been lax about
>>granting
>>>>>>> access of late as the signup process is overrun by spammers.
>>>>>>
>>>>>>
>>>>>> My username is stevej on the wiki. Thanks!
>>>>>>
>>>>>>
>>>>>>>
>>>>>>>> Please let me know if you'd prefer I not email the list with
>>these
>>>>>>>> issues or
>>>>>>>> if you'd rather I used trac or a different forum.
>>>>>>>>
>>>>>>>
>>>>>>> The list is where most stuff happens. Also in the irc channel.
>>>>>>>
>>>>>>> If it gets to where it needs to be tracked we have a bugtracker
>>at
>>>>>>>
>>>>>>> http://www.bufferbloat.net/projects/cerowrt/issues
>>>>>>>
>>>>>>> The first question I have is: Are you on comcast? Cerowrt
>>>>>>> had a dhcpv6-pd implementation that "just worked" from feburary
>>through
>>>>>>> december. Regrettably they changed the RA announcement interval
>>>>>>> to a really low number around then... and this triggers a
>>firewall reload
>>>>>>> every minute on everything prior to the release I point to below.
>>>>>>>
>>>>>>> If there is a memory leak somewhere that would have triggered it.
>>>>>>
>>>>>>
>>>>>> I am on AT&T ADSL2+ with a Motorola NVG510 modem.
>>>>>>
>>>>>>
>>>>>>>
>>>>>>>> In this state, I can connect to the cerowrt base station via
>>wifi but
>>>>>>>> am
>>>>>>>> unable to route packets to the internet. I can connect to :81
>>and see
>>>>>>>> the
>>>>>>>> login page but logging in results in a lua error at
>>`/cgi-bin/luci`
>>>>>>>>
>>>>>>>>
>>>>>>>> /usr/lib/lua/luci/dispatcher.lua:448: Failed to execute
>>function
>>>>>>>> dispatcher target for entry '/'.
>>>>>>>> The called action terminated with an exception:
>>>>>>>> /usr/lib/lua/luci/sauth.lua:87: Session data invalid!
>>>>>>>> stack traceback:
>>>>>>>> [C]: in function 'assert'
>>>>>>>> /usr/lib/lua/luci/dispatcher.lua:448: in function 'dispatch'
>>>>>>>> /usr/lib/lua/luci/dispatcher.lua:195: in function
>>>>>>>> </usr/lib/lua/luci/dispatcher.lua:194>
>>>>>>>>
>>>>>>>> I can ssh into the device and cat various log files until the
>>router
>>>>>>>> hangs
>>>>>>>> and reboots. here's a few relevant lines from my terminal
>>history
>>>>>>>> before the
>>>>>>>> device rebooted (I'm assuming a watchdog kicked in and rebooted
>>it).
>>>>>>>>
>>>>>>>> root@buffy2-1:~# ping google.com
>>>>>>>> ping: bad address 'google.com'
>>>>>>>> root@buffy2-1:~# free
>>>>>>>> total used free shared
>>buffers
>>>>>>>> Mem: 126336 110332 16004 0
>> 5616
>>>>>>>> -/+ buffers: 104716 21620
>>>>>>>> Swap: 0 0 0
>>>>>>>> root@buffy2-1:~# uptime
>>>>>>>> 02:08:54 up 2 days, 1:26, load average: 0.10, 0.21, 0.17
>>>>>>>> root@buffy2-1:~# dmesg
>>>>>>>> [ 0.000000] Linux version 3.10.24 (cero2@snapon) (gcc version
>>4.6.4
>>>>>>>> (OpenWrt/Linaro GCC 4.6-2013.05 r38226) ) #1 Tue Dec 24
>>>>>>>> 10:50:15 PST 2013
>>>>>>>> [skipping some lines]
>>>>>>>>
>>>>>>>> [ 13.156250] Error: Driver 'gpio-keys-polled' is already
>>registered,
>>>>>>>> aborting...
>>>>>>>> [ 19.414062] IPv6: ADDRCONF(NETDEV_UP): ge00: link is not
>>ready
>>>>>>>> [ 19.421875] ar71xx: pll_reg 0xb8050010: 0x11110000
>>>>>>>> [ 19.429687] se00: link up (1000Mbps/Full duplex)
>>>>>>>> [ 22.140625] IPv6: ADDRCONF(NETDEV_UP): sw00: link is not
>>ready
>>>>>>>> [ 23.351562] IPv6: ADDRCONF(NETDEV_CHANGE): sw00: link becomes
>>ready
>>>>>>>> [ 23.757812] ar71xx: pll_reg 0xb8050014: 0x11110000
>>>>>>>> [ 23.757812] ge00: link up (1000Mbps/Full duplex)
>>>>>>>> [ 23.773437] IPv6: ADDRCONF(NETDEV_CHANGE): ge00: link becomes
>>ready
>>>>>>>>
>>>>>>>> root@buffy2-1:~# ifconfig
>>>>>>>> ge00 Link encap:Ethernet HWaddr 2C:B0:5D:A0:C5:B1
>>>>>>>> inet addr:192.168.1.138 Bcast:192.168.1.255
>>>>>>>> Mask:255.255.255.0
>>>>>>>> inet6 addr: fe80::2eb0:5dff:fea0:c5b1/64 Scope:Link
>>>>>>>> inet6 addr: 2602:30a:2cdb:330:2eb0:5dff:fea0:c5b1/64
>>>>>>>> Scope:Global
>>>>>>>> UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
>>>>>>>> RX packets:1469670 errors:0 dropped:8 overruns:0
>>frame:0
>>>>>>>> TX packets:547733 errors:0 dropped:0 overruns:0
>>carrier:0
>>>>>>>> collisions:0 txqueuelen:1000
>>>>>>>> RX bytes:229243410 (218.6 MiB) TX bytes:57304808 (54.6
>>MiB)
>>>>>>>> Interrupt:5
>>>>>>>>
>>>>>>>> lo Link encap:Local Loopback
>>>>>>>> inet addr:127.0.0.1 Mask:255.0.0.0
>>>>>>>> inet6 addr: ::1/128 Scope:Host
>>>>>>>> UP LOOPBACK RUNNING MTU:65536 Metric:1
>>>>>>>> RX packets:23689 errors:0 dropped:0 overruns:0 frame:0
>>>>>>>> TX packets:23689 errors:0 dropped:0 overruns:0
>>carrier:0
>>>>>>>> collisions:0 txqueuelen:0
>>>>>>>> RX bytes:2612713 (2.4 MiB) TX bytes:2612713 (2.4 MiB)
>>>>>>>>
>>>>>>>> pimreg Link encap:UNSPEC HWaddr
>>>>>>>> 00-00-00-00-00-00-00-00-00-00-00-00-00-00-00-00
>>>>>>>> UP RUNNING NOARP MTU:1472 Metric:1
>>>>>>>> RX packets:0 errors:0 dropped:0 overruns:0 frame:0
>>>>>>>> TX packets:0 errors:0 dropped:0 overruns:0 carrier:0
>>>>>>>> collisions:0 txqueuelen:0
>>>>>>>> RX bytes:0 (0.0 B) TX bytes:0 (0.0 B)
>>>>>>>>
>>>>>>>> se00 Link encap:Ethernet HWaddr 2E:B0:5D:A0:C5:B0
>>>>>>>> inet addr:172.30.42.1 Bcast:172.30.42.31
>>>>>>>> Mask:255.255.255.224
>>>>>>>> inet6 addr: 2602:30a:2cdb:330:2eb0:5dff:fea0:c5b1/64
>>>>>>>> Scope:Global
>>>>>>>
>>>>>>> How are you assigning your ipv6 addresses?
>>>>>>
>>>>>>
>>>>>> It's been a while since I messed with this but I think IPv6 is
>>assigned
>>>>>> thanks to 6relayd? My modem has IPv6 enabled but no DHCPv6 options
>>that I
>>>>>> can find. Here's how cerowrt is configured.
>>>>>>
>>>>>> root@buffy2-1:/overlay/etc/config# cat 6relayd
>>>>>> config server 'default'
>>>>>> option fallback_relay 'rd dhcpv6 ndp'
>>>>>> list network 'ge00'
>>>>>> list network 'ge01'
>>>>>> list network 'gw00'
>>>>>> list network 'gw01'
>>>>>> list network 'gw10'
>>>>>> list network 'gw11'
>>>>>> list network 'se00'
>>>>>> list network 'sw00'
>>>>>> list network 'sw10'
>>>>>> option rd 'relay'
>>>>>> option dhcpv6 'relay'
>>>>>> option ndp 'relay'
>>>>>> option master 'ge00'
>>>>>>
>>>>>>>
>>>>>>>> inet6 addr: fe80::2cb0:5dff:fea0:c5b0/64 Scope:Link
>>>>>>>> UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
>>>>>>>> RX packets:0 errors:0 dropped:0 overruns:0 frame:0
>>>>>>>> TX packets:191740 errors:0 dropped:0 overruns:0
>>carrier:0
>>>>>>>> collisions:0 txqueuelen:1000
>>>>>>>> RX bytes:0 (0.0 B) TX bytes:42184988 (40.2 MiB)
>>>>>>>> Interrupt:4
>>>>>>>>
>>>>>>>> sw00 Link encap:Ethernet HWaddr 2C:B0:5D:A0:C5:B0
>>>>>>>> inet addr:172.30.42.65 Bcast:172.30.42.95
>>>>>>>> Mask:255.255.255.224
>>>>>>>> inet6 addr: 2602:30a:2cdb:330:2eb0:5dff:fea0:c5b1/64
>>>>>>>> Scope:Global
>>>>>>>> inet6 addr: fe80::2eb0:5dff:fea0:c5b0/64 Scope:Link
>>>>>>>> UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
>>>>>>>> RX packets:70239 errors:0 dropped:0 overruns:0 frame:0
>>>>>>>> TX packets:286967 errors:0 dropped:0 overruns:0
>>carrier:0
>>>>>>>> collisions:0 txqueuelen:1000
>>>>>>>> RX bytes:15590189 (14.8 MiB) TX bytes:127357293 (121.4
>>MiB)
>>>>>>>>
>>>>>>>> root@buffy2-1:~# less /var/log/babeld.log
>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>> send: Cannot assign requested address
>>>>>>>> send: Cannot assign requested address
>>>>>>>> send: Cannot assign requested address
>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>> netlink_read: recvmsg(): No buffer space available
>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>
>>>>>>> This is a problem in babel detecting the channel on a "normal"
>>>>>>> rather than a mesh interface. It's bugged me a long while, but
>>>>>>> haven't got around to finding what triggers it. Might "fix" it by
>>>>>>> acquiring the channel at babel start time from
>>/etc/config/wireless.
>>>>>>>
>>>>>>> It messes up the diversity routing calculation, grump.
>>>>>>>
>>>>>>> There is a possibility a logfile got really big, but this one
>>>>>>> generally doesn't, but I should turn off logging in some
>>>>>>> future release...
>>>>>>
>>>>>>
>>>>>> I believe I've tracked down part of what's going on. It looks like
>>my
>>>>>> tmpfs is filling up 100% and then the device enters a bad state:
>>>>>>
>>>>>> After 24 hours, with tmpfs at 50%, babeld.log is the largest file
>>by far
>>>>>> in tmpfs and the only file that appears to be growing (based on
>>`du`). It
>>>>>> takes about 48 hours from reboot to fill up tmpfs on my device.
>>>>>>
>>>>>> # sort babeld.log | uniq -c |sort -rn |head
>>>>>>
>>>>>> 503236 Couldn't determine channel of interface sw00: Invalid
>>argument.
>>>>>>
>>>>>> 1376 netlink_read: recvmsg(): No buffer space available
>>>>>>
>>>>>> 3 send: Cannot assign requested address
>>>>>>
>>>>>> # wc -l babeld.log
>>>>>>
>>>>>> 504617 babeld.log
>>>>>>
>>>>>> I sped up system failure by using `dd` to fill up tmpfs and the
>>system
>>>>>> became immediately unusable.
>>>>>>
>>>>>> This also explains the luci session store errors as sessions are
>>stored in
>>>>>> tmpfs.
>>>>>>
>>>>>> The other buffer issues may or may not be related to this.
>>>>>>
>>>>>> Best,
>>>>>> Steve
>>>>>
>>>>>
>>>>
>>>>
>>>>
>>>> --
>>>> Dave Täht
>>>>
>>>> Fixing bufferbloat with cerowrt:
>>http://www.teklibre.com/cerowrt/subscribe.html
>>>> _______________________________________________
>>>> Cerowrt-devel mailing list
>>>> Cerowrt-devel@lists.bufferbloat.net
>>>> https://lists.bufferbloat.net/listinfo/cerowrt-devel
>>>
>
> Hi Dave,
> --
> Sent from my Android phone with K-9 Mail. Please excuse my brevity.
--
Dave Täht
Fixing bufferbloat with cerowrt: http://www.teklibre.com/cerowrt/subscribe.html
^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: [Cerowrt-devel] cerowrt issues (3.10.24-8)
2014-01-29 17:44 ` Sebastian Moeller
2014-01-29 18:21 ` Dave Taht
@ 2014-01-29 18:24 ` Steve Jenson
2014-01-29 19:55 ` Sebastian Moeller
1 sibling, 1 reply; 16+ messages in thread
From: Steve Jenson @ 2014-01-29 18:24 UTC (permalink / raw)
To: Sebastian Moeller; +Cc: cerowrt-devel
[-- Attachment #1: Type: text/plain, Size: 688 bytes --]
On Wed, Jan 29, 2014 at 9:44 AM, Sebastian Moeller <moeller0@gmx.de> wrote:
> On January 29, 2014 5:10:18 PM CET, Dave Taht <dave.taht@gmail.com> wrote:
> >On Wed, Jan 29, 2014 at 4:45 AM, Sebastian Moeller <moeller0@gmx.de>
> >wrote:
> >> Hi Dave,
> >>
> >> quick question, how does one turn of logging for babeld? It seems
> >that if daemonized it defaults to logging to /var/log/babeld.log (or
> >similar). Is setting the log file to /dev/null really the answer?
> >
> >seems so.
>
> Okay, I guess I will try that then...
>
Here's the directive I'm using in /etc/babeld.conf
log-file /dev/null
and then you can restart either via the web gui or `/etc/rc.d/S70babeld
restart`
[-- Attachment #2: Type: text/html, Size: 1325 bytes --]
^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: [Cerowrt-devel] cerowrt issues (3.10.24-8)
2014-01-29 18:21 ` Dave Taht
@ 2014-01-29 19:51 ` Sebastian Moeller
2014-01-29 19:56 ` David Personette
0 siblings, 1 reply; 16+ messages in thread
From: Sebastian Moeller @ 2014-01-29 19:51 UTC (permalink / raw)
To: Dave Taht; +Cc: cerowrt-devel
Hi Dave,
On Jan 29, 2014, at 19:21 , Dave Taht <dave.taht@gmail.com> wrote:
> On Wed, Jan 29, 2014 at 9:44 AM, Sebastian Moeller <moeller0@gmx.de> wrote:
>> On January 29, 2014 5:10:18 PM CET, Dave Taht <dave.taht@gmail.com> wrote:
>>> On Wed, Jan 29, 2014 at 4:45 AM, Sebastian Moeller <moeller0@gmx.de>
>>> wrote:
>>>> Hi Dave,
>>>>
>>>> quick question, how does one turn of logging for babeld? It seems
>>> that if daemonized it defaults to logging to /var/log/babeld.log (or
>>> similar). Is setting the log file to /dev/null really the answer?
>>>
>>> seems so.
>>
>> Okay, I guess I will try that then...
>>
>>
>>>
>>>> (Since I have no the IPv6 issue not yet resolved, I assume
>>> babeld is unhappy) I resorted to stopping babeld completely, but that
>>> feels like a crutch...
>>>
>>> no daemon in an embedded system should ever write to flash in an
>>> uncontrollable manner.
>>
>> What bugs me is that it basically keeps repeating the same error over and over again. If it would rate limit and:or push messages to the system log it would be nicer.
>>
>>
>>>
>>> I will also argue that not being able to find the channel is a bug
>>> that messes with diversity routing
>>> in particular.
>>
>> I have actually not yet understood what it wants to tell me ;), since I got your attention, is there an easy way to run a babel client under macosx?
>
> for coping with the mac I use "macports" to get a compiler and support
> for open source software.
Ah, same here (even though I would have thought you a homebrew user, no idea why) ;)
Alas, "port search babel" does not find anything babeld related...
>
> I haven't ever tried to run babeld on the mac I have, I will put it on
> my list…
I just thought it would be nice to finally get into the "stay connected while switching between wired and wire-less fun", but this is in no way essential for me.
Best Regards
Sebastian
>
>>
>>
>> Best Regards
>> Sebastian
>>
>>
>>>
>>>>
>>>> Best
>>>> Sebastian
>>>>
>>>>
>>>>
>>>> On Jan 27, 2014, at 22:14 , Dave Taht <dave.taht@gmail.com> wrote:
>>>>
>>>>> certainly turn off the babeld log! I will leave it off in the next
>>> release.
>>>>>
>>>>> On Mon, Jan 27, 2014 at 4:10 PM, Steve Jenson <stevej@fruitless.org>
>>> wrote:
>>>>>> Looking more, the buffer errors are showing up in syslog well
>>> before tmpfs
>>>>>> fills up. Is the memtester openwrt package available for cerowrt? I
>>> don't
>>>>>> see it under `Available packages`.
>>>>>>
>>>>>> Thanks,
>>>>>> Steve
>>>>>>
>>>>>>
>>>>>> On Mon, Jan 27, 2014 at 1:06 PM, Steve Jenson
>>> <stevej@fruitless.org> wrote:
>>>>>>>
>>>>>>> On Fri, Jan 24, 2014 at 3:23 PM, Dave Taht <dave.taht@gmail.com>
>>> wrote:
>>>>>>>>
>>>>>>>> On Fri, Jan 24, 2014 at 6:08 PM, Steve Jenson
>>> <stevej@fruitless.org>
>>>>>>>> wrote:
>>>>>>>>> Hi everybody,
>>>>>>>>>
>>>>>>>>> I've been using cerowrt as a secondary wifi network (just a
>>> single AP
>>>>>>>>> for
>>>>>>>>> now) for a few weeks now. Recently, my wndr3800 got stuck in a
>>> bad
>>>>>>>>> state and
>>>>>>>>> eventually rebooted. I've had this happen a few times now and am
>>>>>>>>> looking for
>>>>>>>>> ways to debug the issue. I'm new to cerowrt and openwrt so any
>>> advice
>>>>>>>>> is
>>>>>>>>> appreciated.
>>>>>>>>>
>>>>>>>>> Since I use it as a secondary network, this is no way critical.
>>>>>>>>
>>>>>>>> Yea! I appreciate caution before putting alpha software on your
>>> gw.
>>>>>>>>
>>>>>>>>> I'm not
>>>>>>>>> looking for free tech support but I couldn't find anything on
>>> the wiki
>>>>>>>>> about
>>>>>>>>> troubleshooting. I'd love to start a page and write some shell
>>> scripts
>>>>>>>>> to
>>>>>>>>> diagnose and report issues. I know that a cerowrt router is
>>> meant to be
>>>>>>>>> a
>>>>>>>>> research project rather a consumer device but these things seem
>>> helpful
>>>>>>>>> regardless.
>>>>>>>>
>>>>>>>> Sure, let me know your wiki account. I have been lax about
>>> granting
>>>>>>>> access of late as the signup process is overrun by spammers.
>>>>>>>
>>>>>>>
>>>>>>> My username is stevej on the wiki. Thanks!
>>>>>>>
>>>>>>>
>>>>>>>>
>>>>>>>>> Please let me know if you'd prefer I not email the list with
>>> these
>>>>>>>>> issues or
>>>>>>>>> if you'd rather I used trac or a different forum.
>>>>>>>>>
>>>>>>>>
>>>>>>>> The list is where most stuff happens. Also in the irc channel.
>>>>>>>>
>>>>>>>> If it gets to where it needs to be tracked we have a bugtracker
>>> at
>>>>>>>>
>>>>>>>> http://www.bufferbloat.net/projects/cerowrt/issues
>>>>>>>>
>>>>>>>> The first question I have is: Are you on comcast? Cerowrt
>>>>>>>> had a dhcpv6-pd implementation that "just worked" from feburary
>>> through
>>>>>>>> december. Regrettably they changed the RA announcement interval
>>>>>>>> to a really low number around then... and this triggers a
>>> firewall reload
>>>>>>>> every minute on everything prior to the release I point to below.
>>>>>>>>
>>>>>>>> If there is a memory leak somewhere that would have triggered it.
>>>>>>>
>>>>>>>
>>>>>>> I am on AT&T ADSL2+ with a Motorola NVG510 modem.
>>>>>>>
>>>>>>>
>>>>>>>>
>>>>>>>>> In this state, I can connect to the cerowrt base station via
>>> wifi but
>>>>>>>>> am
>>>>>>>>> unable to route packets to the internet. I can connect to :81
>>> and see
>>>>>>>>> the
>>>>>>>>> login page but logging in results in a lua error at
>>> `/cgi-bin/luci`
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> /usr/lib/lua/luci/dispatcher.lua:448: Failed to execute
>>> function
>>>>>>>>> dispatcher target for entry '/'.
>>>>>>>>> The called action terminated with an exception:
>>>>>>>>> /usr/lib/lua/luci/sauth.lua:87: Session data invalid!
>>>>>>>>> stack traceback:
>>>>>>>>> [C]: in function 'assert'
>>>>>>>>> /usr/lib/lua/luci/dispatcher.lua:448: in function 'dispatch'
>>>>>>>>> /usr/lib/lua/luci/dispatcher.lua:195: in function
>>>>>>>>> </usr/lib/lua/luci/dispatcher.lua:194>
>>>>>>>>>
>>>>>>>>> I can ssh into the device and cat various log files until the
>>> router
>>>>>>>>> hangs
>>>>>>>>> and reboots. here's a few relevant lines from my terminal
>>> history
>>>>>>>>> before the
>>>>>>>>> device rebooted (I'm assuming a watchdog kicked in and rebooted
>>> it).
>>>>>>>>>
>>>>>>>>> root@buffy2-1:~# ping google.com
>>>>>>>>> ping: bad address 'google.com'
>>>>>>>>> root@buffy2-1:~# free
>>>>>>>>> total used free shared
>>> buffers
>>>>>>>>> Mem: 126336 110332 16004 0
>>> 5616
>>>>>>>>> -/+ buffers: 104716 21620
>>>>>>>>> Swap: 0 0 0
>>>>>>>>> root@buffy2-1:~# uptime
>>>>>>>>> 02:08:54 up 2 days, 1:26, load average: 0.10, 0.21, 0.17
>>>>>>>>> root@buffy2-1:~# dmesg
>>>>>>>>> [ 0.000000] Linux version 3.10.24 (cero2@snapon) (gcc version
>>> 4.6.4
>>>>>>>>> (OpenWrt/Linaro GCC 4.6-2013.05 r38226) ) #1 Tue Dec 24
>>>>>>>>> 10:50:15 PST 2013
>>>>>>>>> [skipping some lines]
>>>>>>>>>
>>>>>>>>> [ 13.156250] Error: Driver 'gpio-keys-polled' is already
>>> registered,
>>>>>>>>> aborting...
>>>>>>>>> [ 19.414062] IPv6: ADDRCONF(NETDEV_UP): ge00: link is not
>>> ready
>>>>>>>>> [ 19.421875] ar71xx: pll_reg 0xb8050010: 0x11110000
>>>>>>>>> [ 19.429687] se00: link up (1000Mbps/Full duplex)
>>>>>>>>> [ 22.140625] IPv6: ADDRCONF(NETDEV_UP): sw00: link is not
>>> ready
>>>>>>>>> [ 23.351562] IPv6: ADDRCONF(NETDEV_CHANGE): sw00: link becomes
>>> ready
>>>>>>>>> [ 23.757812] ar71xx: pll_reg 0xb8050014: 0x11110000
>>>>>>>>> [ 23.757812] ge00: link up (1000Mbps/Full duplex)
>>>>>>>>> [ 23.773437] IPv6: ADDRCONF(NETDEV_CHANGE): ge00: link becomes
>>> ready
>>>>>>>>>
>>>>>>>>> root@buffy2-1:~# ifconfig
>>>>>>>>> ge00 Link encap:Ethernet HWaddr 2C:B0:5D:A0:C5:B1
>>>>>>>>> inet addr:192.168.1.138 Bcast:192.168.1.255
>>>>>>>>> Mask:255.255.255.0
>>>>>>>>> inet6 addr: fe80::2eb0:5dff:fea0:c5b1/64 Scope:Link
>>>>>>>>> inet6 addr: 2602:30a:2cdb:330:2eb0:5dff:fea0:c5b1/64
>>>>>>>>> Scope:Global
>>>>>>>>> UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
>>>>>>>>> RX packets:1469670 errors:0 dropped:8 overruns:0
>>> frame:0
>>>>>>>>> TX packets:547733 errors:0 dropped:0 overruns:0
>>> carrier:0
>>>>>>>>> collisions:0 txqueuelen:1000
>>>>>>>>> RX bytes:229243410 (218.6 MiB) TX bytes:57304808 (54.6
>>> MiB)
>>>>>>>>> Interrupt:5
>>>>>>>>>
>>>>>>>>> lo Link encap:Local Loopback
>>>>>>>>> inet addr:127.0.0.1 Mask:255.0.0.0
>>>>>>>>> inet6 addr: ::1/128 Scope:Host
>>>>>>>>> UP LOOPBACK RUNNING MTU:65536 Metric:1
>>>>>>>>> RX packets:23689 errors:0 dropped:0 overruns:0 frame:0
>>>>>>>>> TX packets:23689 errors:0 dropped:0 overruns:0
>>> carrier:0
>>>>>>>>> collisions:0 txqueuelen:0
>>>>>>>>> RX bytes:2612713 (2.4 MiB) TX bytes:2612713 (2.4 MiB)
>>>>>>>>>
>>>>>>>>> pimreg Link encap:UNSPEC HWaddr
>>>>>>>>> 00-00-00-00-00-00-00-00-00-00-00-00-00-00-00-00
>>>>>>>>> UP RUNNING NOARP MTU:1472 Metric:1
>>>>>>>>> RX packets:0 errors:0 dropped:0 overruns:0 frame:0
>>>>>>>>> TX packets:0 errors:0 dropped:0 overruns:0 carrier:0
>>>>>>>>> collisions:0 txqueuelen:0
>>>>>>>>> RX bytes:0 (0.0 B) TX bytes:0 (0.0 B)
>>>>>>>>>
>>>>>>>>> se00 Link encap:Ethernet HWaddr 2E:B0:5D:A0:C5:B0
>>>>>>>>> inet addr:172.30.42.1 Bcast:172.30.42.31
>>>>>>>>> Mask:255.255.255.224
>>>>>>>>> inet6 addr: 2602:30a:2cdb:330:2eb0:5dff:fea0:c5b1/64
>>>>>>>>> Scope:Global
>>>>>>>>
>>>>>>>> How are you assigning your ipv6 addresses?
>>>>>>>
>>>>>>>
>>>>>>> It's been a while since I messed with this but I think IPv6 is
>>> assigned
>>>>>>> thanks to 6relayd? My modem has IPv6 enabled but no DHCPv6 options
>>> that I
>>>>>>> can find. Here's how cerowrt is configured.
>>>>>>>
>>>>>>> root@buffy2-1:/overlay/etc/config# cat 6relayd
>>>>>>> config server 'default'
>>>>>>> option fallback_relay 'rd dhcpv6 ndp'
>>>>>>> list network 'ge00'
>>>>>>> list network 'ge01'
>>>>>>> list network 'gw00'
>>>>>>> list network 'gw01'
>>>>>>> list network 'gw10'
>>>>>>> list network 'gw11'
>>>>>>> list network 'se00'
>>>>>>> list network 'sw00'
>>>>>>> list network 'sw10'
>>>>>>> option rd 'relay'
>>>>>>> option dhcpv6 'relay'
>>>>>>> option ndp 'relay'
>>>>>>> option master 'ge00'
>>>>>>>
>>>>>>>>
>>>>>>>>> inet6 addr: fe80::2cb0:5dff:fea0:c5b0/64 Scope:Link
>>>>>>>>> UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
>>>>>>>>> RX packets:0 errors:0 dropped:0 overruns:0 frame:0
>>>>>>>>> TX packets:191740 errors:0 dropped:0 overruns:0
>>> carrier:0
>>>>>>>>> collisions:0 txqueuelen:1000
>>>>>>>>> RX bytes:0 (0.0 B) TX bytes:42184988 (40.2 MiB)
>>>>>>>>> Interrupt:4
>>>>>>>>>
>>>>>>>>> sw00 Link encap:Ethernet HWaddr 2C:B0:5D:A0:C5:B0
>>>>>>>>> inet addr:172.30.42.65 Bcast:172.30.42.95
>>>>>>>>> Mask:255.255.255.224
>>>>>>>>> inet6 addr: 2602:30a:2cdb:330:2eb0:5dff:fea0:c5b1/64
>>>>>>>>> Scope:Global
>>>>>>>>> inet6 addr: fe80::2eb0:5dff:fea0:c5b0/64 Scope:Link
>>>>>>>>> UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
>>>>>>>>> RX packets:70239 errors:0 dropped:0 overruns:0 frame:0
>>>>>>>>> TX packets:286967 errors:0 dropped:0 overruns:0
>>> carrier:0
>>>>>>>>> collisions:0 txqueuelen:1000
>>>>>>>>> RX bytes:15590189 (14.8 MiB) TX bytes:127357293 (121.4
>>> MiB)
>>>>>>>>>
>>>>>>>>> root@buffy2-1:~# less /var/log/babeld.log
>>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>>> send: Cannot assign requested address
>>>>>>>>> send: Cannot assign requested address
>>>>>>>>> send: Cannot assign requested address
>>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>>> netlink_read: recvmsg(): No buffer space available
>>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>>> Couldn't determine channel of interface sw00: Invalid argument.
>>>>>>>>
>>>>>>>> This is a problem in babel detecting the channel on a "normal"
>>>>>>>> rather than a mesh interface. It's bugged me a long while, but
>>>>>>>> haven't got around to finding what triggers it. Might "fix" it by
>>>>>>>> acquiring the channel at babel start time from
>>> /etc/config/wireless.
>>>>>>>>
>>>>>>>> It messes up the diversity routing calculation, grump.
>>>>>>>>
>>>>>>>> There is a possibility a logfile got really big, but this one
>>>>>>>> generally doesn't, but I should turn off logging in some
>>>>>>>> future release...
>>>>>>>
>>>>>>>
>>>>>>> I believe I've tracked down part of what's going on. It looks like
>>> my
>>>>>>> tmpfs is filling up 100% and then the device enters a bad state:
>>>>>>>
>>>>>>> After 24 hours, with tmpfs at 50%, babeld.log is the largest file
>>> by far
>>>>>>> in tmpfs and the only file that appears to be growing (based on
>>> `du`). It
>>>>>>> takes about 48 hours from reboot to fill up tmpfs on my device.
>>>>>>>
>>>>>>> # sort babeld.log | uniq -c |sort -rn |head
>>>>>>>
>>>>>>> 503236 Couldn't determine channel of interface sw00: Invalid
>>> argument.
>>>>>>>
>>>>>>> 1376 netlink_read: recvmsg(): No buffer space available
>>>>>>>
>>>>>>> 3 send: Cannot assign requested address
>>>>>>>
>>>>>>> # wc -l babeld.log
>>>>>>>
>>>>>>> 504617 babeld.log
>>>>>>>
>>>>>>> I sped up system failure by using `dd` to fill up tmpfs and the
>>> system
>>>>>>> became immediately unusable.
>>>>>>>
>>>>>>> This also explains the luci session store errors as sessions are
>>> stored in
>>>>>>> tmpfs.
>>>>>>>
>>>>>>> The other buffer issues may or may not be related to this.
>>>>>>>
>>>>>>> Best,
>>>>>>> Steve
>>>>>>
>>>>>>
>>>>>
>>>>>
>>>>>
>>>>> --
>>>>> Dave Täht
>>>>>
>>>>> Fixing bufferbloat with cerowrt:
>>> http://www.teklibre.com/cerowrt/subscribe.html
>>>>> _______________________________________________
>>>>> Cerowrt-devel mailing list
>>>>> Cerowrt-devel@lists.bufferbloat.net
>>>>> https://lists.bufferbloat.net/listinfo/cerowrt-devel
>>>>
>>
>> Hi Dave,
>> --
>> Sent from my Android phone with K-9 Mail. Please excuse my brevity.
>
>
>
> --
> Dave Täht
>
> Fixing bufferbloat with cerowrt: http://www.teklibre.com/cerowrt/subscribe.html
^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: [Cerowrt-devel] cerowrt issues (3.10.24-8)
2014-01-29 18:24 ` Steve Jenson
@ 2014-01-29 19:55 ` Sebastian Moeller
2014-01-30 16:21 ` Dave Taht
0 siblings, 1 reply; 16+ messages in thread
From: Sebastian Moeller @ 2014-01-29 19:55 UTC (permalink / raw)
To: Steve Jenson; +Cc: cerowrt-devel
Hi Steve,
On Jan 29, 2014, at 19:24 , Steve Jenson <stevej@fruitless.org> wrote:
> On Wed, Jan 29, 2014 at 9:44 AM, Sebastian Moeller <moeller0@gmx.de> wrote:
> On January 29, 2014 5:10:18 PM CET, Dave Taht <dave.taht@gmail.com> wrote:
> >On Wed, Jan 29, 2014 at 4:45 AM, Sebastian Moeller <moeller0@gmx.de>
> >wrote:
> >> Hi Dave,
> >>
> >> quick question, how does one turn of logging for babeld? It seems
> >that if daemonized it defaults to logging to /var/log/babeld.log (or
> >similar). Is setting the log file to /dev/null really the answer?
> >
> >seems so.
>
> Okay, I guess I will try that then...
>
> Here's the directive I'm using in /etc/babeld.conf
>
> log-file /dev/null
>
> and then you can restart either via the web gui or `/etc/rc.d/S70babeld restart`
Ah, thanks. Since I am on 3.10.28-1 this was /etc/init.d/babeld restart. And I opted for putting:
option 'log-file' '/dev/null'
into /etc/config/babeld, since that seemed the more openwork way of doing things; I wonder whether it really is wise to carry both files…
Babeld runs again, and no /var/log/babeld.log appeared, but whether it works I do not know (and I doubt it given that babeld.log was growing due to nasty repeating error messages...)
Best Regards
Sebastian
>
^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: [Cerowrt-devel] cerowrt issues (3.10.24-8)
2014-01-29 19:51 ` Sebastian Moeller
@ 2014-01-29 19:56 ` David Personette
2014-01-29 20:04 ` Sebastian Moeller
0 siblings, 1 reply; 16+ messages in thread
From: David Personette @ 2014-01-29 19:56 UTC (permalink / raw)
To: Sebastian Moeller; +Cc: cerowrt-devel
[-- Attachment #1: Type: text/plain, Size: 951 bytes --]
On Wed, Jan 29, 2014 at 2:51 PM, Sebastian Moeller <moeller0@gmx.de> wrote:
> >> I have actually not yet understood what it wants to tell me ;),
> since I got your attention, is there an easy way to run a babel client
> under macosx?
> >
> > for coping with the mac I use "macports" to get a compiler and support
> > for open source software.
>
> Ah, same here (even though I would have thought you a homebrew
> user, no idea why) ;)
> Alas, "port search babel" does not find anything babeld related...
>
> >
> > I haven't ever tried to run babeld on the mac I have, I will put it on
> > my list…
>
> I just thought it would be nice to finally get into the "stay
> connected while switching between wired and wire-less fun", but this is in
> no way essential for me.
FYI, it does look like homebrew has babel.
dperson@argos2$ brew search babel
babeld gpsbabel open-babel
--
David P.
[-- Attachment #2: Type: text/html, Size: 1477 bytes --]
^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: [Cerowrt-devel] cerowrt issues (3.10.24-8)
2014-01-29 19:56 ` David Personette
@ 2014-01-29 20:04 ` Sebastian Moeller
0 siblings, 0 replies; 16+ messages in thread
From: Sebastian Moeller @ 2014-01-29 20:04 UTC (permalink / raw)
To: David Personette; +Cc: cerowrt-devel
Hi David,
On Jan 29, 2014, at 20:56 , David Personette <dperson@gmail.com> wrote:
>
> On Wed, Jan 29, 2014 at 2:51 PM, Sebastian Moeller <moeller0@gmx.de> wrote:
> >> I have actually not yet understood what it wants to tell me ;), since I got your attention, is there an easy way to run a babel client under macosx?
> >
> > for coping with the mac I use "macports" to get a compiler and support
> > for open source software.
>
> Ah, same here (even though I would have thought you a homebrew user, no idea why) ;)
> Alas, "port search babel" does not find anything babeld related...
>
> >
> > I haven't ever tried to run babeld on the mac I have, I will put it on
> > my list…
>
> I just thought it would be nice to finally get into the "stay connected while switching between wired and wire-less fun", but this is in no way essential for me.
>
>
> FYI, it does look like homebrew has babel.
>
> dperson@argos2$ brew search babel
> babeld gpsbabel open-babel
Thanks. The first one seems to be the real McCoy (the others are also in macports, for what it is worth). I am very reluctant to switch to homebrew though since I got macports working well and am weary of the learning urge involved in getting the same proficiency in home-brew….
A different question, did you manage figure out your del link's overhead? If not I would be happy to help (especially if octave/mtlab availability is an issue), just send me the ping log file (best via a file drop).
Best Regards
Sebastian
>
> --
> David P.
^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: [Cerowrt-devel] cerowrt issues (3.10.24-8)
2014-01-29 19:55 ` Sebastian Moeller
@ 2014-01-30 16:21 ` Dave Taht
2014-01-30 18:46 ` Sebastian Moeller
0 siblings, 1 reply; 16+ messages in thread
From: Dave Taht @ 2014-01-30 16:21 UTC (permalink / raw)
To: Sebastian Moeller; +Cc: cerowrt-devel
On Wed, Jan 29, 2014 at 11:55 AM, Sebastian Moeller <moeller0@gmx.de> wrote:
> Hi Steve,
>
>
> On Jan 29, 2014, at 19:24 , Steve Jenson <stevej@fruitless.org> wrote:
>
>> On Wed, Jan 29, 2014 at 9:44 AM, Sebastian Moeller <moeller0@gmx.de> wrote:
>> On January 29, 2014 5:10:18 PM CET, Dave Taht <dave.taht@gmail.com> wrote:
>> >On Wed, Jan 29, 2014 at 4:45 AM, Sebastian Moeller <moeller0@gmx.de>
>> >wrote:
>> >> Hi Dave,
>> >>
>> >> quick question, how does one turn of logging for babeld? It seems
>> >that if daemonized it defaults to logging to /var/log/babeld.log (or
>> >similar). Is setting the log file to /dev/null really the answer?
>> >
>> >seems so.
>>
>> Okay, I guess I will try that then...
>>
>> Here's the directive I'm using in /etc/babeld.conf
>>
>> log-file /dev/null
>>
>> and then you can restart either via the web gui or `/etc/rc.d/S70babeld restart`
>
> Ah, thanks. Since I am on 3.10.28-1 this was /etc/init.d/babeld restart. And I opted for putting:
> option 'log-file' '/dev/null'
> into /etc/config/babeld, since that seemed the more openwork way of doing things; I wonder whether it really is wise to carry both files...
I stuck it in /etc/config/babeld.
> Babeld runs again, and no /var/log/babeld.log appeared, but whether it works I do not know (and I doubt it given that babeld.log was growing due to nasty repeating error messages...)
It's working. It is just not making an optimal routing decision
between AP-managed networks and meshy ones.
The feature is called diversity routing, and it is key to making
wireless networks scale better. There are (now), quite a few papers on
it, but I like Juliusz's best...
http://www.pps.univ-paris-diderot.fr/~jch/software/babel/wbmv4.pdf
Notably this feature is also in batman, but it's called something else
that I forget.
In an example with two radios on a cerowrt AP:
If you have a packet come in from channel 36, it's best that it goes
out via ethernet if possible, channel 11 if not, and not channel 36.
Even if the number of hops seems less, don't go back out 36, if at all
possible, use a different route.
So right now babel is incorrectly distinquishing between the AP
managed SSIDs (sw00, sw10, gw10, gw00),
so the routing decisions there are sub-optimal. As in most cases you
are going to go out ethernet or one of
the more meshy interfaces, or you have no choice but to send stuff
along on one SSID... it's not very sub-optimal.
still, annoying. rule 22 in embedded design is "never write infinitely
long files as the probability of running out of memory or flash always
hits 100%"
>
> Best Regards
> Sebastian
>
>>
>
--
Dave Täht
Fixing bufferbloat with cerowrt: http://www.teklibre.com/cerowrt/subscribe.html
^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: [Cerowrt-devel] cerowrt issues (3.10.24-8)
2014-01-30 16:21 ` Dave Taht
@ 2014-01-30 18:46 ` Sebastian Moeller
0 siblings, 0 replies; 16+ messages in thread
From: Sebastian Moeller @ 2014-01-30 18:46 UTC (permalink / raw)
To: Dave Taht; +Cc: cerowrt-devel
Hi Dave, hi list,
On Jan 30, 2014, at 17:21 , Dave Taht <dave.taht@gmail.com> wrote:
> On Wed, Jan 29, 2014 at 11:55 AM, Sebastian Moeller <moeller0@gmx.de> wrote:
>> Hi Steve,
>>
>>
>> On Jan 29, 2014, at 19:24 , Steve Jenson <stevej@fruitless.org> wrote:
>>
>>> On Wed, Jan 29, 2014 at 9:44 AM, Sebastian Moeller <moeller0@gmx.de> wrote:
>>> On January 29, 2014 5:10:18 PM CET, Dave Taht <dave.taht@gmail.com> wrote:
>>>> On Wed, Jan 29, 2014 at 4:45 AM, Sebastian Moeller <moeller0@gmx.de>
>>>> wrote:
>>>>> Hi Dave,
>>>>>
>>>>> quick question, how does one turn of logging for babeld? It seems
>>>> that if daemonized it defaults to logging to /var/log/babeld.log (or
>>>> similar). Is setting the log file to /dev/null really the answer?
>>>>
>>>> seems so.
>>>
>>> Okay, I guess I will try that then...
>>>
>>> Here's the directive I'm using in /etc/babeld.conf
>>>
>>> log-file /dev/null
>>>
>>> and then you can restart either via the web gui or `/etc/rc.d/S70babeld restart`
>>
>> Ah, thanks. Since I am on 3.10.28-1 this was /etc/init.d/babeld restart. And I opted for putting:
>> option 'log-file' '/dev/null'
>> into /etc/config/babeld, since that seemed the more openwork way of doing things; I wonder whether it really is wise to carry both files...
>
>
> I stuck it in /etc/config/babeld.
>
>> Babeld runs again, and no /var/log/babeld.log appeared, but whether it works I do not know (and I doubt it given that babeld.log was growing due to nasty repeating error messages...)
>
> It's working. It is just not making an optimal routing decision
> between AP-managed networks and meshy ones.
Ah, if babel can work around this issue than there is no good reason to spam the log with repeats (at least not at the frequency it currently does, once per day might be more reasonable...)
>
> The feature is called diversity routing, and it is key to making
> wireless networks scale better. There are (now), quite a few papers on
> it, but I like Juliusz's best...
>
> http://www.pps.univ-paris-diderot.fr/~jch/software/babel/wbmv4.pdf
Interesting, thanks for the pointer.
>
> Notably this feature is also in batman, but it's called something else
> that I forget.
>
> In an example with two radios on a cerowrt AP:
>
> If you have a packet come in from channel 36, it's best that it goes
> out via ethernet if possible, channel 11 if not, and not channel 36.
> Even if the number of hops seems less, don't go back out 36, if at all
> possible, use a different route.
>
> So right now babel is incorrectly distinquishing between the AP
> managed SSIDs (sw00, sw10, gw10, gw00),
> so the routing decisions there are sub-optimal.
Is it really hard to get the radio and frequency for each interface from linux?
> As in most cases you
> are going to go out ethernet or one of
> the more meshy interfaces, or you have no choice but to send stuff
> along on one SSID... it's not very sub-optimal.
Okay, that does not justify the log spam ;)
>
> still, annoying. rule 22 in embedded design is "never write infinitely
> long files as the probability of running out of memory or flash always
> hits 100%"
I agree (and then I would be happier if openwrt would have a preemptive mechanism to avoid this situation, be it log rotation and deletion or similar methods.)
Best Regards & mangy thanks for the explanation.
Sebastian
>
>
>
>>
>> Best Regards
>> Sebastian
>>
>>>
>>
>
>
>
> --
> Dave Täht
>
> Fixing bufferbloat with cerowrt: http://www.teklibre.com/cerowrt/subscribe.html
^ permalink raw reply [flat|nested] 16+ messages in thread
end of thread, other threads:[~2014-01-30 18:46 UTC | newest]
Thread overview: 16+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2014-01-24 23:08 [Cerowrt-devel] cerowrt issues (3.10.24-8) Steve Jenson
2014-01-24 23:23 ` Dave Taht
2014-01-27 21:06 ` Steve Jenson
2014-01-27 21:10 ` Steve Jenson
2014-01-27 21:14 ` Dave Taht
2014-01-29 12:45 ` Sebastian Moeller
2014-01-29 16:10 ` Dave Taht
2014-01-29 17:44 ` Sebastian Moeller
2014-01-29 18:21 ` Dave Taht
2014-01-29 19:51 ` Sebastian Moeller
2014-01-29 19:56 ` David Personette
2014-01-29 20:04 ` Sebastian Moeller
2014-01-29 18:24 ` Steve Jenson
2014-01-29 19:55 ` Sebastian Moeller
2014-01-30 16:21 ` Dave Taht
2014-01-30 18:46 ` Sebastian Moeller
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox