The WiFi testing matrix is huge. My only help is to publish
iperf 2 and pyflows as open source. There are ways to build test rigs using programmable attenuators and variable phase shifters, splitters, combiners etc. Then use statistical process controls (SPC) to automate monitoring for anomalies and regressions. It's a lot of sw work and each test rig costs a lot of money. I find this level of testing is typically beyond to scope of "free and voluntary labor."
I'd agree that there are likely many poorly behaving devices.
Bob