* [Bloat] known buffer sizes on switches
@ 2018-11-28 16:32 Bruno George Moraes
2018-11-28 16:55 ` Dave Taht
0 siblings, 1 reply; 9+ messages in thread
From: Bruno George Moraes @ 2018-11-28 16:32 UTC (permalink / raw)
To: Bufferbloat lists bloat
[-- Attachment #1: Type: text/plain, Size: 836 bytes --]
>
> Nice resource, thanks.
>
> If someone wonders why things look the way they do, so it's all about
> on-die and off-die memory. Either you use off-die or on-die memory, often
> SRAM which requires 6 gates per bit. So spending half a billion gates
> gives you ~10MB buffer on-die. If you're doing off-die memory (DRAM or
> similar) then you'll get the gigabytes of memory seen in some equipment.
> There basically is nothing in between. As soon as you go off-die you might
> as well put at least 2-6 GB in there.
>
>
There are some reasearch on new memory devices with unexpected results...
https://ieeexplore.ieee.org/document/8533260
The HMC memory allows improvements in execution time and consumed energy.
> In some situations, this memory type permits removing the L2 cache from the
> memory hierarchy.
>
HMC parts start at 2GB
[-- Attachment #2: Type: text/html, Size: 1347 bytes --]
^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: [Bloat] known buffer sizes on switches
2018-11-28 16:32 [Bloat] known buffer sizes on switches Bruno George Moraes
@ 2018-11-28 16:55 ` Dave Taht
2018-11-28 18:25 ` Dave Collier-Brown
` (2 more replies)
0 siblings, 3 replies; 9+ messages in thread
From: Dave Taht @ 2018-11-28 16:55 UTC (permalink / raw)
To: Bruno George Moraes; +Cc: Bufferbloat lists bloat
Bruno George Moraes <brunogm0@gmail.com> writes:
> Nice resource, thanks.
>
> If someone wonders why things look the way they do, so it's all about
> on-die and off-die memory. Either you use off-die or on-die memory, often
> SRAM which requires 6 gates per bit. So spending half a billion gates
> gives you ~10MB buffer on-die. If you're doing off-die memory (DRAM or
> similar) then you'll get the gigabytes of memory seen in some equipment.
> There basically is nothing in between. As soon as you go off-die you might
> as well put at least 2-6 GB in there.
>
> There are some reasearch on new memory devices with unexpected
> results...
> https://ieeexplore.ieee.org/document/8533260
>
> The HMC memory allows improvements in execution time and consumed
> energy. In some situations, this memory type permits removing the
> L2 cache from the memory hierarchy.
>
> HMC parts start at 2GB
Thank you for that. I do have a long standing dream of a single chip
wifi router, with the lowest SNR possible, and the minimum number of
pins coming off of it. I'd settle for 32MB of (static?) ram on chip as
that has proven sufficient to date to drive 802.11n....
which would let you get rid of both the L2 and L1 cache. That said, I
think the cost of 32MB of on-chip static ram remains a bit high, and
plugging it into a mips cpu, kind of silly. Someday there will be a case
to just doing everything on a single chip, but...
>
>
> _______________________________________________
> Bloat mailing list
> Bloat@lists.bufferbloat.net
> https://lists.bufferbloat.net/listinfo/bloat
^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: [Bloat] known buffer sizes on switches
2018-11-28 16:55 ` Dave Taht
@ 2018-11-28 18:25 ` Dave Collier-Brown
2018-11-28 18:26 ` David Collier-Brown
2018-11-29 2:33 ` Dave Taht
2 siblings, 0 replies; 9+ messages in thread
From: Dave Collier-Brown @ 2018-11-28 18:25 UTC (permalink / raw)
To: bloat
On 2018-11-28 11:55 a.m., Dave Taht wrote:
> Thank you for that. I do have a long standing dream of a single chip
> wifi router, with the lowest SNR possible, and the minimum number of
> pins coming off of it. I'd settle for 32MB of (static?) ram on chip as
> that has proven sufficient to date to drive 802.11n....
>
> which would let you get rid of both the L2 and L1 cache. That said, I
> think the cost of 32MB of on-chip static ram remains a bit high, and
> plugging it into a mips cpu, kind of silly. Someday there will be a case
> to just doing everything on a single chip, but...
I could see 32MB or more of fast memory on-chip as being attractive when
one is fighting with diminishing returns in CPU speed and program
parallelizability.
In the past that might have excited MIPS, but these days less so. Maybe
ARM? IBM?
--dave
--
David Collier-Brown, | Always do right. This will gratify
System Programmer and Author | some people and astonish the rest
dave.collier-brown@indexexchange.com | -- Mark Twain
CONFIDENTIALITY NOTICE AND DISCLAIMER : This telecommunication, including any and all attachments, contains confidential information intended only for the person(s) to whom it is addressed. Any dissemination, distribution, copying or disclosure is strictly prohibited and is not a waiver of confidentiality. If you have received this telecommunication in error, please notify the sender immediately by return electronic mail and delete the message from your inbox and deleted items folders. This telecommunication does not constitute an express or implied agreement to conduct transactions by electronic means, nor does it constitute a contract offer, a contract amendment or an acceptance of a contract offer. Contract terms contained in this telecommunication are subject to legal review and the completion of formal documentation and are not binding until same is confirmed in writing and has been signed by an authorized signatory.
^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: [Bloat] known buffer sizes on switches
2018-11-28 16:55 ` Dave Taht
2018-11-28 18:25 ` Dave Collier-Brown
@ 2018-11-28 18:26 ` David Collier-Brown
2018-11-28 19:02 ` Dave Taht
2018-11-29 2:33 ` Dave Taht
2 siblings, 1 reply; 9+ messages in thread
From: David Collier-Brown @ 2018-11-28 18:26 UTC (permalink / raw)
To: bloat
On 2018-11-28 11:55 a.m., Dave Taht wrote:
> Thank you for that. I do have a long standing dream of a single chip
> wifi router, with the lowest SNR possible, and the minimum number of
> pins coming off of it. I'd settle for 32MB of (static?) ram on chip as
> that has proven sufficient to date to drive 802.11n....
>
> which would let you get rid of both the L2 and L1 cache. That said, I
> think the cost of 32MB of on-chip static ram remains a bit high, and
> plugging it into a mips cpu, kind of silly. Someday there will be a case
> to just doing everything on a single chip, but...
I could see 32MB or more of fast memory on-chip as being attractive when
one is fighting with diminishing returns in CPU speed and program
parallelizability.
In the past that might have excited MIPS, but these days less so. Maybe
ARM? IBM?
--dave
--
David Collier-Brown, | Always do right. This will gratify
System Programmer and Author | some people and astonish the rest
davecb@spamcop.net | -- Mark Twain
^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: [Bloat] known buffer sizes on switches
2018-11-28 18:26 ` David Collier-Brown
@ 2018-11-28 19:02 ` Dave Taht
2018-11-28 20:34 ` David Collier-Brown
0 siblings, 1 reply; 9+ messages in thread
From: Dave Taht @ 2018-11-28 19:02 UTC (permalink / raw)
To: David Collier-Brown; +Cc: bloat
I really don't know a whole heck of a lot about where mips is going.
Certainly they remain strong in the embedded market (I do like the
edgerouter X a lot), but as for their current direction or future
product lines, not a clue.
I used to know someone over there, maybe he's restored new directions.
Last I recall he was busy obsoleting a whole lot of instruction space
in order to make room for "new stuff". He'd even asked me if adding an
invsqrt to the instruction set would help, and I sadly replied that
that bit of codel was totally invisible on a trace.....
I really like(d) mips. ton of registers, better instruction set than
arm (IMHO), no foolish processor extensions.
On Wed, Nov 28, 2018 at 10:26 AM David Collier-Brown <davec-b@rogers.com> wrote:
>
> On 2018-11-28 11:55 a.m., Dave Taht wrote:
>
> > Thank you for that. I do have a long standing dream of a single chip
> > wifi router, with the lowest SNR possible, and the minimum number of
> > pins coming off of it. I'd settle for 32MB of (static?) ram on chip as
> > that has proven sufficient to date to drive 802.11n....
> >
> > which would let you get rid of both the L2 and L1 cache. That said, I
> > think the cost of 32MB of on-chip static ram remains a bit high, and
> > plugging it into a mips cpu, kind of silly. Someday there will be a case
> > to just doing everything on a single chip, but...
>
> I could see 32MB or more of fast memory on-chip as being attractive when
> one is fighting with diminishing returns in CPU speed and program
> parallelizability.
>
> In the past that might have excited MIPS, but these days less so. Maybe
> ARM? IBM?
>
> --dave
>
> --
> David Collier-Brown, | Always do right. This will gratify
> System Programmer and Author | some people and astonish the rest
> davecb@spamcop.net | -- Mark Twain
>
> _______________________________________________
> Bloat mailing list
> Bloat@lists.bufferbloat.net
> https://lists.bufferbloat.net/listinfo/bloat
--
Dave Täht
CTO, TekLibre, LLC
http://www.teklibre.com
Tel: 1-831-205-9740
^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: [Bloat] known buffer sizes on switches
2018-11-28 19:02 ` Dave Taht
@ 2018-11-28 20:34 ` David Collier-Brown
0 siblings, 0 replies; 9+ messages in thread
From: David Collier-Brown @ 2018-11-28 20:34 UTC (permalink / raw)
To: Dave Taht, David Collier-Brown; +Cc: bloat
[-- Attachment #1: Type: text/plain, Size: 2363 bytes --]
That would be really cool: I loved the Mips we had at YorkU.ca
--dave
On 2018-11-28 2:02 p.m., Dave Taht wrote:
> I really don't know a whole heck of a lot about where mips is going.
> Certainly they remain strong in the embedded market (I do like the
> edgerouter X a lot), but as for their current direction or future
> product lines, not a clue.
>
> I used to know someone over there, maybe he's restored new directions.
> Last I recall he was busy obsoleting a whole lot of instruction space
> in order to make room for "new stuff". He'd even asked me if adding an
> invsqrt to the instruction set would help, and I sadly replied that
> that bit of codel was totally invisible on a trace.....
>
> I really like(d) mips. ton of registers, better instruction set than
> arm (IMHO), no foolish processor extensions.
>
> On Wed, Nov 28, 2018 at 10:26 AM David Collier-Brown <davec-b@rogers.com> wrote:
>> On 2018-11-28 11:55 a.m., Dave Taht wrote:
>>
>>> Thank you for that. I do have a long standing dream of a single chip
>>> wifi router, with the lowest SNR possible, and the minimum number of
>>> pins coming off of it. I'd settle for 32MB of (static?) ram on chip as
>>> that has proven sufficient to date to drive 802.11n....
>>>
>>> which would let you get rid of both the L2 and L1 cache. That said, I
>>> think the cost of 32MB of on-chip static ram remains a bit high, and
>>> plugging it into a mips cpu, kind of silly. Someday there will be a case
>>> to just doing everything on a single chip, but...
>> I could see 32MB or more of fast memory on-chip as being attractive when
>> one is fighting with diminishing returns in CPU speed and program
>> parallelizability.
>>
>> In the past that might have excited MIPS, but these days less so. Maybe
>> ARM? IBM?
>>
>> --dave
>>
>> --
>> David Collier-Brown, | Always do right. This will gratify
>> System Programmer and Author | some people and astonish the rest
>> davecb@spamcop.net | -- Mark Twain
>>
>> _______________________________________________
>> Bloat mailing list
>> Bloat@lists.bufferbloat.net
>> https://lists.bufferbloat.net/listinfo/bloat
>
>
--
David Collier-Brown, | Always do right. This will gratify
System Programmer and Author | some people and astonish the rest
davecb@spamcop.net | -- Mark Twain
[-- Attachment #2: Type: text/html, Size: 3461 bytes --]
^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: [Bloat] known buffer sizes on switches
2018-11-28 16:55 ` Dave Taht
2018-11-28 18:25 ` Dave Collier-Brown
2018-11-28 18:26 ` David Collier-Brown
@ 2018-11-29 2:33 ` Dave Taht
2 siblings, 0 replies; 9+ messages in thread
From: Dave Taht @ 2018-11-29 2:33 UTC (permalink / raw)
To: Dave Täht; +Cc: Bruno George Moraes, bloat
On Wed, Nov 28, 2018 at 8:55 AM Dave Taht <dave@taht.net> wrote:
>
> Bruno George Moraes <brunogm0@gmail.com> writes:
>
> > Nice resource, thanks.
> >
> > If someone wonders why things look the way they do, so it's all about
> > on-die and off-die memory. Either you use off-die or on-die memory, often
> > SRAM which requires 6 gates per bit. So spending half a billion gates
> > gives you ~10MB buffer on-die. If you're doing off-die memory (DRAM or
> > similar) then you'll get the gigabytes of memory seen in some equipment.
> > There basically is nothing in between. As soon as you go off-die you might
> > as well put at least 2-6 GB in there.
> >
> > There are some reasearch on new memory devices with unexpected
> > results...
> > https://ieeexplore.ieee.org/document/8533260
> >
> > The HMC memory allows improvements in execution time and consumed
> > energy. In some situations, this memory type permits removing the
> > L2 cache from the memory hierarchy.
> >
> > HMC parts start at 2GB
That effort actually looks pretty promising. I liked the support for
atomic ops too, offloaded.There are also so many useful operations
that I'd like to see offloaded to ram - like zeroing memory regions as
one example.
http://www.hybridmemorycube.org/
Will probably run hot. But: grump: I still don't "get" why the
traditional division between memory and cpu makers hasn't collapsed
yet. A package like that
with a cpu *in it*, and we're done. 4GB "ought to be enough for everybody".
27? years ago, back when I was attempting to write a SF novel, I had
an idea for a more efficient way to pack cores and memory together.
Basically: shrink the cray 1 design down to about the size of a nickel
(or dime!).
The cray had that rough shape for optimum routing and cooling, but...
the overall shape of the package becomes a hexagon
(https://en.wikipedia.org/wiki/Hexagon) cylinder. That gives you 6 or
12 vertical flat surfaces to mount chips on (or just let them stand in
slots on the package). There's one natural crossbar bus at the center,
connecting the 6 "core" chips more rapidly than the edges. Top, bottom
and sides of the package can be used for I/O, power and so on, and
each hexagonal component wedged tightly together (instead of today's
north-south east-west architectures you get 2 more dimensions
horizontally)
fill the package with some sort of coolant. Seal it up tight. Test the
module as a whole and ship 'em in palletloads. I'm pretty sure the
heat circulates from the center out naturally, in every orientation,
but what the heck, stick in some MEMs fans in there to keep things
pumping along.
that design naturally led to 2 cpu chips and 4 memories. Or 4 cpu
chips and 2 memories. or 2 cpus 2 mems and 2 IOs. Before you started
coming up with things to do with the outer 6 sides.
I I never thought separating ram from cpu by more than a millimeter
was a good idea.....
It's a quite a jump to envision going from the cray-1 (115kw!!!) down
to the size of a nickel!
But everybody has a cray-1 now. They just run too hot. And are often
not suited to task, just like the cray was.
https://en.wikipedia.org/wiki/Cray-1
Don't know if anyone's ever tried to pattern any circuits on a cylinder though!
We are certainly seeing a lot of multi-package modules now (like in
epyc) but I'd like 'em to be taller and not need so many darn pins. A
full blown wifi
router on chip wouldn't need more than... oh... this many pins:
https://www.amazon.com/Makerfocus-ESP8266-Wireless-Transceiver-Compatible/dp/B01EA3UJJ4/ref=asc_df_B01EA3UJJ4/?tag=hyprod-20&linkCode=df0&hvadid=309773039951&hvpos=1o1&hvnetw=g&hvrand=15072864816819105911&hvpone=&hvptwo=&hvqmt=&hvdev=c&hvdvcmdl=&hvlocint=&hvlocphy=9032156&hvtargid=pla-599566692924&psc=1
> Thank you for that. I do have a long standing dream of a single chip
> wifi router, with the lowest SNR possible, and the minimum number of
> pins coming off of it. I'd settle for 32MB of (static?) ram on chip as
> that has proven sufficient to date to drive 802.11n....
>
> which would let you get rid of both the L2 and L1 cache. That said, I
> think the cost of 32MB of on-chip static ram remains a bit high, and
> plugging it into a mips cpu, kind of silly. Someday there will be a case
> to just doing everything on a single chip, but...
>
> >
> >
> > _______________________________________________
> > Bloat mailing list
> > Bloat@lists.bufferbloat.net
> > https://lists.bufferbloat.net/listinfo/bloat
> _______________________________________________
> Bloat mailing list
> Bloat@lists.bufferbloat.net
> https://lists.bufferbloat.net/listinfo/bloat
--
Dave Täht
CTO, TekLibre, LLC
http://www.teklibre.com
Tel: 1-831-205-9740
^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: [Bloat] known buffer sizes on switches
2018-11-24 23:29 Dave Taht
@ 2018-11-25 6:44 ` Mikael Abrahamsson
0 siblings, 0 replies; 9+ messages in thread
From: Mikael Abrahamsson @ 2018-11-25 6:44 UTC (permalink / raw)
To: Dave Taht; +Cc: bloat
On Sat, 24 Nov 2018, Dave Taht wrote:
> https://people.ucsc.edu/~warner/buffer.html
Nice resource, thanks.
If someone wonders why things look the way they do, so it's all about
on-die and off-die memory. Either you use off-die or on-die memory, often
SRAM which requires 6 gates per bit. So spending half a billion gates
gives you ~10MB buffer on-die. If you're doing off-die memory (DRAM or
similar) then you'll get the gigabytes of memory seen in some equipment.
There basically is nothing in between. As soon as you go off-die you might
as well put at least 2-6 GB in there.
Also, off-die memory takes IO capacity. A forwarding chip might have 4
"sides" with I/O lanes sets. If you put it in a 1RU device with no buffer,
you can connect ports to all of the lanes. This gives you a very high port
density low buffer size device and a very good price point.
Now, if you want more buffer and more route memory (taking one "side"
each) plus connecting it to a backplane (another side), you now only have
a single "side" left for ports. This is why high route-count, high buffer,
modular switches are so much more expensive compared low-route,
low-buffer, fixed configuration ones.
Above is principle, there are of course combinations and optimizations to
be made so not all devices adhere exactly to the above.
--
Mikael Abrahamsson email: swmike@swm.pp.se
^ permalink raw reply [flat|nested] 9+ messages in thread
* [Bloat] known buffer sizes on switches
@ 2018-11-24 23:29 Dave Taht
2018-11-25 6:44 ` Mikael Abrahamsson
0 siblings, 1 reply; 9+ messages in thread
From: Dave Taht @ 2018-11-24 23:29 UTC (permalink / raw)
To: bloat
https://people.ucsc.edu/~warner/buffer.html
--
Dave Täht
CTO, TekLibre, LLC
http://www.teklibre.com
Tel: 1-831-205-9740
^ permalink raw reply [flat|nested] 9+ messages in thread
end of thread, other threads:[~2018-11-29 2:33 UTC | newest]
Thread overview: 9+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2018-11-28 16:32 [Bloat] known buffer sizes on switches Bruno George Moraes
2018-11-28 16:55 ` Dave Taht
2018-11-28 18:25 ` Dave Collier-Brown
2018-11-28 18:26 ` David Collier-Brown
2018-11-28 19:02 ` Dave Taht
2018-11-28 20:34 ` David Collier-Brown
2018-11-29 2:33 ` Dave Taht
-- strict thread matches above, loose matches on Subject: below --
2018-11-24 23:29 Dave Taht
2018-11-25 6:44 ` Mikael Abrahamsson
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox