Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (Cyborg)
  • No Skin
Collapse
Brand Logo

CIRCLE WITH A DOT

  1. Home
  2. Uncategorized
  3. You know what would be really nice (but nobody is ever going to build)?

You know what would be really nice (but nobody is ever going to build)?

Scheduled Pinned Locked Moved Uncategorized
25 Posts 6 Posters 88 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • funkylab@mastodon.socialF funkylab@mastodon.social

    @azonenberg I'm always kind of weary of silicon manufacturer's proprietary high-speed buses, because they teeeend to be slightly use-case-specific and don't deal well with edge cases outside that. Anyways, when I saw NVLink my first reaction was "wait is this HyperTransport, but with expensive modern transceivers?"; it isn't, but my guess is that from a system's perspective, you'd be better off going for AMD's InfinityFabric, which seems to make stronger coherence statements (not sure). But, you

    funkylab@mastodon.socialF This user is from outside of this forum
    funkylab@mastodon.socialF This user is from outside of this forum
    funkylab@mastodon.social
    wrote last edited by
    #4

    @azonenberg are mostly only optimizing for nvidia GPUs anyways, so that might be a moot point.

    azonenberg@ioc.exchangeA 1 Reply Last reply
    0
    • funkylab@mastodon.socialF funkylab@mastodon.social

      @azonenberg are mostly only optimizing for nvidia GPUs anyways, so that might be a moot point.

      azonenberg@ioc.exchangeA This user is from outside of this forum
      azonenberg@ioc.exchangeA This user is from outside of this forum
      azonenberg@ioc.exchange
      wrote last edited by
      #5

      @funkylab Well I mean I would *like* a ludicrously high bandwidth portable interface, but the vendors aren't building it.

      Realistically, I think the best you can do portably today is 100GbE with RoCE.

      funkylab@mastodon.socialF 1 Reply Last reply
      0
      • azonenberg@ioc.exchangeA azonenberg@ioc.exchange

        For comparison... my 16 GHz LeCroy oscilloscope puts out 40 Gsps * 4 channels * 8 bits of raw ADC samples, not counting the flatness corrections done in gateware/firmware.

        That's 160 GB/s or 1.28 Tbps of raw samples.

        That would even fit in NVLink 2.0 much less the current gen4/5 stuff.

        Imagine four channels of 16 GHz bandwidth waveform data straight into a (very large) GPU nonstop... We'd have to do a hell of a lot of optimization to ngscopeclient to keep up and probably add multi-GPU support but it would be so much fun lol.

        1div0@mastodon.social1 This user is from outside of this forum
        1div0@mastodon.social1 This user is from outside of this forum
        1div0@mastodon.social
        wrote last edited by
        #6

        @azonenberg https://tenstorrent.com/hardware/cards#compare ?

        1 Reply Last reply
        0
        • azonenberg@ioc.exchangeA azonenberg@ioc.exchange

          @funkylab Well I mean I would *like* a ludicrously high bandwidth portable interface, but the vendors aren't building it.

          Realistically, I think the best you can do portably today is 100GbE with RoCE.

          funkylab@mastodon.socialF This user is from outside of this forum
          funkylab@mastodon.socialF This user is from outside of this forum
          funkylab@mastodon.social
          wrote last edited by
          #7

          @azonenberg oh you *can* buy 800 Gb/s interfaces, don't know what their host sides look like, if any for non network-vendor stuff (this is mostly aggregated traffic equipment, i.e. linking racks or DC 1 to DC 2)

          azonenberg@ioc.exchangeA 1 Reply Last reply
          0
          • funkylab@mastodon.socialF funkylab@mastodon.social

            @azonenberg oh you *can* buy 800 Gb/s interfaces, don't know what their host sides look like, if any for non network-vendor stuff (this is mostly aggregated traffic equipment, i.e. linking racks or DC 1 to DC 2)

            azonenberg@ioc.exchangeA This user is from outside of this forum
            azonenberg@ioc.exchangeA This user is from outside of this forum
            azonenberg@ioc.exchange
            wrote last edited by
            #8

            @funkylab yeah exactly. 100G with a normal pcie interface is available today, i have a 100G pipe to my desk and have saturated it with iperf in benchmarks.

            And the nic has RoCE offload capabilities although I'm not using it yet

            azonenberg@ioc.exchangeA 1 Reply Last reply
            0
            • azonenberg@ioc.exchangeA azonenberg@ioc.exchange

              @funkylab yeah exactly. 100G with a normal pcie interface is available today, i have a 100G pipe to my desk and have saturated it with iperf in benchmarks.

              And the nic has RoCE offload capabilities although I'm not using it yet

              azonenberg@ioc.exchangeA This user is from outside of this forum
              azonenberg@ioc.exchangeA This user is from outside of this forum
              azonenberg@ioc.exchange
              wrote last edited by
              #9

              @funkylab You can go all the way up to 800G if you have a host system with PCIe gen6 and sufficiently deep pockets (I do not)

              favicon

              (www.fs.com)

              funkylab@mastodon.socialF 1 Reply Last reply
              0
              • azonenberg@ioc.exchangeA azonenberg@ioc.exchange

                For comparison... my 16 GHz LeCroy oscilloscope puts out 40 Gsps * 4 channels * 8 bits of raw ADC samples, not counting the flatness corrections done in gateware/firmware.

                That's 160 GB/s or 1.28 Tbps of raw samples.

                That would even fit in NVLink 2.0 much less the current gen4/5 stuff.

                Imagine four channels of 16 GHz bandwidth waveform data straight into a (very large) GPU nonstop... We'd have to do a hell of a lot of optimization to ngscopeclient to keep up and probably add multi-GPU support but it would be so much fun lol.

                scribblesonnapkins@mastodon.socialS This user is from outside of this forum
                scribblesonnapkins@mastodon.socialS This user is from outside of this forum
                scribblesonnapkins@mastodon.social
                wrote last edited by
                #10

                @azonenberg so my tek 11801C with it's ability to connect to an external sampling head array might present a problem? Oh no wait it's sampling to slow.

                azonenberg@ioc.exchangeA 1 Reply Last reply
                0
                • scribblesonnapkins@mastodon.socialS scribblesonnapkins@mastodon.social

                  @azonenberg so my tek 11801C with it's ability to connect to an external sampling head array might present a problem? Oh no wait it's sampling to slow.

                  azonenberg@ioc.exchangeA This user is from outside of this forum
                  azonenberg@ioc.exchangeA This user is from outside of this forum
                  azonenberg@ioc.exchange
                  wrote last edited by
                  #11

                  @scribblesonnapkins well equivalent time sampling is easy to handle with today's tech because the number of actual samples acquired per second is low.

                  equally, a scope that acquires high speed data and buffers it in memory before processing at a much slower rate is something we can handle today.

                  But the vision is to be able to do real time or at least lower-dead-time processing at much higher data rates. ThunderScope almost maxes out 10GbE, my vision is to able to keep up with 25/40/100G eventually

                  scribblesonnapkins@mastodon.socialS 1 Reply Last reply
                  0
                  • meph@social.treehouse.systemsM meph@social.treehouse.systems shared this topic
                  • azonenberg@ioc.exchangeA azonenberg@ioc.exchange

                    @funkylab You can go all the way up to 800G if you have a host system with PCIe gen6 and sufficiently deep pockets (I do not)

                    favicon

                    (www.fs.com)

                    funkylab@mastodon.socialF This user is from outside of this forum
                    funkylab@mastodon.socialF This user is from outside of this forum
                    funkylab@mastodon.social
                    wrote last edited by
                    #12

                    @azonenberg I doubt your pockets will be deep enough for NVlink things involving anything but GPUs 🙂

                    azonenberg@ioc.exchangeA 1 Reply Last reply
                    0
                    • funkylab@mastodon.socialF funkylab@mastodon.social

                      @azonenberg I doubt your pockets will be deep enough for NVlink things involving anything but GPUs 🙂

                      azonenberg@ioc.exchangeA This user is from outside of this forum
                      azonenberg@ioc.exchangeA This user is from outside of this forum
                      azonenberg@ioc.exchange
                      wrote last edited by
                      #13

                      @funkylab oh i know, nvlink doesnt even let you get the PHY chiplets (the protocol itself is undocumented) unless you have NDAs and a partnership with nvidia etc.

                      but I can dream...

                      funkylab@mastodon.socialF 1 Reply Last reply
                      0
                      • azonenberg@ioc.exchangeA azonenberg@ioc.exchange

                        @funkylab oh i know, nvlink doesnt even let you get the PHY chiplets (the protocol itself is undocumented) unless you have NDAs and a partnership with nvidia etc.

                        but I can dream...

                        funkylab@mastodon.socialF This user is from outside of this forum
                        funkylab@mastodon.socialF This user is from outside of this forum
                        funkylab@mastodon.social
                        wrote last edited by
                        #14

                        @azonenberg I was assuming that you'd probably (assuming infinite money) could buy an Nvidia server platform that has network->VRAM piping (I assume this because I presume that's what nvidia bought mellanox for)

                        azonenberg@ioc.exchangeA 1 Reply Last reply
                        0
                        • funkylab@mastodon.socialF funkylab@mastodon.social

                          @azonenberg I was assuming that you'd probably (assuming infinite money) could buy an Nvidia server platform that has network->VRAM piping (I assume this because I presume that's what nvidia bought mellanox for)

                          azonenberg@ioc.exchangeA This user is from outside of this forum
                          azonenberg@ioc.exchangeA This user is from outside of this forum
                          azonenberg@ioc.exchange
                          wrote last edited by
                          #15

                          @funkylab That's where RoCE comes in.

                          But Ethernet today tops out at 800 Gbps while the latest NVLink can do 14.4 Tbps

                          azonenberg@ioc.exchangeA 1 Reply Last reply
                          0
                          • azonenberg@ioc.exchangeA azonenberg@ioc.exchange

                            @funkylab That's where RoCE comes in.

                            But Ethernet today tops out at 800 Gbps while the latest NVLink can do 14.4 Tbps

                            azonenberg@ioc.exchangeA This user is from outside of this forum
                            azonenberg@ioc.exchangeA This user is from outside of this forum
                            azonenberg@ioc.exchange
                            wrote last edited by
                            #16

                            @funkylab NVLink is the fantasy, the actually achievable real world implementation is to make the scope speak RoCE, put a mellanox NIC in the client, and RDMA the incoming Ethernet frames straight into VRAM.

                            But it still has to cross over PCIe and get bottlenecked on that bandwidth

                            cliffsesport@mastodon.socialC 1 Reply Last reply
                            0
                            • azonenberg@ioc.exchangeA azonenberg@ioc.exchange

                              @scribblesonnapkins well equivalent time sampling is easy to handle with today's tech because the number of actual samples acquired per second is low.

                              equally, a scope that acquires high speed data and buffers it in memory before processing at a much slower rate is something we can handle today.

                              But the vision is to be able to do real time or at least lower-dead-time processing at much higher data rates. ThunderScope almost maxes out 10GbE, my vision is to able to keep up with 25/40/100G eventually

                              scribblesonnapkins@mastodon.socialS This user is from outside of this forum
                              scribblesonnapkins@mastodon.socialS This user is from outside of this forum
                              scribblesonnapkins@mastodon.social
                              wrote last edited by
                              #17

                              @azonenberg I was trying to make a joke with the 1st part "Oh no wait it's sampling to slow."

                              But the second part about the thunderscope is cool.

                              1 Reply Last reply
                              0
                              • azonenberg@ioc.exchangeA azonenberg@ioc.exchange

                                You know what would be really nice (but nobody is ever going to build)?

                                Oscilloscope that replaces the ungodly slow USB3/1000baseT PC interface port with NVLink.

                                Forget PCIe and Thunderbolt... 900 GB/s of bandwidth straight from the ADC to my GPU? Sign me up.

                                penguin42@mastodon.org.ukP This user is from outside of this forum
                                penguin42@mastodon.org.ukP This user is from outside of this forum
                                penguin42@mastodon.org.uk
                                wrote last edited by
                                #18

                                @azonenberg How about CXL4? That claims 242GB/s and is at least designed for external connectivity.

                                azonenberg@ioc.exchangeA 1 Reply Last reply
                                0
                                • penguin42@mastodon.org.ukP penguin42@mastodon.org.uk

                                  @azonenberg How about CXL4? That claims 242GB/s and is at least designed for external connectivity.

                                  azonenberg@ioc.exchangeA This user is from outside of this forum
                                  azonenberg@ioc.exchangeA This user is from outside of this forum
                                  azonenberg@ioc.exchange
                                  wrote last edited by
                                  #19

                                  @penguin42 If somebody makes a GPU with CXL I'll be all over it.

                                  Until then I'm stuck with what I can get my hands on. Realistically, that's PCIe and RoCE

                                  1 Reply Last reply
                                  0
                                  • azonenberg@ioc.exchangeA azonenberg@ioc.exchange

                                    @funkylab NVLink is the fantasy, the actually achievable real world implementation is to make the scope speak RoCE, put a mellanox NIC in the client, and RDMA the incoming Ethernet frames straight into VRAM.

                                    But it still has to cross over PCIe and get bottlenecked on that bandwidth

                                    cliffsesport@mastodon.socialC This user is from outside of this forum
                                    cliffsesport@mastodon.socialC This user is from outside of this forum
                                    cliffsesport@mastodon.social
                                    wrote last edited by
                                    #20

                                    @azonenberg @funkylab I am curious what fields use Oscilloscopes at level you build and test for? I am guessing radio & perhaps medical? I've only ever used them for basic electronics back in the 90s so the performance of your stuff is just stunning.

                                    azonenberg@ioc.exchangeA 1 Reply Last reply
                                    0
                                    • cliffsesport@mastodon.socialC cliffsesport@mastodon.social

                                      @azonenberg @funkylab I am curious what fields use Oscilloscopes at level you build and test for? I am guessing radio & perhaps medical? I've only ever used them for basic electronics back in the 90s so the performance of your stuff is just stunning.

                                      azonenberg@ioc.exchangeA This user is from outside of this forum
                                      azonenberg@ioc.exchangeA This user is from outside of this forum
                                      azonenberg@ioc.exchange
                                      wrote last edited by
                                      #21

                                      @CliffsEsport @funkylab My focus is mostly on the high speed digital side of things, so networking, high speed buses, etc. Modern digital interfaces are absurdly fast.

                                      Even USB 3.0 was 5 Gbps per pair and that's pretty slow compared to modern stuff. PCIe gen6 runs at 64 Gbps.

                                      DisplayPort goes up to 20 Gbps per lane now.

                                      But understanding complex issues around these buses involves recording a lot of data, processing it fast, looking at packet captures and physical layer signal quality, etc. There's always room to crunch more data faster.

                                      azonenberg@ioc.exchangeA 1 Reply Last reply
                                      0
                                      • azonenberg@ioc.exchangeA azonenberg@ioc.exchange

                                        @CliffsEsport @funkylab My focus is mostly on the high speed digital side of things, so networking, high speed buses, etc. Modern digital interfaces are absurdly fast.

                                        Even USB 3.0 was 5 Gbps per pair and that's pretty slow compared to modern stuff. PCIe gen6 runs at 64 Gbps.

                                        DisplayPort goes up to 20 Gbps per lane now.

                                        But understanding complex issues around these buses involves recording a lot of data, processing it fast, looking at packet captures and physical layer signal quality, etc. There's always room to crunch more data faster.

                                        azonenberg@ioc.exchangeA This user is from outside of this forum
                                        azonenberg@ioc.exchangeA This user is from outside of this forum
                                        azonenberg@ioc.exchange
                                        wrote last edited by
                                        #22

                                        @CliffsEsport @funkylab Much lower speed stuff still benefits from better processing.

                                        Modern cars are switching from CAN bus to Ethernet for talking between modules. 100baseT1 is a common flavor that runs Ethernet frames at 100 Mbps bidirectionally over a single pair of wires.

                                        If you want to simultaneously look at a protocol decode and signal quality view of that signal in both directions you're looking at around a GB/s of data. Which ngscopeclient is able to crunch in real time (barely, this is a recent tech demo)

                                        cliffsesport@mastodon.socialC 1 Reply Last reply
                                        0
                                        • azonenberg@ioc.exchangeA azonenberg@ioc.exchange

                                          @CliffsEsport @funkylab Much lower speed stuff still benefits from better processing.

                                          Modern cars are switching from CAN bus to Ethernet for talking between modules. 100baseT1 is a common flavor that runs Ethernet frames at 100 Mbps bidirectionally over a single pair of wires.

                                          If you want to simultaneously look at a protocol decode and signal quality view of that signal in both directions you're looking at around a GB/s of data. Which ngscopeclient is able to crunch in real time (barely, this is a recent tech demo)

                                          cliffsesport@mastodon.socialC This user is from outside of this forum
                                          cliffsesport@mastodon.socialC This user is from outside of this forum
                                          cliffsesport@mastodon.social
                                          wrote last edited by
                                          #23

                                          @azonenberg @funkylab Oh, wow, I did not know cars were switching to Ethernet.

                                          funkylab@mastodon.socialF 1 Reply Last reply
                                          0
                                          Reply
                                          • Reply as topic
                                          Log in to reply
                                          • Oldest to Newest
                                          • Newest to Oldest
                                          • Most Votes


                                          • Login

                                          • Login or register to search.
                                          • First post
                                            Last post
                                          0
                                          • Categories
                                          • Recent
                                          • Tags
                                          • Popular
                                          • World
                                          • Users
                                          • Groups