Re: [Beowulf] AMD and AVX512

2021-06-16 Thread Stu Midgley
I've told AMD brass that we need AVX512 many many times. I've also told them that we need more memory bandwidth and that adding dimms is not the answer. We don't need more capacity - just more bandwidth. We have a stack load of KNL systems and have invested heavily in AVX512 (writing with

Re: [Beowulf] [External] Re: AMD and AVX512

2021-06-16 Thread Prentice Bisbal via Beowulf
i also think you're hpl numbers on the amd chip are low, you should be > 4000 which would put you closer to intel, but intel will still edge out just because it has a higher base clock. I think I could probably get better numbers out of the AMD chip now, too. I've done some testing since

Re: [Beowulf] [External] Re: AMD and AVX512

2021-06-16 Thread Prentice Bisbal via Beowulf
Scott (and Michael and Carlos), Thanks for your excellent feedback. That's the kind of enlightening feedback I was looking for. Interesting that the HBM on Fugaku exceeds the needs of the processor. Prentice On 6/16/21 2:23 PM, Scott Atchley wrote: On Wed, Jun 16, 2021 at 1:15 PM Prentice

Re: [Beowulf] AMD and AVX512

2021-06-16 Thread Scott Atchley
On Wed, Jun 16, 2021 at 1:15 PM Prentice Bisbal via Beowulf < beowulf@beowulf.org> wrote: > Did anyone else attend this webinar panel discussion with AMD hosted by > HPCWire yesterday? It was titled "AMD HPC Solutions: Enabling Your > Success in HPC" > >

Re: [Beowulf] AMD and AVX512

2021-06-16 Thread Michael Di Domenico
AMD's argument is a little unsalesmen like, but i'd buy it as an explanation. avx512 uptake isn't a profound as intel would lead you to believe and the push to GPU's for vectors will probably remove the need for most of these high end vectors sooner or later (but that's my opinion, some chip

Re: [Beowulf] AMD and AVX512

2021-06-16 Thread Carlos Bederián
On Wed, Jun 16, 2021 at 2:16 PM Prentice Bisbal via Beowulf < beowulf@beowulf.org> wrote: > Last fall I evaluated potential new cluster nodes for a large cluster > purchase using the HPL benchmark. I compared a server with dual AMD EPYC > 7H12 processors (128) cores to a server with quad Intel

[Beowulf] AMD and AVX512

2021-06-16 Thread Prentice Bisbal via Beowulf
Did anyone else attend this webinar panel discussion with AMD hosted by HPCWire yesterday? It was titled "AMD HPC Solutions: Enabling Your Success in HPC" https://www.hpcwire.com/amd-hpc-solutions-enabling-your-success-in-hpc/ I attended it, and noticed there was no mention of AMD supporting

Re: [Beowulf] odd vlan issue

2021-06-16 Thread Michael Di Domenico
On Wed, Jun 16, 2021 at 9:25 AM Robert Taylor wrote: > > I’ve seen it happen with udp, in my case it was a syslog server, that hardly > ever “spoke” so eventually it’s MAC address disappears from the Cam table > long before it leaves the arp table and the UDP packets get flooded hoping to >

Re: [Beowulf] odd vlan issue

2021-06-16 Thread Robert Taylor
I’ve seen it happen with udp, in my case it was a syslog server, that hardly ever “spoke” so eventually it’s MAC address disappears from the Cam table long before it leaves the arp table and the UDP packets get flooded hoping to find the server. I’ve also seen this happen in an HA environment,

Re: [Beowulf] odd vlan issue

2021-06-16 Thread Michael Di Domenico
thanks after two days of digging, i think i finally figured out that we have a layer 2 routing problem. i'm not the network guy so i'm not digging into it deeper, but it appears that there are either malfunctioning LACP trunks or more likely a misconfigured VPC connection inside the menagerie of