5::gentoo

james Wed, 10 Aug 2016 11:40:19 -0700

On 08/10/2016 10:20 AM, Michael Mol wrote:

On Wednesday, August 10, 2016 10:13:29 AM james wrote:

On 08/10/2016 07:45 AM, Michael Mol wrote:

On Tuesday, August 09, 2016 05:22:22 PM james wrote:


I did a quick test with games-arcade/xgalaga. It's an old, quirky game
with sporadic lag variations. On a workstation with 32G ram and (8) 4GHz
64bit cores, very lightly loaded, there is no reason for in game lag.
Your previous settings made it much better and quicker the vast majority
of the time; but not optimal (always responsive). Experiences tell me if
I can tweak a system so that that game stays responsive whilst the
application(s) mix is concurrently running then the  quick
test+parameter settings is reasonably well behaved. So thats becomes a
baseline for further automated tests and fine tuning for a system under
study.


What kind of storage are you running on? What filesystem? If you're still
hitting swap, are you using a swap file or a swap partition?


The system I mostly referenced, rarely hits swap in days of uptime. It's
the keyboard latency, while playing the game, that I try to tune away,
while other codes are running. I try very hard to keep codes from
swapping out, cause ultimately I'm most interested in clusters that keep
everything running (in memory). AkA ultimate utilization of Apache-Spark
and other "in-memory" techniques.


Gotcha. dirty_bytes and dirty_background_bytes won't apply to anything that
doesn't call mmap() with a file backing or perform some other file I/O. If
you're not doing those things, they should have little to no impact.

Background needed:: I'm one of those (idealists?) that deeply believesthe holy grail of computing will soon emerge (nice pun huh). That isthat clusters, local clusters will run all workloads that multicoresystems currently do. So a bunch of old crap can become a beautifulcomputational system, whilst I sit back and sip exotic beverages andenjoy my day; video training to go to the gym and dominate the youngstuds on the court.... New hardware (aka new computers and cosmeticsurgery) will do the rest.

So an incredible variety of memory, storage and file systems willultimately need to be tested. I try to stay simple and focused (believeit or not). Initially the thought is to run a primitive desktop, likelxde or lxqt and use those under utilized resources asnode-computational contributors, whist still remaining responsive at thekeyboard (xgalaga is a quick and dirty test for this). So, you now havea wonderful cover story is the boss catches you noodling around withswords and sorcery to, you can tell'm you looking for subtle latencyissues...... The game speeds up and slows down, with zero swapping, dueto my I suspect mostly as VM issues and MM issues.An 8 core never goes above 0.2 on the load and only rarely saturates onecore, for a transient instance. Even if xgalaga is a single thread game,it does not explain this transient keyboard lag. I'm open to other formsof quick at-the-keyboard graphical tests as a quick and dirtymeasurement of overall system attentiveness to pending addtionalinput/workload demands. After that is happy, with a given set of runningcodes (test-vectors) I can get a very quick feedback of performance thisway.

For deeper studies, I like trace-cmd/Ftrace/KernelShark, but those arelike zabbix on utilization and analytical studies. I use xgalaga as aquick and dirty; but am surely open to new codes for that sort of quickand easy feedback.

Ideal values for dirty_bytes and dirty_background_bytes will depend heavily on
the nature of your underlying storage. Dozens of other things might be tweaked
depending on what filesystem you're using. Which is why I was asking about
those things.

A myriad of combinations exist. So picking some common combinations,will allow for others to test my work, when it is package up for sharingand testing. For me eventually automating a collection of 'test vectors'is what's important, not the first few test-vectors themselvs. Then thepathway forward for other collections of running processes can becomeyet another collection of 'test vectors'. No limit on these collectives.Eventually a customer will step forward and define the collective of'test vectors', so I do hope to work with/for one of the moreprogressive vendors, eventually, in these efforts. Certainly sharing thework, openly, is far more important to me. For now, I start with thingsI like, know and have some familiarity with; no magic on these choices.

Combined codes running simultaneously never hits the HD (no swappiness)
but still there is keyboard lag.


Where are you measuring this lag? How much lag are we talking about?

Remember, I'm an EE and complex fluids computational kind of guy, so Ihave no problem drudging down the sparse or full matrix types ofmentally inebriating adventuresome calculations, like computationalchemistry. But, since this approach is not yet ready for those sorts ofthings, I keep things simple; for now. What I want, is an automatedinstallation semantic, where folks can download images and run them ontheir small clusters) on a weekly basis and keep solving the sametest-vector collectives over and over. Tweaks and ideas are in the newlyreleased images, a group of gentoo-users test things out. Butan automated, quick and simple gentoo system, flies against what mostfolks believe in this community (dammit, I have to respect, so I work onmy one scripts I have lifted from others) {wink wink; nudge nudge}.

As you already know....

Not that it is actually affecting the
running codes to any appreciable degree, but it is a test I run so that
the cluster nodes will benefit from still being (low latency) quickly
attentive to interactions with the cluster master processes, regardless
of workloads on the nodes. Sure its  not totally accurate, but so far
this semantical approach, is pretty darn close. It's not part of this
conversation (on VM etc) but ultimately getting this right solves one of
the biggest problems for building any cluster; that is workload
invocation, shedding and management to optimize resource utilization,
regardless of the orchestration(s) used to manage the nodes. Swapping to
disc is verbotim, in my (ultimate) goals and target scenarios.

No worries, you have given me enough info and ideas to move forward with
testing and tuning. I'm going to evolve these  into more precisely
controlled and monitored experiments, noting exact hardware differences;
that should complete the tuning of the Memory Management tasks, within
acceptable confine  . Then automate it for later checking on cluster
test runs with various hardware setups. Eventually these test will be
extended to a variety of  memory and storage hardware, once the
techniques are automated. No worries, I now have enough ideas and
details (thanks to you) to move forward.


You've got me curious, now you're going to go run off and play with your
thought problems and not share! Tease!

Dude, I share too much. If you had not gone of vacation (fromgentoo-user) you'd know this. Since I am way too mentally handicapped todo all of this on my own, (and too old and wise to even try) I routinelyseek guidance and help. I read quite a lot, to remind me of the mistakesfrom previous distributed parallel computational attempts; and thatreading also saddens me a bit to see so many malformed cluster ideas. Ohwell, failure is the most important lesson technical folks learn. Mostoften ideas just bounces off the wall right back at me, but I havelearned to duck (most of the time). YOU and anyone else are most welcometo join my efforts; we all shall benefit from robust, local clusters, asmasters of gentoo (or poezer of gentoo, just like me). <end philosophy>

So while we are at it, scripts or stage-4 images that can be rapidlybooted up on a given small hardware cluster, are keen to my approach.Memory management, is probably the most challenging aspect of buildingand robustly (efficient resource utilization) managing these clustersor outsourced clusters (clouds in vendor speak). I Use the same clustersetup, to test a myriad of different problem-solution sets on theidentical hardware, but only change the software, including filesystems: both DFS (cephfs/orangefs/openAFS/Beefs) and the local fs (xfs,ext4,) and well as hybrids like btrfs and special file systsems likebcache. On top of Openstack, Hadoop, Mesos, old Beowulf (with a fast DFSreplacing NFS) and others.


Once domain specific problems are moved to a cluster and that solution
set is near-optimal, after robustly testing many codes, in a CI fashion
outlined above, it becomes a stage-4 canned solution for somebody to run

on their hardware. If they need more hardware resouces, within aspecific interval, THEN outsource those resource needs to the CloudVendors. Expecting a cloud vendor to be a champion of your DomainSpecific need, is a roadmap to chapter 11 or 13, for that corporation.I suspect that once AWS and Google and MS and IBM learn what the NSAalready knows, there will be a feeding frenzy on aquisitions of oldtechnology companies. That's ultimately where the action is in clusters.

All of this 'smoke and mirrors' marketing centric on social networks isjust that; smoke and mirros. Why do I say this? Simple; there already isenough processing power to solve those problems and needs with currentSnoracle style solutions and the by the bloated on wall-street.


Now HPC, dude, that's the sweet edge of clustering. There are numerous

gargantuan issues in that sphere and a few, like DESHAW are getting RICHoff of clusters. He, a single Stanford professor, mastered computationalchemistry, and locked his expertise into ASIC chips.Now he is conquering wallstreet. Domain Specific solutions are where theaction is in clusters. It not that there's not money in the socialnetworking spheres, those are locked up by the 'cost barrier to entry'semantics. OK, I digress. But the important thing is local clusters,taht can be rapidly build and torn down and reconfigured, with a fewsimple keystrokes, are the future of clusters. A given small to midsized company better learn how to build their own clusters, or they bein the welfare line, like several other billion folks are.

CoreOS and unikernels are really quite similar to my approach toclusters. A variety of Problem-solutions sets (aka test vectors) onidentical hardware will light the pathway for Domain Specific clustersolutions. Mine will be a node cluster on amd64, for now.

So, I'm not sitting on some Stanford level of skills or knowledge base(think amplabs). I have decades of experiences in mostly unfulfilledpromises for ubiquitous distributed processing, and only narrowly (verytightly) focuses success stories. Still, I am a believer in that thecurrent crop of linux clusters will become an Utopia computation enginesystem that works from the most modest of needs like mundane admintaskloads to the most demanding, time-sequence RT simulations of some ofthe grand challenges in computational dynamics and similar areas.

But, after several years of research, I mostly see kids trying the samecrap we tried decades ago, with a new 'fancy-pants' programminglanguage:: (hence the prediction that the current cluster kids are beingmanipulated by the VC firms and deep pocketed folks toward certainfailure), whilst they pay off their debts. Same story, different overlord.

I am conflicted as to whether this is intentional or just a repeat oftards leading the blind and innocent off the cliff. That is most of thevendor centric cluster (marketers call these clouds), developing newcodes are clueless. That said, surely those corps with large collectionsof existing software can migrate those critical codes to the cloud andonly offer new versions of that software, with a (cloud centric)internet-needed license. Think Azure/MS, IBM etc etc. But that sort ofposition, will just allow competitors to eat away larger chunks of theirmarket share. (But I really don't care about his part of the Cloudillusion. I'm a hard core hardware type who already knows that thefuture of clusters is mostly local, with local control. The cloud willbecome a secondary or tertiary market for cpu cycles and garbagecollection (think social networking databases). Sure folks will puttheir websites on commercial clouds, but that is already just a naturalevolution of Co-location of server and not some breakthrough is technology.

Down this pathway, the developments in the latest version of Clang, gcc,etc etc, and EEs making the resources of the GPU (including DDR5+) intoa transparent computational resource for the routine compilers. rDMA isgoing to change (everything). Ram will finally not be the bottleneck, asFPGA and GPU resources can be configured, dynamically, as either highlyspecialize processors or highly specialized memory (look at CAMs, orContent Addressible Memory for a teaser). Router vendors have beenmaking billions of dollars by adding CAMs to otherwise mundaneprocessing systsems.

No more of those ancient (intel) parallel compilers and shit likethat.... Plus and avalance of re-configurable memory types; mostlytransparent to folks that use "emerge" for custom compiling. Then thereis a hardened kernel. Few in the cluster world even know such thingsexist; more sadly why they are necessary and when they are necessary.

Keep puffing on that buntu hoka pipe, brah_heim.....

The flip side to this is that a lot of Vendors think that bloated linuxoperating systems, on top of non-tuned, non-stripped insecure linuxkernels is going to be commercially viable. If you build your house onturds, when it starts to rain, there is a funky smell in the air, beforeit washes away. Bloated buntu, debian or RHEL are turds and are notgoing to work compared to stripped, minimal linux systems. That's whereDocker, just "bitch-slapped" their competition by moving to subsumeApline linux.....

Your postings and clarity on VM, has helped me focus, immensely. It isthe current need in my work. Have I shared enough for you, today?


Any other questions, or ideas are most welcome, publically or privately.
I could be wrong about all of this, but, my fourth generational stab at

ubiquitous (distributed || parallel) processing experiences tell me I'mnot wrong but have the right idea. I do lack current skills in so manyareas, that my work is impeded.

Without the gentoo community, I could not posses such visions offuture-present greatness; nor share it with others.

Perhaps Zabbix +TSdB can get me further down the pathway.  Time
sequenced and analyzed data is over kill for this (xgalaga) test, but
those coalesced test-vectors  will be most useful for me as I seek a
gentoo centric pathway for low latency clusters (on bare metal).


If you're looking to avoid Zabbix interfering with your performance,
you'll
want the Zabbix server and web interface on a machine separate from the
machines you're trying to optimize.


agreed.

Thanks Mike,
James

np

Clusters will end up on people's wrist watches, in the trunks of theirautos and at their homes:: So they control their computational needs andsecurity, sooner rather than later. I think the next president willmandate the opening of the OS to many vendors and open source for Cellphones, Apps and such. The current monopolies are excessively morepowerful than the old 'robber barrons' and that fact is well recognizedby lost of deeper thinkers. It's braned under globalization, but, it'sdemise is just under the horizon, imho.

True, ubiquitous clusters will be a result of hard work on compilersthat take sequential problems and break them down into pieces andreassemble them into a form that can leverage parallel techniques. gcc 5and 6 and Clang are moving, rapidly in this direction. GPU vendorsunderstand the importance of SIMD and MIMD processing for 'systolic'algorithms and such approaches to massive distributed processing. AMD(Radeon) understands that this power can most effectively be used, if itis cheap and open sourced. Nvidia, no so quick to follow (or lead) downthis open source path, imho. Intel purchasing a FPGA company andlicensing GPU technologies from many others, tells me the hardwarevendors are preparing for a revolution. A direct sales channel to thecommoners will be their greatest path to rediculous profitability. Why?Simple, the smaller the core (competitive team) that exists, the moreexcessive processing resources that will be purchased and purchasedcloser to the retail price.

When hardware vendors partner with a few sofware companies, the marginson hardware get squeezed. Besides the hackers of the work, are findingany critical barriers to codes and publishing it so all have fair accessto the latest codes (one way or another). The NSA and such entities arenot going to stop this, because all of this software espionage,justifies governments taxing the snot out of citizens to fight thoseevil hackers. It's a far superior business model for DoD

types like intel and google, than the cold ware ever though about being.

The average tax-payer is too stupid to realize social network, with anOnior approach, is just feeding data-sets via google, linkedin, facebook

etc, directly to the NSA and other Nation State actors. WE get jobs
and pay taxes. They set the rules and manage the data.

Problem is, eventually, the commoners will have sufficent clusters,solar panels water wells or sources and green house and tell da_main

to stick his taxes on imports. Fine that works, then everybody gets
a 3D printer and we, the commoners are self sufficient.

The simple fact is that is a great business model for EVERYONE,including the elites, so what are we waiting on? A stupid old man likeme? Naw, not at Gentoo, buntu, sure, RHEL definately, but not gentoo,brah. WE are the solution to everything!


</>

James

Re: [gentoo-user] kde-apps/kde-l10n-16.04.3:5/5::gentoo conflicting with kde-apps/kdepim-l10n-15.12.3:5/5::gentoo

Reply via email to