Re: Walter's DConf 2014 Talks - Topics in Finance

Laeeth Isharc via Digitalmars-d Mon, 22 Dec 2014 19:11:39 -0800

Hi.

Sorry if this is a bit long, but perhaps it may be interesting toone or two.

On Monday, 22 December 2014 at 22:00:36 UTC, Daniel Davidsonwrote:

On Monday, 22 December 2014 at 19:25:51 UTC, aldanor wrote:
On Monday, 22 December 2014 at 17:28:39 UTC, Daniel Davidsonwrote:
I don't see D attempting to tackle that at this point.
If the bulk of the work for the "data sciences" piece is themaths, which I believe it is, then the attraction of D as a"data sciences" platform is muted. If the bulk of the work ispreprocessing data to get to an all numbers world, then inthat space D might shine.
That is one of my points exactly -- the "bulk of the work", asyou put it, is quite often the data processing/preprocessingpipeline (all the way from raw data parsing, aggregation,validation and storage to data retrieval, feature extraction,and then serialization, various persistency models, etc).
I don't know about low frequency which is why I asked aboutWinton. Some of this is true in HFT but it is tough to breakthat pipeline that exists in C++. Take live trading vsbacktesting: you require all that data processing beforegetting to the math of it to be as low latency as possible forlive trading which is why you use C++ in the first place. Tobreak into that pipeline with another language like D to addvalue, say for backtesting, is risky not just because theduplication of development cost but also the risk of live notmatching backtesting.
Maybe you have some ideas in mind where D would help that dataprocessing pipeline, so some specifics might help?

I have been working as a PM for quantish buy side places since98, after starting in a quant trading role on sell side in 96,with my first research summer job in 93. Over time I have becomeless quant and more discretionary, so I am less in touch with thetechniques the cool kids are using when it doesn't relate to whatI do. But more generally there is a kind of silo mentality wherein a big firm people in different groups don't know much aboutwhat the guy sitting at the next bank of desks might be doing,and even within groups the free flow of ideas might be a lot lessthan you might thinkAgainst that, firms with a pure research orientation may be atouch different, which just goes hex again to say that from theoutside it may be difficult to make useful generalisations.

A friend of mine who wrote certain parts of the networking stackin linux is interviewing with HFT firms now, so I may have abetter idea about whether D might be of interest. He has heardof D but suggests Java instead. (As a general option, not forHFT). Even smart people can fail to appreciate beauty ;)

I think its public that GS use a python like language internally,JPM do use python for what you would expect, and so do AHL (oneof the largest lower freq quant firms). More generally, in everyfield, but especially in finance, it seems like the dataprocessing aspect is going to be key - not just a necessary evil.Yes, once you have it up and running you can tick it off, but itis going to be some years before you start to tick off itemsfaster than they appear. Look at what Bridgewater are doing withgauging real time economic activity (and look at Google Fluprediction if one starts to get too giddy - it worked and thendidn't).

There is a spectrum of different qualities of data. What ismost objective is not necessarily what is most interesting. Yetwork on affect, media, and sentiment analysis is in its veryearly stages. One can do much better than just affect bad, buystocks once they stop going down... Someone that asked me tohelp with something are close to Twitter, and I have heard thenumber of firms and rough breakdown by sector taking their fullfeed. It is shockingly small in the financial services field,and that's probably in part just that it takes people time tofigure out something new.

Ravenpack do interesting work from the point of view of apractitioner, and I heard a talk by their former technicalarchitect, and he really seemed to know his stuff. Not sure whatthey use as a platform.

I can't see why the choice of language will affect your backtesting results (except that it is painful to write goodalgorithms in a klunky language and risk of bugs higher - butthat isn't what you meant).

Anyway, back to D and finance. I think this mental image peoplehave of back testing as being the originating driver of researchmay be mistaken. Its funny but sometimes it seems the moment youtake a scientist out of his lab and put him on a trading floor hewants to know if such and such beats transaction costs. But whatyou are trying to do is understand certain dynamics, and oneneeds to understand that markets are non linear and have highlyunstable parameters. So one must be careful about just jumpingto a back test. (And then of course, questions of riskmanagement and transaction costs really matter also).

To a certain extent one must recognise that the asset managementbusiness has a funny nature. (This does not apply to many HFTfirms that manage partners money), It doesn't take an army tomake a lot of money with good people because of the intrinsicintellectual leverage of the business. But to do that one needscapital, and investors expect to see something tangible for thefees if you are managing size. Warren Buffett gets away withhaving a tiny organisation because he is Buffett, but that may beharder for a quant firm. So since intelligent enough people arecheap, and investors want you to hire people, it can be temptingto hire that army after all and set them to work on projects thatcertainly cover their costs but really may not be bigdeterminants of variations in investment outcomes. Ie oneshouldn't mistake the number of projects for what is trulyimportant.

I agree that it is setting up and keeping everything inproduction running smoothly that creates a challenge. So it'snot just a question of doing a few studies in R. And the moreways of looking at the world, the harder you have to think abouthow to combine them. Spreadsheets don't cut the mustard anymore- they haven't for years, yet it emerged even recently with theJPM whale that lack of integrity in the spreadsheet worsenedcommunication problems between departments (risk especially).Maybe pypy and numpy will pick up all of slack, but I am not sosure.

In spreadsheet world (where one is a user, not a pro), one neverfinishes and says finally I am done building sheets. One questionleads to another in the face of an unfolding and generativereality. It's the same with quant tools for trading. Perhapsthat means value to tooling suited to rapid iteration andbuilding of robust code that won't need later to be totallyrewritten from scratch later.

At one very big US hf I worked with, the tools were initiallywritten in Perl (some years back). They weren't pretty, but theyworked, and were fast and robust enough. I has many new featuresI needed for my trading strategy. But the owner - who liked toread about ideas on the internet - came to the conclusion thatPerl was not institutional quality and that we should thereforecease new development and rewrite everything in C++. Two yearslater a new guy took over the larger group, and one way or theother everyone left. I never got my new tools, and thatcertainly didn't help on the investment front. After he left ayear after that they scrapped the entire code base and boughtMurex as nobody could understand what they had.

If we had had D then, its possible the outcome might have beendifferent.

So in any case, hard to generalise, and better to pick a fewsympathetic people that see in D a possible solution to theirpain, and use patterns will emerge organically out of that. I amhappy to help where I can, and that is somewhat my ownperspective - maybe D can help me solve my pain of tools not upto scratch because good investment tool design requiresinvestment and technology skills to be combined in one personwhereas each of these two are rare found on their own. (D makesa vast project closer to brave than foolhardy),

It would certainly be nice to have matrices, but I also don'tthink it would be right to say D is dead in water here because itis so far behind. It also seems like the cost of writing such alibrary is v small vs possible benefit.

One final thought. It's very hard to hire good young people. Wehad 1500 cvs for one job with very impressive backgrounds -French grande ecoles, and the like. But ask a chap how he wouldsort a list of books without a library, and results wereshocking, seems like looking amongst D programmers is a niceheuristic, although perhaps the pool is too small for now. Nothiring now, but was thinking about for future.

Re: Walter's DConf 2014 Talks - Topics in Finance

Reply via email to