Re: some regex vs std.ascii vs handcode times

Jay Norwood Mon, 19 Mar 2012 16:40:08 -0700

On Monday, 19 March 2012 at 17:23:36 UTC, Andrei Alexandrescuwrote:

On 3/18/12 11:12 PM, Jay Norwood wrote:
I'm timing operations processing 10 2MB text files inparallel. Ihaven't gotten to the part where I put the words in the map,but I'vedone enough through this point to say a few things about themeasurements.
Great work! This prompts quite a few bug reports andenhancement suggestions - please submit them to bugzilla.

I don't know if they are bugs. On D.learn I got the explanationthat the matches.captures.length() just returns the matches inthe expressions surrounded by (), so I don't think this can beused ,other than in a for loop, to count lines, for example.std.algorithm.count works ok, but I was hoping that there wassomething in the ctRegex that would make it work as fast as thehand-coded string scan.

Two quick notes:
On the other end of the spectrum is the byLine version of theread. Sothis is way too slow to be promoting in our examples, and ifanyone isusing this in the code you should instead read chunks ...maybe 1MB likein my example later below, and then split up the linesyourself.
// read files by line ... yikes! don't want to do this
//finished! time: 485 ms
void wcp_byLine(string fn)
{
auto f = File(fn);
foreach(line; f.byLine(std.string.KeepTerminator.yes)){
}
}
What OS did you use? (The implementation of byLine varies a lotacross OSs.)


I'm doing everything now on win7-64 right now.

I wanted for a long time to improve byLine by allowing it to doits own buffering. That means once you used byLine it's notpossible to stop it, get back to the original File, andcontinue reading it. Using byLine is a commitment. This is whatmost uses of it do anyway.
Ok, this was the good surprise. Reading by chunks was fasterthan
reading the whole file, by several ms.
What may be at work here is cache effects. Reusing the same 1MBmay place it in faster cache memory, whereas reading 20MB atonce may spill into slower memory.

Yes, I would guess that's the problem. This corei7 has 8MB cache,and the threadpool creates 7 active tasks by default, as Iunderstand, so even 1MB blocks is on the border when runningparallel. I'll lower the chunk size to some level that seemsreasonable and retest.



Andrei

Re: some regex vs std.ascii vs handcode times

Reply via email to