Re: [Haskell-cafe] Re: Greetings

Seth Gordon Sat, 30 Sep 2006 21:05:36 -0700

Paul Johnson wrote:

I've done some stuff with maybe 50k rows at a time.  A few bits and pieces:
1: I've used HSQL(http://sourceforge.net/project/showfiles.php?group_id=65248) to talk toODBC databases. Works fine, but possibly a bit slowly. I'm not surewhere the delay is: it might just be the network I was running it over.One gotcha: the field function takes a field name, but its not randomaccess. Access the fields in query order or it crashes.


Thanks; that's certainly the sort of thing I like knowing in advance.

2: For large data sets laziness is your friend. When reading files"getContents" presents an entire file as a list, but its reallyevaluated lazily. This is implemented using unsafeInterleaveIO. I'venever used this, but in theory you should be able to set up a query thatreturns the entire database as a list and then step through it usinglazy evaluation in the same way.

I assume that the collectRows function in HSQL can produce this kind ofa lazy list...right?

3: You don't say whether these algorithms are just row-by-row algorithmsor whether there is something more sophisticated going on. Either way,try to make things into lists and then apply map, fold and filteroperations. Its much more declarative and high level when you do itthat way.


I'm going to need to do some mapping, folding, partitioning...


Let us know how you get on.


I certainly will.
_______________________________________________
Haskell-Cafe mailing list
[email protected]
http://www.haskell.org/mailman/listinfo/haskell-cafe

Re: [Haskell-cafe] Re: Greetings

Reply via email to