> 1) In the Febrl probabilistic record linkage suite (see
> http://febrl.sf.net) we have a little utility to synthesise names and
> addresses (with or without duplicates with random errors).
Excellent.

> for things like TORCH or GnuMed, testing with 10,000 or 20,000
> patients would not be unrealistic - and better to discover scalability
True enough. I did test a very few aspects of GnuMed with
125,000 patient names already delighted to discover that in
this particular area (pattern match search of patient name)
minor performance tweaks to the SQL involved leapfrogged
response times - on a machine the specs of which I don't dare
mention lest I be ridiculed for retro-computing :-)

> 3) Shareable EHR/EMR test data requires a shareable format which
[...]
> 4) Which raises the issue of the appropriate (as generic as possible)
> schema/data model for such test data. Are there candidate schema
> available - for example, what schema would be required by TORCH?
*fingers crossed, making go-away signs*  I wasn't going to
say it !   :-))

Karsten
-- 
GPG key ID E4071346 @ wwwkeys.pgp.net
E167 67FD A291 2BEA 73BD  4537 78B9 A9F9 E407 1346

Reply via email to