On Mon, 21 Mar 2011, Daniel Hedlund wrote: > There's a really cool open source project for managing just this kind of > problem (cleaning up data) called "Google Refine" (they bought Freebase > and released refine as open source). The introduction videos act as great > tutorial. http://code.google.com/p/google-refine/
Daniel, Thanks. I'll take a look. I'll probably have to write individual python scripts for each page exported as a .csv (or .txt) file to re-arrange the data. Rich _______________________________________________ PLUG mailing list [email protected] http://lists.pdxlinux.org/mailman/listinfo/plug
