On Nov 17, 2010, at 12:10 PM, Ethan Merritt wrote:
> merritt [36] which in_cranio
>             in_cranio:       aliased to pdftotext -layout
> 
> merritt [37] in_cranio pro200700038_s.pdf

This has just taken the data from one visual format to another purely 
text-based visual format. You still have to split it into text files manually 
and then import the data manually. Table S3 is split, so it will have to be 
imported to the spreadsheet program in three steps and then merged. We have to 
be careful when we split "Best Ion Score", "Best Ion C.I. %", and "Coverage". 
These headings merge to a single space between them, so we can't use that 
spacing as an indicator.

Try this, start your stopwatch, begin to convert all of these tables to 
spreadsheets or your favorite database format, validate that the import was 
correct and that the data types are what you expect, clean up extraneous 
information like "(page n/N)", save all the files, stop your stopwatch, and 
then tell us how long it took. That is the real task, not simply reformatting 
the data to pure text.

If you think that this is an unreasonable request, then you are starting to get 
my point.

James


Reply via email to