All,
I've implemented skipping columns using NULL in colClasses, and
logicals are now also read. read.csv reads
"T","F","TRUE","FALSE","True" and "False" as type logical, so I've
followed suit. But I'm wondering about the single letters "T" and
"F". To illustrate, the following might be confusing :
> fread("A,B,C\nD,E,F\n")
A B C
1: D E FALSE
> fread("A,B,C\nD,E,F\nG,H,I\n")
A B C
1: D E F
2: G H I
>
Should fread treat "T" and "F" as logical? Should it read a column of
only 0's and 1's as logical, too? I think I'd prefer that as it's quite
common.
I'm also thinking of increasing the number of rows used for type
detection to the top 500, middle 500 and bottom 500, since that's a
very small extra cost to save the relatively much larger cost of mid
read column bumps. As a parameter, with 500 by default.
Matthew
_______________________________________________
datatable-help mailing list
[email protected]
https://lists.r-forge.r-project.org/cgi-bin/mailman/listinfo/datatable-help