I have a 5.9-million line logfile that starts with dates of the format "Jan
23 14:15:16". I am converting these to DateTime via
mkdt(dts::AbstractString) = DateTime(dts, "uuu dd HH:MM:SS") + Dates.Year(
2014)
and calling mkdt via
words = split(l)
dt = mkdt(join(words[1:3]," "))
Processing the file using DateTime takes an exceedingly long time (15
minutes) vs storing the dates as a string (just keeping words[1:3] - 2.5
minutes). A @profile on a 100k line sample file shows most of the time
(9480 / 10251 samples) in the mkdt call above.
Is there anything I can do to speed this up or is it just a given that
creating DateTime types will be the slowest part of this processing?