training data for sentence detector

Tim Miller Fri, 07 Feb 2014 14:25:15 -0800

James,

We were discussing the sentence detector thing in person here the otherday and Pei had a thought that depending on what sources you were usingfor training the sentence detector, we might be able to do somethingequivalent here in Boston by using SHARP, THYME, MIPACQ data which arelargely from Mayo and probably similar to what you use, then augmentingwith the little bit of MIMIC that I annotated. I don't know how thatcompares size-wise to the dataset that you are using. Is it quite largeor do you think if we use derived data from those other projects will webe good? What do you think of this plan? Anyone else?

Tim

training data for sentence detector

Reply via email to