The only requirement is that each sentence be on a separate line in the
training file.
Don't try putting non-sentences in the training file.
On 2/8/2013 12:46 PM, Surendra wrote:
Hi,
I am a post graduate student in computer science. I am working on sentence
boundary detection of local Indian language. Could you please provide me the
format of the train file and a sample file like en-sent.train which will be
help full for me to create model.
regards,
Surendra H
R. V. College of Engineering, Bangalore, India