Does this mean that if I ran lmplz on a tagged corpus (with this format: surface form|POS), the program will automatically generate Part of Speech Language Model?
Or ran lmplz on a parallel corpus of POS tags (parallel to sentences where the tags were generated)? Thank you for your reply. On Sat, Oct 28, 2017 at 10:48 PM, Kenneth Heafield <[email protected]> wrote: > Hi, > > You convert the words to part of speech using an external tagger > (lmplz > does not include POS detection). Then you'll probably need to run lmplz > --discount_fallback because the vocabulary is small. > > Kenneth > > On 10/28/2017 02:06 AM, Aileen Joan Vicente wrote: > > Hi! I am learning Factored Training and the tutorial suggests building a > > part-of-speech language model. I have already tried building one on and > > english training sentences and I wonder if there is an option in lmplz > > to direct the program to look at the sentence's pos tags. I've been > > googling for two days and I haven't found the answer yet. > > > > Thank you for your response. > > > > Best, > > > > Aileen Joan Vicente > > UP Cebu Philippines > > > > > > _______________________________________________ > > Moses-support mailing list > > [email protected] > > http://mailman.mit.edu/mailman/listinfo/moses-support > > > _______________________________________________ > Moses-support mailing list > [email protected] > http://mailman.mit.edu/mailman/listinfo/moses-support >
_______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
