Re: Surronding tokens of the entity on MaxEnt models

2016-05-02 Thread Russ, Daniel (NIH/CIT) [E]
Of course you can use regex patterns, but it gets pretty complicated. See: https://web.stanford.edu/class/cs124/lec/Maximum_Entropy_Classifiers.pdf, Christopher Manning uses the example of a word that ends in “c” as a feature for the class drug. That could be a regex feature. you could also ha

Re: Surronding tokens of the entity on MaxEnt models

2016-05-02 Thread Damiano Porta
Hi Daniel! Thank you so much! Unfortunately, I am not sure. I really do not know what is the best way in this case. I have a dataset with patterns like: my name is {name}, from {location} name: {name} full name: {name} I am {name}, i was born in {location} etc etc etc I could use regexes too. M

Re: Surronding tokens of the entity on MaxEnt models

2016-05-02 Thread Russ, Daniel (NIH/CIT) [E]
Hi Damiano, Why are you so sure that your model with not work? A couple of things to remember, 1. you need quite a bit of training data. Two sentences does not make a training set. 2. You probably need more than a window of words as your features. However, you can see that word-2=“name