Hi Daniel,

Previous publications suggest features are more important than learning
methods. Before last year, the trend seemed to go towards CRFs, nowadays,
it goes towards deep learning (LSTM, CNN, RNN, etc. and so on).

However, if we do a very quick review of English results for CoNLL 2003
(the most used dataset for NER), published results show that it is possible
to get as good results in NER using Perceptron as those using other
methods. With the additional benefit of training in a few minutes at most
(the OpenNLP implementation at least) and being very easy to configure.

1. Ratinov and Roth (2009): 90.57 F1 Perceptron
2. Passos et al. (2009): 90.90 F1 CRF
3. Chiu and Nichols (2015): 90.77 LSTM-CNN
4. Lample et al. (2016): 90.94 LSTM
5. Luo et al. (2016): 91.20 Extended Semi-CRF
6. Agerri and Rigau (2016): 91.36 Perceptron
7. Yang et al (2016: 91.20 F1 RNN
8. Ma and Hovy (2016): 91.21 LSTM-CNN-CRF

Having said that, +1 to adding CRF and deep learning algorithms in the
future too :)

R


On Tue, Feb 7, 2017 at 3:42 PM, Russ, Daniel (NIH/CIT) [E] <
dr...@mail.nih.gov> wrote:

> Hi Jörn,
>
>
>
>    I think the best entity recognition systems use CRF’s.  At some point
> we might want to consider adding them.  As you know, ME classifiers suffer
> from label bias problem (see Lafferty et. al<http://repository.upenn.
> edu/cgi/viewcontent.cgi?article=1162&context=cis_papers>.) CRF’s deal
> with that issue.  I believe that perceptrons suffer from the same problem.
> If you think the results are better, I have no problem.  I think that our
> long-term goal should be to add a CRF, and make it the default for the
> NameFinder.
>
>
>
> Daniel
>
>
>
>
>
> On 2/6/17, 12:40 PM, "Joern Kottmann" <kottm...@gmail.com> wrote:
>
>
>
>     Hello all,
>
>
>
>     I would like to propose to switch the default training algorithm from
>
>     maxent gis to perceptron for the Name Finder. In all the data sets I
>
>     tried perceptron performs better than maxent gis and I believe that
>
>     would be a much more sensible default.
>
>
>
>     A user can always override the default by providing the algorithms
>
>     parameter for training.
>
>
>
>     What do you think?
>
>
>
>     Jörn
>
>
>
>
>

Reply via email to