LASRI YASSINE wrote:

If it possible i have a problem if you have any solution or idea :

I have an external file that contains a list of persons names, for example
:

adam
smith
lary
page
jorge
wolker
buch


and I need to extract all persons names from others source (Text
Documents),
for example :
 "Lary Page is the creator of google and Adam Smith is an economist and
jorge wolker bush is the ...."
The annotator shoul extract <Adam Smith> and <Lary Page> as  person name
and also <Jorge Wolker Bush>
I have created two annotator :

PersonAnnotator that record person names from source
SpacePonctuationAnnotator

I want to have a config file where i will put some rules to extract person
names such as

rule 1 : [person]
rule 2 : [person][space] [person]
rule 2 : [person][space][person][space][person]

 what I can do ?

Best regards
-Yassine


Do you look for a way to specify rules for an annotator?
If true, one possible solution will be to create an XML or simple text file with your own rule syntax to specify the rules you would like to have. This rule files than can be set as external resource for your annotator. So when the annotator is loaded it can read the rule file, parse the syntax and process the specified rules.

Does this help, or did I misunderstand your question?

-- Michael







Reply via email to