Thank Michael! I only recently joined the list so I missed the early posting. I like this example a lot. I was able to get it to run using the document analyzer from the uimaj-example. I have some questions though: Is the testDict.xml just an arbitrary xml file which means any well-formed xml file should work? How do I get my own xml dictionary files to work without transforming them into the xml format in your testDict.xml file? Is there documentation for this so that I can understand it on my own without bugging the entire list?Thanks! Ahmed
On Tue, Jun 17, 2008 at 8:05 PM, Michael Tanenblatt <[EMAIL PROTECTED]> wrote: > As Thilo mentioned in an email from May 19, 2008, I forgot to include the > source for uima.tt.TokenAnnotation, but otherwise the code should be fine. > > Additionally, the problem you are seeing is with OffsetTokenizer, which is > just a sample tokenizer--if you have another, more robust tokenizer, you > don't need this OffsetTokenizer. > >
