Ah.....yeah, it works now. I think I made mistakes in both of the points you mentioned.
Thank you so much for your help! Fangkai On Fri, Jun 4, 2010 at 8:46 AM, Philip Alexiev <[email protected]> wrote: > Hi again > > > I tried with this file and annotated it with no problems. > > Some things to consider: > > * Do your text files have .txt extension ? > * Do you give the populater the file itself as a parameter, or a directory. > It should be a directory. Doesn't work with files. > > All the best, > Philip > > > On 06/04/2010 04:21 PM, Yang Fangkai wrote: > > Sorry I forgot to attach the file > Fangkai > On Fri, Jun 4, 2010 at 8:20 AM, Yang Fangkai <[email protected]> > wrote: > > > Hi, Philip, > Yesterday I found a software that transformed all .txt file > to .html file and all annotation is done. However, this is not a final > solution because in the future I may have pdf or .doc file to > annotate. > I am sure the attached document is not annotated. I checked > it in this way: I have a html file which contains the same content > with the .txt file, and use toolpopulate to annotate both of them, and > I use keyword "Rice University" in entity pattern search (object, > whose name is exactly equal to "Rice Univerisity"), and in the > resuult, I saw the html doc is retrieved, but .txt not. I think this > convinced me that .txt file is not annotated. > Also, from the panel of toolpopulate, it returns the following > message after I chose .txt file to annotate: > Checking (please wait) ... > Check: SUCCESS! > Processing file(s) ... > Completed: 100% ( 1 of 1 files processed ) > Indices optimized ! > -=[ TOTALS ]=- > Directory files: 1 > Start time: Fri Jun 04 08:13:57 CDT 2010 > End time: Fri Jun 04 08:13:57 CDT 2010 > Total time (ms): 47 > -=[ STATISTICS ]=- > Document count: 1 > Document size (kb): 0 > Create time (ms): 0 > Parse features time (ms): 0 > Annotation time (ms): 0 > Store time (ms): 0 > Index sync time (ms): 0 > Index opt time (ms): 0 > ---------------------------------------------------------------- > End Time: Fri Jun 04 08:13:57 CDT 2010 > ---------------------------------------------------------------- > Finished. > From thie message it doesn't look like the file is annotated. > Thank you very much for your help! > Fangkai > On Fri, Jun 4, 2010 at 6:02 AM, Philip Alexiev > <[email protected]> wrote: > > > Hello Fangkai, > Could you send us some of your txt files that you are sure are not > annotated? This could help us a lot in solving the problem. > Thanks, > Philip > On 06/03/2010 08:00 PM, Yang Fangkai wrote: > > > hi, Anton, > I tried HTML files, and the population works. But this just > doesn't work for txt file... > I checked the populator.xml and found the following configuration: > <INPUT_DOC_EXT>doc,htm,html,txt,page,xml</INPUT_DOC_EXT> > I suspect the populator has already been configured to process > txt file. So where is the problem? Thank you! > Fangkai > 2010/6/3 Yang Fangkai<[email protected]>: > > > Anton, > On Thu, Jun 3, 2010 at 10:39 AM, Anton Andreev > <[email protected]> wrote: > > > Hello Fangkai, > First I would like to point out that the kim-discussion: > http://ontotext.com/mailman/listinfo/kim-discussion is dedicated for > asking > technical questions like this one. Next time please use the > kim-discussion > mailing list, not this one. Thanks. > > > Sorry for the mistake. I will use that list the next time. > > > Now back to your problem: > What version of KIM do you use? KIM 2.4? > > > Yes. I am using KIM2.4 under Windows XP. > > > Are you using the KIMGate hybrid - a GATE developer with KIM's default > pipeline or the tool called "populater" again from the bin folder? > > > I started KIM by running startkim.bat, and the populator by running > toolPopulate.cmd in tool folder. I didn't see the tool "populator" in > the bin folder. > > > The later > only needs a document source folder and uses an already running KIM > instance. Do you see that the documents are being annotated? What > results do > you expect, what is missing? > > > Here is what I expect. I have a corpus containing about 2000 docs, and > I want to query over these docs. So I plan to use toolPopulate to > extract entities over these docs (this is what I am trying to do), and > then query over them. I expect to see the entities populated from > these docs, but I didn't see any meaningful entities when I query the > entity from the KIM GUI. > I don't know if the above makes sense. Thank you! > Fangkai > > > The steps you are doing are correct in general. > Best regards, > Anton Andreev > -- > Anton Andreev > Account Manager > Ontotext AD > Tel: +359 2 875 81 17 > Fax:+359 2 975 32 26 > email: [email protected] > www.ontotext.com > On 3.6.2010 г. 18:17 ч., KIM Platform info newsletter wrote: > > > Dear List, > I am trying to use Populate GUI to populate entities from my > own corpus. I have downloaded the raw file of PennTree bank, i.e., the > articles from Wall Street Journal in plain text form, and refer to the > folder in Populate GUI. However, it seems no entities is populated. I > try to add an .xml file with the same name of the text file, but still > doesn't work. (I check that by first deleting all files from > /context/default/populated, and populate entities from a file, and > check the entities by querying the entities at > http://localhost:8080/kim, but no meaningful entities found). I am > wondering if I miss some steps or important configurations. Thank you > very much! > Best, > Fangkai > _______________________________________________ > interested-in-KIM mailing list > [email protected] > http://ontotext.com/mailman/listinfo/interested-in-kim > > > > > -- > Fangkai Yang, Ph.D student > Taylor Hall 3.150A > Department of Computer Sciences > The University of Texas at Austin > Austin, 78712-0233, Texas > USA > http://www.cs.utexas.edu/~fkyang > email: [email protected] > > > > > -- > Philip Alexiev<[email protected]> > Software Engineer > Ontotext AD > > > -- > Fangkai Yang, Ph.D student > Taylor Hall 3.150A > Department of Computer Sciences > The University of Texas at Austin > Austin, 78712-0233, Texas > USA > http://www.cs.utexas.edu/~fkyang > email: [email protected] > > > > > _______________________________________________ > Kim-discussion mailing list > [email protected] > http://ontotext.com/mailman/listinfo/kim-discussion > > > -- > Philip Alexiev <[email protected]> > Software Engineer > Ontotext AD -- Fangkai Yang, Ph.D student Taylor Hall 3.150A Department of Computer Sciences The University of Texas at Austin Austin, 78712-0233, Texas USA http://www.cs.utexas.edu/~fkyang email: [email protected] _______________________________________________ Kim-discussion mailing list [email protected] http://ontotext.com/mailman/listinfo/kim-discussion
