Re: [KIM-discussion] Fwd: Getting title, author, etc....

borislav popov Mon, 20 Aug 2007 05:49:07 -0700

Hi Alvaro,
KIM does not do any kind of extraction of features on the document level, except for the one you've mentioned. The way this is supposed to work is to provide this kind of metadata additionally. There are two ways to do that: by setting document level features to the kim document; by providing a file with this metadata in XML and it will be automatically assigned to the document. The second option works with the population tool - part of the installation.
The place to find the format of this file is "your kim installation path"/doc/sys-doc/kim-platform-administration/populate-doc-store.html
or through navigating the system documentation.
Somehow our focus never was on finding the document-level features, just because KIM has been designed as a quite general platform for semantic annotation, instead we have focused on finding entities in the content.
On the other hand, a custom IE application can be created if you need this functionality, especially if your input files are with a predictable structure - this way you will be able to automatically generate the features you would need for further processing.
all the best
borislav

Alvaro Hernandez wrote:

Thank you Borislav, but I have problems with this.

I tried to do it, but without success. From where does Kim get this features (title, author, subject, etc..) ?, is it automatically ??

This is the output:

KIM Server connected.

Todo obtained successfully.

Document created from URL and annotated.

title null

[ Document's Features (begin) ]

[key: gate.SourceURL] [feature: file:/kim-platform-1.7.12.15/context/default/documentos/Grupos de Trabajo.htm]

[key: KeyEntities] [feature: ]

[key: MimeType] [feature: text/html]

[ Document's Features (end) ]

I attached my source. The document I used for the example has a tag 'TITLE', does Kim get the title from this tag or from where??

Thank you a lot,

Alvaro

borislav popov <[EMAIL PROTECTED]> wrote:
Dear Alvaro,
in the system documentation available in your installation you find out about different ways of searching and accessing documents.
For the specific issue please look at the JavaDoc of the API and more specifically into the KIMDocument interface documentation.
There you would find predefined constants for the most common feature types on the document level.
The map of features itself is accessible through the getFeatures() method.
all the best
borislav

Alvaro Hernandez wrote:
Any help, advice, tip???

Thank you.

Note: forwarded message attached.

Be a better Heartthrob. Get better relationship answers from someone who knows.
Yahoo! Answers - Check it out.

Subject:
Getting title, author, etc....

From:
Alvaro Hernandez <[EMAIL PROTECTED]>

Date:
Fri, 17 Aug 2007 06:26:17 -0700 (PDT)

To:
[email protected]

To:
[email protected]

Hi everybody,

I want to get features about a document, like title, author, etc...,

Is there any way using the API ???

Thanks a lot,

Alvaro

Need a vacation? Get great deals to amazing places on Yahoo! Travel.
  _______________________________________________  NOTE: Please REPLY TO ALL to ensure that your reply reaches all members of this mailing list.    KIM-discussion mailing list  [email protected]  http://ontotext.com/mailman/listinfo/kim-discussion_ontotext.com    
  No virus found in this incoming message.  Checked by AVG Free Edition.   Version: 7.5.484 / Virus Database: 269.12.0/961 - Release Date: 8/19/2007 7:27 AM   
 
Fussy? Opinionated? Impossible to please? Perfect. Join Yahoo!'s user panel and lay it on us.
No virus found in this incoming message.
Checked by AVG Free Edition. 
Version: 7.5.484 / Virus Database: 269.12.0/961 - Release Date: 8/19/2007 7:27 AM

_______________________________________________
NOTE: Please REPLY TO ALL to ensure that your reply reaches all members of this 
mailing list.

KIM-discussion mailing list
[email protected]
http://ontotext.com/mailman/listinfo/kim-discussion_ontotext.com

Re: [KIM-discussion] Fwd: Getting title, author, etc....

Reply via email to