Hi Jenny

Thanks for your interest in Stanbol. I will try to give you some more
information
on the topic you are interested in. See my comments inline.

On Tue, Mar 4, 2014 at 3:35 PM, Zhu Qiuxiang <jenny.qiuxi...@gmail.com> wrote:
> Hi all,
>
> My full name is Qiuxiang Zhu (you can call me Jenny for short),  who is a
> Chinese student interested in participating GSoC 2014. In recent years,
> I've been working on semantic web related projects, most of which are small
> student projects funded by my university, with a big project of finance
> knowledge base (RDF/OWL) development from my tutor. I'm quite experienced
> with RDF, Topbraid Composer, Jena, SPARQL and Linked Data.
>
> Apache Stabol attracts me because it adopts semantic technologies for
> content management, especially the Enhancer component to process semantic
> data in a chain. I've read the related documents [1]. I can also understand
> the source code of the Tika Engine [2].
>
> In GSoC 2014, I'd like to work on a similar engine of "Face Detection
> Engine based on OpenIMAJ" [3] (STANBOL-1006) which also deals with Blobs.
> Could you please tell me more about the details of the project? Here're my
> questions:
>
> 1) The input of the Face Detection Engine can be a ContentItem containing
> the original images. Are the extracted face images registered with
> predefined URIs as Content parts (Blobs) in the ContentItem?

In Stanbol Content is accessible as Blobs. The Blob provides the Content-Type
and an InputStream to read the data. Both images and videos are possible
inputs for a Face Detection Engine

>
> 2) What metadata can be enhanced for Face Detection Engine? Are there any
> Face Detection related ontologies to be reused?

Extending the Stanbol Enhancement Structure for Image and Video Annotations
is covered by STANBOL-1005. There are several existing ontologies and even
Recommendations like MediaFragments [1], the ontology for Media Resources [2]
that should be considered.


[1] http://www.w3.org/TR/media-frags/
[2] http://www.w3.org/TR/mediaont-10/

>
> 3) How to deal with videos? It looks like that we should firstly (1)
> extract images/frames from the videos, and then (2) apply Face Detection
> Engine for face recognition. Shall we separate (1) from (2), to make a
> Video Frame Extraction Engine?

AFAIK OpenIMAJ provides all the required functionality. Separation of
functionality in different engines is a good thing as it allows users more
flexibility of configuring chains.

I see a lot of possible engines

* scene detection: to set detected faces in a context. This could be
used to group different faces found within the same scene)
* face detection. Video segments showing a face could be marked with
MediaFragments URIs so the clients can easily play back the annotated
section in the browser.
* extraction of images showing detected faces. This would be nice for
Clients as they can easily show detected faces to users


best
Rupert

>
>
> Yours truly,
> Jenny
>
>
>
>
> [1] http://stanbol.apache.org/docs/trunk/components/enhancer/
> [2]
> https://svn.apache.org/repos/asf/stanbol/trunk/enhancement-engines/tika/src/main/java/org/apache/stanbol/enhancer/engines/tika/TikaEngine.java
> [3] https://issues.apache.org/jira/browse/STANBOL-1006



-- 
| Rupert Westenthaler             rupert.westentha...@gmail.com
| Bodenlehenstraße 11                             ++43-699-11108907
| A-5500 Bischofshofen

Reply via email to