Hi all, My full name is Qiuxiang Zhu (you can call me Jenny for short), who is a Chinese student interested in participating GSoC 2014. In recent years, I've been working on semantic web related projects, most of which are small student projects funded by my university, with a big project of finance knowledge base (RDF/OWL) development from my tutor. I'm quite experienced with RDF, Topbraid Composer, Jena, SPARQL and Linked Data.
Apache Stabol attracts me because it adopts semantic technologies for content management, especially the Enhancer component to process semantic data in a chain. I've read the related documents [1]. I can also understand the source code of the Tika Engine [2]. In GSoC 2014, I'd like to work on a similar engine of "Face Detection Engine based on OpenIMAJ" [3] (STANBOL-1006) which also deals with Blobs. Could you please tell me more about the details of the project? Here're my questions: 1) The input of the Face Detection Engine can be a ContentItem containing the original images. Are the extracted face images registered with predefined URIs as Content parts (Blobs) in the ContentItem? 2) What metadata can be enhanced for Face Detection Engine? Are there any Face Detection related ontologies to be reused? 3) How to deal with videos? It looks like that we should firstly (1) extract images/frames from the videos, and then (2) apply Face Detection Engine for face recognition. Shall we separate (1) from (2), to make a Video Frame Extraction Engine? Yours truly, Jenny [1] http://stanbol.apache.org/docs/trunk/components/enhancer/ [2] https://svn.apache.org/repos/asf/stanbol/trunk/enhancement-engines/tika/src/main/java/org/apache/stanbol/enhancer/engines/tika/TikaEngine.java [3] https://issues.apache.org/jira/browse/STANBOL-1006