Hi,
My name is Zuozhi Wang and I'm a third year undergraduate student in
University of California, Irvine. DBpedia instantly draws my attention when
I'm browsing through the long list of organizations. I just did some
information extraction experiments using SystemT, a commercial product from
IBM. It's so good to have a chance to contribute to an open source
community on this domain. I just get started on getting familiar with
DBpedia, reading some ideas, and working on some warm-up tasks. Hope it's
not too late, I see many other people already working on them for a while
:)
I want to share a little bit about my current research project with a
professor because I feel it might relate to DBpedia as well. Our program
builds index on the input data, takes a systemT query and filters the input
data based on dictionary and regular expressions. It only feeds the files
that may contain an entry in dictionary/match a regular expression into
further extraction processes. It will speed up the process a lot because
the input after filtering is often very small. The prototype for systemT is
almost done and works really well. And we also have further plans to build
larger text data management system targeting other different information
extraction packages.
I'm also definitely interested in other ideas and I feel really excited
about joining the community, a big hi to all the community members and gsoc
students!
Cheers,
Zuozhi Wang
------------------------------------------------------------------------------
Transform Data into Opportunity.
Accelerate data analysis in your applications with
Intel Data Analytics Acceleration Library.
Click to learn more.
http://pubads.g.doubleclick.net/gampad/clk?id=278785231&iu=/4140
_______________________________________________
Dbpedia-gsoc mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-gsoc