Hi Max,

I've already read the code but it seems to deal with files that are
generated by by pigproc after preprocessing. Should I try to preprocess
wiki-link file and generate files like what pigproc generated?


Best regards,
Zhiwei


On Sun, Apr 21, 2013 at 4:40 PM, Max Jakob <[email protected]> wrote:

> Hi Zhiwei,
>
> On Sat, Apr 20, 2013 at 1:51 PM, Cai Zhiwei <[email protected]> wrote:
> > Good news!We don't have to crawl pages anymore.I tried to contact Sameer
> > according to Max's suggestion and he told me that their code to deal with
> > google corpus and expanded corpus with context had released here.
>
> That's great! Did you already have a look at CreateSpotlightModel [1]
> and at the *Source classes [2] like Jo suggested? Do you want to give
> it a shot to implement additional methods to extract the respective
> data utilizing the code from wiki-links? If, in the process, you think
> of a good way to introduce interfaces (traits) here, you are strongly
> encouraged to discuss it with us, but has lower priority than the
> actual implementations.
>
> Cheers,
> Max
>
> [1]
> https://github.com/dbpedia-spotlight/dbpedia-spotlight/blob/master/index/src/main/scala/org/dbpedia/spotlight/db/CreateSpotlightModel.scala
> [2]
> https://github.com/dbpedia-spotlight/dbpedia-spotlight/tree/master/index/src/main/scala/org/dbpedia/spotlight/db/io
>
>
> On Tue, Apr 16, 2013 at 9:57 AM, Joachim Daiber
> <[email protected]> wrote:
> > have a look at
> > CreateSpotlightModel. You would only have to implement the *Source
> objects
> > for this corpus. While we are at this, making those interfaces more
> general
> > is of course something we would like to see eventually (e.g. not having
> > Scala objects but classes with different implementations).
>
------------------------------------------------------------------------------
Precog is a next-generation analytics platform capable of advanced
analytics on semi-structured data. The platform includes APIs for building
apps and a phenomenal toolset for data science. Developers can use
our toolset for easy data analysis & visualization. Get a free account!
http://www2.precog.com/precogplatform/slashdotnewsletter
_______________________________________________
Dbpedia-gsoc mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-gsoc

Reply via email to