Hi Zhiwei, On Sat, Apr 20, 2013 at 1:51 PM, Cai Zhiwei <[email protected]> wrote: > Good news!We don't have to crawl pages anymore.I tried to contact Sameer > according to Max's suggestion and he told me that their code to deal with > google corpus and expanded corpus with context had released here.
That's great! Did you already have a look at CreateSpotlightModel [1] and at the *Source classes [2] like Jo suggested? Do you want to give it a shot to implement additional methods to extract the respective data utilizing the code from wiki-links? If, in the process, you think of a good way to introduce interfaces (traits) here, you are strongly encouraged to discuss it with us, but has lower priority than the actual implementations. Cheers, Max [1] https://github.com/dbpedia-spotlight/dbpedia-spotlight/blob/master/index/src/main/scala/org/dbpedia/spotlight/db/CreateSpotlightModel.scala [2] https://github.com/dbpedia-spotlight/dbpedia-spotlight/tree/master/index/src/main/scala/org/dbpedia/spotlight/db/io On Tue, Apr 16, 2013 at 9:57 AM, Joachim Daiber <[email protected]> wrote: > have a look at > CreateSpotlightModel. You would only have to implement the *Source objects > for this corpus. While we are at this, making those interfaces more general > is of course something we would like to see eventually (e.g. not having > Scala objects but classes with different implementations). ------------------------------------------------------------------------------ Precog is a next-generation analytics platform capable of advanced analytics on semi-structured data. The platform includes APIs for building apps and a phenomenal toolset for data science. Developers can use our toolset for easy data analysis & visualization. Get a free account! http://www2.precog.com/precogplatform/slashdotnewsletter _______________________________________________ Dbpedia-gsoc mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/dbpedia-gsoc
