Hi Max,
Sorry if I didn't make myself clear.Do you mean I can implement the task
with any way and improve the design later?
Best regards,
Zhiwei
On Sun, Apr 21, 2013 at 10:49 PM, Cai Zhiwei <[email protected]> wrote:
> Hi Max,
>
> I've already read the code but it seems to deal with files that are
> generated by by pigproc after preprocessing. Should I try to preprocess
> wiki-link file and generate files like what pigproc generated?
>
>
> Best regards,
> Zhiwei
>
>
> On Sun, Apr 21, 2013 at 4:40 PM, Max Jakob <[email protected]> wrote:
>
>> Hi Zhiwei,
>>
>> On Sat, Apr 20, 2013 at 1:51 PM, Cai Zhiwei <[email protected]>
>> wrote:
>> > Good news!We don't have to crawl pages anymore.I tried to contact Sameer
>> > according to Max's suggestion and he told me that their code to deal
>> with
>> > google corpus and expanded corpus with context had released here.
>>
>> That's great! Did you already have a look at CreateSpotlightModel [1]
>> and at the *Source classes [2] like Jo suggested? Do you want to give
>> it a shot to implement additional methods to extract the respective
>> data utilizing the code from wiki-links? If, in the process, you think
>> of a good way to introduce interfaces (traits) here, you are strongly
>> encouraged to discuss it with us, but has lower priority than the
>> actual implementations.
>>
>> Cheers,
>> Max
>>
>> [1]
>> https://github.com/dbpedia-spotlight/dbpedia-spotlight/blob/master/index/src/main/scala/org/dbpedia/spotlight/db/CreateSpotlightModel.scala
>> [2]
>> https://github.com/dbpedia-spotlight/dbpedia-spotlight/tree/master/index/src/main/scala/org/dbpedia/spotlight/db/io
>>
>>
>> On Tue, Apr 16, 2013 at 9:57 AM, Joachim Daiber
>> <[email protected]> wrote:
>> > have a look at
>> > CreateSpotlightModel. You would only have to implement the *Source
>> objects
>> > for this corpus. While we are at this, making those interfaces more
>> general
>> > is of course something we would like to see eventually (e.g. not having
>> > Scala objects but classes with different implementations).
>>
>
>
------------------------------------------------------------------------------
Precog is a next-generation analytics platform capable of advanced
analytics on semi-structured data. The platform includes APIs for building
apps and a phenomenal toolset for data science. Developers can use
our toolset for easy data analysis & visualization. Get a free account!
http://www2.precog.com/precogplatform/slashdotnewsletter
_______________________________________________
Dbpedia-gsoc mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-gsoc