I found the stopword files available from the Carrot2 project [1] to
be of great help. Stopword files for all major languages can be found
at http://download.carrot2.org/head/classes/ (at the bottom of the
page).

Cheers
David

[1] http://download.carrot2.org/head/classes/


On 23 February 2012 23:17, Pablo Mendes <[email protected]> wrote:
>
> Oh, nice! Very helpful community. :)
>
> Daniel,
> Would you be able to share the patches? I'd like to make sure they get fixed
> for 0.6 if they haven't been shipped in 0.5.
>
> Also, do you have a copy of stopwords.de.txt that we can put for download on
> our site? I have already shared your blacklistedURIPatterns, and
> acknowledged your contribution:
> http://spotlight.dbpedia.org/download/README.txt
>
> By the way, do we have a running German endpoint?
>
> Cheers,
> Pablo
>
> On Thu, Feb 23, 2012 at 1:35 PM, Jimmy O'Regan <[email protected]> wrote:
>>
>> On 23 February 2012 11:45, Gerber Daniel
>> <[email protected]> wrote:
>> > hi reinhard,
>> > I've created the german surface form file myself. You can download it
>> > from here [1]. Please not that this file is probably 2-3 months old. If you
>> > want to create it on your own, you would need to install the dbpedia
>> > extraction framework, which is pretty easy and make sure that it creates 
>> > the
>> > redirects/disambiguations/labels file which are needed by spotlight (I 
>> > think
>> > only the redirects file is not downloaded from [2] ).
>>
>> The redirects file is provided by de.dbpedia:
>> http://de.dbpedia.org/datens%C3%A4tze
>>
>>
>>
>

------------------------------------------------------------------------------
Virtualization & Cloud Management Using Capacity Planning
Cloud computing makes use of virtualization - but cloud computing 
also focuses on allowing computing to be delivered as a service.
http://www.accelacomm.com/jaw/sfnl/114/51521223/
_______________________________________________
Dbp-spotlight-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbp-spotlight-users

Reply via email to