Hi Joachim,

a first run has created the language model for german with 7 gb.
the machine is becoming very slow and may be swapping...
but i have used an outdated index_db.sh and an outdated(?) gittree.
(after it has been installed i could no longer build pignlproc, it was
complaining of missing classes)

git clone --depth 1 https://github.com/jodaiber/dbpedia-spotlight.git

changed the index/pom.xml there and called the launcher because i have
thought there is a memory issue when launching etc...

                         <launcher>
                            <id>CreateSpotlightModel</id>
                           
<mainClass>org.dbpedia.spotlight.db.CreateSpotlightModel</mainClass>
                            <jvmArgs>
                                <jvmArg>-Xmx7g</jvmArg>
                            </jvmArgs>
                            <!--args>
                                <arg>../conf/indexing-3.8.properties</arg>
                            </args-->
                        </launcher>

i have had 1.3-1.4 mio resources to index.

i have now synced to the officially dbpedia spotlight trunk and i will
redo all steps.
the script index_db.sh has changed a lot compared to the old one i have
used.
i think i have got it from

https://raw.github.com/jodaiber/dbpedia-spotlight/master/bin/index_db.sh

which is linked here
https://github.com/dbpedia-spotlight/dbpedia-spotlight/wiki/Internationalization-%28DB-backed-core%29

in the chapter "The detailed way"

/This section describes the detailed steps for creating the Spotlight
model. These steps are all performed by index_db.sh
<https://raw.github.com/jodaiber/dbpedia-spotlight/master/bin/index_db.sh>.

/best regards
reinhard

Am 19.05.2013 12:03, schrieb Joachim Daiber:
> Hey Reinhard,
>
> I have usually run English with 24G, so German should probably fit in
> 12-14G. However, with 8G you will probably not have much chance to
> run CreateSpotlightModel for anything but the smallest languages.
>
> Best,
> Jo
>
>
> On Sun, May 19, 2013 at 10:21 AM, reinhard schwab
> <[email protected] <mailto:[email protected]>> wrote:
>
>     I have just discovered a maven setting at the begin of index_db.sh
>
>     
> https://github.com/dbpedia-spotlight/dbpedia-spotlight/blob/master/bin/index_db.sh
>
>     export MVN_OPTS="-Xmx26G"
>
>     26G is needed?
>     my notebook only has 8 G.
>
>     I am using another index_db.sh, not the one in the master git tree,
>     there is no such setting.
>
>     regards
>     reinhard
>
>     Am 19.05.2013 09:52, schrieb reinhard schwab:
>     > hi,
>     >
>     > i try to build the a language model using index_db.sh and i am
>     at the
>     > last step.
>     >
>     > mvn -pl index exec:java
>     > -Dexec.mainClass=org.dbpedia.spotlight.db.CreateSpotlightModel
>     > -Dexec.args="$2 $WDIR $TARGET_DIR $opennlp $STOPWORDS $4Stemmer";
>     >
>     > it has failed with an OutOfMemoryError.
>     > how much java heap is needed or what are the memory setting you have
>     > used to build the language model for german?
>     >
>     > best regards
>     > reinhard
>     >
>     >
>     
> ------------------------------------------------------------------------------
>     > AlienVault Unified Security Management (USM) platform delivers
>     complete
>     > security visibility with the essential security capabilities.
>     Easily and
>     > efficiently configure, manage, and operate all of your security
>     controls
>     > from a single console and one unified framework. Download a free
>     trial.
>     > http://p.sf.net/sfu/alienvault_d2d
>     > _______________________________________________
>     > Dbp-spotlight-users mailing list
>     > [email protected]
>     <mailto:[email protected]>
>     > https://lists.sourceforge.net/lists/listinfo/dbp-spotlight-users
>     >
>     >
>
>
>     
> ------------------------------------------------------------------------------
>     AlienVault Unified Security Management (USM) platform delivers
>     complete
>     security visibility with the essential security capabilities.
>     Easily and
>     efficiently configure, manage, and operate all of your security
>     controls
>     from a single console and one unified framework. Download a free
>     trial.
>     http://p.sf.net/sfu/alienvault_d2d
>     _______________________________________________
>     Dbp-spotlight-users mailing list
>     [email protected]
>     <mailto:[email protected]>
>     https://lists.sourceforge.net/lists/listinfo/dbp-spotlight-users
>
>

------------------------------------------------------------------------------
AlienVault Unified Security Management (USM) platform delivers complete
security visibility with the essential security capabilities. Easily and
efficiently configure, manage, and operate all of your security controls
from a single console and one unified framework. Download a free trial.
http://p.sf.net/sfu/alienvault_d2d
_______________________________________________
Dbp-spotlight-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbp-spotlight-users

Reply via email to