On Thu, Nov 15, 2012 at 12:16 AM, Mauro Dragoni <[email protected]> wrote:
> how much memory I need for running the service with the full set of resources?

About 24G for loading the context index, candidate index and spotting
dictionary into memory, which will give you the best time performance.

> is it possible to choose, easily, only a subset of resources?

There was some work done in this direction. I'm not sure if it is
integrated yet.

What exists for sure, and is useful for the extraction process, is a
file that contains regular expressions that filter out bad URIs (for
example List_of_.*). Property org.dbpedia.spotlight.data.badURIs.* in
indexing.properties.
Another hacky way is to interrupt the ExtractCandidateMap process
after it finished creating the conceptURIs file. You can then adjust
this file to only include URIs that you want and then continue with
the indexing.

Cheers,
Max

------------------------------------------------------------------------------
Monitor your physical, virtual and cloud infrastructure from a single
web console. Get in-depth insight into apps, servers, databases, vmware,
SAP, cloud infrastructure, etc. Download 30-day Free Trial.
Pricing starts from $795 for 25 servers or applications!
http://p.sf.net/sfu/zoho_dev2dev_nov
_______________________________________________
Dbp-spotlight-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbp-spotlight-users

Reply via email to