Please try to also specify spotter and disambiguator params
&spotter=LingPipeSpotter&disambiguator=Document
Cheers,
Pablo
On Fri, Mar 1, 2013 at 12:05 PM, Neil Ireson <[email protected]> wrote:
> All I do is fire off the "Example 1: without type restriction" query from
> the http://dbpedia.org/spotlight/usersmanual page.
>
>
> http://spotlight.dbpedia.org/rest/annotate?text=President%20Obama%20called%20Wednesday%20on%20Congress%20to%20extend%20a%20tax%20break%20for%20students%20included%20in%20last%20year%27s%20economic%20stimulus%20package,%20arguing%20that%20the%20policy%20provides%20more%20generous%20assistance.&confidence=0.2&support=20
>
> Obviously changing the host for my local installation. Unfortunately
> there's no external access to that host as it is running on a local cluster
> node. However I've included the log output below if that helps
>
> N
>
>
> 28-Feb-2013 17:14:44 org.dbpedia.spotlight.model.SpotterConfiguration
> <init>
> INFO: Will load spotters: [LingPipeSpotter, WikiMarkupSpotter].
> 28-Feb-2013 17:14:44
> org.dbpedia.spotlight.model.DisambiguatorConfiguration <init>
> INFO: Will load disambiguators: [Default, Document].
> 28-Feb-2013 17:14:44 org.dbpedia.spotlight.model.SpotlightConfiguration
> <init>
> WARNING: Cannot find stopwords file 'data/stopwords.en.list'. Using
> default Lucene Analyzer StopWords.
> 28-Feb-2013 17:14:44 org.dbpedia.spotlight.model.SpotlightConfiguration
> <init>
> INFO: Core database from Lucene: data/index-withSF-withTypes-compressed
> 28-Feb-2013 17:14:45 org.dbpedia.spotlight.lucene.similarity.TermCache
> setMaxCacheSize
> INFO: Setting the SurrogateCache.maxCacheSize to 15000
> 28-Feb-2013 17:14:45 org.apache.jcs.engine.control.CompositeCacheManager
> getUnconfiguredInstance
> INFO: Instance is null, creating with provided config
> 28-Feb-2013 17:14:45 org.apache.jcs.utils.threadpool.ThreadPoolManager
> loadConfig
> INFO: thread_pool.default PoolConfiguration = useBoundary = [true]
> boundarySize = [2000] maximumPoolSize = [150] minimumPoolSize = [4]
> keepAliveTime = [300000] whenBlockedPolicy = [RUN] startUpSize = [4]
> 28-Feb-2013 17:14:45
> org.apache.jcs.engine.control.CompositeCacheConfigurator setDefaultAuxValues
> INFO: Setting default auxiliaries to
> 28-Feb-2013 17:14:45
> org.apache.jcs.engine.control.CompositeCacheConfigurator
> setDefaultCompositeCacheAttributes
> INFO: setting defaultCompositeCacheAttributes to [ useLateral = true,
> useRemote = true, useDisk = true, maxObjs = 100, maxSpoolPerRun = -1,
> diskUsagePattern = 0 ]
> 28-Feb-2013 17:14:45
> org.apache.jcs.engine.control.CompositeCacheConfigurator
> parseElementAttributes
> INFO: No special ElementAttribute class defined for key
> [jcs.default.elementattributes], using default class.
> 28-Feb-2013 17:14:45
> org.apache.jcs.engine.control.CompositeCacheConfigurator
> setDefaultElementAttributes
> INFO: setting defaultElementAttributes to [ IS_LATERAL = true, IS_SPOOL =
> true, IS_REMOTE = true, IS_ETERNAL = true, MaxLifeSeconds = -1, IdleTime =
> -1, CreateTime = 1362071685981, LastAccessTime = 1362071685981,
> getTimeToLiveSeconds() = -1, createTime = 1362071685981 ]
> 28-Feb-2013 17:14:45
> org.apache.jcs.engine.control.CompositeCacheConfigurator parseRegions
> INFO: Parsed regions []
> 28-Feb-2013 17:14:45
> org.apache.jcs.engine.control.CompositeCacheConfigurator doConfigure
> INFO: Finished configuration in 479 ms.
> 28-Feb-2013 17:14:45
> org.apache.jcs.engine.control.CompositeCacheConfigurator
> parseElementAttributes
> INFO: No special ElementAttribute class defined for key
> [jcs.region.termCache.elementattributes], using default class.
> 28-Feb-2013 17:14:46 org.apache.jcs.engine.memory.lru.LRUMemoryCache
> initialize
> INFO: initialized LRUMemoryCache for termCache
> 28-Feb-2013 17:14:46 org.apache.jcs.engine.control.CompositeCache <init>
> INFO: Constructed cache with name [termCache] and cache attributes [
> useLateral = true, useRemote = true, useDisk = true, maxObjs = 100,
> maxSpoolPerRun = -1, diskUsagePattern = 0 ]
> 28-Feb-2013 17:14:46 org.dbpedia.spotlight.lucene.search.BaseSearcher
> <init>
> INFO: Creating in-memory lucene searcher... (may take 2-3 minutes and
> several GB of RAM)
> 28-Feb-2013 17:26:07 org.dbpedia.spotlight.lucene.search.BaseSearcher
> <init>
> INFO: Using index at:
> org.apache.lucene.store.RAMDirectory@7745c859lockFactory=org.apache.lucene.store.SingleInstanceLockFactory@664883c
> 28-Feb-2013 17:26:09
> org.dbpedia.spotlight.lucene.search.LuceneCandidateSearcher <init>
> INFO: Creating in-memory LuceneCandidateSearcher.
> 28-Feb-2013 17:27:13
> org.dbpedia.spotlight.lucene.search.LuceneCandidateSearcher <init>
> INFO: Opened LuceneCandidateSearcher from
> org.apache.lucene.store.RAMDirectory@469a9b65lockFactory=org.apache.lucene.store.SingleInstanceLockFactory@50502819.
> 28-Feb-2013 17:27:13
> org.dbpedia.spotlight.model.Factory$CandidateSearcher$ fromLuceneIndex
> INFO: CandidateSearcher initiated
> (inMemory=org.apache.lucene.store.MMapDirectory@/mnt/fastdata/ac1nsi/dbpedia/dbpedia-spotlight-quickstart-0.6.5/data/candidateIndexTitRedDis
> lockFactory=org.apache.lucene.store.NativeFSLockFactory@525c7734) from
> true
> 28-Feb-2013 17:27:16 org.dbpedia.spotlight.model.SpotlightFactory <init>
> INFO: Initiating spotters...
> 28-Feb-2013 17:27:51 org.dbpedia.spotlight.spot.lingpipe.LingPipeSpotter
> <init>
> INFO: Initiating LingPipeSpotter ... (Dictionary[String])
> 28-Feb-2013 17:28:16 org.dbpedia.spotlight.spot.lingpipe.LingPipeSpotter
> <init>
> INFO: Done.
> 28-Feb-2013 17:28:16 org.dbpedia.spotlight.spot.ChainedSelector <init>
> INFO: Creating a chained surface form selector with
> List(org.dbpedia.spotlight.spot.ShortSurfaceFormSelector@1816daa9).
> 28-Feb-2013 17:28:16 org.dbpedia.spotlight.model.SpotlightFactory <init>
> INFO: Initiating disambiguators...
> 28-Feb-2013 17:28:16
> org.dbpedia.spotlight.disambiguate.DefaultDisambiguator <init>
> INFO: Initializing disambiguator object ...
> 28-Feb-2013 17:28:16
> org.dbpedia.spotlight.disambiguate.DefaultDisambiguator <init>
> INFO: Done.
> 28-Feb-2013 17:28:16
> org.dbpedia.spotlight.disambiguate.TwoStepDisambiguator <init>
> INFO: Initializing disambiguator object ...
> 28-Feb-2013 17:28:16
> org.dbpedia.spotlight.disambiguate.TwoStepDisambiguator <init>
> INFO: Done.
> 28-Feb-2013 17:28:16
> org.dbpedia.spotlight.disambiguate.DefaultDisambiguator <init>
> INFO: Initializing disambiguator object ...
> 28-Feb-2013 17:28:16
> org.dbpedia.spotlight.disambiguate.DefaultDisambiguator <init>
> INFO: Done.
> 28-Feb-2013 17:28:16 org.dbpedia.spotlight.model.SpotlightFactory <init>
> INFO: Done.
> 28-Feb-2013 17:28:16 org.dbpedia.spotlight.web.rest.Server main
> INFO: Initiated 2 disambiguators.
> 28-Feb-2013 17:28:16 org.dbpedia.spotlight.web.rest.Server main
> INFO: Initiated 3 spotters.
> 28-Feb-2013 17:28:16 com.sun.grizzly.Controller logVersion
> INFO: GRIZZLY0001: Starting Grizzly Framework 1.9.48 - 28/02/13 17:28
> Server started in
> /mnt/fastdata/ac1nsi/dbpedia/dbpedia-spotlight-quickstart-0.6.5 listening
> on http://localhost:2222/rest/
> Could not open browser. java.awt.HeadlessException:
> No X11 DISPLAY variable was set, but this program performed an operation
> which requires it.
> 28-Feb-2013 17:50:35 com.sun.grizzly.http.servlet.ServletContextImpl log
> INFO: null: init
> 28-Feb-2013 17:50:35 com.sun.jersey.api.core.PackagesResourceConfig init
> INFO: Scanning for root resource and provider classes in the packages:
> org.dbpedia.spotlight.web.rest.resources
> 28-Feb-2013 17:50:39 com.sun.jersey.api.core.ScanningResourceConfig
> logClasses
> INFO: Root resource classes found:
> class org.dbpedia.spotlight.web.rest.resources.Feedback
> class org.dbpedia.spotlight.web.rest.resources.Disambiguate
> class org.dbpedia.spotlight.web.rest.resources.Annotate
> class org.dbpedia.spotlight.web.rest.resources.Candidates
> class org.dbpedia.spotlight.web.rest.resources.Spot
> 28-Feb-2013 17:50:39 com.sun.jersey.api.core.ScanningResourceConfig init
> INFO: No provider classes found.
> 28-Feb-2013 17:50:39
> com.sun.jersey.server.impl.application.WebApplicationImpl _initiate
> INFO: Initiating Jersey application, version 'Jersey: 1.10 11/02/2011
> 04:41 PM'
> 28-Feb-2013 17:50:40 com.sun.jersey.api.wadl.config.WadlGeneratorLoader
> loadWadlGenerator
> INFO: Loading wadlGenerator
> org.dbpedia.spotlight.web.rest.wadl.ExternalUriWadlGenerator
> 28-Feb-2013 17:50:41 org.apache.any23.Any23 <init>
> INFO:
> ======================= Configuration Properties =======================
> any23.http.client.max.connections=5
> any23.extraction.metadata.timesize=off
> any23.rdfa.extractor.xslt=rdfa.xslt
> any23.extraction.csv.comment=#
> any23.extraction.head.meta=off
> any23.extraction.csv.field=,
> any23.extraction.rdfa.programmatic=on
> any23.microdata.strict=off
> any23.http.client.timeout=10000
> any23.extraction.metadata.nesting=on
> any23.core.version=0.7.0-incubating
> (tags/any23-0.7.0-incubating/core@r1358077; 2012-07-06 10:41:50+0200)
> any23.http.user.agent.default=Any23-CLI
> any23.extraction.context.uri=?
> any23.extraction.metadata.domain.per.entity=off
> any23.plugin.dirs=./plugins
> any23.microdata.ns.default=http://rdf.data-vocabulary.org/
> ========================================================================
>
> 28-Feb-2013 17:50:41 org.apache.any23.rdf.PopularPrefixes getPrefixes
> INFO: Loading prefixes from /org/apache/any23/prefixes/prefixes.properties
> 28-Feb-2013 17:50:41 org.dbpedia.spotlight.web.rest.SpotlightInterface
> announce
>
> 28-Feb-2013 18:01:33 org.dbpedia.spotlight.web.rest.SpotlightInterface
> announce
> INFO: ******************************** Parameters
> ********************************
> 28-Feb-2013 18:01:33 org.dbpedia.spotlight.web.rest.SpotlightInterface
> announce
> INFO: API: /annotate
> 28-Feb-2013 18:01:33 org.dbpedia.spotlight.web.rest.SpotlightInterface
> announce
> INFO: client ip: 192.168.1.252
> 28-Feb-2013 18:01:33 org.dbpedia.spotlight.web.rest.SpotlightInterface
> announce
> INFO: text: President Obama called Wednesday on Congress to extend a tax
> break for students included in last year's economic stimulus package,
> arguing that the policy provides more generous assistance.
> 28-Feb-2013 18:01:33 org.dbpedia.spotlight.web.rest.SpotlightInterface
> announce
> INFO: text length in chars: 189
> 28-Feb-2013 18:01:33 org.dbpedia.spotlight.web.rest.SpotlightInterface
> announce
> INFO: confidence: 0.2
> 28-Feb-2013 18:01:33 org.dbpedia.spotlight.web.rest.SpotlightInterface
> announce
> INFO: support: 20
> 28-Feb-2013 18:01:33 org.dbpedia.spotlight.web.rest.SpotlightInterface
> announce
> INFO: types:
> 28-Feb-2013 18:01:33 org.dbpedia.spotlight.web.rest.SpotlightInterface
> announce
> INFO: sparqlQuery:
> 28-Feb-2013 18:01:33 org.dbpedia.spotlight.web.rest.SpotlightInterface
> announce
> INFO: policy: false
> 28-Feb-2013 18:01:33 org.dbpedia.spotlight.web.rest.SpotlightInterface
> announce
> INFO: coreferenceResolution: true
> 28-Feb-2013 18:01:33 org.dbpedia.spotlight.web.rest.SpotlightInterface
> announce
> INFO: spotter: Default
> 28-Feb-2013 18:01:33 org.dbpedia.spotlight.web.rest.SpotlightInterface
> announce
> INFO: disambiguator: Default
> 28-Feb-2013 18:01:33 org.dbpedia.spotlight.spot.SpotterWithSelector extract
> INFO: Selecting candidates...
> 28-Feb-2013 18:01:33 org.dbpedia.spotlight.spot.SpotterWithSelector extract
> INFO: Removed 1 (8 percent) spots using spotSelector ChainedSelector
> 28-Feb-2013 18:01:57
> org.dbpedia.spotlight.lucene.disambiguate.MergedOccurrencesDisambiguator
> disambiguate
> INFO: Total time of all disambiguations: 24.126 s
> 28-Feb-2013 18:01:57
> org.dbpedia.spotlight.lucene.disambiguate.MergedOccurrencesDisambiguator
> disambiguate
> INFO: Average time of one disambiguation: 2.0105 s
> 28-Feb-2013 18:01:57 org.dbpedia.spotlight.filter.annotations.TypeFilter
> <init>
> INFO: types are empty: showing all types
> 28-Feb-2013 18:01:57
> org.dbpedia.spotlight.filter.annotations.ConfidenceFilter touchOcc
> INFO: (c=0.2) filtered out by similarity score threshold (0.022<0.046):
> SurfaceForm[called] -0.022->
> DBpediaResource[Fifty_State_Initiative(Freebase:/fictional_universe/fictional_organization,Freebase:/fictional_universe)]
> - at position *16* in - Text[... President Obama called Wednesday on
> Congress to extend a tax break ...]
> 28-Feb-2013 18:01:57
> org.dbpedia.spotlight.filter.annotations.ConfidenceFilter touchOcc
> INFO: (c=0.2) filtered out by similarity score threshold (0.041<0.046):
> SurfaceForm[Wednesday] -0.041->
> DBpediaResource[Wednesday(Freebase:/time/day_of_week,Freebase:/time)] - at
> position *23* in - Text[... President Obama called Wednesday on Congress to
> extend a tax break for st ...]
> 28-Feb-2013 18:01:57
> org.dbpedia.spotlight.filter.annotations.ConfidenceFilter touchOcc
> INFO: (c=0.2) filtered out by similarity score threshold (0.037<0.046):
> SurfaceForm[year] -0.037->
> DBpediaResource[University(Freebase:/organization/organization_type,Freebase:/organization,Freebase:/business/company_type,Freebase:/business,Freebase:/tv/tv_subject,Freebase:/tv,Freebase:/education/school_type,Freebase:/education,Freebase:/book/book_subject,Freebase:/book,Freebase:/fictional_universe/type_of_fictional_setting,Freebase:/fictional_universe,Freebase:/architecture/building_function,Freebase:/architecture)]
> - at position *97* in - Text[... extend a tax break for students included
> in last year's economic stimulus package, arguing that the ...]
> 28-Feb-2013 18:01:57
> org.dbpedia.spotlight.filter.annotations.ConfidenceFilter touchOcc
> INFO: (c=0.2) filtered out by similarity score threshold (0.030<0.046):
> SurfaceForm[package] -0.030-> DBpediaResource[Package_holiday] - at
> position *122* in - Text[... tudents included in last year's economic
> stimulus package, arguing that the policy provides more gen ...]
> 28-Feb-2013 18:01:57
> org.dbpedia.spotlight.filter.annotations.ConfidenceFilter touchOcc
> INFO: (c=0.2) filtered out by similarity score threshold (0.036<0.046):
> SurfaceForm[generous] -0.036->
> DBpediaResource[Generous_%28horse%29(Freebase:/biology/organism,Freebase:/biology,Freebase:/biology/pedigreed_animal,Freebase:/biology/owned_animal)]
> - at position *169* in - Text[... us package, arguing that the policy
> provides more generous assistance. ...]
> 28-Feb-2013 18:01:57 org.dbpedia.spotlight.web.rest.SpotlightInterface
> getHTML
> INFO: HTML format
>
>
--
Pablo N. Mendes
http://pablomendes.com
------------------------------------------------------------------------------
Everyone hates slow websites. So do we.
Make your web apps faster with AppDynamics
Download AppDynamics Lite for free today:
http://p.sf.net/sfu/appdyn_d2d_feb
_______________________________________________
Dbp-spotlight-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbp-spotlight-users