Author: buildbot
Date: Fri May 23 07:57:24 2014
New Revision: 909691
Log:
Staging update by buildbot for stanbol
Modified:
websites/staging/stanbol/trunk/content/ (props changed)
websites/staging/stanbol/trunk/content/docs/trunk/components/enhancer/chains/weightedchain.html
websites/staging/stanbol/trunk/content/docs/trunk/components/enhancer/engines/entityhubdereference.html
websites/staging/stanbol/trunk/content/docs/trunk/components/enhancer/engines/list.html
websites/staging/stanbol/trunk/content/docs/trunk/components/enhancer/enhancementproperties.html
websites/staging/stanbol/trunk/content/docs/trunk/components/entityhub/managedsite.html
websites/staging/stanbol/trunk/content/docs/trunk/utils/marmotta-kiwi-repository-service.html
Propchange: websites/staging/stanbol/trunk/content/
------------------------------------------------------------------------------
--- cms:source-revision (original)
+++ cms:source-revision Fri May 23 07:57:24 2014
@@ -1 +1 @@
-1597019
+1597023
Modified:
websites/staging/stanbol/trunk/content/docs/trunk/components/enhancer/chains/weightedchain.html
==============================================================================
---
websites/staging/stanbol/trunk/content/docs/trunk/components/enhancer/chains/weightedchain.html
(original)
+++
websites/staging/stanbol/trunk/content/docs/trunk/components/enhancer/chains/weightedchain.html
Fri May 23 07:57:24 2014
@@ -100,7 +100,32 @@
</pre></div>
+<p>The following figure shows the configuration dialog of a WeightedCahin
configured with two required and an optional engine.</p>
<p><img alt="Configuration dialog for the WeightedCahin"
src="enhancer-weightedchain-config.png" title="Screenshot of the configuration
dialog for a WeightedChain with two required and one optional engine" /></p>
+<h2 id="enhancement-properties-support">Enhancement Properties Support</h2>
+<p><strong>since <code>0.12.1</code></strong></p>
+<p>Starting from <code>0.12.1</code> the Weighted Chain allows to configure <a
href="../enhancementproperties">EnhancementProperties</a></p>
+<ul>
+<li>
+<p><strong>chain and engine</strong> scoped properties are defined as
parameters to the engines with the syntax <code>{engine-name};
{property-name-1}={value-1},{value-2}; {property-name-2}={value-1};</code> </p>
+</li>
+<li>
+<p><strong>chain</strong> scoped properties can be configured by using the
osgi property key <code>stanbol.enhancer.chain.chainproperties</code> by the
syntax <code>{property-name-1}={value-1},{value-2}</code>. NOTE that
<code>;</code> is NOT supported as separator for parsing multiple properties as
OSGI configurations already define a way for parsing multiple values</p>
+</li>
+</ul>
+<p>All EnhancementProperties configured with a <a href="chains">Chain</a> are
written as RDF to the <a href="chains/executionplan">ExecutionPlan</a>.
<em>Chain</em> scoped properties are directly added to the
<code>ep:ExecutionPlan</code> instance while <em>chain and engine</em> scoped
properties are added to the <code>ep:ExecutionNode</code> of the according
engine.</p>
+<p>The following figure and listing provide an example</p>
+<p><img alt="WeightedChain including some Enhancement Properties"
src="enhancer-weightedchain-enhprop-config.png" /></p>
+<p>The figure shows that for the <code>dbpedia-fst</code> engine the maximum
number of suggestions are set to <code>10</code>. Also the minimum confidence
value is set to <code>0.8</code>. For the <code>dbpedia-dereference</code>
engine the dereferenced languages are set to English, German and Spanish.
Finally a <em>chain</em> scoped property is used to set the maximum number of
suggestions for the whole chain to <code>5</code>. However this has no effect
for the <code>dbpedia-fst</code> engine as its custom configuration will
override this chain wide property.</p>
+<p>The following listing shows the exact same configuration in the
<code>.cfg</code> format.</p>
+<div class="codehilite"><pre><span class="n">stanbol</span><span
class="p">.</span><span class="n">enhancer</span><span class="p">.</span><span
class="n">chain</span><span class="p">.</span><span class="n">name</span><span
class="p">=</span>"<span class="n">dbpedia</span><span
class="o">-</span><span class="n">linking</span>"
+<span class="n">stanbol</span><span class="p">.</span><span
class="n">enhancer</span><span class="p">.</span><span
class="n">chain</span><span class="p">.</span><span
class="n">weighted</span><span class="p">.</span><span
class="n">chain</span><span class="p">=[</span>"<span
class="n">tika</span><span class="p">;</span><span
class="n">optional</span>"<span class="p">,</span>"<span
class="n">opennlp</span><span class="o">-</span><span
class="n">sentence</span>"<span class="p">,</span>"<span
class="n">opennlp</span><span class="o">-</span><span
class="n">token</span>"<span class="p">,</span>"<span
class="n">opennlp</span><span class="o">-</span><span
class="n">pos</span>"<span class="p">,</span>"<span
class="n">opennlp</span><span class="o">-</span><span
class="n">chunker</span>"<span class="p">,</span>
+ "<span class="n">dbpedia</span><span class="o">-</span><span
class="n">fst</span><span class="p">;</span><span class="o">\</span> <span
class="n">enhancer</span><span class="p">.</span><span
class="n">max</span><span class="o">-</span><span
class="n">suggestions</span><span class="o">\</span><span
class="p">=</span>10<span class="p">;</span><span class="o">\</span> <span
class="n">enhancer</span><span class="p">.</span><span
class="n">min</span><span class="o">-</span><span
class="n">confidence</span><span class="o">\</span><span
class="p">=</span>0<span class="p">.</span>8"<span class="p">,</span>
+ "<span class="n">dbpedia</span><span class="o">-</span><span
class="n">dereference</span><span class="p">;</span><span class="o">\</span>
<span class="n">enhancer</span><span class="p">.</span><span
class="n">engines</span><span class="p">.</span><span
class="n">dereference</span><span class="p">.</span><span
class="n">languages</span><span class="o">\</span><span class="p">=</span><span
class="n">en</span><span class="p">,</span><span class="n">de</span><span
class="p">,</span><span class="n">es</span>"<span class="p">]</span>
+<span class="n">stanbol</span><span class="p">.</span><span
class="n">enhancer</span><span class="p">.</span><span
class="n">chain</span><span class="p">.</span><span
class="n">chainproperties</span><span class="p">=[</span>"<span
class="n">enhancer</span><span class="p">.</span><span
class="n">max</span><span class="o">-</span><span
class="n">suggestions</span><span class="o">\</span><span
class="p">=</span>5"<span class="p">]</span>
+</pre></div>
+
+
<h2 id="calculation-of-the-executionplan">Calculation of the ExecutionPlan</h2>
<p>It is important to note that the ordering of the list has no influence on
the ExecutionPlan because the order of execution of the configured <a
href="../engines">EnhancementEngines</a> is calculated only by using the value
of the "org.apache.stanbol.enhancer.engine.order" property provided by the
EnhancementEngine:</p>
<ul>
Modified:
websites/staging/stanbol/trunk/content/docs/trunk/components/enhancer/engines/entityhubdereference.html
==============================================================================
---
websites/staging/stanbol/trunk/content/docs/trunk/components/enhancer/engines/entityhubdereference.html
(original)
+++
websites/staging/stanbol/trunk/content/docs/trunk/components/enhancer/engines/entityhubdereference.html
Fri May 23 07:57:24 2014
@@ -119,7 +119,8 @@
<h3 id="shared-thread-pool-configuration">Shared Thread Pool Configuration</h3>
<p>The Shared Thread Pool is a singelton Component used by all Entityhub
Dereference Engines with the <em>'Use Shared Thread Pool'</em> option enabled.
It has only a single configuration option
<em>(enhancer.engines.dereference.entityhub.sharedthreadpool.size)</em> that
allows to set the size of the thread pool.</p>
<p><img alt="Shared Thread Pool Configuration"
src="entityhub-dereference-engine-shared-threadpool-config.png" /></p>
-<h3 id="field-mapping-support">Field Mapping Support</h3>
+<h3 id="advanced-dereference-configurations">Advanced Dereference
Configurations</h3>
+<h4 id="entityhub-field-mapping-support">Entityhub Field Mapping Support</h4>
<p>The <em>enhancer.engines.dereference.fields</em> configuration does support
the Entityhub Field Mapping language.</p>
<p>FieldMappings do use the following syntax:</p>
<div class="codehilite"><pre>[!]FieldPattern [| Filter] [> Mapping]
@@ -127,18 +128,39 @@
<ul>
-<li>an optional Exclusion indicated by '!' as the first character of the
mapping used to exclude fields that are matched by the pattern.</li>
+<li>an optional Exclusion indicated by '!' as the first character of the
mapping used to exclude fields that are matched by the
<code>FieldPattern</code> part (e.g. <code>!foaf:*</code> will exclude all
properties of the FOAF namespace). Exclusions are only useful if a wildcard is
used (e.g. <code>foaf:*</code> together with <code>!foaf:mbox</code>).</li>
<li>the required <em>FieldPattern</em> supports the definition of prefixes
such as <code>http://xmlns.com/foaf/0.1/*</code> or <code>foaf:*</code></li>
<li>the optional <em>Filter</em> part allows to filter specific languages
(e.g. <code>@=null;en;de;</code> will only dereference English and German
literals as well as literals with no language tag), typed literals (e.g.
<code>d=xsd:dateTime;xsd:date</code>) or URI values (e.g.
<code>d=entityhub:ref</code>). Filters will also try to convert values to the
parsed data type (e.g. <code>d=xsd:double</code> would convert
<code>xsd:float</code> values to <code>xsd:doule</code>. Also string literals
that can be parsed as double would be converted).</li>
<li>an optional <em>Mapping</em> can be used to copy values to an other field
(e.g. <code>foaf:name > schema:name</code> would copy all FOAF names to the
schema.org name field)</li>
</ul>
-<p><strong>NOTE</strong>: Field Mappings configured for the
EntityhubDerefereceEngine are overridden by Field Mappings parsed as <a
href="../enhancementproperties">Enhancement Properties</a>.</p>
+<p><strong>NOTE</strong> that Field Mappings configured for the
EntityhubDerefereceEngine are overridden by Field Mappings parsed as <a
href="../enhancementproperties">Enhancement Properties</a>.</p>
+<h3 id="ldpath-support">LDPath support</h3>
+<p>The use of<a href="http://marmotta.apache.org/ldpath/language.html">LD Path
Language</a> is an alternative to most of the features supported by the
Entityhub Field Mapping language. Especially <em>Filters</em> and
<em>Mapping</em> SHOULD BE expressed using LD Path. </p>
+<p>The only advantage of the Field Mapping language is that is supports the
use of wildcards and exclusions. So in cases where one once to dereference all
properties of a specific namespace it is only possible to specify this by using
the Field Mapping language.</p>
+<p>The following Example shows a configuration that dereferences all
schema.org properties and also uses LD Path to align soem none schema.org
properties</p>
+<div
class="codehilite"><pre>enhancer.engines.dereference.fields="schema:*"
+enhancer.engines.dereference.ldpath=["@prefix schema
<http://schema.org/>;",
+ "@prefix dct <http://purl.org/dc/terms/>;",
+ "schema:name = (rdfs:label | dct:title | dc:title | foaf:name |
skos:prefLabel);",
+ "schema:alternateName = skos:altLabel;"
+ "schema:image = foaf:depiction;",
+ "schema:homepage = foaf:homepage;"]
+</pre></div>
+
+
+<p><em>NOTE</em> when used in a OSGI <code>*.cfg</code> file one would need to
escape spaces and <code>=</code> with <code>\</code> and remove all line
breaks.</p>
<h2 id="supported-enhancement-properties">Supported Enhancement Properties</h2>
<p>The following Enhancement Properties are supported by the Entityhub
Dereference Engine</p>
<ul>
<li><strong>Dereference Languages</strong>
<em>(enhancer.engines.dereference.languages)</em>: A set of languages that are
dereferenced. Even if <em>'Dereference only Content Language Literals'</em> is
active explicitly configured languages will still get dereferenced. *
<strong>Dereferenced Fields</strong>
<em>(enhancer.engines.dereference.fields)</em>: The dereferenced fields - in
RDF terminology 'properties' - to be dereferenced. QNames (e.g.
<code>rdf:label</code>) can be used for the configuration. This Engine supports
the use of FieldMappings for the configuration. Dereferenced Fields parsed as
EnhancementProperty will override values configured for the Engine.</li>
<li><strong>Dereference LD Path</strong>
<em>(enhancer.engines.dereference.ldpath)</em>: The <a
href="http://marmotta.apache.org/ldpath/language.html">LD Path Language</a>
allows to define powerful selectors for dereferenced Entities. An LD Path
program parsed as EnhancementProperty will be executed in addition to those
configured for the engine.</li>
</ul>
+<p>As an example the following query parameter would instruct all Entityhub
Dereference engines used in an enhancement engine to just dereference English
and German literals.</p>
+<div class="codehilite"><pre>curl -X POST -H <span class="s2">"Accept:
text/turtle"</span> -H <span class="s2">"Content-type:
text/plain"</span> <span class="se">\</span>
+ --data <span class="s2">"The Eifeltower is located in
Paris."</span>
+ http://localhost:8080/enhancer?enhancer.engines.dereference.languages<span
class="o">=</span>en&<span class="se">\</span>
+ enhancer.engines.dereference.languages<span class="o">=</span>de
+</pre></div>
</div>
<div id="footer">
Modified:
websites/staging/stanbol/trunk/content/docs/trunk/components/enhancer/engines/list.html
==============================================================================
---
websites/staging/stanbol/trunk/content/docs/trunk/components/enhancer/engines/list.html
(original)
+++
websites/staging/stanbol/trunk/content/docs/trunk/components/enhancer/engines/list.html
Fri May 23 07:57:24 2014
@@ -357,11 +357,9 @@
</ul>
</li>
<li>
-<p><em>Solr More-like-This Disambiguation Engine:</em> __under development_
(see <a
href="https://issues.apache.org/jira/browse/STANBOL-723">STANBOL-723</a>)</p>
+<p><strong>Solr More-like-This Disambiguation Engine:</strong> (see <a
href="https://issues.apache.org/jira/browse/STANBOL-723">STANBOL-723</a>)</p>
<ul>
<li>disambiguates Entities managed by the Stanbol Entityhub by using Solr MLT
queries</li>
-<li>only available via the <a
href="http://svn.apache.org/repos/asf/stanbol/branches/disambiguation-engine/">disambiguation-engine</a>
branch</li>
-<li>adjusts the fise:confidence of existing fise:EntityAnnotations</li>
</ul>
</li>
</ul>
Modified:
websites/staging/stanbol/trunk/content/docs/trunk/components/enhancer/enhancementproperties.html
==============================================================================
---
websites/staging/stanbol/trunk/content/docs/trunk/components/enhancer/enhancementproperties.html
(original)
+++
websites/staging/stanbol/trunk/content/docs/trunk/components/enhancer/enhancementproperties.html
Fri May 23 07:57:24 2014
@@ -197,11 +197,11 @@ of suggestions for all engines to five.
<h3 id="parsing-request-properties-via-the-enhancer-restful-service">Parsing
Request Properties via the Enhancer RESTful Service</h3>
<p>Starting with <code>0.12.1</code> Enhancement Properties can be parsed as
query parameter of Enhancement Requests. For request scoped properties the
property name is used as parameter. Request and engine scoped properties need
to use <code>{engine-name}:{property-name}</code> as parameter.</p>
<p>The following shows the curl request generating the equivalent of the
example used in the above section:</p>
-<div class="codehilite"><pre><span class="n">curl</span> <span
class="o">-</span><span class="n">X</span> <span class="n">POST</span> <span
class="o">-</span><span class="n">H</span> "<span
class="n">Accept</span><span class="p">:</span> <span
class="n">text</span><span class="o">/</span><span
class="n">turtle</span>" <span class="o">-</span><span class="n">H</span>
"<span class="n">Content</span><span class="o">-</span><span
class="n">type</span><span class="p">:</span> <span class="n">text</span><span
class="o">/</span><span class="n">plain</span>" <span class="o">\</span>
- <span class="o">--</span><span class="n">data</span> "<span
class="n">The</span> <span class="n">Eifeltower</span> <span
class="n">is</span> <span class="n">located</span> <span class="n">in</span>
<span class="n">Paris</span><span class="p">.</span>"
- <span class="n">http</span><span class="p">:</span><span
class="o">//</span><span class="n">localhost</span><span
class="p">:</span>8080<span class="o">/</span><span
class="n">enhancer</span>?<span class="n">enhancer</span><span
class="p">.</span><span class="n">max</span><span class="o">-</span><span
class="n">suggestions</span><span class="p">=</span>5<span
class="o">&\</span>
- <span class="n">dbpedia</span><span class="o">-</span><span
class="n">linking</span><span class="p">:</span><span
class="n">enhancer</span><span class="p">.</span><span
class="n">min</span><span class="o">-</span><span
class="n">confidence</span><span class="p">=</span>0<span
class="p">.</span>33<span class="o">&\</span>
- <span class="n">conf</span><span class="o">-</span><span
class="n">filter</span><span class="p">:</span><span
class="n">enhancer</span><span class="p">.</span><span
class="n">min</span><span class="o">-</span><span
class="n">confidence</span><span class="p">=</span>0<span class="p">.</span>85
+<div class="codehilite"><pre>curl -X POST -H <span class="s2">"Accept:
text/turtle"</span> -H <span class="s2">"Content-type:
text/plain"</span> <span class="se">\</span>
+ --data <span class="s2">"The Eifeltower is located in
Paris."</span>
+ http://localhost:8080/enhancer?enhancer.max-suggestions<span
class="o">=</span>5&<span class="se">\</span>
+ dbpedia-linking:enhancer.min-confidence<span
class="o">=</span>0.33&<span class="se">\</span>
+ conf-filter:enhancer.min-confidence<span class="o">=</span>0.85
</pre></div>
Modified:
websites/staging/stanbol/trunk/content/docs/trunk/components/entityhub/managedsite.html
==============================================================================
---
websites/staging/stanbol/trunk/content/docs/trunk/components/entityhub/managedsite.html
(original)
+++
websites/staging/stanbol/trunk/content/docs/trunk/components/entityhub/managedsite.html
Fri May 23 07:57:24 2014
@@ -165,7 +165,7 @@
<h4 id="configuration-of-a-sesame-yard-site">Configuration of a Sesame Yard
Site</h4>
<p>With <a
href="https://issues.apache.org/jira/browse/STANBOL-1169">STANBOL-1169</a>
(since version <code>0.12.1</code>) a Sesame Repository registered as OSGI
service can be used as Entityhub Yard.</p>
-<p>The following figure shows a Apache Marmotta Kiwi Repository registered as
OSGI service. </p>
+<p>The following figure shows a <a
href="/docs/trunk/utils/marmotta-kiwi-repository-service">Apache Marmotta Kiwi
Repository</a> registered as OSGI service. </p>
<p><img alt="Marmotta Kiwi Repository Service"
src="marmotta-kiwi-repository-service.png" /></p>
<p>The highlighted <code>org.openrdf.repository.Repository.id</code> key is
used to link a specific Sesame Repository to a Sesame Yard Site. All the other
keys are implementation specific and not used by the Entityhub Sesame Yard
Site.</p>
<p>When configuring a SesameYard one need to set the Repository
(<code>org.openrdf.repository.Repository.id</code> key) to the value of the
Sesame Repository one would like to use as backend. This is especially
important if multiple Sesame Repositories are registered as OSGI services.</p>
Modified:
websites/staging/stanbol/trunk/content/docs/trunk/utils/marmotta-kiwi-repository-service.html
==============================================================================
---
websites/staging/stanbol/trunk/content/docs/trunk/utils/marmotta-kiwi-repository-service.html
(original)
+++
websites/staging/stanbol/trunk/content/docs/trunk/utils/marmotta-kiwi-repository-service.html
Fri May 23 07:57:24 2014
@@ -101,14 +101,18 @@ configuration. The following figure show
<li><code>org.openrdf.repository.Repository.id</code>: The id of the
Repository. Intended to be used by
other components to track a specific repository instance.</li>
<li><code>marmotta.kiwi.dialect</code>: The KiWi Database dialect. Currently
Marmotta supports the
-H2Dialect, PostgreSQLDialect and MySQLDialect. Note that the selected dialect
will select
+<code>H2Dialect</code>, <code>PostgreSQLDialect</code> and
<code>MySQLDialect</code>. Note that the selected dialect will select
different database driver. If those are not available the activation will
throw an
exception. PostgreSQL driver are embedded. H2 drivers are included in the
default
-Bundlelist used by Stanbol.</li>
+<a
href="http://svn.apache.org/repos/asf/stanbol/branches/release-0.12/launchers/bundlelists/marmotta/kiwi/src/main/bundles/list.xml">Marmotta
Kiwi Bundlelist</a> used by Stanbol. For MySQL the according dependency needs
to be uncommented in
+the <a
href="http://svn.apache.org/repos/asf/stanbol/branches/release-0.12/launchers/bundlelists/marmotta/kiwi/src/main/bundles/list.xml">Marmotta
Kiwi Bundlelist</a>.</li>
<li><code>marmotta.kiwi.dburl</code>: This property can be used to directly
configure the DB URL. If
-present this is preferred over the configuration of the
<code>marmotta.kiwi.host</code>,
-<code>marmotta.kiwi.port</code>, <code>marmotta.kiwi.database</code> and
<code>marmotta.kiwi.options</code> parameters.</li>
-<li><code>marmotta.kiwi.user</code> and <code>marmotta.kiwi.password</code>
for the database</li>
+present this is preferred over the configuration of the
<code>host</code>,<code>port</code>, <code>database</code> and
<code>options</code> parameters.</li>
+<li><code>marmotta.kiwi.host</code>: The host of the database (a file path in
case of H2)</li>
+<li><code>marmotta.kiwi.port</code>: The port of the database (ignored in case
of H2)</li>
+<li><code>marmotta.kiwi.user</code>: The database user</li>
+<li><code>marmotta.kiwi.password</code>: The password for the configured
user</li>
+<li><code>marmotta.kiwi.options</code>: Additional database options</li>
<li><code>marmotta.kiwi.cluster</code>: defines the name of the cluster.
Different KiWi Repositories
might use clusters with different names. If not present or empty clustering
will be
deactivated.</li>
@@ -133,8 +137,9 @@ registered as OSGI service with the para
<p><img alt="Marmotta KiWi Repository Service"
src="marmotta-kiwi-repository-service.png" /></p>
<p>The marked <code>org.openrdf.repository.Repository.id</code> property is of
special interest as it
can be used to track for a Sesame Repository with a specific name. As an
Example the
-Repository with the name <code>dummy</code> can be tracked with the Filter
-<code>(&(objectClass=org.openrdf.repository.Repository)(org.openrdf.repository.Repository.id=dummy))</code></p>
+Repository with the name <code>dummy</code> can be tracked with the Filter</p>
+<div
class="codehilite"><pre>(&(objectClass=org.openrdf.repository.Repository)(org.openrdf.repository.Repository.id=dummy))
+</pre></div>
</div>
<div id="footer">