Author: buildbot
Date: Fri May 23 07:57:24 2014
New Revision: 909691

Log:
Staging update by buildbot for stanbol

Modified:
    websites/staging/stanbol/trunk/content/   (props changed)
    
websites/staging/stanbol/trunk/content/docs/trunk/components/enhancer/chains/weightedchain.html
    
websites/staging/stanbol/trunk/content/docs/trunk/components/enhancer/engines/entityhubdereference.html
    
websites/staging/stanbol/trunk/content/docs/trunk/components/enhancer/engines/list.html
    
websites/staging/stanbol/trunk/content/docs/trunk/components/enhancer/enhancementproperties.html
    
websites/staging/stanbol/trunk/content/docs/trunk/components/entityhub/managedsite.html
    
websites/staging/stanbol/trunk/content/docs/trunk/utils/marmotta-kiwi-repository-service.html

Propchange: websites/staging/stanbol/trunk/content/
------------------------------------------------------------------------------
--- cms:source-revision (original)
+++ cms:source-revision Fri May 23 07:57:24 2014
@@ -1 +1 @@
-1597019
+1597023

Modified: 
websites/staging/stanbol/trunk/content/docs/trunk/components/enhancer/chains/weightedchain.html
==============================================================================
--- 
websites/staging/stanbol/trunk/content/docs/trunk/components/enhancer/chains/weightedchain.html
 (original)
+++ 
websites/staging/stanbol/trunk/content/docs/trunk/components/enhancer/chains/weightedchain.html
 Fri May 23 07:57:24 2014
@@ -100,7 +100,32 @@
 </pre></div>
 
 
+<p>The following figure shows the configuration dialog of a WeightedCahin 
configured with two required and an optional engine.</p>
 <p><img alt="Configuration dialog for the WeightedCahin" 
src="enhancer-weightedchain-config.png" title="Screenshot of the configuration 
dialog for a WeightedChain with two required and one optional engine" /></p>
+<h2 id="enhancement-properties-support">Enhancement Properties Support</h2>
+<p><strong>since <code>0.12.1</code></strong></p>
+<p>Starting from <code>0.12.1</code> the Weighted Chain allows to configure <a 
href="../enhancementproperties">EnhancementProperties</a></p>
+<ul>
+<li>
+<p><strong>chain and engine</strong> scoped properties are defined as 
parameters to the engines with the syntax <code>{engine-name}; 
{property-name-1}={value-1},{value-2}; {property-name-2}={value-1};</code> </p>
+</li>
+<li>
+<p><strong>chain</strong> scoped properties can be configured by using the 
osgi property key <code>stanbol.enhancer.chain.chainproperties</code> by the 
syntax <code>{property-name-1}={value-1},{value-2}</code>. NOTE that 
<code>;</code> is NOT supported as separator for parsing multiple properties as 
OSGI configurations already define a way for parsing multiple values</p>
+</li>
+</ul>
+<p>All EnhancementProperties configured with a <a href="chains">Chain</a> are 
written as RDF to the <a href="chains/executionplan">ExecutionPlan</a>. 
<em>Chain</em> scoped properties are directly added to the 
<code>ep:ExecutionPlan</code> instance while <em>chain and engine</em> scoped 
properties are added to the <code>ep:ExecutionNode</code> of the according 
engine.</p>
+<p>The following figure and listing provide an example</p>
+<p><img alt="WeightedChain including some Enhancement Properties" 
src="enhancer-weightedchain-enhprop-config.png" /></p>
+<p>The figure shows that for the <code>dbpedia-fst</code> engine the maximum 
number of suggestions are set to <code>10</code>. Also the minimum confidence 
value is set to <code>0.8</code>. For the <code>dbpedia-dereference</code> 
engine the dereferenced languages are set to English, German and Spanish. 
Finally a <em>chain</em> scoped property is used to set the maximum number of 
suggestions for the whole chain to <code>5</code>. However this has no effect 
for the <code>dbpedia-fst</code> engine as its custom configuration will 
override this chain wide property.</p>
+<p>The following listing shows the exact same configuration in the 
<code>.cfg</code> format.</p>
+<div class="codehilite"><pre><span class="n">stanbol</span><span 
class="p">.</span><span class="n">enhancer</span><span class="p">.</span><span 
class="n">chain</span><span class="p">.</span><span class="n">name</span><span 
class="p">=</span>&quot;<span class="n">dbpedia</span><span 
class="o">-</span><span class="n">linking</span>&quot;
+<span class="n">stanbol</span><span class="p">.</span><span 
class="n">enhancer</span><span class="p">.</span><span 
class="n">chain</span><span class="p">.</span><span 
class="n">weighted</span><span class="p">.</span><span 
class="n">chain</span><span class="p">=[</span>&quot;<span 
class="n">tika</span><span class="p">;</span><span 
class="n">optional</span>&quot;<span class="p">,</span>&quot;<span 
class="n">opennlp</span><span class="o">-</span><span 
class="n">sentence</span>&quot;<span class="p">,</span>&quot;<span 
class="n">opennlp</span><span class="o">-</span><span 
class="n">token</span>&quot;<span class="p">,</span>&quot;<span 
class="n">opennlp</span><span class="o">-</span><span 
class="n">pos</span>&quot;<span class="p">,</span>&quot;<span 
class="n">opennlp</span><span class="o">-</span><span 
class="n">chunker</span>&quot;<span class="p">,</span>
+    &quot;<span class="n">dbpedia</span><span class="o">-</span><span 
class="n">fst</span><span class="p">;</span><span class="o">\</span> <span 
class="n">enhancer</span><span class="p">.</span><span 
class="n">max</span><span class="o">-</span><span 
class="n">suggestions</span><span class="o">\</span><span 
class="p">=</span>10<span class="p">;</span><span class="o">\</span> <span 
class="n">enhancer</span><span class="p">.</span><span 
class="n">min</span><span class="o">-</span><span 
class="n">confidence</span><span class="o">\</span><span 
class="p">=</span>0<span class="p">.</span>8&quot;<span class="p">,</span>
+    &quot;<span class="n">dbpedia</span><span class="o">-</span><span 
class="n">dereference</span><span class="p">;</span><span class="o">\</span> 
<span class="n">enhancer</span><span class="p">.</span><span 
class="n">engines</span><span class="p">.</span><span 
class="n">dereference</span><span class="p">.</span><span 
class="n">languages</span><span class="o">\</span><span class="p">=</span><span 
class="n">en</span><span class="p">,</span><span class="n">de</span><span 
class="p">,</span><span class="n">es</span>&quot;<span class="p">]</span>
+<span class="n">stanbol</span><span class="p">.</span><span 
class="n">enhancer</span><span class="p">.</span><span 
class="n">chain</span><span class="p">.</span><span 
class="n">chainproperties</span><span class="p">=[</span>&quot;<span 
class="n">enhancer</span><span class="p">.</span><span 
class="n">max</span><span class="o">-</span><span 
class="n">suggestions</span><span class="o">\</span><span 
class="p">=</span>5&quot;<span class="p">]</span>
+</pre></div>
+
+
 <h2 id="calculation-of-the-executionplan">Calculation of the ExecutionPlan</h2>
 <p>It is important to note that the ordering of the list has no influence on 
the ExecutionPlan because the order of execution of the configured <a 
href="../engines">EnhancementEngines</a> is calculated only by using the value 
of the "org.apache.stanbol.enhancer.engine.order" property provided by the 
EnhancementEngine:</p>
 <ul>

Modified: 
websites/staging/stanbol/trunk/content/docs/trunk/components/enhancer/engines/entityhubdereference.html
==============================================================================
--- 
websites/staging/stanbol/trunk/content/docs/trunk/components/enhancer/engines/entityhubdereference.html
 (original)
+++ 
websites/staging/stanbol/trunk/content/docs/trunk/components/enhancer/engines/entityhubdereference.html
 Fri May 23 07:57:24 2014
@@ -119,7 +119,8 @@
 <h3 id="shared-thread-pool-configuration">Shared Thread Pool Configuration</h3>
 <p>The Shared Thread Pool is a singelton Component used by all Entityhub 
Dereference Engines with the <em>'Use Shared Thread Pool'</em> option enabled. 
It has only a single configuration option 
<em>(enhancer.engines.dereference.entityhub.sharedthreadpool.size)</em> that 
allows to set the size of the thread pool.</p>
 <p><img alt="Shared Thread Pool Configuration" 
src="entityhub-dereference-engine-shared-threadpool-config.png" /></p>
-<h3 id="field-mapping-support">Field Mapping Support</h3>
+<h3 id="advanced-dereference-configurations">Advanced Dereference 
Configurations</h3>
+<h4 id="entityhub-field-mapping-support">Entityhub Field Mapping Support</h4>
 <p>The <em>enhancer.engines.dereference.fields</em> configuration does support 
the Entityhub Field Mapping language.</p>
 <p>FieldMappings do use the following syntax:</p>
 <div class="codehilite"><pre>[!]FieldPattern [| Filter] [&gt; Mapping]
@@ -127,18 +128,39 @@
 
 
 <ul>
-<li>an optional Exclusion indicated by '!' as the first character of the 
mapping used to exclude fields that are matched by the pattern.</li>
+<li>an optional Exclusion indicated by '!' as the first character of the 
mapping used to exclude fields that are matched by the 
<code>FieldPattern</code> part (e.g. <code>!foaf:*</code> will exclude all 
properties of the FOAF namespace). Exclusions are only useful if a wildcard is 
used (e.g. <code>foaf:*</code> together with <code>!foaf:mbox</code>).</li>
 <li>the required <em>FieldPattern</em> supports the definition of prefixes 
such as <code>http://xmlns.com/foaf/0.1/*</code> or <code>foaf:*</code></li>
 <li>the optional <em>Filter</em> part allows to filter specific languages 
(e.g. <code>@=null;en;de;</code> will only dereference English and German 
literals as well as literals with no language tag), typed literals (e.g. 
<code>d=xsd:dateTime;xsd:date</code>) or URI values (e.g. 
<code>d=entityhub:ref</code>). Filters will also try to convert values to the 
parsed data type (e.g. <code>d=xsd:double</code> would convert 
<code>xsd:float</code> values to <code>xsd:doule</code>. Also string literals 
that can be parsed as double would be converted).</li>
 <li>an optional <em>Mapping</em> can be used to copy values to an other field 
(e.g. <code>foaf:name &gt; schema:name</code> would copy all FOAF names to the 
schema.org name field)</li>
 </ul>
-<p><strong>NOTE</strong>: Field Mappings configured for the 
EntityhubDerefereceEngine are overridden by Field Mappings parsed as <a 
href="../enhancementproperties">Enhancement Properties</a>.</p>
+<p><strong>NOTE</strong> that Field Mappings configured for the 
EntityhubDerefereceEngine are overridden by Field Mappings parsed as <a 
href="../enhancementproperties">Enhancement Properties</a>.</p>
+<h3 id="ldpath-support">LDPath support</h3>
+<p>The use of<a href="http://marmotta.apache.org/ldpath/language.html";>LD Path 
Language</a> is an alternative to most of the features supported by the 
Entityhub Field Mapping language. Especially <em>Filters</em> and 
<em>Mapping</em> SHOULD BE expressed using LD Path. </p>
+<p>The only advantage of the Field Mapping language is that is supports the 
use of wildcards and exclusions. So in cases where one once to dereference all 
properties of a specific namespace it is only possible to specify this by using 
the Field Mapping language.</p>
+<p>The following Example shows a configuration that dereferences all 
schema.org properties and also uses LD Path to align soem none schema.org 
properties</p>
+<div 
class="codehilite"><pre>enhancer.engines.dereference.fields=&quot;schema:*&quot;
+enhancer.engines.dereference.ldpath=[&quot;@prefix schema 
&lt;http://schema.org/&gt;;&quot;,
+    &quot;@prefix dct &lt;http://purl.org/dc/terms/&gt;;&quot;,
+    &quot;schema:name = (rdfs:label | dct:title | dc:title | foaf:name | 
skos:prefLabel);&quot;,
+    &quot;schema:alternateName = skos:altLabel;&quot;
+    &quot;schema:image = foaf:depiction;&quot;,
+    &quot;schema:homepage = foaf:homepage;&quot;]
+</pre></div>
+
+
+<p><em>NOTE</em> when used in a OSGI <code>*.cfg</code> file one would need to 
escape spaces and <code>=</code> with <code>\</code> and remove all line 
breaks.</p>
 <h2 id="supported-enhancement-properties">Supported Enhancement Properties</h2>
 <p>The following Enhancement Properties are supported by the Entityhub 
Dereference Engine</p>
 <ul>
 <li><strong>Dereference Languages</strong> 
<em>(enhancer.engines.dereference.languages)</em>: A set of languages that are 
dereferenced. Even if <em>'Dereference only Content Language Literals'</em> is 
active explicitly configured languages will still get dereferenced. * 
<strong>Dereferenced Fields</strong> 
<em>(enhancer.engines.dereference.fields)</em>: The dereferenced fields - in 
RDF terminology 'properties' - to be dereferenced. QNames (e.g. 
<code>rdf:label</code>) can be used for the configuration. This Engine supports 
the use of FieldMappings for the configuration. Dereferenced Fields parsed as 
EnhancementProperty will override values configured for the Engine.</li>
 <li><strong>Dereference LD Path</strong> 
<em>(enhancer.engines.dereference.ldpath)</em>: The <a 
href="http://marmotta.apache.org/ldpath/language.html";>LD Path Language</a> 
allows to define powerful selectors for dereferenced Entities. An LD Path 
program parsed as EnhancementProperty will be executed in addition to those 
configured for the engine.</li>
 </ul>
+<p>As an example the following query parameter would instruct all Entityhub 
Dereference engines used in an enhancement engine to just dereference English 
and German literals.</p>
+<div class="codehilite"><pre>curl -X POST -H <span class="s2">&quot;Accept: 
text/turtle&quot;</span> -H <span class="s2">&quot;Content-type: 
text/plain&quot;</span> <span class="se">\</span>
+    --data <span class="s2">&quot;The Eifeltower is located in 
Paris.&quot;</span> 
+    http://localhost:8080/enhancer?enhancer.engines.dereference.languages<span 
class="o">=</span>en&amp;<span class="se">\</span>
+    enhancer.engines.dereference.languages<span class="o">=</span>de
+</pre></div>
   </div>
   
   <div id="footer">

Modified: 
websites/staging/stanbol/trunk/content/docs/trunk/components/enhancer/engines/list.html
==============================================================================
--- 
websites/staging/stanbol/trunk/content/docs/trunk/components/enhancer/engines/list.html
 (original)
+++ 
websites/staging/stanbol/trunk/content/docs/trunk/components/enhancer/engines/list.html
 Fri May 23 07:57:24 2014
@@ -357,11 +357,9 @@
 </ul>
 </li>
 <li>
-<p><em>Solr More-like-This Disambiguation Engine:</em> __under development_ 
(see <a 
href="https://issues.apache.org/jira/browse/STANBOL-723";>STANBOL-723</a>)</p>
+<p><strong>Solr More-like-This Disambiguation Engine:</strong> (see <a 
href="https://issues.apache.org/jira/browse/STANBOL-723";>STANBOL-723</a>)</p>
 <ul>
 <li>disambiguates Entities managed by the Stanbol Entityhub by using Solr MLT 
queries</li>
-<li>only available via the <a 
href="http://svn.apache.org/repos/asf/stanbol/branches/disambiguation-engine/";>disambiguation-engine</a>
 branch</li>
-<li>adjusts the fise:confidence of existing fise:EntityAnnotations</li>
 </ul>
 </li>
 </ul>

Modified: 
websites/staging/stanbol/trunk/content/docs/trunk/components/enhancer/enhancementproperties.html
==============================================================================
--- 
websites/staging/stanbol/trunk/content/docs/trunk/components/enhancer/enhancementproperties.html
 (original)
+++ 
websites/staging/stanbol/trunk/content/docs/trunk/components/enhancer/enhancementproperties.html
 Fri May 23 07:57:24 2014
@@ -197,11 +197,11 @@ of suggestions for all engines to five. 
 <h3 id="parsing-request-properties-via-the-enhancer-restful-service">Parsing 
Request Properties via the Enhancer RESTful Service</h3>
 <p>Starting with <code>0.12.1</code> Enhancement Properties can be parsed as 
query parameter of Enhancement Requests. For request scoped properties the 
property name is used as parameter. Request and engine scoped properties need 
to use <code>{engine-name}:{property-name}</code> as parameter.</p>
 <p>The following shows the curl request generating the equivalent of the 
example used in the above section:</p>
-<div class="codehilite"><pre><span class="n">curl</span> <span 
class="o">-</span><span class="n">X</span> <span class="n">POST</span> <span 
class="o">-</span><span class="n">H</span> &quot;<span 
class="n">Accept</span><span class="p">:</span> <span 
class="n">text</span><span class="o">/</span><span 
class="n">turtle</span>&quot; <span class="o">-</span><span class="n">H</span> 
&quot;<span class="n">Content</span><span class="o">-</span><span 
class="n">type</span><span class="p">:</span> <span class="n">text</span><span 
class="o">/</span><span class="n">plain</span>&quot; <span class="o">\</span>
-    <span class="o">--</span><span class="n">data</span> &quot;<span 
class="n">The</span> <span class="n">Eifeltower</span> <span 
class="n">is</span> <span class="n">located</span> <span class="n">in</span> 
<span class="n">Paris</span><span class="p">.</span>&quot; 
-    <span class="n">http</span><span class="p">:</span><span 
class="o">//</span><span class="n">localhost</span><span 
class="p">:</span>8080<span class="o">/</span><span 
class="n">enhancer</span>?<span class="n">enhancer</span><span 
class="p">.</span><span class="n">max</span><span class="o">-</span><span 
class="n">suggestions</span><span class="p">=</span>5<span 
class="o">&amp;\</span>
-    <span class="n">dbpedia</span><span class="o">-</span><span 
class="n">linking</span><span class="p">:</span><span 
class="n">enhancer</span><span class="p">.</span><span 
class="n">min</span><span class="o">-</span><span 
class="n">confidence</span><span class="p">=</span>0<span 
class="p">.</span>33<span class="o">&amp;\</span>
-    <span class="n">conf</span><span class="o">-</span><span 
class="n">filter</span><span class="p">:</span><span 
class="n">enhancer</span><span class="p">.</span><span 
class="n">min</span><span class="o">-</span><span 
class="n">confidence</span><span class="p">=</span>0<span class="p">.</span>85
+<div class="codehilite"><pre>curl -X POST -H <span class="s2">&quot;Accept: 
text/turtle&quot;</span> -H <span class="s2">&quot;Content-type: 
text/plain&quot;</span> <span class="se">\</span>
+    --data <span class="s2">&quot;The Eifeltower is located in 
Paris.&quot;</span> 
+    http://localhost:8080/enhancer?enhancer.max-suggestions<span 
class="o">=</span>5&amp;<span class="se">\</span>
+    dbpedia-linking:enhancer.min-confidence<span 
class="o">=</span>0.33&amp;<span class="se">\</span>
+    conf-filter:enhancer.min-confidence<span class="o">=</span>0.85
 </pre></div>
 
 

Modified: 
websites/staging/stanbol/trunk/content/docs/trunk/components/entityhub/managedsite.html
==============================================================================
--- 
websites/staging/stanbol/trunk/content/docs/trunk/components/entityhub/managedsite.html
 (original)
+++ 
websites/staging/stanbol/trunk/content/docs/trunk/components/entityhub/managedsite.html
 Fri May 23 07:57:24 2014
@@ -165,7 +165,7 @@
 
 <h4 id="configuration-of-a-sesame-yard-site">Configuration of a Sesame Yard 
Site</h4>
 <p>With <a 
href="https://issues.apache.org/jira/browse/STANBOL-1169";>STANBOL-1169</a> 
(since version <code>0.12.1</code>) a Sesame Repository registered as OSGI 
service can be used as Entityhub Yard.</p>
-<p>The following figure shows a Apache Marmotta Kiwi Repository registered as 
OSGI service. </p>
+<p>The following figure shows a <a 
href="/docs/trunk/utils/marmotta-kiwi-repository-service">Apache Marmotta Kiwi 
Repository</a> registered as OSGI service. </p>
 <p><img alt="Marmotta Kiwi Repository Service" 
src="marmotta-kiwi-repository-service.png" /></p>
 <p>The highlighted <code>org.openrdf.repository.Repository.id</code> key is 
used to link a specific Sesame Repository to a Sesame Yard Site. All the other 
keys are implementation specific and not used by the Entityhub Sesame Yard 
Site.</p>
 <p>When configuring a SesameYard one need to set the Repository 
(<code>org.openrdf.repository.Repository.id</code> key) to the value of the 
Sesame Repository one would like to use as backend. This is especially 
important if multiple Sesame Repositories are registered as OSGI services.</p>

Modified: 
websites/staging/stanbol/trunk/content/docs/trunk/utils/marmotta-kiwi-repository-service.html
==============================================================================
--- 
websites/staging/stanbol/trunk/content/docs/trunk/utils/marmotta-kiwi-repository-service.html
 (original)
+++ 
websites/staging/stanbol/trunk/content/docs/trunk/utils/marmotta-kiwi-repository-service.html
 Fri May 23 07:57:24 2014
@@ -101,14 +101,18 @@ configuration. The following figure show
 <li><code>org.openrdf.repository.Repository.id</code>: The id of the 
Repository. Intended to be used by
 other components to track a specific repository instance.</li>
 <li><code>marmotta.kiwi.dialect</code>: The KiWi Database dialect. Currently 
Marmotta supports the
-H2Dialect, PostgreSQLDialect and MySQLDialect. Note that the selected dialect 
will select
+<code>H2Dialect</code>, <code>PostgreSQLDialect</code> and 
<code>MySQLDialect</code>. Note that the selected dialect will select
 different database driver. If those are not available the activation will 
throw an
 exception. PostgreSQL driver are embedded. H2 drivers are included in the 
default
-Bundlelist used by Stanbol.</li>
+<a 
href="http://svn.apache.org/repos/asf/stanbol/branches/release-0.12/launchers/bundlelists/marmotta/kiwi/src/main/bundles/list.xml";>Marmotta
 Kiwi Bundlelist</a> used by Stanbol. For MySQL the according dependency needs 
to be uncommented in
+the <a 
href="http://svn.apache.org/repos/asf/stanbol/branches/release-0.12/launchers/bundlelists/marmotta/kiwi/src/main/bundles/list.xml";>Marmotta
 Kiwi Bundlelist</a>.</li>
 <li><code>marmotta.kiwi.dburl</code>: This property can be used to directly 
configure the DB URL. If
-present this is preferred over the configuration of the 
<code>marmotta.kiwi.host</code>, 
-<code>marmotta.kiwi.port</code>, <code>marmotta.kiwi.database</code> and 
<code>marmotta.kiwi.options</code> parameters.</li>
-<li><code>marmotta.kiwi.user</code> and <code>marmotta.kiwi.password</code> 
for the database</li>
+present this is preferred over the configuration of the 
<code>host</code>,<code>port</code>, <code>database</code> and 
<code>options</code> parameters.</li>
+<li><code>marmotta.kiwi.host</code>: The host of the database (a file path in 
case of H2)</li>
+<li><code>marmotta.kiwi.port</code>: The port of the database (ignored in case 
of H2)</li>
+<li><code>marmotta.kiwi.user</code>: The database user</li>
+<li><code>marmotta.kiwi.password</code>: The password for the configured 
user</li>
+<li><code>marmotta.kiwi.options</code>: Additional database options</li>
 <li><code>marmotta.kiwi.cluster</code>: defines the name of the cluster. 
Different KiWi Repositories
 might use clusters with different names. If not present or empty clustering 
will be
 deactivated.</li>
@@ -133,8 +137,9 @@ registered as OSGI service with the para
 <p><img alt="Marmotta KiWi Repository Service" 
src="marmotta-kiwi-repository-service.png" /></p>
 <p>The marked <code>org.openrdf.repository.Repository.id</code> property is of 
special interest as it
 can be used to track for a Sesame Repository with a specific name. As an 
Example the
-Repository with the name <code>dummy</code> can be tracked with the Filter
-<code>(&amp;(objectClass=org.openrdf.repository.Repository)(org.openrdf.repository.Repository.id=dummy))</code></p>
+Repository with the name <code>dummy</code> can be tracked with the Filter</p>
+<div 
class="codehilite"><pre>(&amp;(objectClass=org.openrdf.repository.Repository)(org.openrdf.repository.Repository.id=dummy))
+</pre></div>
   </div>
   
   <div id="footer">


Reply via email to