Thanks Alessandro,

Find hereby:

- the text you are submitting for enhancement

text.txt attached.

- the recipe(s) you are using

seo_rules.sem attached.

- your enhancement chain configuration


   - tika ( required , TikaEngine)
   - langid ( required , LangIdEnhancementEngine)
   - ner ( required , NamedEntityExtractionEnhancementEngine)
   - dbpediaLinking ( required , NamedEntityTaggingEngine)
   - entityhubExtraction ( required , KeywordLinkingEngine)
   - seo_refactoring ( required , RefactorEnhancementEngine)

The seo_refactoring configuration is attached
(refactor-engine-configuration.png).

- anything else?

We're using a local DBpedia index (in sling/datafiles):
http://dev.iks-project.eu/downloads/stanbol-indices/dbpedia-3.6-insideOut10/dbpedia.solrindex.zip

Thanks,
David

On Fri, Mar 16, 2012 at 12:58 PM, Alessandro Adamou <[email protected]>wrote:

> Hi David,
>
>
> On 3/16/12 10:49 AM, David Riccitelli wrote:
>
>> It's important to note that I am the only user on this stanbol instance
>> and
>> the error is raised at the second analysis.
>>
>> I think I can easily help you reproduce this issue in case.
>>
>
> Great, so I guess we would need:
>
> - the text you are submitting for enhancement
> - the recipe(s) you are using
> - your enhancement chain configuration
> - anything else?
>
> I am not the main Rules/Refactor Engine head, but perhaps I can help the
> engine create fewer persistent graphs.
>
> Best,
>
> Alessandro
>
>
>  On Fri, Mar 16, 2012 at 11:44 AM, Rupert Westenthaler<
>> [email protected]>  wrote:
>>
>>  Hi David, all
>>>
>>> this could be the explanation for the failed build on the Jenkins server
>>> when the SEO configuration for the Refactor engine was used in the
>>> default
>>> configuration of the Full launcher
>>>
>>> see 
>>> http://markmail.org/message/**sprwklaobdjankig<http://markmail.org/message/sprwklaobdjankig>for
>>>  details.
>>>
>>> For me that looks like as if the RefactorEngine does create multiple Jena
>>> TDB instances for various created MGraphs. One needs to know the even for
>>> an empty graph Jena TDB creates ~200MByte of index files. So it is
>>> important to map multiple MGraphs to different named graphs of the same
>>> Jena TDB store.
>>>
>>> I have no Idea how Clerezza manages this or how Ontonet creates MGraphs,
>>> but I hope this can help in tracing this down.
>>>
>>> best
>>> Rupert
>>>
>>> On 16.03.2012, at 10:30, David Riccitelli wrote:
>>>
>>>  Dears,
>>>>
>>>> As I ran into disk issues, I found that this folder:
>>>> sling/felix/bundleXXX/data/**tdb-data/mgraph
>>>>
>>>> where XX is the bundle of:
>>>> Clerezza - SCB Jena TDB Storage Provider
>>>> org.apache.clerezza.rdf.jena.**tdb.storage
>>>>
>>>> took almost 70 gbytes of disk space (then the disk space has been
>>>> exhausted).
>>>>
>>>> These are some of the files I found inside:
>>>> 193M ./ontonet%3A%3Ainputstream%**3Aontology889
>>>> 193M ./ontonet%3A%3Ainputstream%**3Aontology1041
>>>> 193M ./ontonet%3A%3Ainputstream%**3Aontology395
>>>> 193M ./ontonet%3A%3Ainputstream%**3Aontology363
>>>> 193M ./ontonet%3A%3Ainputstream%**3Aontology661
>>>> 193M ./ontonet%3A%3Ainputstream%**3Aontology786
>>>> 193M ./ontonet%3A%3Ainputstream%**3Aontology608
>>>> 193M ./ontonet%3A%3Ainputstream%**3Aontology213
>>>> 193M ./ontonet%3A%3Ainputstream%**3Aontology188
>>>> 193M ./ontonet%3A%3Ainputstream%**3Aontology602
>>>>
>>>>
>>>> Any clues?
>>>>
>>>> Thanks,
>>>> David Riccitelli
>>>>
>>>>
>>>>  ****************************************************************
>>> ********************
>>>
>>>> InsideOut10 s.r.l.
>>>> P.IVA: IT-11381771002
>>>> Fax: +39 0110708239
>>>> ---
>>>> LinkedIn: 
>>>> http://it.linkedin.com/in/**riccitelli<http://it.linkedin.com/in/riccitelli>
>>>> Twitter: ziodave
>>>> ---
>>>> Layar Partner Network<
>>>>
>>> http://www.layar.com/**publishing/developers/list/?**
>>> page=1&country=&city=&keyword=**insideout10&lpn=1<http://www.layar.com/publishing/developers/list/?page=1&country=&city=&keyword=insideout10&lpn=1>
>>>
>>>>
>>>>  ****************************************************************
>>> ********************
>>>
>>>
>>>
>>
>
> --
> M.Sc. Alessandro Adamou
>
> Alma Mater Studiorum - Università di Bologna
> Department of Computer Science
> Mura Anteo Zamboni 7, 40127 Bologna - Italy
>
> Semantic Technology Laboratory (STLab)
> Institute for Cognitive Science and Technology (ISTC)
> National Research Council (CNR)
> Via Nomentana 56, 00161 Rome - Italy
>
>
> "I will give you everything, so long as you do not demand anything."
> (Ettore Petrolini, 1930)
>
> Not sent from my iSnobTechDevice
>
>


-- 
David Riccitelli

********************************************************************************
InsideOut10 s.r.l.
P.IVA: IT-11381771002
Fax: +39 0110708239
---
LinkedIn: http://it.linkedin.com/in/riccitelli
Twitter: ziodave
---
Layar Partner 
Network<http://www.layar.com/publishing/developers/list/?page=1&country=&city=&keyword=insideout10&lpn=1>
********************************************************************************
The future is called smart grids, both in the US and Europe

New technologies, the enhancing of energy efficiency and especially the 
development of smart grids are necessary to avoid the increase of electricity 
bills in the US and to achieve  a number of important energy policy targets.

The US electricity grid will have to face significant challenges over the 
coming two decades, according to a recent study compiled by the Center for 
Energy and Environmental Research (CEEPR) at the MIT (Massachusetts Institute 
of Technology), of which Enel is an associate.

New technologies, the enhancing of energy efficiency and especially the 
development of smart grids are necessary to avoid the increase of electricity 
bills in the US and to achieve  a number of important energy policy targets.

One of the major challenges regards the need to integrate the increasing 
renewableproduction into the grid. A consistent portion of this growth will 
depend on solar and wind power, and the output will therefore be discontinuous 
and not fully programmable. Also, the need to place these plants in the most 
suitable locations  in terms of solar radiation and wind force, far from 
inhabited settlements will necessarily produce an expansion of the transmission 
system.

Unless corrective measures are adopted, the widespread use of electric vehicles 
will increase electricity demand. This means that energy prices will rise and 
the diffusion of electric transport will be hindered. In this context, a policy 
that is focused on retail sale pricing should be applied, which will require 
enhanced efficiency to be achieved by the widespread installation of smart 
meters.

Naturally, investments in technological innovation and development are 
imperative to renovate the US electricity grid, and will affect the actions 
needed to achieve the targets that have been set.

Smart grid technology is a field in which Enel has been a pioneer and has an 
acknowledged leadership at an international level. At the same time it has 
enhanced and integrated renewable sources and has developed electric mobility, 
helping update Europe’s infrastructure system like required by the Roadmap 2050.

Enel Green Power North America, Inc. (EGP-NA), part of Enel Green Power, is a 
leading owner and operator of renewable energy plants in North America with 
projects operating and under development in 21 U.S. states and three Canadian 
provinces. EGP-NA owns and operates over 70 plants with an installed capacity 
of around 800 MW powered by renewable hydropower, wind, geothermal, solar and 
biomass energy.

Reply via email to