Re: [tdf-discuss] Huge problems with big documents

2013-02-07 Thread Cor Nouws

Hi Georgio,

Giorgio Migliaccio wrote (07-02-13 09:07)


We’re using LibreOffice (since v2.x up to 4.0 beta) inside our
product LetterSketch, which was started 7 years ago. [...]


Good to read from you, after we've met in Gouda, last year!


[...]
Is there any kind of suggestion you can give us to overcome this
problem/limitation, since this is going to be a major showstopper for
our project.


We know that there already have been considerable memory usage 
improvements in LibreOffice after the making free ...


Looking at your use case, I would seriously suggest you to get in 
contact with a developer knowing 'that code area', or with a company 
sponsoring some of those guys (or girls). We have various active in TDF.


It's one of the many advantages of LibreOffice / TDF: that you can 
choose a route at your choise, to get improvements in. And especially 
when there is a interesting, big, business case, that is both healthy 
for your product and the LibreOffice project.


Kind regards,
Cor

--
 - Cor
 - http://nl.libreoffice.org
 - www.librelex.org


--
Unsubscribe instructions: E-mail to discuss+h...@documentfoundation.org
Problems? http://www.libreoffice.org/get-help/mailing-lists/how-to-unsubscribe/
Posting guidelines + more: http://wiki.documentfoundation.org/Netiquette
List archive: http://listarchives.documentfoundation.org/www/discuss/
All messages sent to this list will be publicly archived and cannot be deleted



RE: [tdf-discuss] Huge problems with big documents

2013-02-07 Thread Giorgio Migliaccio
Our customers are just normal non-technical end-users which are used to work 
with MS Word for creating documents. 
We provide them with a solution where they an continue to work with something 
like they already know, LibreOffice replacing MS Word, and extend it with all 
kind of automation possibilities. 
Any other offering wouldn't be accepted by our customers, because it would mean 
a major training is needed for hundreds to thousands of employees, I'm sure I 
don't have to tell you how much money that would cost, right? 
You just have to learn to live with the fact that not every person on this 
planet has sufficient technical background to start working with applications 
like Lyx, how powerful such applications even might be. 
Conditions, loops and variables are just point and click stuff in LetterSketch, 
so that most non-technical people can build such templates.

And believe it or not, we are just one of a couple of hundred(thousands?) 
companies worldwide doing this, with competitors like HP, IBM and DOC1 being 
market leaders, and such products are being used by any (really *any*) major 
bank, insurance company, or whatever company in the world for customer or 
internal communication... that's just a fact

We, and our customers, use XML for transferring data. The data in the XML 
however can come from databases, Salesforce.com, Excel files, web forms, 
Navision, and then some. 
Some of them might have a DB behind them, but most of the the large banks or 
insurance companies won't give external parties direct access to their 
DB's...you must surely know that.
Using Sax XML processing, complex documents can be generated in milliseconds, 
making it possible to generate millions of documents in 24 hours. Not too bad I 
would say.

The 113 pages is just the template from which the generations will start. This 
can result in a 5 page document or even in a 10.000 page document, depending on 
the data and business rules.

Regards,
Giorgio


-- 
Unsubscribe instructions: E-mail to discuss+h...@documentfoundation.org
Problems? http://www.libreoffice.org/get-help/mailing-lists/how-to-unsubscribe/
Posting guidelines + more: http://wiki.documentfoundation.org/Netiquette
List archive: http://listarchives.documentfoundation.org/www/discuss/
All messages sent to this list will be publicly archived and cannot be deleted



Re: [tdf-discuss] Huge problems with big documents

2013-02-07 Thread Wolfgang Keller
> LetterSketch is a document authoring tool with which authors can
> create documents and extend it with conditions, loops, variables, sub
> documents and other ornaments. 



Sounds like *the* use case where LyX/LaTeX is beyond competition.

Because it has all this already built-in.



> Finally this template gets compiled to an internal format and finally
> it can be generated on a server in high volume by feeding it data in
> XML format.

Data is best fed from a database (LyX/LaTeX has "native" solutions for
this as well). XML was never made for data and it's hopelessly
inefficient (both in terms of volume as well as required processing
bandwidth) as a format for large amounts of data. 

> At the customer we created a template/document containing 113 pages,
> built up with plain text, various objects like comments and frames,
> and dozens of sub-documents and where it takes some 10-20 minutes to
> just open the template.

113 pages is not at all a "big" document. Unless that's just the "root"
document.
 
> Please find attached the concerning (main) document.

This list doesn't allow attachments.

Sincerely,

Wolfgang

-- 
Unsubscribe instructions: E-mail to discuss+h...@documentfoundation.org
Problems? http://www.libreoffice.org/get-help/mailing-lists/how-to-unsubscribe/
Posting guidelines + more: http://wiki.documentfoundation.org/Netiquette
List archive: http://listarchives.documentfoundation.org/www/discuss/
All messages sent to this list will be publicly archived and cannot be deleted



Re: [tdf-discuss] Huge problems with big documents

2013-02-07 Thread Florian Reisinger

Hi,

Bug filed: https://bugs.freedesktop.org/show_bug.cgi?id=60418

Yours

Florian

Am 07.02.2013 10:58, schrieb Giorgio Migliaccio:

Hi,

Please find attached the document (one of many alike).

With kind regards,

Giorgio Migliaccio
Product Development Manager

Telephone : +32 3 450 89 84
Mobile : +32 486 92 43 41
Email : giorgio.migliac...@lettergen.be
LetterGen
Groene Hofstraat 13, 2850 Boom, Belgium
www.lettergen.com



-Original Message-
From: Florian Reisinger [mailto:reisi...@gmail.com]
Sent: donderdag 7 februari 2013 10:48
To: Giorgio Migliaccio
Cc: discuss@documentfoundation.org
Subject: Re: [tdf-discuss] Huge problems with big documents

Hi,

The document is not attached. Please send me the document to me as PM...

Yours,

Florian

Am 07.02.2013 09:07, schrieb Giorgio Migliaccio:

Hi all,

   
We’re using LibreOffice (since v2.x up to 4.0 beta) inside our product LetterSketch, which was started 7 years ago.


LetterSketch is a document authoring tool with which authors can create 
documents and extend it with conditions, loops, variables, sub documents and 
other ornaments.

Finally this template gets compiled to an internal format and finally it can be 
generated on a server in high volume by feeding it data in XML format.

Now we got our biggest implementation up to date, a major European bank. And 
they need to create pretty complex document structures, containing multiple 
sub-documents.

And here we reached the limits of the OpenDocument format, or at least the 
LibreOffice/OpenOffice internal object representation.

   
At the customer we created a template/document containing 113 pages, built up with plain text, various objects like comments and frames, and dozens of sub-documents and where it takes some 10-20 minutes to just open the template.


   
We finally pinpointed the problem.


When we just take the top-level odt file, without resolving any of the 
sub-documents, LibreOffice needs 1 minute at 100% CPU(intel i5 CPU) and 500 MB 
of memory (!!!) for this document alone.
So when we also start resolving the sub-documents, LibreOffice goes up to 1.8 
GB of memory and then just crashes or disappears or freezes, there seems to be 
some kind of invisible ceiling.

Closing the document takes another 1-2 minutes and some additional memory is 
needed!

When saving the main document to the Word (.doc) format and then opening it in 
Word, MS Word only needs some 30 MB of memory to visualize this SAME document.

Please find attached the concerning (main) document.


Is there any kind of suggestion you can give us to overcome this 
problem/limitation, since this is going to be a major showstopper for our 
project.

Thank you very much for your suggestions.


With kind regards,

Giorgio Migliaccio





--
Unsubscribe instructions: E-mail to discuss+h...@documentfoundation.org
Problems? http://www.libreoffice.org/get-help/mailing-lists/how-to-unsubscribe/
Posting guidelines + more: http://wiki.documentfoundation.org/Netiquette
List archive: http://listarchives.documentfoundation.org/www/discuss/
All messages sent to this list will be publicly archived and cannot be deleted



Re: [tdf-discuss] Huge problems with big documents

2013-02-07 Thread Italo Vignoli
On 2/7/13 9:07 AM, Giorgio Migliaccio wrote:

> Please find attached the concerning (main) document.

Mailing lists do not accept attachments. You should file a bug on
Bugzilla, where you can describe the issue and attach the document.

-- 
Italo Vignoli - italo.vign...@gmail.com
mob +39.348.5653829 - VoIP 5316...@messagenet.it
skype italovignoli - gtalk italo.vign...@gmail.com

-- 
Unsubscribe instructions: E-mail to discuss+h...@documentfoundation.org
Problems? http://www.libreoffice.org/get-help/mailing-lists/how-to-unsubscribe/
Posting guidelines + more: http://wiki.documentfoundation.org/Netiquette
List archive: http://listarchives.documentfoundation.org/www/discuss/
All messages sent to this list will be publicly archived and cannot be deleted



Re: [tdf-discuss] Huge problems with big documents

2013-02-07 Thread Florian Reisinger

Hi,

The document is not attached. Please send me the document to me as PM...

Yours,

Florian

Am 07.02.2013 09:07, schrieb Giorgio Migliaccio:

Hi all,

  
We’re using LibreOffice (since v2.x up to 4.0 beta) inside our product LetterSketch, which was started 7 years ago.


LetterSketch is a document authoring tool with which authors can create 
documents and extend it with conditions, loops, variables, sub documents and 
other ornaments.

Finally this template gets compiled to an internal format and finally it can be 
generated on a server in high volume by feeding it data in XML format.

Now we got our biggest implementation up to date, a major European bank. And 
they need to create pretty complex document structures, containing multiple 
sub-documents.

And here we reached the limits of the OpenDocument format, or at least the 
LibreOffice/OpenOffice internal object representation.

  
At the customer we created a template/document containing 113 pages, built up with plain text, various objects like comments and frames, and dozens of sub-documents and where it takes some 10-20 minutes to just open the template.


  
We finally pinpointed the problem.


When we just take the top-level odt file, without resolving any of the 
sub-documents, LibreOffice needs 1 minute at 100% CPU(intel i5 CPU) and 500 MB 
of memory (!!!) for this document alone.
So when we also start resolving the sub-documents, LibreOffice goes up to 1.8 
GB of memory and then just crashes or disappears or freezes, there seems to be 
some kind of invisible ceiling.

Closing the document takes another 1-2 minutes and some additional memory is 
needed!

When saving the main document to the Word (.doc) format and then opening it in 
Word, MS Word only needs some 30 MB of memory to visualize this SAME document.

Please find attached the concerning (main) document.


Is there any kind of suggestion you can give us to overcome this 
problem/limitation, since this is going to be a major showstopper for our 
project.

Thank you very much for your suggestions.


With kind regards,

Giorgio Migliaccio





--
Unsubscribe instructions: E-mail to discuss+h...@documentfoundation.org
Problems? http://www.libreoffice.org/get-help/mailing-lists/how-to-unsubscribe/
Posting guidelines + more: http://wiki.documentfoundation.org/Netiquette
List archive: http://listarchives.documentfoundation.org/www/discuss/
All messages sent to this list will be publicly archived and cannot be deleted