[Mayan EDMS: 348] Re: How I check that document is complete parsed

Roberto Rosario Fri, 02 Nov 2012 08:04:52 -0700

Hi Martin,

Awesome!  Thanks for pointing this out.  The documents app or the Document 
model have no way to report when a document is ready for usage because this 
functionality is done by other apps.  The thumbnail is generated by the 
converter app and the text OCR/parsing is done by the ocr app (
https://github.com/rosarior/mayan/blob/hotfix/v0.12.3/apps/ocr/__init__.py#L78).
 
 Both of these apps in turn call operating system binaries to handle their 
tasks.  I would go about this by adding a .is_parsed() property to the 
Document model that is handled by the ocr app that returns True when the 
document was been processed without errors from the OCR queue and a 
.has_thumbnail() or similar to the Document model handled by the converter 
app if the appropiate preview is ready in the image_cache folder.  Would 
these help you out?


--Roberto

On Monday, October 29, 2012 6:26:33 AM UTC-4, Martin Kubát wrote:
>
> Hi,
> I would like to create API for Mayan (it will be public after will be 
> complete). And I have problem with check state, when is document succesfuly 
> parsed (know good document's page, parsed plain text from document, 
> generated thumbnails).
> Example of development version is here: 
> http://simpledwarf.appspot.com/paste/view/841016 . With other application 
> communicate this API with python Pyro library (
> http://pypi.python.org/pypi/Pyro4).
>
> When I call API upload_document method too fast (more documents in - for 
> example - 10 seconds), method will return wrong value of pages-count. And 
> when I can get plaintext from uploaded document, I must wait (10-? seconds) 
> before is document ready.
> Is there some "document is complete ready" indicator or method?
> Have somebody some hint?
>
> Thanks 
> Martin Kubát, 
> COEX, Czech Republic
>

--

[Mayan EDMS: 348] Re: How I check that document is complete parsed

Reply via email to