Hello friends,
Over a Nuxeo 5.3 application with the 'nuxeo-platform-importer-core'
plug-ins installed,
I try this little package with some problems (importer for OCR outputs)
!
I set a config file for the plug-ins like this :
<importerConfig>
<!-- folder that holds the data to be imported -->
<sourcePath>/scan_import/data_in</sourcePath>
<!-- folder where xml files will be moved when processed
(files will be deleted if directory is not set or does not exist)-->
<processedPath>/scan_import/data_out</processedPath>
<!-- number of threads used by the importer : keep it to 1
if using H2 or you will break H2's lucene index -->
<nbThreads>1</nbThreads>
<!-- define how many documents are imported between 2
commits -->
<batchSize>10</batchSize>
<targetPath>/default-domain/workspaces</targetPath>
</importerConfig>
As soon as the application is started and the Scheduler run one time, I
got this info in console (the line with 'added-' in front are my debug
info):
2010-01-22 14:19:00,225 INFO
[org.nuxeo.ecm.platform.scanimporter.processor.ScannedFileImporter] All
Threads terminated
2010-01-22 14:19:00,225 INFO
[org.nuxeo.ecm.platform.scanimporter.processor.ScannedFileImporter] 1
docs created
2010-01-22 14:19:00,225 INFO
[org.nuxeo.ecm.platform.scanimporter.processor.ScannedFileImporter]
4.9261084 docs/s
2010-01-22 14:19:00,225 INFO
[org.nuxeo.ecm.platform.scanimporter.processor.ScannedFileImporter]
pool-40-thread-1-T0 --> 1
2010-01-22 14:19:00,225 DEBUG
[org.nuxeo.ecm.core.api.ejb.DocumentManagerBean] @Remove
2010-01-22 14:19:00,225 DEBUG
[org.nuxeo.ecm.core.api.ejb.DocumentManagerBean] @PreDestroy
2010-01-22 14:19:00,225 DEBUG [org.nuxeo.ecm.core.api.CoreSession]
Destroying core session ...
2010-01-22 14:19:00,225 INFO
[org.nuxeo.ecm.platform.scanimporter.processor.ScannedFileImporter]
Fininish moving files
added--2010-01-22 14:19:00,225 DEBUG
[org.nuxeo.ecm.platform.scanimporter.processor.ScannedFileImporter]
doCleanup called
added--2010-01-22 14:19:00,225 DEBUG
[org.nuxeo.ecm.platform.scanimporter.processor.ScannedFileImporter]
config not null
added--2010-01-22 14:19:00,225 DEBUG
[org.nuxeo.ecm.platform.scanimporter.processor.ScannedFileImporter]
outPath not null
2010-01-22 14:19:00,225 INFO
[org.nuxeo.ecm.platform.scanimporter.processor.ScannedFileImporter]
Ending import process
In the GED repository, under the targetPath
(/default-domain/workspaces), I can see a new folder named 'data_in'.
But a new one is created at each iteration, so, after a while, I have
many 'data_in' folders.
If I rename this folder, I got this error :
14:29:00,017 ERROR [IngestionTrigger] Error during import
org.nuxeo.ecm.core.api.ClientException: Unable to access source folder
\scan_import\data_in
Like this, I'm sure that my config seems correct.
Now,
I'm not sure that this is the right behaviour... Yes / No ?
If not, how can I solve this problem (many folder in my GED)?
Any idea ?
Kr,
Stefan Paitoni
_______________________________________________
ECM mailing list
[email protected]
http://lists.nuxeo.com/mailman/listinfo/ecm
To unsubscribe, go to http://lists.nuxeo.com/mailman/options/ecm