Hello again! So, I'm doing some investigation to my new ML project and I'm finding out some weird things happening:
-When updates are done, it creates additional documents (creating rather than updating). We would get new documents when we try to update a particular field that is part of the Data Access Descriptor (DAD) object, such as the URI that points to the document's product. What would cause this? I'm thinking some logic on their code. -The time to ingest documents takes about one per second which seems really slow to me (average size of document is approximately 15Kb). On my last project, we would batch 1,000 documents in one file and that seems to work better for us then. The only drawback is that if one document rejects during the ingest, the entire batch of 1,000 doesn't get sent. It sends up to the point when the corrupted document comes up. For example, the 999th document fails, only 998 get sent through. Is batch processing something we should consider now? Any thoughts? Any suggestions is appreciated. Thanks in advance! ~~Bob O.
_______________________________________________ General mailing list [email protected] http://developer.marklogic.com/mailman/listinfo/general
