Hello again!

So, I'm doing some investigation to my new ML project and I'm finding out
some weird things happening:

-When updates are done, it creates additional documents (creating rather
than updating).  We would get new documents when we try to update a
particular field that is part of the Data Access Descriptor (DAD) object,
such as the URI that points to the document's product. What would cause
this? I'm thinking some logic on their code.

-The time to ingest documents takes about one per second which seems really
slow to me (average size of document is approximately 15Kb). On my last
project, we would batch 1,000 documents in one file and that seems to work
better for us then. The only drawback is that if one document rejects
during the ingest, the entire batch of 1,000 doesn't get sent. It sends up
to the point when the corrupted document comes up. For example, the 999th
document fails, only 998 get sent through. Is batch processing something we
should consider now?

Any thoughts?

Any suggestions is appreciated. Thanks in advance!

~~Bob O.
_______________________________________________
General mailing list
[email protected]
http://developer.marklogic.com/mailman/listinfo/general

Reply via email to