kasakrisz opened a new pull request #8: AMBARI-24792 - Infra Manager: not all the documents archived URL: https://github.com/apache/ambari-infra/pull/8 ## What changes were proposed in this pull request? 1. Infra manager did not use the sort_column properties when reading the documents to archive. This lead to some of the documents are not exported when the read block size was less than the number of documents to export and no sort_column were specified by job parameters. Fix: copy the sort_column properties to the actual ArchivingParameters instance if job parameters are not given. 2. Add integration test to check if all the documents are archived. ## How was this patch tested? UTs and ITs passed. Manually: 1. Deploy Ambari and a cluster including infra solr, zookeeper, logsearch to a vagrant env 2. When service logs collection has some documents stop Logsearch after a while to prevent producing more service log documents. 3. Run Infra Manager from Idea 4. Start `archive_service_logs` job by specifying an interval end value of now, a read block size smaller than the service logs collection size and a local destination only 5. Check the files in the local folder contains all the documents were stored in service logs and service logs collection is empty in Solr
---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org With regards, Apache Git Services