Ali Nazemian created METRON-1677:
------------------------------------

             Summary: UUIDv4 GUID is not Lucene friendly
                 Key: METRON-1677
                 URL: https://issues.apache.org/jira/browse/METRON-1677
             Project: Metron
          Issue Type: Bug
            Reporter: Ali Nazemian


Using UUIDv4 by UUID.randomUUID() in Java is not Lucene friendly and impacts 
Elasticsearch and Solr indexing/search performance and makes it unpredictable 
sometimes.

http://blog.mikemccandless.com/2014/05/choosing-fast-unique-identifier-uuid.html

Moreover, specifying doc id at the client side will impact indexing throughput 
due to enabling Elasticsearch deduplication policy and changing insert to 
upsert. Hence, indexing throughput can be increased by providing an ability to 
disable ID generation at the client side. Currently, the way ID is generated 
can be overwritten at the config level by replacing Metron default guid via 
Stellar, but it is not possible to disable it completely to let Elasticsearch 
decide what ID can be used for the corresponding document.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to