[jira] [Commented] (METRON-1829) Large Error Message Causes Slow Search Performance

ASF GitHub Bot (JIRA) Fri, 19 Oct 2018 10:15:28 -0700


    [ 
https://issues.apache.org/jira/browse/METRON-1829?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16657116#comment-16657116
 ]


ASF GitHub Bot commented on METRON-1829:
----------------------------------------

Github user nickwallen commented on a diff in the pull request:

    https://github.com/apache/metron/pull/1239#discussion_r226720064
  
    --- Diff: 
metron-platform/metron-writer/src/main/java/org/apache/metron/writer/BulkWriterComponent.java
 ---
    @@ -118,12 +119,18 @@ public void commit(BulkWriterResponse response) {
     
       public void error(String sensorType, Throwable e, Iterable<Tuple> 
tuples, MessageGetStrategy messageGetStrategy) {
         LOG.error(format("Failing %d tuple(s); sensorType=%s", 
Iterables.size(tuples), sensorType), e);
    -    MetronError error = new MetronError()
    -            .withSensorType(Collections.singleton(sensorType))
    -            .withErrorType(Constants.ErrorType.INDEXING_ERROR)
    -            .withThrowable(e);
    -    tuples.forEach(t -> error.addRawMessage(messageGetStrategy.get(t)));
    -    handleError(tuples, error);
    +    tuples.forEach(t -> {
    +      MetronError error = new MetronError()
    +              .withSensorType(Collections.singleton(sensorType))
    +              .withErrorType(Constants.ErrorType.INDEXING_ERROR)
    +              .withThrowable(e)
    +              .addRawMessage(messageGetStrategy.get(t));
    --- End diff --
    
    While testing this, I am seeing a problem.  I am trying to decide whether 
we should call that a pre-existing condition and fix that problem after this PR 
goes in or whether the change here makes things any worse.  Let me know what 
you think.
    
    I have described the problem in 
[METRON-1832](https://issues.apache.org/jira/browse/METRON-1832).  If any index 
destination goes down, error messages are continually recycled and grow larger 
and larger.  In addition, long sequence of escape characters are created.
    



> Large Error Message Causes Slow Search Performance
> --------------------------------------------------
>
>                 Key: METRON-1829
>                 URL: https://issues.apache.org/jira/browse/METRON-1829
>             Project: Metron
>          Issue Type: Bug
>            Reporter: Ryan Merriman
>            Priority: Major
>
> Errors that occur during batch writes in the index topologies (batch and RA) 
> are written to Elasticsearch as a single, large error message, with a field 
> for each failed message. For example, if the batch size is 5000, a single 
> error message will be created with 5000 fields `raw_message_0`, 
> `raw_message_1`, .., `raw_message_4999`. With such large messages, searching 
> the error index in Elasticsearch is excessively slow.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

[jira] [Commented] (METRON-1829) Large Error Message Causes Slow Search Performance

Reply via email to