[ 
https://issues.apache.org/jira/browse/METRON-1526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16444588#comment-16444588
 ] 

ASF GitHub Bot commented on METRON-1526:
----------------------------------------

Github user justinleet commented on the issue:

    https://github.com/apache/metron/pull/995
  
    I agree, I'm not seeing how partial updates solves the general problem of 
"We need to avoid updates against derived fields".  End users aren't going to 
necessarily know what a generated field is.  Say a data scientist writes a 
machine learning model, and an analyst wants to update some output of the 
model.  They may not know, or even be able to easily discover, what fields are 
poly types and what aren't. They'll just get an error.  I'm not even convinced 
it'll be an easily understandable error to anyone not familiar with Solr.
    
    It seems like there one real, solution to this problem that doesn't 
compromise usability: Don't pass sub fields back in queries.
    
    This can be something in the schema itself (e.g. does docValues change 
work).  It could also be something determined while running (e.g. LukeRequest 
if it turns out to be viable or some other request / query).


> Location field types cause DocValuesField appear more than once error
> ---------------------------------------------------------------------
>
>                 Key: METRON-1526
>                 URL: https://issues.apache.org/jira/browse/METRON-1526
>             Project: Metron
>          Issue Type: Bug
>            Reporter: Ryan Merriman
>            Assignee: Ryan Merriman
>            Priority: Major
>
> While testing [https://github.com/apache/metron/pull/970] I get this error 
> when creating a meta alert:
> {code:java}
> Error from server at http://10.0.2.15:8983/solr/bro: Exception writing 
> document id bbc150f5-92f8-485d-93cc-11730c1edf31 to the index; possible 
> analysis error: DocValuesField 
> \"enrichments.geo.ip_dst_addr.location_point_0_coordinate\" appears more than 
> once in this document (only one value is allowed per field){code}
> I tracked it down to the fact that multiple fields are returned for a 
> location field.  For example when a field named 
> "enrichments.geo.ip_dst_addr.location_point" is configured in a schema, these 
> fields are returned in a query:
> {code:java}
> {
> "enrichments.geo.ip_dst_addr.location_point_0_coordinate": "33.4499",
> "enrichments.geo.ip_dst_addr.location_point_1_coordinate": "-112.0712",
> "enrichments.geo.ip_dst_addr.location_point": "33.4499,-112.0712"
> }
> {code}
>  We need a way to either suppress these extra fields when querying or remove 
> them before updating a document. 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to