[
https://issues.apache.org/jira/browse/METRON-1526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16452723#comment-16452723
]
ASF GitHub Bot commented on METRON-1526:
----------------------------------------
Github user mmiklavc commented on a diff in the pull request:
https://github.com/apache/metron/pull/995#discussion_r184143875
--- Diff: metron-platform/metron-solr/README.md ---
@@ -95,3 +95,19 @@ As of now, we have mapped out the Schemas in
`src/main/config/schema`.
Ambari will eventually install these, but at the moment it's manual and
you should refer to the Solr documentation
[https://lucene.apache.org/solr/guide/6_6](here) in general
and
[here](https://lucene.apache.org/solr/guide/6_6/documents-fields-and-schema-design.html)
if you'd like to know more about schemas in Solr.
+
+In Metron's Solr DAO implementation, document updates involve reading a
document, applying the update and reindexing the whole document.
+Indexing LatLon and PointType field types stores data in internal fields
that should not be returned in search results. For these fields a dynamic
field type matching the suffix needs to be added to store the data points.
--- End diff --
We went through a lot of discussion on this PR to figure out the exact
fields and versions of Solr that the LatLon and Point types are problematic
for. Can you be a bit more explicit about that here? It's not just "for
example" it's literally "these types are a problem bc \<link to problem\> in
Solr version X.X, but have been corrected in Solr version X.Y"
> Location field types cause DocValuesField appear more than once error
> ---------------------------------------------------------------------
>
> Key: METRON-1526
> URL: https://issues.apache.org/jira/browse/METRON-1526
> Project: Metron
> Issue Type: Bug
> Reporter: Ryan Merriman
> Assignee: Ryan Merriman
> Priority: Major
>
> While testing [https://github.com/apache/metron/pull/970] I get this error
> when creating a meta alert:
> {code:java}
> Error from server at http://10.0.2.15:8983/solr/bro: Exception writing
> document id bbc150f5-92f8-485d-93cc-11730c1edf31 to the index; possible
> analysis error: DocValuesField
> \"enrichments.geo.ip_dst_addr.location_point_0_coordinate\" appears more than
> once in this document (only one value is allowed per field){code}
> I tracked it down to the fact that multiple fields are returned for a
> location field. For example when a field named
> "enrichments.geo.ip_dst_addr.location_point" is configured in a schema, these
> fields are returned in a query:
> {code:java}
> {
> "enrichments.geo.ip_dst_addr.location_point_0_coordinate": "33.4499",
> "enrichments.geo.ip_dst_addr.location_point_1_coordinate": "-112.0712",
> "enrichments.geo.ip_dst_addr.location_point": "33.4499,-112.0712"
> }
> {code}
> We need a way to either suppress these extra fields when querying or remove
> them before updating a document.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)