[ https://issues.apache.org/jira/browse/SOLR-1690?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12795737#action_12795737 ]
Ryan McKinley commented on SOLR-1690: ------------------------------------- I have been using it to have structured data stored in a single field. Kind of like a less cryptic version of: http://wiki.apache.org/solr/UserTagDesign I'm not sure it belongs in /trunk, but wanted to post it here so that others could use it if they want... > JSONKeyValueTokenizerFactory -- JSON Tokenizer > ---------------------------------------------- > > Key: SOLR-1690 > URL: https://issues.apache.org/jira/browse/SOLR-1690 > Project: Solr > Issue Type: New Feature > Components: Schema and Analysis > Reporter: Ryan McKinley > Priority: Minor > Attachments: noggit-1.0-A1.jar, > SOLR-1690-JSONKeyValueTokenizerFactory.patch > > > Sometimes it is nice to group structured data into a single field. > This (rough) patch, takes JSON input and indexes tokens based on the key > values pairs in the json. > {code:xml|title=schema.xml} > <!-- JSON Field Type --> > <fieldtype name="json" class="solr.TextField" positionIncrementGap="100" > omitNorms="true"> > <analyzer type="index"> > <tokenizer class="solr.JSONKeyValueTokenizerFactory" keepArray="true" > hierarchicalKey="false"/> > <filter class="solr.TrimFilterFactory"/> > <filter class="solr.LowerCaseFilterFactory"/> > </analyzer> > <analyzer type="query"> > <tokenizer class="solr.KeywordTokenizerFactory"/> > <filter class="solr.TrimFilterFactory" /> > <filter class="solr.LowerCaseFilterFactory"/> > </analyzer> > </fieldtype> > {code} > Given text: > {code} > { "hello": "world", "rank":5 } > {code} > indexed as two tokens: > || term position | 1 | 2 | > || term text | hello:world | rank:5 | > || term type | word | word | > || source start,end | 12,17 | 27,28 | -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.