Can you give some information about the mapper attachment setup you used successfully?
There is no good reason why this should not be possible with JDBC river. Jörg On Sun, Feb 22, 2015 at 5:20 PM, Jiri Pik <[email protected]> wrote: > I need to index a HTML column (nvarchar(MAX)) in a MS SQL Server database. > I have set up a JDBC river > https://github.com/jprante/elasticsearch-river-jdbc and the database is > indexed. > > Using > > "settings":{ > > "analysis":{ > > "analyzer":{ > > "default":{ > > "type":"custom", > > "tokenizer":"standard", > > "filter":[ "standard", "lowercase" ], > > "char_filter" : ["html_strip"] > > } > > } > > } > > } > > is good for searching but not for the highlighter as that returns > sometimes trimmed unpaired html tags. > > I have played with the Mapper Attachments with HTML attachments and then > the highlighter works well - all original html tags are gone - but I am > unable to get the river push the column directly to the Mapper Attachments. > > Questions: > > 1. what is the best practice for indexing HTML columns? I am aware of the > possibility of a manual removal of HTML tags using Agility Pack but do not > like that as it's too much extra maintenance. > > 2. is there any better highlighter for html data which doesn't cut off any > original html tags? > > 3. How to plug in the JDBC river to Mapper Attachments? > > 4. Any better ideas how to achieve my goals? > > > Thanks! > > -- > You received this message because you are subscribed to the Google Groups > "elasticsearch" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > To view this discussion on the web visit > https://groups.google.com/d/msgid/elasticsearch/f175734b-0889-40a9-96d1-d46702e56666%40googlegroups.com > <https://groups.google.com/d/msgid/elasticsearch/f175734b-0889-40a9-96d1-d46702e56666%40googlegroups.com?utm_medium=email&utm_source=footer> > . > For more options, visit https://groups.google.com/d/optout. > -- You received this message because you are subscribed to the Google Groups "elasticsearch" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CAKdsXoH6Ei%2B23bRKrL0Z7WkQALengfhaZeJRBq5gK1F22yxJfg%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.
