David:
David: Do I need to use copy_to a new dummy column in order the highlighting to work??? From: [email protected] [mailto:[email protected]] On Behalf Of David Pilato Sent: Sunday, February 22, 2015 6:15 PM To: [email protected] Subject: Re: Indexing of HTML Column in an MS SQL Server 2014 database Hi Jörg A bit out of topic: I wonder if you are indexing blobs as base64 encoded fields in JDBC river? (I did not look at the doc) -- David ;-) Twitter : @dadoonet / @elasticsearchfr / @scrutmydocs Le 22 févr. 2015 à 18:11, "[email protected] <mailto:[email protected]> " <[email protected] <mailto:[email protected]> > a écrit : Can you give some information about the mapper attachment setup you used successfully? There is no good reason why this should not be possible with JDBC river. Jörg On Sun, Feb 22, 2015 at 5:20 PM, Jiri Pik <[email protected] <mailto:[email protected]> > wrote: I need to index a HTML column (nvarchar(MAX)) in a MS SQL Server database. I have set up a JDBC river https://github.com/jprante/elasticsearch-river-jdbc and the database is indexed. Using "settings":{ "analysis":{ "analyzer":{ "default":{ "type":"custom", "tokenizer":"standard", "filter":[ "standard", "lowercase" ], "char_filter" : ["html_strip"] } } } } is good for searching but not for the highlighter as that returns sometimes trimmed unpaired html tags. I have played with the Mapper Attachments with HTML attachments and then the highlighter works well - all original html tags are gone - but I am unable to get the river push the column directly to the Mapper Attachments. Questions: 1. what is the best practice for indexing HTML columns? I am aware of the possibility of a manual removal of HTML tags using Agility Pack but do not like that as it's too much extra maintenance. 2. is there any better highlighter for html data which doesn't cut off any original html tags? 3. How to plug in the JDBC river to Mapper Attachments? 4. Any better ideas how to achieve my goals? Thanks! -- You received this message because you are subscribed to the Google Groups "elasticsearch" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected] <mailto:[email protected]> . To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/f175734b-0889-40a9-96d1-d46702e56666%40googlegroups.com <https://groups.google.com/d/msgid/elasticsearch/f175734b-0889-40a9-96d1-d46702e56666%40googlegroups.com?utm_medium=email&utm_source=footer> . For more options, visit https://groups.google.com/d/optout. -- You received this message because you are subscribed to the Google Groups "elasticsearch" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected] <mailto:[email protected]> . To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CAKdsXoH6Ei%2B23bRKrL0Z7WkQALengfhaZeJRBq5gK1F22yxJfg%40mail.gmail.com <https://groups.google.com/d/msgid/elasticsearch/CAKdsXoH6Ei%2B23bRKrL0Z7WkQALengfhaZeJRBq5gK1F22yxJfg%40mail.gmail.com?utm_medium=email&utm_source=footer> . For more options, visit https://groups.google.com/d/optout. -- You received this message because you are subscribed to the Google Groups "elasticsearch" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected] <mailto:[email protected]> . To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/09317C08-E397-4044-91F2-072A5FA4A3DF%40pilato.fr <https://groups.google.com/d/msgid/elasticsearch/09317C08-E397-4044-91F2-072A5FA4A3DF%40pilato.fr?utm_medium=email&utm_source=footer> . For more options, visit https://groups.google.com/d/optout. -- You received this message because you are subscribed to the Google Groups "elasticsearch" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/eeeabbac7ce6425abc9edc47698d3413%40Ex13DAG10-N1.dataoncloud.net. For more options, visit https://groups.google.com/d/optout.
smime.p7s
Description: S/MIME cryptographic signature
