David:

 

David: Do I need to use copy_to a new dummy column in order the highlighting to 
work???

 

From: [email protected] [mailto:[email protected]] On 
Behalf Of David Pilato
Sent: Sunday, February 22, 2015 6:15 PM
To: [email protected]
Subject: Re: Indexing of HTML Column in an MS SQL Server 2014 database

 

Hi Jörg 

 

A bit out of topic: I wonder if you are indexing blobs as base64 encoded fields 
in JDBC river?

(I did not look at the doc)

--

David ;-)

Twitter : @dadoonet / @elasticsearchfr / @scrutmydocs


Le 22 févr. 2015 à 18:11, "[email protected] <mailto:[email protected]> 
" <[email protected] <mailto:[email protected]> > a écrit :

Can you give some information about the mapper attachment setup you used 
successfully?

 

There is no good reason why this should not be possible with JDBC river.

 

Jörg

 

On Sun, Feb 22, 2015 at 5:20 PM, Jiri Pik <[email protected] 
<mailto:[email protected]> > wrote:

I need to index a HTML column (nvarchar(MAX)) in a MS SQL Server database. I 
have set up a JDBC river https://github.com/jprante/elasticsearch-river-jdbc 
and the database is indexed.

Using 

  "settings":{

    "analysis":{

      "analyzer":{

        "default":{

          "type":"custom",

          "tokenizer":"standard",

          "filter":[ "standard", "lowercase" ], 

          "char_filter" : ["html_strip"]

        }

      }

    }

  }

is good for searching but not for the highlighter as that returns sometimes 
trimmed unpaired html tags. 

I have played with the Mapper Attachments with HTML attachments and then the 
highlighter works well - all original html tags are gone - but I am unable to 
get the river push the column directly to the Mapper Attachments.

Questions:

1. what is the best practice for indexing HTML columns? I am aware of the 
possibility of a manual removal of HTML tags using Agility Pack but do not like 
that as it's too much extra maintenance.

2. is there any better highlighter for html data which doesn't cut off any 
original html tags?

3. How to plug in the JDBC river to Mapper Attachments?

4. Any better ideas how to achieve my goals?

 

Thanks!

-- 
You received this message because you are subscribed to the Google Groups 
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected] 
<mailto:[email protected]> .
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/f175734b-0889-40a9-96d1-d46702e56666%40googlegroups.com
 
<https://groups.google.com/d/msgid/elasticsearch/f175734b-0889-40a9-96d1-d46702e56666%40googlegroups.com?utm_medium=email&utm_source=footer>
 .
For more options, visit https://groups.google.com/d/optout.

 

-- 
You received this message because you are subscribed to the Google Groups 
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected] 
<mailto:[email protected]> .
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/CAKdsXoH6Ei%2B23bRKrL0Z7WkQALengfhaZeJRBq5gK1F22yxJfg%40mail.gmail.com
 
<https://groups.google.com/d/msgid/elasticsearch/CAKdsXoH6Ei%2B23bRKrL0Z7WkQALengfhaZeJRBq5gK1F22yxJfg%40mail.gmail.com?utm_medium=email&utm_source=footer>
 .
For more options, visit https://groups.google.com/d/optout.

-- 
You received this message because you are subscribed to the Google Groups 
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected] 
<mailto:[email protected]> .
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/09317C08-E397-4044-91F2-072A5FA4A3DF%40pilato.fr
 
<https://groups.google.com/d/msgid/elasticsearch/09317C08-E397-4044-91F2-072A5FA4A3DF%40pilato.fr?utm_medium=email&utm_source=footer>
 .
For more options, visit https://groups.google.com/d/optout.

-- 
You received this message because you are subscribed to the Google Groups 
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/eeeabbac7ce6425abc9edc47698d3413%40Ex13DAG10-N1.dataoncloud.net.
For more options, visit https://groups.google.com/d/optout.

Attachment: smime.p7s
Description: S/MIME cryptographic signature

Reply via email to