Attachments are pretty aggressively stripped by the e-mail server, so
there's nothing to see,
you'll have to paste it somewhere else and provide a link.
Usually, though, this is a character set issue with the browser using a
different charset than
Solr, it's really the same character, just
Looks like images did not come through. Here's the text...
I'm ingesting a .TXT file with HTML content into Solr. The content has the
following character highlighted below:
The file we get from CRM (also attached):
td align=center style=text-align:center;font size=3span
I'm ingesting a .TXT file with HTML content into Solr. The content has the
following character highlighted below:
The file we get from CRM (also attached):
[cid:image001.png@01D0B972.75BE23F0]
After ingesting into solr, I see a different character. This is query response
from solr management
Shawn - Stupid coding error in my java code. Used default charset. Changed to
UTF-8 and problem fixed.
Thanks again!
-Original Message-
From: Tarala, Magesh
Sent: Wednesday, July 08, 2015 8:11 PM
To: solr-user@lucene.apache.org
Subject: RE: Solr Encoding Issue?
Wow, that makes total
...@gmail.com]
Sent: Wednesday, July 08, 2015 1:36 PM
To: solr-user@lucene.apache.org
Subject: Re: Solr Encoding Issue?
Attachments are pretty aggressively stripped by the e-mail server, so there's
nothing to see, you'll have to paste it somewhere else and provide a link.
Usually, though
Wow, that makes total sense. Thanks Shawn!!
I'll go down this path.
Thanks,
Magesh
-Original Message-
From: Shawn Heisey [mailto:apa...@elyograg.org]
Sent: Wednesday, July 08, 2015 7:24 PM
To: solr-user@lucene.apache.org
Subject: Re: Solr Encoding Issue?
On 7/8/2015 6:09 PM, Tarala
On 7/8/2015 6:09 PM, Tarala, Magesh wrote:
I believe the issue is in solr. The character “à” is getting stored in solr
as “Ã ”. Notice the space after Ã.
I'm using solrj to ingest the documents into solr. So, one of those could be
the culprit?
Solr accepts and outputs text in UTF-8. The