Also, this is the first line of what's posted along the river
{ "index": {"_index":"resumes","_type":"resume","_id":"2158912"}}
Things can get truncated when they're as big as a Base64 encoded file :)
On Wednesday, November 19, 2014 6:01:29 PM UTC-5, Raymond Giorgi wrote:
>
> Hey all,
>
> I'm hoping someone can help me out with something I'm having an issue with.
>
> The short: I'm trying to extract plaintext from the attachment-mapper.
>
> The long: I'm posting the contents of a file Base64 encoded to RabbitMQ
> which is feeding an ElasticSearch river plugin. Querying against the field
> works fine, but it only seems to store the Base64 encoding of the file
> instead of the plaintext. I'd like to extract the contents as plaintext and
> have that be returnable (i.e. query for the text of a docx). I'm feeding it
> from a PHP front end, so there are places in the app where I'd like to rely
> on Elasticsearch's built in Tika processor.
>
> Thanks!
>
--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email
to [email protected].
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/68456ac0-14b9-49f8-a0a0-b930223004f8%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.