Try the Apache Tika mailing list.

--
Jan Høydahl, search solution architect
Cominvent AS - www.cominvent.com

> 2. aug. 2019 kl. 05:01 skrev Zheng Lin Edwin Yeo <edwinye...@gmail.com>:
> 
> Hi,
> 
> Does anyone knows if this can be done on the Solr side?
> Or it has to be done on the Tika side?
> 
> Regards,
> Edwin
> 
> On Thu, 1 Aug 2019 at 09:38, Zheng Lin Edwin Yeo <edwinye...@gmail.com>
> wrote:
> 
>> Hi,
>> 
>> Would like to check, Is there anyway which we can detect the number of
>> attachments and their names during indexing of EML files in Solr, and index
>> those information into Solr?
>> 
>> Currently, Solr is able to use Tika and Tesseract OCR to extract the
>> contents of the attachments. However, I could not find the information
>> about the number of attachments in the EML file and what are their filename.
>> 
>> I am using Solr 7.6.0 in production, and also trying out on the new Solr
>> 8.2.0.
>> 
>> Regards,
>> Edwin
>> 

Reply via email to