No, analyzers are plain-text processors. You will need to transform binary
formats to plain text yourself. Tika is a great starting point.

--

Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Lucene.NET committer and PMC member

On Wed, Dec 14, 2016 at 5:22 PM, Francesco Abbruzzese <
[email protected]> wrote:

> Hi Itamar ,
>
> Are there analyzers (or text filters)  for all more common type of
> documents,ie word, pdf, ppt, etc one may use with the new lucene .net
> version?
>
>
> 2016-12-13 13:59 GMT+01:00 Itamar Syn-Hershko <[email protected]>:
>
>> We are about to release Lucene.NET 4.8, and it's time to show what it can
>> do, and how it can be done.
>>
>> I just published a walkthrough video on Channel 9, you can watch it here:
>> https://channel9.msdn.com/Blogs/MVP-VisualStudio-Dev/LuceneN
>> ET-48-a-pre-release-introduction
>>
>> The Demo application can be found at https://github.com/synhershko/
>> LuceneNetDemo
>>
>> nuget packages can be downloaded from https://myget.org/gallery/luce
>> ne-net
>>
>> Comments? questions? reach out to us on our mailing lists:
>> http://lucenenet.apache.org/community.html
>>
>> Enjoy!
>>
>> --
>>
>> Itamar Syn-Hershko
>> http://code972.com | @synhershko <https://twitter.com/synhershko>
>> Freelance Developer & Consultant
>> Lucene.NET committer and PMC member
>>
>>
>
>
> --
> Francesco Abbruzzese
> [email protected]
> http://www.dotnet-programming.com/
> https://github.com/MvcControlsToolkit
> http://mvccontrolstoolkit.codeplex.com/
>
>
>
>
>
>
>
>
>
>

Reply via email to