No, analyzers are plain-text processors. You will need to transform binary formats to plain text yourself. Tika is a great starting point.
-- Itamar Syn-Hershko http://code972.com | @synhershko <https://twitter.com/synhershko> Freelance Developer & Consultant Lucene.NET committer and PMC member On Wed, Dec 14, 2016 at 5:22 PM, Francesco Abbruzzese < [email protected]> wrote: > Hi Itamar , > > Are there analyzers (or text filters) for all more common type of > documents,ie word, pdf, ppt, etc one may use with the new lucene .net > version? > > > 2016-12-13 13:59 GMT+01:00 Itamar Syn-Hershko <[email protected]>: > >> We are about to release Lucene.NET 4.8, and it's time to show what it can >> do, and how it can be done. >> >> I just published a walkthrough video on Channel 9, you can watch it here: >> https://channel9.msdn.com/Blogs/MVP-VisualStudio-Dev/LuceneN >> ET-48-a-pre-release-introduction >> >> The Demo application can be found at https://github.com/synhershko/ >> LuceneNetDemo >> >> nuget packages can be downloaded from https://myget.org/gallery/luce >> ne-net >> >> Comments? questions? reach out to us on our mailing lists: >> http://lucenenet.apache.org/community.html >> >> Enjoy! >> >> -- >> >> Itamar Syn-Hershko >> http://code972.com | @synhershko <https://twitter.com/synhershko> >> Freelance Developer & Consultant >> Lucene.NET committer and PMC member >> >> > > > -- > Francesco Abbruzzese > [email protected] > http://www.dotnet-programming.com/ > https://github.com/MvcControlsToolkit > http://mvccontrolstoolkit.codeplex.com/ > > > > > > > > > >
