Someone with a good experience in programming and a good knowledge of Lucene and IR.
best, reda > On 1 Dec. 2016, at 14:33, Chris Manu <chrisman...@hotmail.com> wrote: > > Thank you for responding. So, theoretically, I would need to hire someone > with Apache programing experience to do this correct (given that I know > nothing about programing)? What type of experience should I look for? > > > ________________________________ > From: Xavier Morera <xav...@familiamorera.com > <mailto:xav...@familiamorera.com>> > Sent: December 1, 2016 2:23 AM > To: general@lucene.apache.org <mailto:general@lucene.apache.org> > Subject: Re: Feasability > > The answer is yes, but you would need to do some programming and > configuring. > > On Wed, Nov 30, 2016 at 7:54 PM, Chris Manu <chrisman...@hotmail.com> wrote: > >> Hello, >> >> >> I want to start off by saying that I am not a programmer...and have very >> little knowledge in this area. >> >> >> What I would like to know if Apache would be capable of doing the >> following: >> >> Take an extensive list (A) of strings of unique words (these are titles - >> anywhere from 4 words to 30) saved in either an Excel worksheet or in a >> text file and search for instances (B) where these can be found in PDF >> files saved on a hard drive (over 100k files). The search would need to be >> done using a fuzzy logic rather than exact matching and the output would be >> in an Excel file list the unique string found (A), the file name in which >> the match was made (B), the page number where the match was made and the >> surrounding text on either side of As well, would this be a complicated >> program, usable by novices coached in the process necessary to input the >> title file (A) and direct the search to the relevant folder containing the >> PDF files (B). >> >> >> I eagerly await (hopefully) an affirmative answer. >> >> >> Cheers! >> >> > > > -- > > *Xavier Morera* > > Entrepreneur | Author & Trainer | Consultant | Developer & Scrum Master > > *www.xaviermorera.com <http://www.xaviermorera.com/>* > [https://i2.wp.com/www.xaviermorera.com/wp-content/uploads/2016/06/xavier-morera.jpg?resize=150%2C150 > > <https://i2.wp.com/www.xaviermorera.com/wp-content/uploads/2016/06/xavier-morera.jpg?resize=150%2C150>]<http://www.xaviermorera.com/ > <http://www.xaviermorera.com/>> > > Xavier Morera<http://www.xaviermorera.com/ <http://www.xaviermorera.com/>> > www.xaviermorera.com <http://www.xaviermorera.com/> > I have been working with Solr for a while, mainly from the .NET world and I > basically love it. I use SolrNet which I think it is a very mature and stable > library. > > > > office: (305) 600-4919 > > cel: +506 8849-8866 > > skype: xmorera > Twitter <https://twitter.com/xmorera <https://twitter.com/xmorera>> | LinkedIn > [https://pbs.twimg.com/profile_images/464050157344940033/7AA_lsgC_400x400.jpeg > > <https://pbs.twimg.com/profile_images/464050157344940033/7AA_lsgC_400x400.jpeg>]<https://twitter.com/xmorera > <https://twitter.com/xmorera>> > > xmorera (@xmorera) | Twitter<https://twitter.com/xmorera > <https://twitter.com/xmorera>> > twitter.com <http://twitter.com/> > The latest Tweets from xmorera (@xmorera). Eternal optimist, entrepreneur, > lifelong learner, passionate about technology. Costa Rica > > > <https://www.linkedin.com/in/xmorera <https://www.linkedin.com/in/xmorera>> | > Pluralsight Author > [https://media.licdn.com/mpr/mpr/shrinknp_200_200/p/5/005/07f/033/28fdf8e.jpg > <https://media.licdn.com/mpr/mpr/shrinknp_200_200/p/5/005/07f/033/28fdf8e.jpg>]<https://www.linkedin.com/in/xmorera > <https://www.linkedin.com/in/xmorera>> > > Xavier Morera | LinkedIn<https://www.linkedin.com/in/xmorera > <https://www.linkedin.com/in/xmorera>> > www.linkedin.com <http://www.linkedin.com/> > Xavier Morera is an entrepreneur, project manager, Pluralsight author, > speaker, trainer, Certified Scrum Master & Professional and Certified > Microsoft professional ... > > > <http://www.pluralsight.com/author/xavier-morera > <http://www.pluralsight.com/author/xavier-morera>> > Xavier Morera - .Net Author | > Pluralsight<http://www.pluralsight.com/author/xavier-morera > <http://www.pluralsight.com/author/xavier-morera>> > www.pluralsight.com <http://www.pluralsight.com/> > Xavier is an entrepreneur, project manager, technical author, trainer, > Certified Scrum Professional & Scrum Master, and Certified Microsoft > Professional.