I have two courses and a book on Solr, aimed for getting started. If you watch the first part of both of them you could get a better idea of what needs to be done. They are in Pluralsight:
Getting Started with Enterprise Search Using Apache Solr <https://www.pluralsight.com/courses/enterprise-search-using-apache-solr> Implementing Search in .NET Applications <https://www.pluralsight.com/courses/implementing-search-dotnet-applications> On Wed, Nov 30, 2016 at 10:33 PM, Chris Manu <chrisman...@hotmail.com> wrote: > Thank you for responding. So, theoretically, I would need to hire someone > with Apache programing experience to do this correct (given that I know > nothing about programing)? What type of experience should I look for? > > > ________________________________ > From: Xavier Morera <xav...@familiamorera.com> > Sent: December 1, 2016 2:23 AM > To: general@lucene.apache.org > Subject: Re: Feasability > > The answer is yes, but you would need to do some programming and > configuring. > > On Wed, Nov 30, 2016 at 7:54 PM, Chris Manu <chrisman...@hotmail.com> > wrote: > > > Hello, > > > > > > I want to start off by saying that I am not a programmer...and have very > > little knowledge in this area. > > > > > > What I would like to know if Apache would be capable of doing the > > following: > > > > Take an extensive list (A) of strings of unique words (these are titles - > > anywhere from 4 words to 30) saved in either an Excel worksheet or in a > > text file and search for instances (B) where these can be found in PDF > > files saved on a hard drive (over 100k files). The search would need to > be > > done using a fuzzy logic rather than exact matching and the output would > be > > in an Excel file list the unique string found (A), the file name in which > > the match was made (B), the page number where the match was made and the > > surrounding text on either side of As well, would this be a complicated > > program, usable by novices coached in the process necessary to input the > > title file (A) and direct the search to the relevant folder containing > the > > PDF files (B). > > > > > > I eagerly await (hopefully) an affirmative answer. > > > > > > Cheers! > > > > > > > -- > > *Xavier Morera* > > Entrepreneur | Author & Trainer | Consultant | Developer & Scrum Master > > *www.xaviermorera.com <http://www.xaviermorera.com/>* > [https://i2.wp.com/www.xaviermorera.com/wp-content/ > uploads/2016/06/xavier-morera.jpg?resize=150%2C150]<http:// > www.xaviermorera.com/> > > Xavier Morera<http://www.xaviermorera.com/> > www.xaviermorera.com > I have been working with Solr for a while, mainly from the .NET world and > I basically love it. I use SolrNet which I think it is a very mature and > stable library. > > > > office: (305) 600-4919 > > cel: +506 8849-8866 > > skype: xmorera > Twitter <https://twitter.com/xmorera> | LinkedIn > [https://pbs.twimg.com/profile_images/464050157344940033/7AA_lsgC_ > 400x400.jpeg]<https://twitter.com/xmorera> > > xmorera (@xmorera) | Twitter<https://twitter.com/xmorera> > twitter.com > The latest Tweets from xmorera (@xmorera). Eternal optimist, entrepreneur, > lifelong learner, passionate about technology. Costa Rica > > > <https://www.linkedin.com/in/xmorera> | Pluralsight Author > [https://media.licdn.com/mpr/mpr/shrinknp_200_200/p/5/005/ > 07f/033/28fdf8e.jpg]<https://www.linkedin.com/in/xmorera> > > Xavier Morera | LinkedIn<https://www.linkedin.com/in/xmorera> > www.linkedin.com > Xavier Morera is an entrepreneur, project manager, Pluralsight author, > speaker, trainer, Certified Scrum Master & Professional and Certified > Microsoft professional ... > > > <http://www.pluralsight.com/author/xavier-morera> > Xavier Morera - .Net Author | Pluralsight<http://www. > pluralsight.com/author/xavier-morera> > www.pluralsight.com > Xavier is an entrepreneur, project manager, technical author, trainer, > Certified Scrum Professional & Scrum Master, and Certified Microsoft > Professional. > > > -- *Xavier Morera* Entrepreneur | Author & Trainer | Consultant | Developer & Scrum Master *www.xaviermorera.com <http://www.xaviermorera.com/>* office: (305) 600-4919 cel: +506 8849-8866 skype: xmorera Twitter <https://twitter.com/xmorera> | LinkedIn <https://www.linkedin.com/in/xmorera> | Pluralsight Author <http://www.pluralsight.com/author/xavier-morera>