Re: Strict Search in Apache Solr

2014-05-05 Thread Reyes, Mark
/14, 3:17 PM, "Ahmet Arslan" wrote: >Hi Reyes, > >I think it is not clear your question. >Please see : https://wiki.apache.org/solr/UsingMailingLists > >Ahmet > >On Tuesday, May 6, 2014 12:23 AM, "Reyes, Mark" >wrote: >How could Solr accomplish a

Strict Search in Apache Solr

2014-05-05 Thread Reyes, Mark
How could Solr accomplish an end-user behavior like a strict search? Let’s say an end-user decides to use quotation marks in their keywords to provide specificity in their search results. Current: If you were to query: your future, then 10 results would return and print to the page. Expected:

Re: Indexing URLs for Binaries

2014-01-03 Thread Reyes, Mark
Check suffix-urlfilter.txt in your conf directory for Nutch. You might be prohibiting those filetypes from the crawl. - Mark On 1/3/14, 10:29 AM, "Teague James" wrote: >I am using Nutch 1.7 with Solr 4.6.0 to index websites that have links to >binary files, such as Word, PDF, etc. The craw

Proxy.php tutorials for AJAX Solr

2013-12-02 Thread Reyes, Mark
Are there any good tutorials that touch base on how to integrate the suggested PHP proxy for JavaScript framework AJAX Solr? Here is the proxy, https://gist.github.com/evolvingweb/298580 Also on Stackoverflow, http://stackoverflow.com/questions/20338073/proxy-php-tutorials-for-ajax-solr IMPORT

Re: Client-side proxy for Solr 4.5.0

2013-11-27 Thread Reyes, Mark
What about using some JSONP techniques since the results in the Solr instance rest as key/value pairs? On 11/26/13, 10:53 AM, "Markus Jelsma" wrote: >I don't think you mean client-side proxy. You need a server side layer >such as a normal web application or good proxy. We use Nginx, it is very

Client-side proxy for Solr 4.5.0

2013-11-26 Thread Reyes, Mark
Are there any GOOD client-side solutions to proxy a Solr 4.5.0 instance so that the end-user can see their queries w/o being able to directly access :8983? Applications/frameworks used: - Solr 4.5.0 - AJAX Solr (javascript library) Thank you, Mark IMPORTANT NOTICE: This e-mail message is inten

Indexing data to a specific collection in Solr 4.5.0

2013-11-21 Thread Reyes, Mark
Hi all: I’m currently on a Solr 4.5.0 instance and running this tutorial, http://lucene.apache.org/solr/4_5_0/tutorial.html My question is specific to indexing data as proposed from this tutorial, $ java -jar post.jar solr.xml monitor.xml The tutorial advises to validate from your localhost, h

Re: Indexing data to a specific collection in Solr 4.5.0

2013-11-21 Thread Reyes, Mark
So then, $ java -jar post.jar Durl=http://localhost:8983/solr/collection2/update solr.xml monitor.xml On 11/21/13, 8:14 AM, "xiezhide" wrote: > >add Durl=http://localhost:8983/solr/collection2/update when run post.jar, >此邮件发送自189邮箱 > >"Reyes, Mark" wrote

Nutch 1.7 solrdedup error

2013-11-17 Thread Reyes, Mark
When trying to delete duplicates after crawl I get the following, http://pastebin.com/aQbqmPLm When running this command on terminal: $ bin/nutch solrdedup http://localhost:8983/solr/rockies Here is my setup: - Nutch 1.7 - Solr 4.5.0 - java version "1.6.0_51" On Stackoverflow as well, http://s

Nutch 1.7 + AJAX Solr returning ALL contents vs. SPECIFIC

2013-11-11 Thread Reyes, Mark
Hi: I was encouraged to explore the Solr mail list, specifically regarding the fl–parameter. What is that parameter for and can it accomplish my original task of crawling/indexing specific html components versus parsing the entire page? My original question is listed below (previously on the

Setting up Multiple Cores on Solr 4.5.0

2013-11-10 Thread Reyes, Mark
Any good/recent documentation that I can reference on setting up multiple cores in Solr 4.5.0? Thanks all, Mark IMPORTANT NOTICE: This e-mail message is intended to be received only by persons entitled to receive the confidential information it may contain. E-mail messages sent from Bridgepoi

Re: Exclude urls without 'www' from Nutch 1.7 crawl

2013-11-01 Thread Reyes, Mark
Noted and will do (that goes twice for the suggestions and putting this on the nutch list instead). Thanks all, Mark On 11/1/13, 10:53 AM, "Furkan KAMACI" wrote: >As Markus pointed Nutch has a feature for such kind of situation. Here is >Solr list but one more thing for you: www.mywebsite.com

Exclude urls without 'www' from Nutch 1.7 crawl

2013-11-01 Thread Reyes, Mark
I'm currently using Nutch 1.7 to crawl my domain. My issue is specific to URLs being indexed as www vs. non-www. Specifically, after firing the crawl and index to Solr 4.5 then validating the results on the front-end with AJAX Solr, the search results page lists results/pages that are both 'www

Re: AJAX Solr returning the default wildcard *:* and not what I query

2013-10-31 Thread Reyes, Mark
I just tweaked the reuters.js example to listen to the window.location object and it resolved the wildcard returns. I put it on pastebin, http://pastebin.com/GyC4RMva Thanks for the reply everyone, Mark --- P.

Re: AJAX Solr returning the default wildcard *:* and not what I query

2013-10-30 Thread Reyes, Mark
solr.log file per Solr 4.5 http://pastebin.com/zSpERJZA Thanks Shawn, Mark On 10/30/13, 12:44 PM, "Shawn Heisey" wrote: >On 10/30/2013 1:26 PM, Reyes, Mark wrote: >> I am currently integrating JavaScript framework AJAX Solr to my domain. >>I am trying to query wor

AJAX Solr returning the default wildcard *:* and not what I query

2013-10-30 Thread Reyes, Mark
I am currently integrating JavaScript framework AJAX Solr to my domain. I am trying to query words such as 'doctorate' or 'programs' but the console is reporting '*:*' only the default wildcard. Just curious if anyone has any helpful hints? The problem can be seen in detail on Stackoverflow, ht