Delete Documents

2015-07-18 Thread naga sharathrayapati
Hey, I understand that DocExpirationUpdateProcessorFactory in the config.xml can be specified to delete documents based on the expiration. I would like to understand whether there is a chance of these deleted documents getting re-indexed? Solr 5.2 Thanks

How to Directly Create indexes from Files in HDFS?

2015-07-18 Thread ??????
Hello everyone, Good day. I'm new to solr and have some questions. Could anyone help me? I want to index files in hdfs using solr. And I know that we can use solr.extraction.ExtractingRequestHandler to directly index files in local file system. But this doesn't work for the files in

Re: solr blocking and client timeout issue

2015-07-18 Thread Jeremy Ashcraft
I turned on GC logging and verified that its definitely being caused by a GC pause. I tried the tuning option from the article and get this warning: OpenJDK 64-Bit Server VM warning: Failed to reserve shared memory (errno = 1). any recommendations on how to get rid of that warning, and

Re: Delete Documents

2015-07-18 Thread Erick Erickson
No idea what you mean by chance of these deleted docs getting re-indexed. Solr shouldn't be doing this by itself. Certainly if your indexing process sends them in again they'll be re-indexed, there's no notion of never index this doc again. Why? Are you seeing some symptom or are you just

Re: Easy way to test index/query filters?

2015-07-18 Thread Alexandre Rafalovitch
I haven't found one. I have a project plan for something just like this but it is one of many Solr-related ideas. Mine is actually around the idea of small multiples with several similar stacks next to each other and seeing how the same text/query run differently with minor variations. If people

Re: Extracting article keywords using tf-idf algorithm

2015-07-18 Thread Diego Ceccarelli
Dear Ali, I'm not sure I understand what you are trying to do, please correct me if I misunderstood: given a document indexed into lucene you want to retrieve the top-k terms with highest tf-idf right? Could you please post your code somewhere? I don't understand what is mlt :) Cheers, Diego

Re: solr blocking and client timeout issue

2015-07-18 Thread Shawn Heisey
On 7/18/2015 12:42 AM, Jeremy Ashcraft wrote: I turned on GC logging and verified that its definitely being caused by a GC pause. I tried the tuning option from the article and get this warning: OpenJDK 64-Bit Server VM warning: Failed to reserve shared memory (errno = 1). any

?????? How to create indexes for files in hdfs using tika embeded insolr?

2015-07-18 Thread ??????
Thank you very much. I'm trying to use MapReduceIndexerTool and have downloaded the search-mr-1.1.0.jar. Another question comes that how can I download the dependent jar of search-mr-1.1.0? I tried to use mvn command to download the dependences with the pom in

Re: Range Facet queries for date ranges with with non-constant gaps

2015-07-18 Thread JoeSmith
Thank you. That helped On Tue, Jul 14, 2015 at 5:02 PM, Chris Hostetter hossman_luc...@fucit.org wrote: : Are there any examples/documentation for IntervalFaceting using dates that : I could refer to? You just specify the interval set start end as properly formated date values. This

Easy way to test index/query filters?

2015-07-18 Thread Robert Oschler
I know the Solr Admin panel has a way to test the current index and query filters already in place in a schema file, but I was wondering if there is a convenient playground for testing index and query filters? I'm imagining a utility where you can select a set of index and query filters, and

Re: Easy way to test index/query filters?

2015-07-18 Thread Nagasharath
I like the idea. I am hearing nowadays about solrmeter, can't we accomplish this in solrmeter? Thanks, Naga On 18-Jul-2015, at 8:51 am, Alexandre Rafalovitch arafa...@gmail.com wrote: I haven't found one. I have a project plan for something just like this but it is one of many

How to create indexes for files in hdfs using tika embeded in solr?

2015-07-18 Thread ??????
I need help. I have several hundreds of GB files in hdfs and I want to creat indexes for these files so that I can search quickly. How can I create indexes for these files in hdfs? I know tika embeded in solr could extact the content of files in local file system and then solr would create

Re: How to create indexes for files in hdfs using tika embeded in solr?

2015-07-18 Thread Raja Pothuganti
would MapReduceIndexerTool option ? http://www.cloudera.com/content/cloudera/en/documentation/cloudera-search/v 1-latest/Cloudera-Search-User-Guide/csug_mapreduceindexertool.html On 7/18/15, 9:38 AM, 步青云 mailliup...@qq.com wrote: I need help. I have several hundreds of GB files in hdfs and

Re: Help me under the pros/cons of dynamic fields and schema-less mode

2015-07-18 Thread Shawn Heisey
On 7/18/2015 9:49 AM, Charlie Hubbard wrote: So I want to allow people to upload any CSV/XML/JSON to solr they want so having a predefined schema isn't going to cut it. After reading about my options I figured my choices were schema-less mode and dynamic fields using the * with a type other

Re: Getting query logs?

2015-07-18 Thread Erick Erickson
If you fire up thing in cloud mode in the example method, you should be seeing something like ./example/cloud/node1/logs, ./example/cloud/node2/logs etc. If you're in non-cloud mode you should see something like ./server/logs. There is some logging on the admin/ui, see the logging selection.

Getting query logs?

2015-07-18 Thread Karl Kildén
Hello! before when start.jar was used the requests rolled in the console. When I start with the new method (bin/solr) it does not. There is no relevant log file anywhere either... How can log / watch requests? Best would be to get them in the admin GUI cheers

Help me under the pros/cons of dynamic fields and schema-less mode

2015-07-18 Thread Charlie Hubbard
So I want to allow people to upload any CSV/XML/JSON to solr they want so having a predefined schema isn't going to cut it. After reading about my options I figured my choices were schema-less mode and dynamic fields using the * with a type other than ignore. I know the docs say schema-less

Re: Issue with using createNodeSet in Solr Cloud

2015-07-18 Thread Savvas Andreas Moysidis
Thanks Eric, The strange thing is that although I have set the log level to ALL I see no error messages in the logs (apart from the line saying that the response is a 400 one). I'm quite confident the configset does exist as the collection gets created fine if I don't specify the createNodeSet

Re: Delete Documents

2015-07-18 Thread Nagasharath
Cool, just curious Thanks Eric Sent from my iPhone On 18-Jul-2015, at 10:23 am, Erick Erickson erickerick...@gmail.com wrote: No idea what you mean by chance of these deleted docs getting re-indexed. Solr shouldn't be doing this by itself. Certainly if your indexing process sends them in

Re: solr blocking and client timeout issue

2015-07-18 Thread Jeremy Ashcraft
I will try that. traffic is pretty dead over the weekend, so I probably won't be able to tell if its effective or not until monday. thanks again! On 7/18/2015 8:16 AM, Shawn Heisey wrote: On 7/18/2015 12:42 AM, Jeremy Ashcraft wrote: I turned on GC logging and verified that its definitely

Re: Help me under the pros/cons of dynamic fields and schema-less mode

2015-07-18 Thread Erick Erickson
bq: So I want to allow people to upload any CSV/XML/JSON to solr they want so having a predefined schema isn't going to cut it Piling on to Shawn's excellent comments I would really advise agains this. Sure, you could make everything a text field using the * catch-all, but. If a field

Re: Issue with using createNodeSet in Solr Cloud

2015-07-18 Thread Erick Erickson
Could you post your clusterstate.json? Or at least the live nodes section of your ZK config? (adminUIcloudtreelive_nodes. The addresses of my nodes are things like 192.168.1.201:8983_solr. I'm wondering if you're taking your node names from the information ZK records or assuming it's 127.0.0.1

Re: Issue with using createNodeSet in Solr Cloud

2015-07-18 Thread Erick Erickson
P.S. It ain't the things ya don't know that'll kill ya, it's the things ya _do_ know that ain't so... On Sat, Jul 18, 2015 at 12:46 PM, Erick Erickson erickerick...@gmail.com wrote: Could you post your clusterstate.json? Or at least the live nodes section of your ZK config?

Basic auth

2015-07-18 Thread solr . user . 1507
SOLR-4470 is about: Support for basic auth in internal Solr requests. What is wrong with the internal requests? Can someone help simplify, would it ever be possible to run with basic auth? What work arounds? Regards