Fwd: Am I allowed to generate, enhance and republish a JavaDoc of an Apache project?

2014-03-06 Thread Alexandre Rafalovitch
I asked this on Apache legal list but got no reply. So, I thought I'll try again for the group it will affect directly (project not mentioned below is Solr). Any opinion on legality, usefulness or possibly underlying causes of the original problem would be appreciated. Regards, Alex.

Re: Fwd: Am I allowed to generate, enhance and republish a JavaDoc of an Apache project?

2014-03-07 Thread Alexandre Rafalovitch
...@elyograg.org wrote: On 3/6/2014 8:42 PM, Alexandre Rafalovitch wrote: I asked this on Apache legal list but got no reply. So, I thought I'll try again for the group it will affect directly (project not mentioned below is Solr). Any opinion on legality, usefulness or possibly underlying

Any reason Solr Jira still lists 4.7 as unreleased?

2014-03-14 Thread Alexandre Rafalovitch
I was doing some searching on issues and noticed 4.7 is listed in Unreleased versions. Also, I have a couple of open issues that (possibly due to my mistake) are marked as open but target at 4.7. Was not sure if this is a process that normally lags the actual version release or something to

Re: Analyzing primitive types, why can't we do this in Solr?

2014-03-20 Thread Alexandre Rafalovitch
Do you mean like: http://lucene.apache.org/solr/4_6_1/solr-core/org/apache/solr/update/processor/ParseDateFieldUpdateProcessorFactory.html ? https://github.com/apache/lucene-solr/blob/lucene_solr_4_7_0/solr/example/example-schemaless/solr/collection1/conf/solrconfig.xml#L1570 Regards, Alex.

Re: Google Summer of Code

2014-03-20 Thread Alexandre Rafalovitch
What does it take to be a mentor? I have a couple of Solr ideas I would be happy to mentor someone on. But do mentors have to sign agreements, be part of Apache formally, etc? Regards, Alex. Personal website: http://www.outerthoughts.com/ LinkedIn:

Re: Analyzing primitive types, why can't we do this in Solr?

2014-03-20 Thread Alexandre Rafalovitch
what I _thought_ I remembered! It looks like I found all the extends for UpdateProcessorFactory, but didn't follow the chain through FieldMutatingUpdateProcessorFactory which would have found that one for me. Siiihhh. Thanks again, Erick On Thu, Mar 20, 2014 at 5:44 PM, Alexandre

Re: Analyzing primitive types, why can't we do this in Solr?

2014-03-20 Thread Alexandre Rafalovitch
outline of what's there similar to the Analyzers and Tokenizers page If I find the time... Siigggh. Erick On Thu, Mar 20, 2014 at 5:53 PM, Alexandre Rafalovitch arafa...@gmail.com wrote: That chain issue is exactly why I built the web page above. That, plus the Javadoc links all over

Re: Google Summer of Code

2014-03-21 Thread Alexandre Rafalovitch
And I think I missed the dates too anyway. Oh well, maybe I can get into that position for next year. Something to target (committer status and all). :-) Thanks, Alex. Personal website: http://www.outerthoughts.com/ LinkedIn: http://www.linkedin.com/in/alexandrerafalovitch - Time is the

Solr: Are tokenizers/analysers/filters self descriptory?

2014-03-22 Thread Alexandre Rafalovitch
Hello, I am thinking of new interfaces to compose analyzer chains. And I already have all the valid analyzers/tokenizers/filters/etc ( http://www.solr-start.com/info/analyzers/ ). But I just realized that I also need the parameters that actually define the behavior. Normally, they are defined in

Wiki: Solr API Javadoc magic redirector

2014-03-27 Thread Alexandre Rafalovitch
Hello, On the following page, there is a Javadoc link that redirects to 404: https://wiki.apache.org/solr/DataImportHandler#EventListeners The link is: http://lucene.apache.org/solr/api/org/apache/solr/handler/dataimport/EventListener.html It redirects (incorrectly) to:

RE: Wiki: Solr API Javadoc magic redirector

2014-03-27 Thread Alexandre Rafalovitch
that I'm not aware of. Anyone have any other ideas? Cassandra On Thu, Mar 27, 2014 at 4:26 AM, Alexandre Rafalovitch arafa...@gmail.com wrote: Hello, On the following page, there is a Javadoc link that redirects to 404: https://wiki.apache.org/solr/DataImportHandler#EventListeners

Re: Solr Ref Guide vs. Wiki

2014-04-04 Thread Alexandre Rafalovitch
+1 on consolidating to the Reference Guide and figuring out the way to make wiki a lot less visible. But for a completely different set of reasons than discussed already. [[rant-start]] I think an interesting side-effect issue here is user perception. I feel that ElasticSearch (yet, them) get a

Solr components' src points to branch, should it be tag?

2014-04-06 Thread Alexandre Rafalovitch
Hello, I am looking at Solr JMX UI (In Web Admin) and I notice that the source code links are pointing at the source code under the 4.7 branch. Except I am running 4.7.1, so this is both confusing and somewhat misleading if the user is actually trying to troubleshooting source code. Should the

Re: Solr Ref Guide vs. Wiki

2014-04-06 Thread Alexandre Rafalovitch
On Sun, Apr 6, 2014 at 5:12 PM, Grant Ingersoll gsing...@apache.org wrote: While I somewhat agree with both the points of Furkan and Alexandre, I am not sure which way you are leaning: If somebody had time/money/permission, that's what I would do 1. Migrate wiki to an archive area in bulk and

Re: Solr Ref Guide vs. Wiki

2014-04-07 Thread Alexandre Rafalovitch
Follow-up in-line. On Mon, Apr 7, 2014 at 1:35 PM, Shalin Shekhar Mangar shalinman...@gmail.com wrote: On Mon, Apr 7, 2014 at 9:58 AM, Alexandre Rafalovitch arafa...@gmail.com wrote: On Sun, Apr 6, 2014 at 5:12 PM, Grant Ingersoll gsing...@apache.org wrote: 2. Mark it as Google non-crawlable

Re: Solr Ref Guide vs. Wiki

2014-04-07 Thread Alexandre Rafalovitch
Right. Now I just need to become a committer. :-) Regards, Alex On 07/04/2014 8:01 pm, Grant Ingersoll gsing...@apache.org wrote: On Apr 6, 2014, at 11:35 PM, Shalin Shekhar Mangar shalinman...@gmail.com wrote: 7. Setup proper analytics (is there any?), so we could at least tell what

Re: 4.7.2

2014-04-08 Thread Alexandre Rafalovitch
Let's hope nobody is trying to finish any books right now. :-) Personal website: http://www.outerthoughts.com/ Current project: http://www.solr-start.com/ - Accelerating your Solr proficiency On Tue, Apr 8, 2014 at 1:55 PM, Simon Willnauer simon.willna...@gmail.com wrote: +1 to both 4.7.3 and

Open Source Gardening - Lucene/Solr relevance?

2014-04-16 Thread Alexandre Rafalovitch
I don't normally repost to the mailng lists, but this is better than my own attempts :-) http://words.steveklabnik.com/how-to-be-an-open-source-gardener I would be curious to know how this topic applies to Lucene/Solr. I have a feeling that the old-timers have a particular

Why does Solr bundle all the licences

2014-04-16 Thread Alexandre Rafalovitch
Hello, I am curious what the legal reason for bundling all the .sha and license files (plus the NOTICE.txt plus the LICENSE.txt)? I am looking at ElasticSearch by comparison and they have only one. Even when they download something like Tika as a plugin, they do not seem download the license

Re: Why does Solr bundle all the licences

2014-04-16 Thread Alexandre Rafalovitch
Bremen http://www.thetaphi.de eMail: u...@thetaphi.de -Original Message- From: Alexandre Rafalovitch [mailto:arafa...@gmail.com] Sent: Wednesday, April 16, 2014 1:57 PM To: dev@lucene.apache.org Subject: Why does Solr bundle all the licences Hello, I am curious what the legal reason

Re: Why does Solr bundle all the licences

2014-04-16 Thread Alexandre Rafalovitch
.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: u...@thetaphi.de -Original Message- From: Alexandre Rafalovitch [mailto:arafa...@gmail.com] Sent: Wednesday, April 16, 2014 2:08 PM To: dev@lucene.apache.org Subject: Re: Why does Solr bundle all the licences Even

Solr 5 Jetty going away - which JIRA?

2014-04-17 Thread Alexandre Rafalovitch
I've read a couple of times now that Jetty/Tomcat/Container is going away in Solr 5. Can somebody tell me where to check the detailed discussion about that. Is that a mailing list thread? A JIRA? Something else? I found SOLR-5091, but it does not feel quite right. Regards, Alex. Personal

Solr: Serving Javadoc from Jetty server

2014-04-17 Thread Alexandre Rafalovitch
Hello, The binary Solr distribution includes Javadoc, but it just sits there. I just tested adding second Jetty context that makes that Javadoc served under /javadoc handle. I think it is useful as sometimes Javadoc breaks when it is loaded from local filesystem (I think), plus it opens up

Re: Solr: Serving Javadoc from Jetty server

2014-04-17 Thread Alexandre Rafalovitch
, 2014 at 6:54 AM, Alexandre Rafalovitch arafa...@gmail.com wrote: Hello, The binary Solr distribution includes Javadoc, but it just sits there. I just tested adding second Jetty context that makes that Javadoc served under /javadoc handle. I think it is useful as sometimes Javadoc breaks when

Re: CJKBigramFilter - position bug with outputUnigrams?

2014-04-20 Thread Alexandre Rafalovitch
Have you looked at commercial offerings? At some point, it becomes an ROI issue. If it is becoming such a serious issue: http://www.basistech.com/text-analytics/rosette/base-linguistics/asian-languages/ Regards, Alex. P.s. This is a link, not a recommendation. I haven't tested either their

Re: Solr Admin UI and SolrCloud

2014-05-13 Thread Alexandre Rafalovitch
I think there was a discussion/JIRA on moving to AngularJS (or ReactJS?). Maybe this should be a part of that discussion. What is the process for discussion UI? Was this a heroic effort by one/two individual or was there a subgroup of some sort? Regards, Alex. Personal website:

Re: Unrelated Features within Data at Solr

2014-05-22 Thread Alexandre Rafalovitch
Can you have a custom UpdateRequestProcessor in the chain that just skips processing such requests? Regards, Alex. Personal website: http://www.outerthoughts.com/ Current project: http://www.solr-start.com/ - Accelerating your Solr proficiency On Thu, May 22, 2014 at 6:34 PM, Furkan KAMACI

Request for discussion: Solr plugins

2014-05-28 Thread Alexandre Rafalovitch
Hi, I would like to (re-)initiate a discussion about Solr support for plugin life-cycle (publish, discover, download, dependency management). Triggered by a discussion on the Solr mailing list: http://search-lucene.com/m/QTPaIv50e1subj=Re+Contribute+QParserPlugin My main points: 1)

Re: Request for discussion: Solr plugins

2014-05-29 Thread Alexandre Rafalovitch
On Thu, May 29, 2014 at 1:17 PM, Shawn Heisey s...@elyograg.org wrote: On 5/28/2014 10:01 PM, Alexandre Rafalovitch wrote: 5) Solr, frankly, is getting rather pudgy. Or possibly beyond mere pudgy. This is becoming especially noticeable by comparison with ElasticSearch but also

Re: Querying all docs

2014-06-03 Thread Alexandre Rafalovitch
I guess, there are two questions there: 1) Does SOLR allocate memory based on requested rows (not actual one)? 2) Is there a way to get all the records? For the second question, would paging with deep-paging support (added in 4.8) solve the problem in the meantime? Or is use-case more specific

Re: Lucene/Solr 5?

2014-06-05 Thread Alexandre Rafalovitch
I think Solr 5 should be a modularized deploy with plugins and plugin registry. Either that or we will have to steal the Elephant logo from the sister project and have to setup torrent-based software distribution. And I really wish HUE was a Java-based project. But, alas, Regards, Alex.

Re: Trappy behavior with default search field

2014-06-05 Thread Alexandre Rafalovitch
How about a warning section in Admin UI that shows possible issues with the configuration. Could start with something absolutely basic like missing default field (we can detect that from configuration, right?) but I am sure there are other issues we could think of. Regards, Alex. Personal

Adding Morphline support to DIH - worth the effort?

2014-06-07 Thread Alexandre Rafalovitch
Hello, I had a preliminary look around and it might be possible to plug Morphline (already shipped with Solr) into DIH by creating a bridging EntityProcessor. Two questions: 1) Do people see value in it? 2) DIH is not very supported, so any addition seems to be a bit stuck in rickety bridge,

Re: Adding Morphline support to DIH - worth the effort?

2014-06-07 Thread Alexandre Rafalovitch
Solr proficiency On Sun, Jun 8, 2014 at 12:23 PM, Shalin Shekhar Mangar shalinman...@gmail.com wrote: I do not know much about morphlines but I'd like to know what use-cases would be possible/easier/faster with such an integration? On Sun, Jun 8, 2014 at 10:32 AM, Alexandre Rafalovitch arafa

Re: Adding Morphline support to DIH - worth the effort?

2014-06-08 Thread Alexandre Rafalovitch
consider DIH a solid foundation or a weak link in Solr that desperately needs firming up. -- Jack Krupansky -Original Message- From: Alexandre Rafalovitch Sent: Sunday, June 8, 2014 1:40 AM To: dev@lucene.apache.org Subject: Re: Adding Morphline support to DIH - worth the effort

Re: Adding Morphline support to DIH - worth the effort?

2014-06-08 Thread Alexandre Rafalovitch
Message- From: Alexandre Rafalovitch Sent: Sunday, June 8, 2014 10:16 AM To: dev@lucene.apache.org Subject: Re: Adding Morphline support to DIH - worth the effort? I see DIH as something that offers a quick way to get things done, as long as they fit into DIH's couple of basic scenarios

Re: Adding Morphline support to DIH - worth the effort?

2014-06-10 Thread Alexandre Rafalovitch
-Original Message- From: Alexandre Rafalovitch Sent: Sunday, June 8, 2014 1:40 AM To: dev@lucene.apache.org Subject: Re: Adding Morphline support to DIH - worth the effort? Well, it's the same core scenario as DIH supports (apart from actual data sources), but actively supported

Re: solr magento e-commerce plugin/extension

2014-11-19 Thread Alexandre Rafalovitch
I have only two resources on this (both for Magento): *) http://www.magentocommerce.com/magento-connect/solr-bridge-search.html *) http://inchoo.net/ecommerce/install-apache-solr/ (has links in the comments) Regards, Alex Personal: http://www.outerthoughts.com/ and @arafalov Solr resources and

Re: Solr with LAMP/ XAMPP/ WAMP/MAMP

2014-11-19 Thread Alexandre Rafalovitch
On 19 November 2014 13:21, Shawn Heisey apa...@elyograg.org wrote: I *do* think it might be a good idea for us to write and maintain supported Solr clients for languages beyond Java I believe this has been announced as one of the focus items at the Lucene/Solr Revolution. Including the call to

Re: Lucene ancient greek normalization

2014-11-21 Thread Alexandre Rafalovitch
Are you sure that's not something that's already addressed by the ICU Filter? http://www.solr-start.com/javadoc/solr-lucene/org/apache/lucene/analysis/icu/ICUTransformFilterFactory.html If you follow the links to what's possible, the page talks about Greek, though not ancient:

Testing Solr 5

2014-11-21 Thread Alexandre Rafalovitch
Hi, I am writing something that - will - depend on Solr 5. As I usually work with released versions, I am not entirely sure of the correct workflow. I can check out branch_5x and do my research against that. I assume that's the correct source for what will land in version 5. But if I find an

Re: Lucene ancient greek normalization

2014-11-21 Thread Alexandre Rafalovitch
On 21 November 2014 16:10, paolo anghileri paolo.anghil...@codegeneration.it wrote: The need is being able to search with simple strings without grammatical details and retrieve data with grammatical details. I am pretty sure that this is what I did for a Thai dome. Actually, I went another two

Re: Testing Solr 5

2014-11-21 Thread Alexandre Rafalovitch
as possible. -- Jack Krupansky -Original Message- From: Shawn Heisey Sent: Friday, November 21, 2014 3:36 PM To: dev@lucene.apache.org Subject: Re: Testing Solr 5 On 11/21/2014 1:14 PM, Alexandre Rafalovitch wrote: I am writing something that - will - depend on Solr 5. As I usually

Solr 5: Is anybody working on better basic example/tutorial?

2014-11-22 Thread Alexandre Rafalovitch
Hello, I can't find a relevant Jira/discussion space if this exists. I strongly feel that the basic example is still far from basic and there needs to be a subgroup of people discussing of what can be cut-off to demonstrate a true minimal configuration. I am happy to take a lead on that if

Re: Solr 5: Is anybody working on better basic example/tutorial?

2014-11-23 Thread Alexandre Rafalovitch
://www.linkedin.com/groups?gid=6713853 On 22 November 2014 at 17:06, Shawn Heisey apa...@elyograg.org wrote: On 11/22/2014 10:41 AM, Alexandre Rafalovitch wrote: I can't find a relevant Jira/discussion space if this exists. I strongly feel that the basic example is still far from basic

Newbie help: How do I debug?

2014-11-23 Thread Alexandre Rafalovitch
So, I think I found a bug in Solr 5 build: curl http://localhost:8983/solr/schemaless/select?indent=trueechoParams=all; - returns echoParams twice. So, how do I debug it? Enable remote debugging on Jetty and trace it through the 1024 levels of indirections? Grab a specific test case (which one?)

Re: solr client sdk's/libraries for native platforms

2014-11-24 Thread Alexandre Rafalovitch
Well, a start would be to actually have an up-to-date list of Solr clients. I have the list, if somebody knows where it should go (Ref Guide). I don't want to contribute this to WIKI as we are trying to get rid of it. Then somebody (Summer of Code project?) would derive from that a list of

Re: solr client sdk's/libraries for native platforms

2014-11-24 Thread Alexandre Rafalovitch
Lucene/Solr Search Consultant/Developer http://www.linkedin.com/in/davidwsmiley On Mon, Nov 24, 2014 at 10:35 AM, Alexandre Rafalovitch arafa...@gmail.com wrote: Well, a start would be to actually have an up-to-date list of Solr clients. I have the list, if somebody knows where it should go

Re: solr client sdk's/libraries for native platforms

2014-11-24 Thread Alexandre Rafalovitch
community driven since things change so quickly. Eric On Nov 24, 2014, at 10:35 AM, Alexandre Rafalovitch arafa...@gmail.com wrote: Well, a start would be to actually have an up-to-date list of Solr clients. I have the list, if somebody knows where it should go (Ref Guide). I don't want

Re: Newbie help: How do I debug?

2014-11-24 Thread Alexandre Rafalovitch
On 24 November 2014 at 10:56, Erick Erickson erickerick...@gmail.com wrote: It's certainly something of a challenge when you're totally unfamiliar with the relevant code. And then the next one you tackle means you start the process all over again ;). That does not bode well, since I am

Re: Where is the SVN repository only for Lucene project ?

2014-11-26 Thread Alexandre Rafalovitch
With Git and GitHub it is possible to do a shallow fetch which will only get the files without much history. Maybe with SVN as well, but I haven't tried. Regards, Alex. Personal: http://www.outerthoughts.com/ and @arafalov Solr resources and newsletter: http://www.solr-start.com/ and

Re: solr client sdk's/libraries for native platforms

2014-12-01 Thread Alexandre Rafalovitch
On 1 December 2014 at 09:51, Jan Høydahl jan@cominvent.com wrote: I once asked the SolrNET developers if they would like to move their effort to Apache and become a certified client library, but the response was lukewarm.

Re: solr client sdk's/libraries for native platforms

2014-12-01 Thread Alexandre Rafalovitch
On 1 December 2014 at 10:02, david.w.smi...@gmail.com david.w.smi...@gmail.com wrote: I meant to reply earlier... On Mon, Nov 24, 2014 at 11:37 AM, Alexandre Rafalovitch arafa...@gmail.com wrote: They are super-stale Yup but it’s a wiki so feel free to freshen it up. I’ll be doing

Re: solr client sdk's/libraries for native platforms

2014-12-01 Thread Alexandre Rafalovitch
...@opensourceconnections.com wrote: I think in the vein of a “do-it-tocracy”, getting the Wiki updated is a perfectly good first step, and then if there is a better approach, hopefully that occurs.… ;-) On Dec 1, 2014, at 10:51 AM, Alexandre Rafalovitch arafa...@gmail.com wrote: On 1 December 2014

Re: solr client sdk's/libraries for native platforms

2014-12-02 Thread Alexandre Rafalovitch
! ~ David Smiley Freelance Apache Lucene/Solr Search Consultant/Developer http://www.linkedin.com/in/davidwsmiley On Mon, Dec 1, 2014 at 7:35 PM, Alexandre Rafalovitch arafa...@gmail.com wrote: What would be the reasonable cutoff for the client library last update? Say if it was not updated

Re: Running tests incrementally based on changed files

2014-12-02 Thread Alexandre Rafalovitch
A tangent, but a relevant one (to the issue of speed). Have you tried running the tests with Lucene/Solr code being in the RAM disk? I found that compiling source on RAMDisk is a lot faster than even with SSD drive. Must be just frequency of access. It might be the same with tests. Regards,

Re: solr client sdk's/libraries for native platforms

2014-12-03 Thread Alexandre Rafalovitch
? Eric On Dec 3, 2014, at 12:23 AM, Alexandre Rafalovitch arafa...@gmail.com wrote: Ok, Done: https://wiki.apache.org/solr/IntegratingSolr Also: https://wiki.apache.org/solr/SolPython I am not sure what to do with the stuff at the bottom of the client list, though I've put the dates

Re: Running tests incrementally based on changed files

2014-12-03 Thread Alexandre Rafalovitch
On 2 December 2014 at 18:10, Shawn Heisey apa...@elyograg.org wrote: The test dependency tree for this particular class is *EXTENSIVE* and not very easy to track down. How about a black box approach? Run each test individually once with a debugging interface enabled and the code that records

Re: Adding Morphline support to DIH - worth the effort?

2014-06-11 Thread Alexandre Rafalovitch
On Thu, Jun 12, 2014 at 7:43 AM, Wolfgang Hoschek whosc...@cloudera.com wrote: On Hadoop, even the JDBC/SQL portion of DIH now seems mostly covered by a combination of Sqoop and MapReduceIndexerTool, and perhaps a bit of Hive. I appreciate that if you are in the Big Data space, you already

Re: Language detection for solr 3.6.1

2014-07-07 Thread Alexandre Rafalovitch
I've had an example in my book: https://github.com/arafalov/solr-indexing-book/blob/master/published/languages/conf/solrconfig.xml , though it was for Solr 4.2+. Solr in Action also has a section on multilingual indexing. There is no generic advice, as everybody seems to have slightly different

Hints on constructing/running Solr analyzer chains standalone

2014-07-12 Thread Alexandre Rafalovitch
Hello, I am interested in creating and running Solr analyzer chains outside of normal process (no live Solr). Just construct a chain, feed it tokens and see what happens. I would appreciate any hints on what that takes and whether there are any hidden/weird dependencies (e.g. for resource

Re: Hints on constructing/running Solr analyzer chains standalone

2014-07-12 Thread Alexandre Rafalovitch
wrote: Tracing through indexing or query parsing is... a challenge. Start with something simpler like the analysis admin API. See: http://lucene.apache.org/solr/4_9_0/solr-core/org/apache/solr/handler/FieldAnalysisRequestHandler.html -- Jack Krupansky -Original Message- From: Alexandre

Re: Hints on constructing/running Solr analyzer chains standalone

2014-07-12 Thread Alexandre Rafalovitch
at 6:59 AM, Alexandre Rafalovitch arafa...@gmail.com wrote: Hello, I am interested in creating and running Solr analyzer chains outside of normal process (no live Solr). Just construct a chain, feed it tokens and see what happens. I would appreciate any hints on what that takes

Re: Hints on constructing/running Solr analyzer chains standalone

2014-07-12 Thread Alexandre Rafalovitch
. Of course _you're_ the one doing the work, so whatever you think best. Erick On Sat, Jul 12, 2014 at 10:16 AM, Alexandre Rafalovitch arafa...@gmail.com wrote: I don't want to read the schema.xml, but I do want to create factories using the same parameters they use in schema. So

Why does Solr binary distribution include test-framework?

2014-07-22 Thread Alexandre Rafalovitch
Hello, What is the logic/benefit of shipping test framework with Solr distribution? Is something actually using it outside of build/test cycle? It's 12Mb of libraries and documentations. Regards, Alex. Personal: http://www.outerthoughts.com/ and @arafalov Solr resources:

Re: Running tests incrementally based on changed files

2014-12-04 Thread Alexandre Rafalovitch
: https://www.linkedin.com/groups?gid=6713853 On 3 December 2014 at 01:14, Shawn Heisey apa...@elyograg.org wrote: On 12/2/2014 10:53 PM, Alexandre Rafalovitch wrote: A tangent, but a relevant one (to the issue of speed). Have you tried running the tests with Lucene/Solr code being in the RAM disk

Re: solr client sdk's/libraries for native platforms

2014-12-04 Thread Alexandre Rafalovitch
://www.linkedin.com/in/davidwsmiley On Wed, Dec 3, 2014 at 8:36 AM, Alexandre Rafalovitch arafa...@gmail.com wrote: +1 on merging those two. But also needs a bit of a 'design' of what goes into it. I have probably another 30 links of various Solr-related products. I didn't touch SolrPython

Minimum test set for idempotent changes in schema.xml

2014-12-06 Thread Alexandre Rafalovitch
Hi, I want to do some minor cleanup in (multiple) schema.xml. What's the minimum set of tests I need to run before I submit a patch? 'ant test' seems to be a heavy thing. Is there a way to say, just schema-related ones? Regards, Alex. Personal: http://www.outerthoughts.com/ and @arafalov

Re: Minimum test set for idempotent changes in schema.xml

2014-12-07 Thread Alexandre Rafalovitch
/Developer http://www.linkedin.com/in/davidwsmiley On Sat, Dec 6, 2014 at 2:45 PM, Alexandre Rafalovitch arafa...@gmail.com wrote: Hi, I want to do some minor cleanup in (multiple) schema.xml. What's the minimum set of tests I need to run before I submit a patch? 'ant test' seems

Re: what is the rule for updating CHANGES.txt

2014-12-07 Thread Alexandre Rafalovitch
Hi Michael, Awesome to see you are still working on that tool. It's enough outside of the box, I can see it can cause other people without context to become confused. But I think we will learn something from it, whether it will ultimately work perfectly or not. On the specific question: I was

Re: ref guide cleanup: (example - server) + (bin/solr -e) + (no more war) - (implicit collection)

2014-12-10 Thread Alexandre Rafalovitch
How are we doing the feedback? On the list? On the page in comments? Before looking too hard at the links (will later): 1) If we are talking about solrconfig.xml - there are now general properties that apply to multiple update handlers at once. And multiple of them may do that in order of the

What's the purpose of scripts.conf in the release?

2014-12-20 Thread Alexandre Rafalovitch
I just found that DIH and techproducts examples ship with scripts.conf file in the conf directory (4.x, 5 and trunk). It does not seem to be used at all anywhere in the packaged version. The only mention I seem to find is in the scripts-util in the _source_ directory where it is used as some

Re: how to do auto suggestion using apache lucene?

2014-10-01 Thread Alexandre Rafalovitch
Any reason it's Lucene directly and not Solr or ElasticSearch? Here is an example with Solr, Spring Data and Select2: https://github.com/arafalov/Solr-Javadoc/tree/master/SearchServer (the first version was built from scratch in 3 hours). Regards, Alex. Personal:

Re: how to do auto suggestion using apache lucene?

2014-10-01 Thread Alexandre Rafalovitch
popularizers community: https://www.linkedin.com/groups?gid=6713853 On 1 October 2014 12:18, david.w.smi...@gmail.com david.w.smi...@gmail.com wrote: On Wed, Oct 1, 2014 at 9:19 AM, Alexandre Rafalovitch arafa...@gmail.com wrote: https://github.com/arafalov/Solr-Javadoc/tree/master

Google Summer of Code 2015

2014-10-07 Thread Alexandre Rafalovitch
Hello, Google Summery of Code 2015 has been announced. https://developers.google.com/open-source/soc/ I know it is quite far away, but I also seem to remember that last time we did not use it to as much advantage as possible. Is there some sort of organized effort around it for this year? I

Re: Moving to Java 7 on debian breaks ICU Normalizer2 on Solr 4.7.2 and Solr 4.10.1

2014-10-09 Thread Alexandre Rafalovitch
On 9 October 2014 13:39, Uwe Schindler u...@thetaphi.de wrote: What happens is explained here: https://issues.apache.org/jira/browse/SOLR-6188?focusedCommentId=14039350page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-14039350 Is this one of the things that Solr 5

Re: An experience and some thoughts about solr/example - solr/server

2014-11-02 Thread Alexandre Rafalovitch
That's interesting. I did not realize we were going away from ElasticSearch on that. So, do we need to update the tutorial or some other super-obvious way of what the next step is? (I haven't checked). Because one difference between Solr and the Database is that create table is a standard SQL

History question: contribution from Solr to Lucene

2014-11-02 Thread Alexandre Rafalovitch
Hi, I am trying to understand what used to be in Solr pre-merge and got moved into Lucene packages after the projects merged. For example analyzers/tokenizers, were they always in Lucene or all originally in Solr? I am not sure where to check this quickly, so I am hoping people can do a short

Re: History question: contribution from Solr to Lucene

2014-11-02 Thread Alexandre Rafalovitch
on this subject, more or less: http://heliosearch.org/lucene-solr-history/ ~ David Smiley Freelance Apache Lucene/Solr Search Consultant/Developer http://www.linkedin.com/in/davidwsmiley On Sun, Nov 2, 2014 at 8:36 PM, Alexandre Rafalovitch arafa...@gmail.com wrote: Hi, I am trying

Re: CFP: FOSDEM 2015 - Open Source Search Dev Room

2014-11-03 Thread Alexandre Rafalovitch
May I suggest the next time they do it, they mention event date and location :-) It's 31st of January/1st Feb, Brussels if I found the right web page. Regards, Alex. Personal: http://www.outerthoughts.com/ and @arafalov Solr resources and newsletter: http://www.solr-start.com/ and @solrstart

Is there going to be Lucene/Solr 4.11?

2014-10-12 Thread Alexandre Rafalovitch
Hello, I found two JIRAs I was interested in marked as merged in 4.x branch (SOLR-5097 and SOLR-5098). Got excited and then discovered they were merged after 4.10 and the info added to the 4.11 README section. Does it mean there _will_ be a Solr 4.11? I thought the discussion was that 4.10.x was

Re: Is there going to be Lucene/Solr 4.11?

2014-10-13 Thread Alexandre Rafalovitch
On 13 October 2014 10:37, Jack Krupansky j...@basetechnology.com wrote: Now... I'll have to admit that maybe there might be clarity among the Lucene dev/user crowd, but mostly I'm referring to the Solr user crowd, who aren't up on Lucene internals. I do admit having troubles envisioning the

Re: Is there going to be Lucene/Solr 4.11?

2014-10-13 Thread Alexandre Rafalovitch
On 13 October 2014 15:27, Shai Erera ser...@gmail.com wrote: Point is (at least on my part) - *our* (this community's) major releases are mostly about index backwards compatibility support. This is really interesting. I've read a lot of material on Solr and some on Lucene and I never saw the

Re: Next Solr release (5.0)

2014-10-15 Thread Alexandre Rafalovitch
On 15 October 2014 21:24, Yonik Seeley yo...@heliosearch.com wrote: I imagine Heliosearch will stick with the 4.10 branch for a little while I always thought the fun part about running my own company would be not needing to say I imagine X will happen and just being able to decisively say X

Re: [VOTE] Release 4.10.3 RC1

2014-12-25 Thread Alexandre Rafalovitch
What happened with 4.10.3? There seem to be a downloadable on the download archives, but the home page download link still talks about 4.10.2. Regards, Alex. Sign up for my Solr resources newsletter at http://www.solr-start.com/ On 15 December 2014 at 20:39, Mark Miller

Re: Solr 5 release

2015-01-23 Thread Alexandre Rafalovitch
I would do the following on the Solr's notes top: 1) Usability improvements 2) Distributed IDF 3) No war (though we do ship one anyway) 4) SolrJ support for collection 5) Stats component 6) New handlers/API all together as a block The rest. Just a weak opinion though. Regards, Alex.

Re: How to ant package lucene 4.10 branch

2015-02-04 Thread Alexandre Rafalovitch
I think you just need to check out the right branch. You definitely got the trunk there somehow. I'd start from scratch and make sure you are using SVN to check out the right branch from SVN (not GitHub). There is a read-only access point. That's what I am doing for 5.0 build and it's working.

Interesting resource for Unix shell script cleanup

2015-02-05 Thread Alexandre Rafalovitch
Hi, Just saw a link to http://www.shellcheck.net/ . I run Solr start script and it picked up a couple of interesting issues around variable escaping and deprecated shell commands. Is that something that's worth making JIRA about? Regards, Alex. Sign up for my Solr resources newsletter

Re: Exception while unmarshalling response in SolrJ

2015-01-18 Thread Alexandre Rafalovitch
It's very unlikely that you have exactly the same problem. There are too many unusual pieces in there. Could you perhaps ignore that email and restate your problem. Please include: *) Solr version *) SolrJ version, if different *) Exception stack trace *) Do you actually have a proxy in a middle

Re: Lucene/Solr 5.0 release branch created

2015-01-17 Thread Alexandre Rafalovitch
Could we make sure to fix SOLR-6960, please! Otherwise, Solr is basically lying about its configuration OOTB. Regards, Alex. Sign up for my Solr resources newsletter at http://www.solr-start.com/ On 17 January 2015 at 13:36, Anshum Gupta ans...@anshumgupta.net wrote: Thanks Noble. I'm

Re: SVN server certificate validation

2015-01-14 Thread Alexandre Rafalovitch
. I'll deal with certificates later. Thanks again, Alex. Sign up for my Solr resources newsletter at http://www.solr-start.com/ On 14 January 2015 at 01:18, Shawn Heisey apa...@elyograg.org wrote: On 1/13/2015 10:37 PM, Alexandre Rafalovitch wrote: This hasn't happened to me before

Re: Normailzing Job Title

2015-01-22 Thread Alexandre Rafalovitch
Why can't it be just a Solr search against the collection of titles (and counts)? It will manage all the approximate matches and ranking for you, you just need to tell it the rules (e.g. with mm parameter). Regards, Alex. Sign up for my Solr resources newsletter at

Re: Enforce reasonable field names in Solr?

2015-02-12 Thread Alexandre Rafalovitch
I wonder if the people who are using dynamic schema care about having the fields indexed without _them_ doing pre-processing, but don't mind if they have to use cleaned-up names during search. Like, when you index from Tika and you just have no clue what possible metadata names are in various

Re: Broken link on Solr web site for source code view

2015-01-10 Thread Alexandre Rafalovitch
https://issues.apache.org/jira/browse/SOLR-6824 Still not fixed, I guess. Regards, Alex. Sign up for my Solr resources newsletter at http://www.solr-start.com/ On 10 January 2015 at 17:50, Jack Krupansky jack.krupan...@gmail.com wrote: On this page:

Re: [Possibly spoofed] Re: Anybody having troubles building trunk?

2015-01-08 Thread Alexandre Rafalovitch
certificate. I checked the certificate on the site and switched back to using https and it seems to be fine now... Regards, Luc -Original Message- From: Alexandre Rafalovitch [mailto:arafa...@gmail.com] Sent: donderdag 8 januari 2015 6:32 To: dev@lucene.apache.org Subject

Any way to tell number of documents commited but not visible?

2015-01-08 Thread Alexandre Rafalovitch
Hi, I am trying to check if there are any documents in Solr but they are not visible yet. If there were no commit at all, I know I can see it in the stats for UpdateHandler under @docsPending. But if there was a hard commit with openSearcher=false (as per example configuration), then that

Re: Apache Lucene/Solr 5.0 (alpha)?

2015-01-06 Thread Alexandre Rafalovitch
+1 (If I get to vote :-) ) 5.0 is looking like a lot bigger deal on the usability features than when the original jump was discussed several months ago. It would be nice to have users to actually try those features before RC1 of Final. Which requires some articles to drive adoption, etc.

SVN server certificate validation

2015-01-13 Thread Alexandre Rafalovitch
This hasn't happened to me before. Building (new) 5.0 branch: package-src-tgz: [exec] Error validating server certificate for 'https://svn.apache.org:443': [exec] - The certificate is not issued by a trusted authority. Use the [exec]fingerprint to validate the certificate

Re: Any way to tell number of documents commited but not visible?

2015-01-13 Thread Alexandre Rafalovitch
On 13 January 2015 at 18:50, Chris Hostetter hossman_luc...@fucit.org wrote: once the segment is committe written to desk, w/o a reopen on the searcher, there isn't any object in Solr/Lucene keeping a record of hte number of docs in that segment - something would need to open that segment to

Re: Lucene/Solr 5.0 release branch created

2015-01-13 Thread Alexandre Rafalovitch
Do you mean, svn_5x is now 5.1? Trunk is 6, right. Regards, Alex. Sign up for my Solr resources newsletter at http://www.solr-start.com/ On 13 January 2015 at 21:48, Anshum Gupta ans...@anshumgupta.net wrote: Also, I forgot to mention, the version# has been bumped on 5x and trunk to

  1   2   3   4   5   6   7   8   9   10   >