Re: VelocityResponseWriter's future

2011-12-27 Thread Jan Høydahl
patches to fix bugs and make it more usable. So unless a new and better alternative is in already in place (I love the idea of AJAX-ifying things), I vote for keeping VRW, but lazy loading it not to annoy people copying example/ around. -- Jan Høydahl, search solution architect Cominvent

Re: Looking for a good commit/merge strategy

2011-12-27 Thread Jan Høydahl
Have a look at http://wiki.apache.org/solr/NearRealtimeSearch which will help you (in TRUNK/4.0) with an efficient in-memory handling of NRT changes. Combine this with CommitWithin for persisting to disk: http://wiki.apache.org/solr/CommitWithin. -- Jan Høydahl, search solution architect

Re: Custom content extractor for Solr Cell

2011-12-27 Thread Jan Høydahl
contents or similar. If you want to override the behaviour of Tika's parsing of certain MIME types, you can do this by specifying -Dtika.config=path-to-your-tika-config when starting Solr (3.5 or later), and it will obey your config. See Tika's web page for how to write your own parsers. -- Jan Høydahl

Re: lower score for synonyms

2011-12-27 Thread Jan Høydahl
Hi, Also see discussion in https://issues.apache.org/jira/browse/LUCENE-3130 for possible future way to do this with one field. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 6. des. 2011, at 13:47, Marc SCHNEIDER wrote

Re: spellcheck-index is rebuilt on commit

2012-01-02 Thread Jan Høydahl
Olivier, your log snippets did not make it into the mail. I think the mailing list strips attachments. Did you reload core or restart Jetty/Tomcat after your changes? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 2. jan. 2012

Re: spellcheck-index is rebuilt on commit

2012-01-02 Thread Jan Høydahl
. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 2. jan. 2012, at 22:22, Simon Willnauer wrote: hey, is it possible that during those commits nothing has changed in the index? I mean are you committing nevertheless there are changes

Re: Using SOLR Autocomplete for addresses (i.e. multiple terms)

2012-01-03 Thread Jan Høydahl
tokens and weigh these as you choose, as well as implement phonetic normalization and other useful tricks. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 3. jan. 2012, at 00:52, Dave wrote: Hi, I'm reposting my StackOverflow

Re: Sorting results within the fields

2012-01-17 Thread Jan Høydahl
? Hope you see that being concrete is necessary for such questions. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 17. jan. 2012, at 19:38, aronitin wrote: It's been almost a week and there is no response to the question that I

Re: Facet auto-suggest

2012-01-17 Thread Jan Høydahl
would disable buttons with 0 hits in the facet count. For more in depth, see http://java.dzone.com/news/complex-solr-faceting -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 17. jan. 2012, at 23:38, Jon Drukman wrote: I don't even

Re: index-time over boosted

2012-01-18 Thread Jan Høydahl
long fields. Perhaps you instead can try to tell us your use-case. What kind of raning are you trying to achieve? Then we can help suggest how to get there. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com

Re: Takes a while to see changes in data even after comit

2012-01-18 Thread Jan Høydahl
Hi, What Solr version? How many docs? What do you use as qutowarm count? If it's too high, it may take time. Do you use spellcheck and buildOnCommit? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 18. jan. 2012, at 23:45

Re: How to boost the relevancy of a field

2012-01-18 Thread Jan Høydahl
And using dismax query parser makes this easier: http://wiki.apache.org/solr/DisMaxQParserPlugin Example: q=solrdefType=edismaxqf=title^10 body^0.5 -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 19. jan. 2012, at 01:29

Re: conditional field weighting

2012-01-19 Thread Jan Høydahl
weight :) Because of the TIE parameter, other matching fields will score 0.5X -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 19. jan. 2012, at 05:00, Jack Kanaska wrote: Hi Tim, Unfortunately that's not what I am looking

Re: index-time over boosted

2012-01-19 Thread Jan Høydahl
Hi, Can you paste exactly both fieldType and field definitions from your schema? omitNorms=true should kill norms. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 19. jan. 2012, at 08:18, remi tassing wrote: Hi, just

Re: Tika0.10 language identifier in Solr3.5.0

2012-01-19 Thread Jan Høydahl
into the search box. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 19. jan. 2012, at 09:22, nibing wrote: Hi, all, I am using Solr3.5.0 which applies Tika0.10 to do language detection, and I have a couple of questions

Re: index-time over boosted

2012-01-19 Thread Jan Høydahl
. For your search application to work well you will have to invest some time in designing a schema, working with your queries, perhaps exploring DisMax query parser etc etc. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 19. jan

Re: Takes a while to see changes in data even after comit

2012-01-19 Thread Jan Høydahl
Hi, Try lowering your autowarm to, say, 25, and see if it helps. How often do you call commit? If you have too much warming so it takes longer time than time between commits, you're lost... You can check the stats admin page to see the autowarm time. -- Jan Høydahl, search solution architect

Re: 3.5.0 troubles

2012-01-19 Thread Jan Høydahl
Shouldn't it be literal.uid=foo, not ext.literal.uid ?? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 19. jan. 2012, at 23:08, Wayne W wrote: HI, I'm trying to setup the latest version of Solr. Currently we're running 1.3

Re: How to import data from xml files to solr

2012-01-20 Thread Jan Høydahl
search result XML through XSLT update handler. See http://dl.dropbox.com/u/20080302/updateXml.xsl The curl command is: curl http://localhost:8983/solr/update/xslt?commit=truetr=updateXml.xsl; -H Content-Type: text/xml; charset=utf-8 --data-binary @searchres.xml -- Jan Høydahl, search solution

Re: Tika0.10 language identifier in Solr3.5.0

2012-01-20 Thread Jan Høydahl
of reduction or stemming - then you can easily support full linguistics for 100 languages, indexed in the same field. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 20. jan. 2012, at 18:15, Ted Dunning wrote: I think you

Re: Filtering search results by an external set of values

2012-01-23 Thread Jan Høydahl
it yourself or have some framework/plugin to it for you, see http://wiki.apache.org/solr/SolrSecurity#Document_Level_Security -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 23. jan. 2012, at 11:49, John, Phil (CSS) wrote: Hi

Re: ExractionHandler/Cell ignore just 2 fields defined in schema 3.5.0

2012-01-23 Thread Jan Høydahl
://wiki.apache.org/solr/ExtractingRequestHandler#Input_Parameters -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 23. jan. 2012, at 22:26, Wayne W wrote: Hi, Im been trying to figure this out now for a few days and I'm just

Re: hot deploy of newer version of solr schema in production

2012-01-24 Thread Jan Høydahl
is to do a more thorough upgrade of schema including new functionality and breaking changes, and then do a full reindex. The exception is if my index is huge and the reason for Solr upgrade or schema change is to fix a bug, not to use new functionality. -- Jan Høydahl, search solution architect

Re: index-time over boosted

2012-01-24 Thread Jan Høydahl
That looks right. Can you restart your Solr, do a new search with debugQuery=true and copy/paste the full EXPLAIN output for your query? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 24. jan. 2012, at 13:22, remi tassing wrote

Re: index-time over boosted

2012-01-24 Thread Jan Høydahl
/solr/conf/schema.xml and add omitNorms=true to these two fields. Then stop Solr, delete your index, start Solr, re-index the docs and try again. fieldNorm is now 1.0. Once you get that working you can start debugging where you got it wrong in your own setup. -- Jan Høydahl, search solution

Re: Difference between #indexed documents and #results in *:* query

2012-01-25 Thread Jan Høydahl
Hi, No, it's not normal :) Have you tried to hit SHIFT-F5 to make sure you're not getting tricked by browser caching? Or try a slightly different query like id:* You can also visit the Schema browser page of Solr admin and check the stats on how many docs are in the index. -- Jan Høydahl

Re: Using SOLR Autocomplete for addresses (i.e. multiple terms)

2012-01-25 Thread Jan Høydahl
an example to GitHub. See http://www.cominvent.com/2012/01/25/super-flexible-autocomplete-with-solr/ -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 3. jan. 2012, at 20:41, Dave wrote: I've got another question for anyone that might

Re: Commit and sessions

2012-01-27 Thread Jan Høydahl
, but not there yet: https://issues.apache.org/jira/browse/SOLR-2700 -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 27. jan. 2012, at 13:05, Per Steffensen wrote: Hi If I have added some document to solr, but not done explicit

Re: hot deploy of newer version of solr schema in production

2012-02-01 Thread Jan Høydahl
and frontend changes to take place at the same time as a major search engine upgrade, so it's not always as easy as switching the backend, you need to synchronize things. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 1. feb. 2012

Re: StopFilterFactory and qf containing some fields that use it and some that do not

2012-02-01 Thread Jan Høydahl
be a nice way to circumvent this problem in a future release... I created a JIRA issue to investigate if it is possible to implement this. See https://issues.apache.org/jira/browse/SOLR-3085 -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training

Re: Copy Index from Solr 3.2.0 to Solr 3.4.0

2012-02-04 Thread Jan Høydahl
, previous versions of Solr will no longer be able to read your indices. In a master/slave configuration, all searchers/slaves should be upgraded before the master. If the master were to be updated first, the older searchers would not be able to read the new index format. -- Jan Høydahl

Re: Help:Solr can't put all pdf files into index

2012-02-09 Thread Jan Høydahl
Hi, Are you 100% sure that the filename is globally unique, since you use it as the uniqueKey? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 9. feb. 2012, at 08:30, 荣康 wrote: Hey , I am using solr as my search engine

Re: Range facet - Count in facet menu != Count in search results

2012-02-09 Thread Jan Høydahl
Hi, If you use trunk (4.0) version, you can say fq=price:[10 TO 20} and have the upper bound be exclusive. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 10. feb. 2012, at 00:58, Yuhao wrote: I've changed

Re: Solr / Tika Integration

2012-02-10 Thread Jan Høydahl
I think you need to control the parameter enableAutoSpace in PDFBox. There's a JIRA for it, but it depends on some Tika1.1 stuff as far I can understand https://issues.apache.org/jira/browse/SOLR-2930 -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training

Re: solr search speed is so slow.

2012-02-11 Thread Jan Høydahl
, newer versions of Solr may have optimizations for faster highlighting.. Another thing from your reqHandler config. You use maxAnalyzedChars=-1. The correct should be hl.maxAnalyzedChars and this will only work for the original highlighter, not fvh. -- Jan Høydahl, search solution architect

Re: Setting up logging for a Solr project that isn't in tomcat/webapps/solr

2012-02-11 Thread Jan Høydahl
You can unpack your war (jar -xvf solr.war), change logging.properties and then pack it again (jar -cvf solr.war) You can also try to specify a new folder in a lib... tag in solrconfig.xml and put your prop file there. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com

Re: Do we need reindexing from solr 1.4.1 to 3.5.0?

2012-02-13 Thread Jan Høydahl
Hi, If you keep the old schema and solrconfig, you should be able to simply upgrade your solr.war. Solr 3.5 will read the old index format and write the new one. Once you've upgraded the index will not be readable by 1.4.1. Test this on a test environment to verify. -- Jan Høydahl, search

Re: Solr binary response for C#?

2012-02-15 Thread Jan Høydahl
Hi, I just created a JIRA to investigate an Avro based serialization format for Solr: https://issues.apache.org/jira/browse/SOLR-3135 You're welcome to contribute. Guess we'll first need to define schemas, then create an AvroResponseWriter and then support in the C# Solr client. -- Jan Høydahl

Re: Stemming and accents (HunspellStemFilterFactory)

2012-02-15 Thread Jan Høydahl
to apply when parsing the dictionary into memory. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 14. feb. 2012, at 16:27, Chantal Ackermann wrote: Hi Bráulio, I don't know about HunspellStemFilterFactory especially but concerning

Re: Semantic autocomplete with Solr

2012-02-15 Thread Jan Høydahl
Check out http://www.cominvent.com/2012/01/25/super-flexible-autocomplete-with-solr/ You can feed it anything, such as a log of previous searches, or a pre-computed dictionary of item + color combinations that exist in your DB etc. -- Jan Høydahl, search solution architect Cominvent

Re: Solr edismax clarification

2012-02-17 Thread Jan Høydahl
Please provide your full query, including your qf parameter and all other request parameters, and also the relevant fields/field-types from schema. Do you use stopwords? Can you also add debugQuery=true and paste in the parsedQuery? -- Jan Høydahl, search solution architect Cominvent

Re: Date search by specific month and day

2012-03-05 Thread Jan Høydahl
individually store multiple versions in the index, perhaps using poly-field? It could work exactly like DateField for full date input, but also allow queries like myDate:2012, myDate:2012-03, myDate:2012-03-05, myDate:[1991 TO 2012] etc. -- Jan Høydahl, search solution architect Cominvent

Re: How to define a multivalued string type langid.langsField in solrconfig.xml

2012-03-05 Thread Jan Høydahl
. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 27. feb. 2012, at 05:09, bing wrote: Hi, all, I am using tika language detection. It is said that, if langid.langsField is set as multivalued string, and then a list

Re: errata for solr tutorial

2012-03-05 Thread Jan Høydahl
Hi, Thanks for reporting. This is fixed now on the staging site, will be set live soon. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 1. mars 2012, at 16:50, Nicolai Scheer wrote: Hi! Having just worked through the solr

Re: How to Index Custom XML structure

2012-03-09 Thread Jan Høydahl
You could setup a ManifoldCF job to fetch the XMLs and then setup a new SolrOutputConnection for /solr/update/xslt?tr=myStyleSheet.xsl where myStyleSheet.xsl is the stylesheet to use for that kind of XML. See http://wiki.apache.org/solr/XsltUpdateRequestHandler -- Jan Høydahl, search solution

Re: Solr 4.0

2012-03-12 Thread Jan Høydahl
Hi Robert, See http://wiki.apache.org/solr/Solr4.0 The developer community is working towards a 4.0-Alpha release expected in a few months, however no dates are fixed. Many already use a snapshot version of TRUNK. You are free to do so, at your own risk. -- Jan Høydahl, search solution

Re: solr 3.5 and indexing performance

2012-03-13 Thread Jan Høydahl
the profiler to see what part of the code takes up the time http://docs.oracle.com/javase/6/docs/technotes/tools/share/jvisualvm.html -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 12. mars 2012, at 16:42, Agnieszka Kukałowicz

Re: Can solr-langid(Solr3.5.0) detect multiple languages in one text?

2012-03-13 Thread Jan Høydahl
the text into chunks and then do langid on each chunk. If you'd like to open a JIRA for this, it will not be forgotten... -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 13. mars 2012, at 04:25, bing wrote: Hi, all, I am using

Re: solr 3.5 and indexing performance

2012-03-13 Thread Jan Høydahl
to the issue your polish hunspell dictionaries. Then we'll try to reproduce the error. I wonder if this performance decrease is also seen for English dictionaries? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 13. mars 2012, at 16

Re: Solr Monitoring / Stats

2012-03-13 Thread Jan Høydahl
And here is a page on how to wire Solr's JMX info into OpenNMS monitoring tool. Have not tried it, but as soon as a collector config is defined once I'd guess it could be re-used, maybe shipped with Solr. http://www.opennms.org/wiki/JMX_Collector -- Jan Høydahl, search solution architect

Re: Field Value Substitution

2012-03-16 Thread Jan Høydahl
at query time? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 15. mars 2012, at 08:13, tosenthu wrote: Hi I have a scenario, where I store a field which is an Id, ID field -- 1 3 4 Descrption mapping

Re: whethere solr 3.3 index file is compatable with solr 4.0

2012-03-22 Thread Jan Høydahl
That's correct. Solr4 will read your existing index and let you use it with the feature set it already has. But in order for you to use new fieldTypes, you need to re-index your data. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com

Re: Is it possible to grouping solr results by their domain ?

2012-04-09 Thread Jan Høydahl
Sure! http://wiki.apache.org/solr/FieldCollapsing -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 9. apr. 2012, at 07:27, hadi wrote: I have crawled many site with nutch and using solr 3.4 to browse the results but i want

Re: Cloud-aware request processing?

2012-04-09 Thread Jan Høydahl
Hi, Instead of using Solr, you may want to have a look at Hadoop or another framework for distributed computation, see e.g. http://java.dzone.com/articles/comparison-gridcloud-computing -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training

Re: Solr 3.5 takes very long to commit gradually

2012-04-12 Thread Jan Høydahl
What operating system? Are you using spellchecker with buildOnCommit? Anything special in your Update Chain? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 12. apr. 2012, at 06:45, Rohit wrote: We recently migrated from

Re: Options for automagically Scaling Solr (without needing distributed index/replication) in a Hadoop environment

2012-04-13 Thread Jan Høydahl
responsible for the Reduce part building the indexes? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 13. apr. 2012, at 04:23, Otis Gospodnetic wrote: Hello Ali, I'm trying to setup a large scale *Crawl + Index + Search

Re: Options for automagically Scaling Solr (without needing distributed index/replication) in a Hadoop environment

2012-04-14 Thread Jan Høydahl
Hi, This won't give you the performance you need, unless you have enough RAM on the Solr box to cache the whole index in memory. Have you tested this yourself? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 12. apr. 2012

Re: Location Based Searching – Long Lat?

2012-04-16 Thread Jan Høydahl
Hi, There is no geocoding API in Solr as per now so you could handle that in your PHP app. But check out https://issues.apache.org/jira/browse/SOLR-2833 for potential solution. I have a first version of the processor which I can upload to that JIRA if you're interested. -- Jan Høydahl, search

Re: Issue with Solr 3.5 while using TikaEntityProcessor on .docx files

2012-04-16 Thread Jan Høydahl
Hi, Solr3.6 is just out with Tika 1.0. Can you try that? Also, Solr TRUNK now has Tika 1.1... I recommend downloading Tika-App and testing your offending files directly with that http://tika.apache.org/1.1/gettingstarted.html -- Jan Høydahl, search solution architect Cominvent

Re: Can Solr solve this simple problem?

2012-04-16 Thread Jan Høydahl
Hi everyone :) Hi :) So, these are my 3 questions: 1. Does Solr provide searching among different count fields with different types like in WHERE condition? Yes. As long as these are not full-text you should use filter queries for these, e.g. q=*:* fq=country:USA fq=language:SPA fq=age:[30

Re: Options for automagically Scaling Solr (without needing distributed index/replication) in a Hadoop environment

2012-04-17 Thread Jan Høydahl
the realtime shards as today, but indexing the static shards will be handled by Katta. If Katta adds a shard it will tell SolrCloud by updating the ZK tree, and SolrCloud will pick up the shard and start serving search for it.. -- Jan Høydahl, search solution architect Cominvent

Re: Can I use Field Aliasing/Renaming on Solr3.3?

2012-04-17 Thread Jan Høydahl
You'll have to upgrade to 3.6. Upgrading is really easy and should be 100% back-compat. Just keep your old config and drop in the new solr.war, then you'll get the new features. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com

Re: Can Solr solve this simple problem?

2012-04-17 Thread Jan Høydahl
ways to do what you REALLY wanted instead of limiting yourself to the ORDER BY syntax etc. One of Solr's strengths is relevancy and FunctionQueries and it can do amazing things :) Further answers below.. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training

Re: Jira 1540

2012-04-17 Thread Jan Høydahl
Simply try using Solr3.6 to read your old 1.3 indices. Chances are that it will work - without the exceptions :) -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 17. apr. 2012, at 11:08, Ramprakash Ramamoorthy wrote: I am using

Re: Can Solr solve this simple problem?

2012-04-17 Thread Jan Høydahl
. On http://wiki.apache.org/solr/PublicServers the only one saying so explicitly is Jeeran - I'm sure others can fill in with more examples -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 17. apr. 2012, at 12:10, Alexandr Bocharov

Re: Difference between Search result from Admin console and solr/browse

2012-04-18 Thread Jan Høydahl
. A simple way to prototype transforms is to write an XSL and using the XSLTUpdateRequestHandler at solr/update/xslt instead of the XML handler. See http://wiki.apache.org/solr/XsltUpdateRequestHandler -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training

Re: minimum match and not matched words / term frequency in query result

2012-04-18 Thread Jan Høydahl
for instance. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 18. apr. 2012, at 17:34, giovanni.bricc...@banzai.it wrote: Hi I have a dismax query with a mininimum match settings, this allows some terms to be missing in query results

Re: Solr 3.6 parsing and extraction files

2012-04-18 Thread Jan Høydahl
/solrconfig.xml which lib ../ directives are included near the top of the file, this should give you a hint of how to configure your own solrconfig.xml depending on where you put those libs. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training

Re: Problems with edismax parser and solr3.6

2012-04-18 Thread Jan Høydahl
Hi, Thanks for reporting this. I've created a bug ticket for this at https://issues.apache.org/jira/browse/SOLR-3377 -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 18. apr. 2012, at 17:00, Bernd Fehling wrote: I just

Re: Language Identification

2012-04-20 Thread Jan Høydahl
Hi, Solr just reuses Tika's language identifier. But you are of course free to do your language detection on the Nutch side if you choose and not invoke the one in Solr. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 20. apr

Re: 'Error 404: missing core name in path' in Solr

2012-04-23 Thread Jan Høydahl
Hi, Perhaps your search server uses a multi core setup? In that case you need your core name as part of the URL http://wiki.apache.org/solr/CoreAdmin#Example -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 23. apr. 2012, at 01

Re: Language Identification

2012-04-23 Thread Jan Høydahl
I think nothing has moved. We just offer Solr users to do language detection inside of Solr, using any of these two libs. If you choose to do language detection on client side instead, using any of these, what is stopping you? -- Jan Høydahl, search solution architect Cominvent

Re: Using Customized sorting in Solr

2012-04-26 Thread Jan Høydahl
group.offset=1 etc etc. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 26. apr. 2012, at 08:10, solr user wrote: Hi, We are planning to move the search of one of our listing based portal to solr/lucene search server from sphinx

Re: Weird query results with edismax and boolean operator +

2012-04-28 Thread Jan Høydahl
Hi, What is your qf parameter? Can you run the three queries with debugQuery=trueechoParams=all and attach parsed query and all params? It will probably explain what is happening. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com

Re: Weird query results with edismax and boolean operator +

2012-04-30 Thread Jan Høydahl
Hi, I see that you have already commented on SOLR-2649 MM ignored in edismax queries with operators. So let's continue the way towards resolution there... -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 30. apr. 2012, at 14:28

Re: 1MB file to Zookeeper

2012-05-05 Thread Jan Høydahl
) method chops up the file and stores it spanning multiple 1M ZK nodes, and the get(key) method assembles all parts and returns the object. It would be good enough for most, but if you require something better you can easily impl support for CouchDb, Voldemort or whatever. -- Jan Høydahl, search

Re: Solritas in production

2012-05-06 Thread Jan Høydahl
and never expose Solr directly to the world. So you should feel totally comfortable continuing to use Solr over HTTP from PHP! -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 6. mai 2012, at 14:02, András Bártházi wrote: Hi

Re: Partition Question

2012-05-06 Thread Jan Høydahl
get back with some more details? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 6. mai 2012, at 17:07, Yuval Dotan wrote: Hi All We have an index of ~2,000,000,000 Documents and the query and facet times are too slow for us

Re: Solritas in production

2012-05-07 Thread Jan Høydahl
as he has 1.000.000+ pageviews a day. Again, no PHP in this scenario. What's your opinion about both solutions? Thanks in advance, Marcelo Carvalho Fernandes +55 21 8272-7970 +55 21 2205-2786 On Sun, May 6, 2012 at 7:42 PM, Jan Høydahl jan@cominvent.com wrote: Hi

Re: Can one determine which results are good enough to alert users about?

2012-05-09 Thread Jan Høydahl
such that the alerts themselves contain only top-N hits, with a link in the email to see all (of course only those which are new or updated since last alert). -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 9. mai 2012, at 10:50, Chris Harris

Re: Edismax lowercaseOperators=false does not ignore lowercase and

2012-05-09 Thread Jan Høydahl
a global stopwords list somewhere, would probably make sense at QP level, alternatively as a SearchComponent. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 9. mai 2012, at 11:49, jpascua wrote: Hi, I could not prevent users from

Re: Is it possible to limit the bandwidth of replication

2012-05-09 Thread Jan Høydahl
I think we have to add this for java based rep. +1

Re: Can one determine which results are good enough to alert users about?

2012-05-10 Thread Jan Høydahl
query for Software License -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 9. mai 2012, at 22:55, Otis Gospodnetic wrote: Hi Chris, I think there is some confusion here. When people say things about relevance scores they talk

Re: SOLR Security

2012-05-11 Thread Jan Høydahl
Hi, There is nothing stopping you from pointing Ajax-SOLR to a URL on your app-server, which acts as a security insulation layer between the Solr backend and the world. In this (thin) layer you can analyze the input and choose carefully what to let through and not. -- Jan Høydahl, search

Re: Editing long Solr URLs - Chrome Extension

2012-05-11 Thread Jan Høydahl
to the size of the content -- Jan Høydahl, search solution architect Cominvent AS - www.facebook.com/Cominvent Solr Training - www.solrtraining.com On 11. mai 2012, at 07:32, Amit Nithian wrote: Hey all, I don't know about you but most of the Solr URLs I issue are fairly lengthy full

Re: query with DATE FIELD AND RANGE query using dismax

2012-05-15 Thread Jan Høydahl
Hi, You can't. Try eDisMax instead: http://wiki.apache.org/solr/ExtendedDisMax -- Jan Høydahl, search solution architect Cominvent AS - www.facebook.com/Cominvent Solr Training - www.solrtraining.com On 15. mai 2012, at 11:05, ayyappan wrote: Hi My queries are working with standard

Re: Fault tolerant Solr replication architecture

2012-05-21 Thread Jan Høydahl
places. [1] https://builds.apache.org/job/Solr-trunk/lastSuccessfulBuild/artifact/artifacts/ [2] http://wiki.apache.org/solr/SolrCloud -- Jan Høydahl, search solution architect Cominvent AS - www.facebook.com/Cominvent Solr Training - www.solrtraining.com On 21. mai 2012, at 09:25, Parvin Gasimzade

Re: Remote streaming - posting a URL which is password protected

2012-05-21 Thread Jan Høydahl
stream.url=http://username:passw...@somewebsite.com/docs/DOC2609 but I have not tested it. Why can't you download the file locally first? If you're looking for a production grade HTTP crawler you could look at ManifoldCF. -- Jan Høydahl, search solution architect Cominvent AS - www.facebook.com

Re: How can i search site name

2012-05-22 Thread Jan Høydahl
the UrlClassifyProcessor, see SOLR-2826. -- Jan Høydahl, search solution architect Cominvent AS - www.facebook.com/Cominvent Solr Training - www.solrtraining.com On 22. mai 2012, at 08:03, Shameema Umer wrote: Sorry, Please let me know how can I search site name using the solr query syntax. My

Re: How can i search site name

2012-05-22 Thread Jan Høydahl
the domain field separately with higher weight. -- Jan Høydahl, search solution architect Cominvent AS - www.facebook.com/Cominvent Solr Training - www.solrtraining.com On 22. mai 2012, at 12:23, Shameema Umer wrote: Thanks Li Li and Jan. Yes, if url is www.company.com/foo/bar/index.html, I

Re: System requirements in my case?

2012-05-22 Thread Jan Høydahl
searches include everything. -- Jan Høydahl, search solution architect Cominvent AS - www.facebook.com/Cominvent Solr Training - www.solrtraining.com On 22. mai 2012, at 11:06, Bruno Mannina wrote: My choice: http://www.ovh.com/fr/serveurs_dedies/eg_best_of.xml 24 Go DDR3 Le 22/05/2012 10:26

Highlighting: When alternateField does not exist

2013-01-09 Thread Jan Høydahl
other workarounds for this problem on the solr side? In this case we cannot control the source DB to make sure the teaser exists. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com

Re: indexing Text file in solr

2013-01-29 Thread Jan Høydahl
it -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com 27. jan. 2013 kl. 11:23 skrev hadyelsahar hadyelsa...@gmail.com: i have a large Arabic Text File that contains Tweets each line contains one tweet , that i want to index in solr

Re: Minimum word length for stemming

2013-01-31 Thread Jan Høydahl
#Customizing_Stemming and the KeywordMarkerFilterFactory which lets you list a bunch of words you do not want the stemmers to touch. I guess you could easily implement your own TokenLengthMarkerFilterFactory which keeps words from being stemmed based on length. -- Jan Høydahl, search solution architect Cominvent

Re: Solr 4 - Missing Field Collapsing Components

2013-02-04 Thread Jan Høydahl
Hi, Field collapsing is built-in and is also called Result Grouping: http://wiki.apache.org/solr/FieldCollapsing You simply enable it with ...group=truegroup.field=myfield If that does not work for you, please respond with detailed error messages so we can help you further. -- Jan Høydahl

Re: Multi-threaded post.jar?

2013-02-04 Thread Jan Høydahl
from scratch is something to discuss. Please bring on your ideas of how you plan to extend it, perhaps even simplifying the code in the process? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com 3. feb. 2013 kl. 17:19 skrev Upayavira u

Re: Multi-threaded post.jar?

2013-02-05 Thread Jan Høydahl
that details the features of the tool, and also explains that its role is educational, rather than anything else. Upayavira On Mon, Feb 4, 2013, at 09:10 PM, Jan Høydahl wrote: Hi, Hmm, the tool is getting bloated for a one-class no-deps tool already :) Guess it would be useful too

Re: Multi-threaded post.jar?

2013-02-06 Thread Jan Høydahl
With dependencies I meant external jar dependencies. Perhaps extensions could have deps while leaving the core compilable without? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com 5. feb. 2013 kl. 17:10 skrev Upayavira u

Re: Multi-select faceting is not working when facet fields are configured in default request handler.

2013-02-07 Thread Jan Høydahl
If you want to override facet.field through the query, you have to override ALL facet.field's defined as default in reqeust handler, else those other facets are gone. You say But it's not working. without specifying WHAT is not working. -- Jan Høydahl, search solution architect Cominvent

Re: SolrCloud new zookeper node on different ip/ replicate between two clasters

2013-02-07 Thread Jan Høydahl
You should run replicated ZK: http://zookeeper.apache.org/doc/trunk/zookeeperStarted.html#sc_RunningReplicatedZooKeeper Give Solr the list of all ZK's and you're good to go -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com 7. feb

  1   2   3   4   5   6   7   8   9   10   >