Re: New PHP API for Solr (Logic Solr API)

2011-03-30 Thread Lukas Kahwe Smith
to explore and learn the API via tab completion, which is of course not so easy to offer via a C extension API :-/ regards, Lukas Kahwe Smith m...@pooteeweet.org [1] http://pooteeweet.org/blog/1796

Re: New PHP API for Solr (Logic Solr API)

2011-03-07 Thread Lukas Kahwe Smith
(namespaces etc). btw there is already another PHP 5.3 based API, though it tries to also unify other Lucene based API's as much as possible: https://github.com/dstendardi/Ariadne regards, Lukas Kahwe Smith m...@pooteeweet.org

Re: [POLL] Where do you get Lucene/Solr from? Maven? ASF Mirrors?

2011-01-18 Thread Lukas Kahwe Smith
website) [] Maven repository (whether you use Maven, Ant+Ivy, Buildr, etc.) [X] I/we build them from source via an SVN/Git checkout. [] Other (someone in your company mirrors them internally or via a downstream project) regards, Lukas Kahwe Smith m...@pooteeweet.org

Re: PHP app not communicating with Solr

2011-01-12 Thread Lukas Kahwe Smith
with--curl-wrappers. regards, Lukas Kahwe Smith m...@pooteeweet.org

Re: PHP PECL solr API library

2011-01-09 Thread Lukas Kahwe Smith
. regards, Lukas Kahwe Smith m...@pooteeweet.org

Re: Dataimport performance

2010-12-19 Thread Lukas Kahwe Smith
On 19.12.2010, at 23:30, Alexey Serba wrote: Also Ephraim proposed a really neat solution with GROUP_CONCAT, but I'm not sure that all RDBMS-es support that. Thats MySQL only syntax. But if you google you can find similar solution for other RDBMS. regards, Lukas Kahwe Smith m

Re: facetting when using field collapsing

2010-11-13 Thread Lukas Kahwe Smith
On 13.11.2010, at 10:30, Yonik Seeley wrote: On Wed, Nov 10, 2010 at 9:12 AM, Lukas Kahwe Smith m...@pooteeweet.org wrote: The above wiki page seems to be out of date. Reading the comments in https://issues.apache.org/jira/browse/SOLR-236 it seems like group should be replaced

facet+shingle in autosuggest

2010-11-11 Thread Lukas Kahwe Smith
I can sensibly bring in a stop word filter here? Actually in theory the stop words could appear as the first or second word as well. So I guess when producing shingle's I want to skip any stop word from being part of any shingle. regards, Lukas Kahwe Smith m...@pooteeweet.org

Re: facet+shingle in autosuggest

2010-11-11 Thread Lukas Kahwe Smith
states of america. it would then generate a shingle with united states america which in turn wouldnt generate a proper phrase search string. one option of course would be to restrict the shingles to 2 words and then using the stop word filter would work as expected. regards, Lukas Kahwe Smith m

Re: facetting when using field collapsing

2010-11-10 Thread Lukas Kahwe Smith
On 07.11.2010, at 20:13, Lukas Kahwe Smith wrote: Hi, I am pondering making use of field collapsing. I am currently indexing clauses (sections) inside UN documents: http://resolutionfinder.org/search/unifiedResults?q=africa=t[22]=medicationdc=st=clause Now since right now my data set

facetting when using field collapsing

2010-11-07 Thread Lukas Kahwe Smith
the right counts. So I am wondering if field collapsing in its current form supports faceting, since its not mentioned on the wiki page: http://wiki.apache.org/solr/FieldCollapsing regards, Lukas Kahwe Smith m...@pooteeweet.org

Re: Which version of Solr to use?

2010-10-14 Thread Lukas Kahwe Smith
On 14.10.2010, at 19:50, Jonathan Rochkind rochk...@jhu.edu wrote: I'm kind of confused about Solr development plans in general, highlighted by this thread. I think 1.4.1 is the latest officially stable release, yes? Why is there both a 1.5 and a 3.x, anyway? Not to mention a 4.x?

Re: Which version of Solr to use?

2010-10-14 Thread Lukas Kahwe Smith
however its Y. but no information means users are essentially without any information about the future. regards, Lukas Kahwe Smith m...@pooteeweet.org

Re: Solr PHP PECL Extension going to Stable Release - Wishing for Any New Features?

2010-10-11 Thread Lukas Kahwe Smith
tokenizing lucene queries to enable validation of the syntax used etc. see here for a use case and a user land implementation: http://pooteeweet.org/blog/1796 regards, Lukas Kahwe Smith m...@pooteeweet.org

Re: DIH: alternative approach to deltaQuery

2010-09-22 Thread Lukas Kahwe Smith
On 20.09.2010, at 08:32, Lukas Kahwe Smith wrote: Hi, ok since it didnt seem like there was interest to document this approach on the wiki i have simply documented it on my blog: http://pooteeweet.org/blog/1827 sorry for the spam. Lance (and Erik) did think it would be good to add

Re: DIH: alternative approach to deltaQuery

2010-09-20 Thread Lukas Kahwe Smith
Hi, ok since it didnt seem like there was interest to document this approach on the wiki i have simply documented it on my blog: http://pooteeweet.org/blog/1827 regards, Lukas Kahwe Smith m...@pooteeweet.org

Re: DataImportHandler with multiline SQL

2010-09-16 Thread Lukas Kahwe Smith
split up your queries like that? regards, Lukas Kahwe Smith m...@pooteeweet.org

DIH: alternative approach to deltaQuery

2010-09-16 Thread Lukas Kahwe Smith
out by any sane RDBMS. And if clean = false it basically triggers the delta query part to be evaluated. Is there any downside to this approach? Should this be added to the wiki? regards. Lukas Kahwe Smith m...@pooteeweet.org

Re: DIH: alternative approach to deltaQuery

2010-09-16 Thread Lukas Kahwe Smith
be available. Lukas Kahwe Smith wrote: Hi, I think i have mentioned this approach before on this list, but I really think that the deltaQuery approach which is currently explained as the way to do updates is far from ideal. It seems to add a lot of redundant queries. I therefore propose

roadmap

2010-09-09 Thread Lukas Kahwe Smith
as to whats going to happen in the near future would make it all the more easier for us users to bet our futures on solr :) regards, Lukas Kahwe Smith m...@pooteeweet.org

Re: roadmap

2010-09-09 Thread Lukas Kahwe Smith
park when to expect them would go a long way. regards, Lukas Kahwe Smith m...@pooteeweet.org

Re: Delta Import with something other than Date

2010-09-08 Thread Lukas Kahwe Smith
. regards, Lukas Kahwe Smith m...@pooteeweet.org

Re: jetty logging

2010-07-19 Thread Lukas Kahwe Smith
On 17.07.2010, at 15:39, Lukas Kahwe Smith wrote: Hi, I am following: http://wiki.apache.org/solr/LoggingInDefaultJettySetup All works fine except defining the logging properties files from jetty.xml Does this approach work for anyone else? problem solved i had to remove

jetty logging

2010-07-17 Thread Lukas Kahwe Smith
Hi, I am following: http://wiki.apache.org/solr/LoggingInDefaultJettySetup All works fine except defining the logging properties files from jetty.xml Does this approach work for anyone else? regards, Lukas Kahwe Smith m...@pooteeweet.org

Re: Less convoluted way to query for an empty string?

2010-07-14 Thread Lukas Kahwe Smith
/select?debugQuery=onq={!raw f=foo_s} regards, Lukas Kahwe Smith m...@pooteeweet.org

optional vs. probhibited aka standard vs. dismax handler

2010-06-29 Thread Lukas Kahwe Smith
, and +/- can be used to denote mandatory and optional clauses ... but all other Lucene query parser special characters are escaped to simplify the user experience. regards, Lukas Kahwe Smith m...@pooteeweet.org

Re: optional vs. probhibited aka standard vs. dismax handler

2010-06-29 Thread Lukas Kahwe Smith
Kahwe Smith m...@pooteeweet.org

Re: optional vs. probhibited aka standard vs. dismax handler

2010-06-29 Thread Lukas Kahwe Smith
On 29.06.2010, at 13:38, Lukas Kahwe Smith wrote: On 29.06.2010, at 13:24, Jan Høydahl / Cominvent wrote: Hi, In DisMax the mm parameter controls whether terms are required or optional. The default is 100% which means all terms required, i.e. you do not need to add +. You can change

Re: optional vs. probhibited aka standard vs. dismax handler

2010-06-29 Thread Lukas Kahwe Smith
search and boolean operators, if you need more advanced user-facing query syntax. yeah .. i am keeping an eye on that already. thx! regards, Lukas Kahwe Smith m...@pooteeweet.org

phrase highlighting

2010-06-26 Thread Lukas Kahwe Smith
security and council are still highlighted separately. regards, Lukas Kahwe Smith m...@pooteeweet.org

Re: phrase highlighting

2010-06-26 Thread Lukas Kahwe Smith
On 26.06.2010, at 16:22, Koji Sekiguchi wrote: (10/06/26 22:19), Lukas Kahwe Smith wrote: Hi, Form googling and looking at jira tickets it seems like phrase highlighting should work out of the box, but even enabling it manually didnt get me the desired result: http

Re: phrase highlighting

2010-06-26 Thread Lukas Kahwe Smith
On 26.06.2010, at 16:30, Lukas Kahwe Smith wrote: On 26.06.2010, at 16:22, Koji Sekiguchi wrote: (10/06/26 22:19), Lukas Kahwe Smith wrote: Hi, Form googling and looking at jira tickets it seems like phrase highlighting should work out of the box, but even enabling it manually didnt

Re: Recommended MySQL JDBC driver

2010-06-25 Thread Lukas Kahwe Smith
to the 5.1.12 connector. I just saw the message on the list about branch_3x in SVN, which looks like a better option than trunk. Any news on this topic? regards, Lukas Kahwe Smith m...@pooteeweet.org

finding out why a document is in the result

2010-06-18 Thread Lukas Kahwe Smith
the given fields to stored? regards, Lukas Kahwe Smith m...@pooteeweet.org

Re: finding out why a document is in the result

2010-06-18 Thread Lukas Kahwe Smith
looked at. it doesnt really contain the infos required, plus its obviously quite slow too. regards, Lukas Kahwe Smith m...@pooteeweet.org

dealing with dash chars in fields when using dismax

2010-06-13 Thread Lukas Kahwe Smith
rather just remove support for prohibiting words. regards, Lukas Kahwe Smith m...@pooteeweet.org

error running dih import

2010-06-13 Thread Lukas Kahwe Smith
) ... 12 more Jun 13, 2010 4:13:47 PM org.apache.solr.update.DirectUpdateHandler2 rollback INFO: start rollback Jun 13, 2010 4:13:47 PM org.apache.solr.update.DirectUpdateHandler2 rollback INFO: end_rollback regards, Lukas Kahwe Smith m...@pooteeweet.org

Re: dealing with dash chars in fields when using dismax

2010-06-13 Thread Lukas Kahwe Smith
is there some clean solution that doesnt mean a lot of coding work on my end to handle dash both as a special and as a normal char. something like doing the search twice both with the dash escaped and not escaped and then some intelligent scoring to produce the final result set. regards, Lukas Kahwe

Re: dealing with dash chars in fields when using dismax

2010-06-13 Thread Lukas Kahwe Smith
might end up doing though is not escape dashes only in specific cases: foo-bar (escape) foo - bar (escape) foo -bar (not escape, aka probihit bar) This should enable power users and should rarely hit non power users. regards, Lukas Kahwe Smith m...@pooteeweet.org

Re: error running dih import

2010-06-13 Thread Lukas Kahwe Smith
On 13.06.2010, at 16:46, Lukas Kahwe Smith wrote: Hi, what could cause this issue? i cannot reproduce it on my dev machine, but i am pretty sure its not an access control issue in either the file system nor the database. INFO: Creating a connection for entity entitydelete with URL

Re: minpercentage vs. mincount

2010-06-02 Thread Lukas Kahwe Smith
to say. ok i have created it: https://issues.apache.org/jira/browse/SOLR-1937 regards, Lukas Kahwe Smith m...@pooteeweet.org

Re: Subclassing DIH

2010-06-01 Thread Lukas Kahwe Smith
evalutes to true and the updated_at would be ignored in pretty much any decent RDBMS. if its false, then the updated_at is checked as per usual. regards, Lukas Kahwe Smith m...@pooteeweet.org

Re: Sites with Innovative Presentation of Tags and Facets

2010-05-31 Thread Lukas Kahwe Smith
time, obviously less flexible. regards, Lukas Kahwe Smith m...@pooteeweet.org

matching only empty fields

2010-05-29 Thread Lukas Kahwe Smith
an untokenized string. Of course I could just turn an empty string into none at index time, but I am wondering how to do it in general :) I tried using just or [ TO ] to match for empty strings, but this matches everything and due to the negation removes everything. regards, Lukas Kahwe Smith m

Re: Sites with Innovative Presentation of Tags and Facets

2010-05-28 Thread Lukas Kahwe Smith
://www.edwardtufte.com/bboard/q-and-a-fetch-msg?msg_id=0001OR kayak.com uses a double slider to handle the flight departure range: http://screencast.com/t/ZjExMTE5 regards, Lukas Kahwe Smith m...@pooteeweet.org

Re: Sites with Innovative Presentation of Tags and Facets

2010-05-27 Thread Lukas Kahwe Smith
the count without and with those additional checkbox filters applied (actually i only display two numbers of they are not the same): http://screencast.com/t/MWUzYWZkY2Yt regards, Lukas Kahwe Smith m...@pooteeweet.org (*) if anyone has a slider that can do the above i would love to integrate

minpercentage vs. mincount

2010-05-26 Thread Lukas Kahwe Smith
request or should stuff like this rather be done in userland (I have noticed for example that Solr prefers to have users normalize the scores in userland too)? regards, Lukas Kahwe Smith m...@pooteeweet.org

Tagging and excluding Filters

2010-05-25 Thread Lukas Kahwe Smith
}information_type_id:(-1)rows=21} Can someone give me a hint? regards, Lukas Kahwe Smith m...@pooteeweet.org

Re: Tagging and excluding Filters

2010-05-25 Thread Lukas Kahwe Smith
On 25.05.2010, at 08:55, Lukas Kahwe Smith wrote: Now when I deselect one of the checkboxes I add an fq parameters: facet=truefl=*,scoresort=score+descstart=0q=(tag_ids:(23))facet.field={!ex%3Ddt}organisation_idfacet.field={!ex%3Ddt}tag_idsfacet.field={!ex%3Ddt}addressee_idsfacet.field={!ex

Re: date slider

2010-05-16 Thread Lukas Kahwe Smith
On 16.05.2010, at 21:01, Ahmet Arslan iori...@yahoo.com wrote: http://wiki.apache.org/solr/StatsComponent can give you min and max values. Sorry my bad, I just tested StatsComponent with tdate field. And it is not working for date typed fields. Wiki says it is for numeric fields. ok

date slider

2010-05-15 Thread Lukas Kahwe Smith
points for the slider. The user can then move the sliders to further filter the result set. How can I best go about fetching just those min and max values, ideally without having to add a separate query just for this? regards, Lukas Kahwe Smith m...@pooteeweet.org [1] http://wiki.apache.org/solr

Re: Recommended MySQL JDBC driver

2010-05-14 Thread Lukas Kahwe Smith
Hi, just FYI I am using mysql-connector-java-5.1.10-bin.jar and I my full import takes about 3 hours and I am not experiencing crashes. regards, Lukas

Re: Recommended MySQL JDBC driver

2010-05-14 Thread Lukas Kahwe Smith
with about 16M entities. regards, Lukas Kahwe Smith m...@pooteeweet.org

Re: resolutions and chapters

2010-04-27 Thread Lukas Kahwe Smith
On 26.04.2010, at 12:48, Lukas Kahwe Smith wrote: Hi, I am currently putting together a search for a DB where I have resolutions along with their metadata as well as chapters, its text and metadata. Most of the searching will actually be done on the metadata. The plan atm is to support

resolutions and chapters

2010-04-26 Thread Lukas Kahwe Smith
I could just as well use one core. grouping: how do I best group the scores for the (a) type search? should I just do two searches and combine the results? then again this will make paging tricky. regards, Lukas Kahwe Smith m...@pooteeweet.org

deploying nightly updates to slaves

2010-04-07 Thread Lukas Kahwe Smith
. Plus if we run into any issues we can also easily rollback by just swapping the data around again. I would appreciate any comments you guys might have on this concept. regards, Lukas Kahwe Smith m...@pooteeweet.org

Re: deploying nightly updates to slaves

2010-04-07 Thread Lukas Kahwe Smith
On 07.04.2010, at 14:24, Lukas Kahwe Smith wrote: For Solr the idea is also just copy the index files into a new directory and then use http://wiki.apache.org/solr/CoreAdmin#RELOAD after updating the config file (I assume its not possible to hot swap like with MySQL). Since I want to keep

Re: One item, multiple fields, and range queries

2010-03-29 Thread Lukas Kahwe Smith
not some crazy use case .. regards, Lukas Kahwe Smith m...@pooteeweet.org

Re: DIH questions

2010-03-18 Thread Lukas Kahwe Smith
However you can even also define default for these parameters inside your solrconfig.xml request handler configuration. regards, Lukas Kahwe Smith m...@pooteeweet.org

Re: SQL and $deleteDocById

2010-03-17 Thread Lukas Kahwe Smith
On 16.03.2010, at 15:42, Lukas Kahwe Smith wrote: Hi, I am trying to use $deleteDocById to delete rows based on an SQL query in my db-data-config.xml. The following tag is a top level tag in the document tag. entity name=company_del query=SELECT e.id AS `$deleteDocById` ROM

Re: SQL and $deleteDocById

2010-03-17 Thread Lukas Kahwe Smith
On 17.03.2010, at 11:36, Lukas Kahwe Smith wrote: On 16.03.2010, at 15:42, Lukas Kahwe Smith wrote: Hi, I am trying to use $deleteDocById to delete rows based on an SQL query in my db-data-config.xml. The following tag is a top level tag in the document tag. entity name

DIH request parameters

2010-03-16 Thread Lukas Kahwe Smith
(for example to pass in the password)? Furthermore is there some way to define default values for these request parameters in case no value is passed in? regards, Lukas Kahwe Smith m...@pooteeweet.org

SQL and $deleteDocById

2010-03-16 Thread Lukas Kahwe Smith
the rows, its not actually issuing any index deletes. regards, Lukas Kahwe Smith m...@pooteeweet.org

filtering documents by multiple coordinates per document

2010-03-15 Thread Lukas Kahwe Smith
to be index supported in order to perform decently): for (i=0, icount_values(x_coord),i++) x_coord[i] x_upper_left_coord AND x_coord[i] x_lower_right_coord AND y_coord[i] y_upper_left_coord AND y_coord[i] y_lower_right_coord regards, Lukas Kahwe Smith m...@pooteeweet.org

Re: DIH template multivalued fields

2010-03-13 Thread Lukas Kahwe Smith
in the group into a string. regards, Lukas Kahwe Smith m...@pooteeweet.org

inconsistency between analysis.jsp and actual search

2010-02-12 Thread Lukas Kahwe Smith
Whats going on there? regards, Lukas Kahwe Smith m...@pooteeweet.org

Re: inconsistency between analysis.jsp and actual search

2010-02-12 Thread Lukas Kahwe Smith
=solr.StopFilterFactory ignoreCase=true words=stopwords.txt enablePositionIncrements=true / /analyzer /fieldType It seems I should probably use the solr.StandardTokenizerFactory anyways, but for this case it wouldnt help either. regards, Lukas Kahwe Smith m...@pooteeweet.org

Re: delete via DIH

2010-02-11 Thread Lukas Kahwe Smith
On 10.02.2010, at 16:41, Lukas Kahwe Smith wrote: There is a solution to update via DIH, but is there also a way to define a query that fetches id's for documents that should be removed? Or to phrase the question a bit more open. I have a file with id's of documents to delete (one per line

delete via DIH

2010-02-10 Thread Lukas Kahwe Smith
Hi, There is a solution to update via DIH, but is there also a way to define a query that fetches id's for documents that should be removed? regards, Lukas Kahwe Smith m...@pooteeweet.org

wildcards in stopword list

2010-02-03 Thread Lukas Kahwe Smith
, but then i would need to read in the stopword.txt file the script, which i would like to avoid, then again it would probably be the more efficient approach. regards, Lukas Kahwe Smith m...@pooteeweet.org

Re: wildcards in stopword list

2010-02-03 Thread Lukas Kahwe Smith
. regards, Lukas Kahwe Smith m...@pooteeweet.org

Re: wildcards in stopword list

2010-02-03 Thread Lukas Kahwe Smith
. Furthermore a phrase entered can either match on one field or any combination of fields. Phrase in this context means separated by spaces or dash. For this I found the above approach the only feasible solution. regards, Lukas Kahwe Smith m...@pooteeweet.org

Re: wildcards in stopword list

2010-02-03 Thread Lukas Kahwe Smith
version and remove the solr.LowerCaseFilterFactory in both analyzer sections, then it should work however as long as terms usually start with an upper case letter if they do contain upper case letters. let me try this out .. regards, Lukas Kahwe Smith m...@pooteeweet.org

Re: wildcards in stopword list

2010-02-03 Thread Lukas Kahwe Smith
and requesting a field. q=suggest_field:difl=suggest_field Yeah, I just realized that while I was trying it out. :-) Still testing .. regards, Lukas Kahwe Smith m...@pooteeweet.org

Re: wildcards in stopword list

2010-02-03 Thread Lukas Kahwe Smith
tough. regards, Lukas Kahwe Smith m...@pooteeweet.org

autosuggest via solr.EdgeNGramFilterFactory (was: Re: wildcards in stopword list)

2010-02-03 Thread Lukas Kahwe Smith
Hi Ahmet, Well after some more testing I am now convinced that you rock :) I like the solution because its obviously way less hacky and more importantly I expect this to be a lot faster and less memory intensive, since instead of a facet prefix or terms search, I am doing an equality comparison

Re: multi term, multi field, auto suggest

2010-02-01 Thread Lukas Kahwe Smith
On 29.01.2010, at 15:40, Lukas Kahwe Smith wrote: I am still a bit unsure how to handle both the lowercased and the case preserved version: So here are some examples: UBS = ubs|UBS Kreuzstrasse = kreuzstrasse|Kreuzstrasse So when I type Kreu I would get a suggestion of Kreuzstrasse

Re: multi term, multi field, auto suggest

2010-02-01 Thread Lukas Kahwe Smith
On 01.02.2010, at 13:27, Lukas Kahwe Smith wrote: On 29.01.2010, at 15:40, Lukas Kahwe Smith wrote: I am still a bit unsure how to handle both the lowercased and the case preserved version: So here are some examples: UBS = ubs|UBS Kreuzstrasse = kreuzstrasse|Kreuzstrasse So when

multi term, multi field, auto suggest

2010-01-29 Thread Lukas Kahwe Smith
this? regards, Lukas Kahwe Smith m...@pooteeweet.org

Re: Solr vs. Compass

2010-01-25 Thread Lukas Kahwe Smith
Lucene doesnt support 2PhaseCommits yet? regards, Lukas Kahwe Smith m...@pooteeweet.org

request handler defaults

2010-01-22 Thread Lukas Kahwe Smith
and I find no (or just very few) result, then I want to also include a field that has a doublemethaphone analyzer on the name. So I just want to append that field to the qf setting of the request handler defaults. regards, Lukas Kahwe Smith m...@pooteeweet.org

Re: TermsComponent, multiple fields, total count

2010-01-21 Thread Lukas Kahwe Smith
On 20.01.2010, at 15:50, Lukas Kahwe Smith wrote: On 19.01.2010, at 22:52, Lukas Kahwe Smith wrote: I also want to match multiple fields at once. Can you give an example? I enter Kreuz but this could either be part of a persons name or of a street name, which are separate fields

Re: TermsComponent, multiple fields, total count

2010-01-19 Thread Lukas Kahwe Smith
On 19.01.2010, at 21:55, Otis Gospodnetic wrote: Hi Lukas, - Original Message From: Lukas Kahwe Smith m...@pooteeweet.org I want to use TermsComponent for both auto complete suggestions but also showing Is TermsComponent really that good for AutoComplete? Have a look

TermsComponent, multiple fields, total count

2010-01-18 Thread Lukas Kahwe Smith
me this number more efficiently. regards, Lukas Kahwe Smith m...@pooteeweet.org

multi field search

2010-01-18 Thread Lukas Kahwe Smith
splitter etc.). I can of course also do an OR query as well. But it would be nice to be able to do: q=*:foo and that would simply search all fields against the query foo. regards, Lukas Kahwe Smith m...@pooteeweet.org