Re: Solr - Multi Term highlighting issue

2011-04-23 Thread Koji Sekiguchi
=solr.RemoveDuplicatesTokenFilterFactory/ /analyzer /fieldType fieldtype name=ignored stored=false indexed=false multiValued=true class=solr.StrField / /types thanks regards, Rajesh Ramana -Original Message- From: Koji Sekiguchi [mailto:k...@r.email.ne.jp] Sent: Saturday, April 23, 2011 9:51

Re: Solr - Multi Term highlighting issue

2011-04-22 Thread Koji Sekiguchi
How are your hl.fl fields defined in schema.xml? Koji -- http://www.rondhuit.com/en/ (11/04/23 1:23), Ramanathapuram, Rajesh wrote: Does anybody has other suggestions? thanks regards, Rajesh Ramana Enterprise Applications, Turner Broadcasting System, Inc. 404.878.7474 -Original

Re: Special Character Hightlighting issues after 3.1.0 update

2011-04-14 Thread Koji Sekiguchi
(11/04/14 23:49), Yonik Seeley wrote: Hmmm, looks like the highlighting code changed. Using the example doc, with 1.4 I get http://localhost:8983/solr/select?q=features:circumflexeshl=truehl.fl=featureswt=jsonindent=true highlighting:{ UTF8TEST:{ features:[eaiou

Re: Field Analyzers: which values are indexed?

2011-04-13 Thread Koji Sekiguchi
Or is the only the final value after completing the whole chain indexed? Yes. Koji -- http://www.rondhuit.com/en/

Re: what happens to docsPending if stop solr before commit

2011-04-06 Thread Koji Sekiguchi
(11/04/06 5:25), Robert Petersen wrote: I tried to find the answer to this simple question online, but failed. I was wondering about this, what happens to uncommitted docsPending if I stop solr and then restart solr? Are they lost? Are they still there but still uncommitted? Do they get

Re: Exporting to CSV

2011-03-29 Thread Koji Sekiguchi
(11/03/30 10:59), Charles Wardell wrote: Is there an easy way to get queried data exported from solr in a csv format? Hoping there is a handler or library for this. Charlie, Solr 3.1, will be released shortly, has csv response writer which is implicitly defined. Try wt=csv request parameter.

Re: hierarchical faceting, SOLR-792 - confused on config

2011-03-16 Thread Koji Sekiguchi
(11/03/17 3:53), Jonathan Rochkind wrote: Interesting, any documentation on the PathTokenizer anywhere? It is PathHierarchyTokenizer: https://hudson.apache.org/hudson/job/Solr-trunk/javadoc/org/apache/solr/analysis/PathHierarchyTokenizerFactory.html Koji -- http://www.rondhuit.com/en/

Re: Solr and Permissions

2011-03-11 Thread Koji Sekiguchi
(11/03/12 10:28), go canal wrote: Looking at the API doc, it seems that only floating value is currently supported, is it true? Right. And it is just for changing score by using float values in the file, so it cannot be used for filtering. Koji -- http://www.rondhuit.com/en/

Re: Location of Main Class in Solr?

2011-03-05 Thread Koji Sekiguchi
(11/03/04 3:30), Anurag wrote: I searched SolrIndexSearcher.java file but there is no main class. I wanted to know as to where this class resides. Can i call this main class (if it exists) using command line options in terminal , rather than through war file? Kumar, I think you may want to

Re: Problem adding new requesthandler to solr branch_3x

2011-03-04 Thread Koji Sekiguchi
bool hl=true/ If this amended to read: bool hl=truetrue/bool the solr-example starts fine. Paul, It should be bool name=hltrue/bool. Koji -- http://www.rondhuit.com/en/

Re: MLT with boost

2011-03-02 Thread Koji Sekiguchi
(11/03/02 0:23), Mark wrote: Is it possible to add function queries/boosts to the results that are by MLT? If not out of the box how would one go about achieving this functionality? Thanks Beside the point, why do you need such function? If you give us more information/background of your

Re: MLT with boost

2011-03-02 Thread Koji Sekiguchi
(11/03/03 2:54), Mark wrote: High level overview. We have items and we have sellers. The scoring of our documents is such that our boost functions outweight the pure lucene term/query scoring. Our boost functions basically take into account how good the seller is. Now for MLT searches we

Re: Make syntax highlighter caseinsensitive

2011-02-26 Thread Koji Sekiguchi
That is why I'm storing the non lowercased version of the field - with that I do not loose information. You do not loose information when you store lowercased version of the field. Koji -- http://www.rondhuit.com/en/

Re: DIH regex remove email + extract url

2011-02-25 Thread Koji Sekiguchi
Hi Rosa, Are you sure you have transformer=RegexTransformer in your entity/? My question was more about the solr DIH syntax. It doesn't work either with the new regex. Especially the syntax for this: field column=source xpath=/product/url regex=http:\/\/(.*?)\/(.*) / --- Is it correct?

Re: Make syntax highlighter caseinsensitive

2011-02-25 Thread Koji Sekiguchi
(11/02/25 18:30), Tarjei Huse wrote: Hi, On 02/25/2011 02:06 AM, Koji Sekiguchi wrote: (11/02/24 20:18), Tarjei Huse wrote: Hi, I got an index where I have two fields, body and caseInsensitiveBody. Body is indexed and stored while caseInsensitiveBody is just indexed. The idea

Re: DIH regex remove email + extract url

2011-02-25 Thread Koji Sekiguchi
(11/02/26 5:24), Rosa (Anuncios) wrote: Hi Koji, Yes of course i have RegexTransformer in my entity/. What i'm not sure is the syntax of this field column=source xpath=/product/url regex= / i don't need any other parameter here? Hi Rosa, So I've mentioned groupNames attribute for field

Re: Question Solr Index main in RAM

2011-02-24 Thread Koji Sekiguchi
(11/02/24 21:38), Andrés Ospina wrote: Hi, My name is Felipe and i want to use the index main of solr in RAM memory. How it's possible? I have solr 1.4 Thank you! Felipe Welcome Felipe! If I understand your question correctly, you can use

Re: Solr 4.0 DIH

2011-02-24 Thread Koji Sekiguchi
(11/02/22 6:58), Mark wrote: I download Solr 4.0 from trunk today and I tried using a custom Evaluator during my full/delta-importing. Within the evaluate method though, the Context is always null? When using this same class with Solr 1.4.1 the context always exists. Is this a bug or is this

Re: DIH regex remove email + extract url

2011-02-24 Thread Koji Sekiguchi
Hi Rosa, field column=description xpath=/product/content regex=[a-zA-Z0-9._%+-]+@[a-zA-Z0-9.-]+\.[A-Z]{2,4} replaceWith= / Shouldn't it be regex=[a-zA-Z0-9._%+-]+@[a-zA-Z0-9.-]+\.[a-z]{2,4}? field column=source xpath=/product/url regex=http://(.*?)\\/(.*) / Example:

Re: Make syntax highlighter caseinsensitive

2011-02-24 Thread Koji Sekiguchi
(11/02/24 20:18), Tarjei Huse wrote: Hi, I got an index where I have two fields, body and caseInsensitiveBody. Body is indexed and stored while caseInsensitiveBody is just indexed. The idea is that by not storing the caseInsensitiveBody I save some space and gain some performance. So I query

Re: Any plan to make Field Collapsing available for distributed search?

2011-02-22 Thread Koji Sekiguchi
(11/02/22 13:46), Andy wrote: Hello, I'm looking into Field Collapsing. According to the documentation one limitation is that distributed search support for result grouping has not yet been implemented. Just wondered if there's any plan to add distributed search support to field collapsing.

Re: hierarchical faceting, SOLR-792 - confused on config

2011-02-22 Thread Koji Sekiguchi
(11/02/23 8:26), kmf wrote: I'm using solr 4.0 and trying to implement a hierarchical faceting example. The example I'm trying to implement is taken from the webcast Mastering the Power of Faceted Search. (http://www.lucidimagination.com/solutions/webcasts/faceting) Around minute 30, Chris

Re: Where is NGramFilter?

2011-02-09 Thread Koji Sekiguchi
http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.EdgeNGramFilterFactory There is only EdgeNGramFilterFactory listed (which I got working for prefix indexing), but no NGramFilterFactory. Is that filter not supported anymore, or is that list not up to date? It should be there.

Re: SOLR 1.4 and Lucene 3.0.3 index problem

2011-02-01 Thread Koji Sekiguchi
(11/02/01 23:58), Churchill Nanje Mambe wrote: am sorry I downloaded the solr released version as I dont know how to build solr myself but I wrote my crawler with lucene 3.x now I need solr to search this index so I tried used the solr 1.4 I downloaded from the site as the most recent

Re: UpdateHandler-Bug or intended feature?

2011-01-31 Thread Koji Sekiguchi
(11/01/31 22:20), Em wrote: Hi list, I am not sure whether this behaviour is intended or not. I am experimenting with the UpdateRequestProcessor-feature of Solr (V: 1.4) and there occured something I find strange. Well, when I send csv-data to the CSV-UpdateHandler with some fields specified

Re: UpdateHandler-Bug or intended feature?

2011-01-31 Thread Koji Sekiguchi
(11/01/31 23:33), Em wrote: Hi Koji, following is the solrconfig: requestHandler name=/update/csv class=solr.CSVRequestHandler lst name=defaults str name=update.processorthrowAway/str /lst /requestHandler updateRequestProcessorChain name=throwAway

Re: SolrJ (Trunk) Invalid version or the data in not in 'javabin' format

2011-01-30 Thread Koji Sekiguchi
(11/01/31 3:11), Em wrote: Hello list, I build an application that uses SolrJ to communicate with Solr. What did I do? Well, I deleted all the solrj-lib stuff from my application's Webcontent-directory and inserted the solrj-lib from the freshly compiled solr 4.0 - trunk. However, when trying

Re: [POLL] Where do you get Lucene/Solr from? Maven? ASF Mirrors?

2011-01-18 Thread Koji Sekiguchi
[X] ASF Mirrors (linked in our release announcements or via the Lucene website) [] Maven repository (whether you use Maven, Ant+Ivy, Buildr, etc.) [X] I/we build them from source via an SVN/Git checkout. [] Other (someone in your company mirrors them internally or via a downstream project)

Re: Strange query behaviour using splitOnCaseChange=1

2011-01-11 Thread Koji Sekiguchi
(11/01/11 20:49), Frederico Azeiteiro wrote: Hi all, I had indexed a text with the word InterContinental with fieldType text (with the default filters just removing the solr.SnowballPorterFilterFactory). As far as I understand, using the filter solr.WordDelimiterFilterFactory with

Re: strange SOLR behavior with required field attribute

2011-01-10 Thread Koji Sekiguchi
(11/01/10 23:26), Bernd Fehling wrote: Dear list, while trying different options with DIH and SciptTransformer I also tried using the required=true option for a field. I have 3 records: documents document titlefirst title/title ididentifier_01/id

Re: Print highlighting descriptions

2010-12-22 Thread Koji Sekiguchi
(10/12/23 11:56), Ruixiang Zhang wrote: I want to print the highlighting descriptions: {responseHeader:{status:0,

Re: Print highlighting descriptions

2010-12-22 Thread Koji Sekiguchi
(10/12/23 14:10), Ruixiang Zhang wrote: Thanks Koji. Actually my question is: We can use response.response.docs[i].description to print the description in docs. What expression should we use to print the description in highlighting? Ruixiang, I cannot understand your question. Is it Solr

Re: [Import Timeout] using /dataimport

2010-12-21 Thread Koji Sekiguchi
(10/12/22 9:35), Adam Estrada wrote: All, I've noticed that there are some RSS feeds that are slow to respond, especially during high usage times throughout the day. Is there a way to set the timeout to something really high or have it just wait until the feed is returned? The entire thing

Re: Is there a way to create multiple doc using DIH and access the data pertaining to a particular doc name ?

2010-12-18 Thread Koji Sekiguchi
(10/11/11 1:57), bbarani wrote: Hi, I have a peculiar situation where we are trying to use SOLR for indexing multiple tables (There is no relation between these tables). We are trying to use the SOLR index instead of using the source tables and hence we are trying to create the SOLR index as

Re: Separate Lines Like Google

2010-12-13 Thread Koji Sekiguchi
(10/12/13 23:00), Alejandro Delgadillo wrote: Hi everybody, I¹m having some troubles trying to figure out how to separate lines in a paragraph from a search result, I¹m indexing PDF¹s but when I search the highlight terms I can not know when the first line ends and the next one begins, Is

Re: Separate Lines Like Google

2010-12-13 Thread Koji Sekiguchi
(10/12/14 5:06), Alejandro Delgadillo wrote: Koji, Thank you for helping me with my questions, but I still don't get it how it's done, let's say I search for the term love and I get something like this: emLove/em is an intense feeling of affectionemLove/em may also refer to: Contents. 1 Film

Re: access to environment variables in solrconfig.xml and/or schema.xml?

2010-12-13 Thread Koji Sekiguchi
(10/12/14 4:28), Burton-West, Tom wrote: I see variables used to access java system properties in solrconfig.xml and schema.xml: http://wiki.apache.org/solr/SolrConfigXml#System_property_substitution dataDir${solr.data.dir:}/dataDir or ${solr.abortOnConfigurationError:true} Is there a way to

Re: [pubDate] is not converting correctly

2010-12-12 Thread Koji Sekiguchi
(10/12/13 8:49), Adam Estrada wrote: All, I am having some difficulties parsing the pubDate field that is part of the RSS spec (I believe). I get the warning that states, Dec 12, 2010 6:45:26 PM org.apache.solr.handler.dataimport.DateFormatTransformer transformRow WARNING: Could not parse a

Re: Highlighting Issue

2010-12-09 Thread Koji Sekiguchi
(10/12/09 21:22), Shaun Campbell wrote: I'm trying to highlight a field and I'm getting an exception thrown, only on certain search terms though. I am fairly certain that the cause of the problem is through having synonyms on the highlighted field as I have had highlighting working in the past

Re: Highlighting Issue

2010-12-09 Thread Koji Sekiguchi
(10/12/09 22:13), Shaun Campbell wrote: Koji Thanks a lot it's stopped crashing now. Can I ask one other question about synonym highlighting which looks a bit puzzling? I enter asset as my criteria and it returns through synonym matching other terms highlighted as well. My debug output is:

Re: Highlighting Issue

2010-12-09 Thread Koji Sekiguchi
(10/12/09 22:50), Shaun Campbell wrote: OK. I'd switch to FastVectorHighlighter which cured the exceptions and gives me highlighting so I assumed that you could use this instead of the standard highlighter on n-grammed fields. I guess my query was how does the highlighter now highlight synonym

Re: highlighting encoding issue

2010-12-07 Thread Koji Sekiguchi
(10/12/08 1:12), getagrip wrote: Hi, when I query solr (trunk) I get numeric character references instead of regular UTF-8 strings in case of special characters in the highlighting section, in the result section the characters are presented fine. e.g instead of the German Umlaut Ä I get #228;

Re: FastVectorHighlighter ignoring fragmenter parameter . . .

2010-12-06 Thread Koji Sekiguchi
(10/12/06 23:52), CRB wrote: Koji, Thank you for the reply. Being something of a novice with Solr, I would be grateful if you could clarify my next steps. I infer from your reply that there is no current implementation yet contributed for the FVH similar to the regex fragmenter. Thus I

Re: Dataimport: Could not load driver: com.mysql.jdbc.Driver

2010-12-05 Thread Koji Sekiguchi
(10/12/05 18:38), Ruixiang Zhang wrote: *I got the following error for dataimport:* *Full Import failed org.apache.solr.handler.dataimport.DataImportHandlerException: Could not load driver: com.mysql.jdbc.Driver* I have the following files: \example-DIH\solr\db\conf\ solrconfig.xml,

Re: Reading timestamp for DIH

2010-12-04 Thread Koji Sekiguchi
(10/11/24 6:05), Siddharth Powar wrote: Hey, Is it possible to read the timestamp that the DataImportHandler uses for a delta-import from a location other than conf/dataimport.properties. Thanks, Sid No. There is an open issue for this problem:

Re: Problem with DIH delta-import delete.

2010-12-04 Thread Koji Sekiguchi
(10/11/17 20:18), Matti Oinas wrote: Solr does not delete documents from index although delta-import says it has deleted n documents from index. I'm using version 1.4.1. The schema looks like fields field name=uuid type=string indexed=true stored=true required=true / field

Re: solr/admin/dataimport Not Found

2010-12-02 Thread Koji Sekiguchi
(10/12/03 8:58), Ruixiang Zhang wrote: I tried to import data from mysql. When I tried to run http://mydomain.com:8983/solr/admin/dataimport , I got these error message: HTTP ERROR: 404 NOT_FOUND RequestURI=/solr/admin/dataimport *Powered by Jetty://http://jetty.mortbay.org/ * Any help

Re: solr/admin/dataimport Not Found

2010-12-02 Thread Koji Sekiguchi
(10/12/03 9:29), Ruixiang Zhang wrote: Hi Koji Thanks for your reply. I pasted the wrong link. Actually I tried this fist http://mydomain.com:8983/solr/dataimport It didn't work. The page should be there after installation, right? Did I miss something? Thanks a lot! Richard To work that URL,

Re: best way to get maxDocs in java (i.e. as on stats.jsp page).

2010-12-01 Thread Koji Sekiguchi
(10/12/02 13:51), Will Milspec wrote: hi all, What's the best way to programmatically-in-java get the 'maxDoc' attribute (as seen on the stats.jsp page). I don't see any hooks on the solrj api. Currently I plan to use an http client to get stats.jsp (which returns xml) and parse it using

Re: Tuning Solr caches with high commit rates (NRT)

2010-11-15 Thread Koji Sekiguchi
(10/11/16 6:43), Dennis Gearon wrote: fc='field collapsing'? fc of facet.method=fc stands for Lucene's FieldCache. enum of facet.method=enum stands for Lucene's TermEnum. Usually, you do not need to set facet.method because Solr automatically uses most appropriate facet method for each field

Re: Tuning Solr caches with high commit rates (NRT)

2010-11-15 Thread Koji Sekiguchi
(10/11/16 8:36), Jonathan Rochkind wrote: In Solr 1.4, facet.method=enum DOES work on multi-valued fields, I'm pretty certain. Correct, and I didn't say that facet.method=enum doesn't work for multiValued/tokenized field in my previous mail. I think Koji's explanation is based on before

Re: using CJKTokenizerFactory for Japanese language

2010-11-11 Thread Koji Sekiguchi
(10/11/12 1:49), Kumar Pandey wrote: I am exploring support for Japanese language in solr. Solr seems to provide CJKTokenizerFactory. How useful is this module? Has anyone been using this in production for Japanese language? CJKTokenizer is used in a lot of places in Japan. One shortfall it

Re: Highlighting and maxBooleanClauses limit

2010-11-02 Thread Koji Sekiguchi
(10/11/02 23:14), Ken Stanley wrote: I've noticed in the stack trace that this exception occurs when trying to build the query for the highlighting; I've confirmed this by copying the params and changing hl=true to hl=false. Unfortunately, when using debugQuery=on, I do not see any details on

big terms in UnInvertedField

2010-11-01 Thread Koji Sekiguchi
Hello, With solr example, using facet.field=text creates UnInvertedField for the text field in fieldValueCache. After that, I saw stats page and I was surprised at counters in *filterCache* were up: lookups : 213 hits : 106 hitratio : 0.49 inserts : 107 evictions : 0 size : 107 warmupTime : 0

Re: big terms in UnInvertedField

2010-11-01 Thread Koji Sekiguchi
Yonik, Thank you for your reply. I just wanted to share my surprise. :) Koji -- http://www.rondhuit.com/en/ (10/11/01 23:17), Yonik Seeley wrote: 2010/11/1 Koji Sekiguchik...@r.email.ne.jp: With solr example, using facet.field=text creates UnInvertedField for the text field in

Re: documentCache clarification

2010-10-27 Thread Koji Sekiguchi
(10/10/28 6:32), Jonathan Rochkind wrote: Woah, I hadn't known about that. queryResultMaxDocsCached is actually a part of Solr 1.4? Is it documented anywhere at all? I guess it is included in the example solrconfig.xml, but is not in my own personal solrconfig.xml. The feature was added

Re: MoreLikeThis explanation?

2010-10-23 Thread Koji Sekiguchi
branch, but it didn't take completely. What branch does it work for? Darren On Thu, 2010-10-21 at 23:03 +0900, Koji Sekiguchi wrote: (10/10/21 20:33), dar...@ontrenet.com wrote: Hi, Does the latest Solr provide an explanation for results returned by MLT? No, but there is an open issue

Re: MoreLikeThis explanation?

2010-10-21 Thread Koji Sekiguchi
(10/10/21 20:33), dar...@ontrenet.com wrote: Hi, Does the latest Solr provide an explanation for results returned by MLT? No, but there is an open issue: https://issues.apache.org/jira/browse/SOLR-860 Koji -- http://www.rondhuit.com/en/

Re: configuring custom CharStream in solr

2010-10-11 Thread Koji Sekiguchi
(10/10/12 5:57), Michael Sokolov wrote: I would like to inject my CharStream (or possibly it could be a CharFilter; this is all in flux at the moment) into the analysis chain for a field. Can I do this in solr using the Analyzer configuration syntax in schema.xml, or would I need to define my

Re: Should Medical be highlighted when user search for medication?

2010-10-05 Thread Koji Sekiguchi
(10/10/06 4:41), Khai Doan wrote: I am still trying to learn Solr. My Solr configuration is based on the default example schema.xml (I haven't customize the field types). I am using text for the fields that I want highlighting on. I am searching for medication, but I see that Medical is

Re: matches in result grouping

2010-09-25 Thread Koji Sekiguchi
Correct. The matches is the doc count before any grouping (and for field.query that means before the restriction given by field.query is applied). It won't always be the same though - for example we might implement filter excludes like we do with faceting, etc. -Yonik

Re: Solr Highlighting Question

2010-09-24 Thread Koji Sekiguchi
/ /lst Any Ideas? On 09/08/2010 07:54 PM, Koji Sekiguchi wrote: (10/09/09 2:26), Jed Glazner wrote: Thanks for taking time to read through this. I'm using a checkout from the solr 3.x branch My problem is with the highlighter and wildcards I can get the highlighter to work

Re: matches in result grouping

2010-09-23 Thread Koji Sekiguchi
(10/09/23 18:14), Koji Sekiguchi wrote: I'm using recent committed field collapsing / result grouping feature in trunk. I'm confusing matches parameter in the result at the second sample output of Wiki: http://wiki.apache.org/solr/FieldCollapsing#Quick_Start I cannot understand why

Re: Solr 1.4 vs. 1.3

2010-09-21 Thread Koji Sekiguchi
And second question about 1.4 is are there any known issues with the external scoring field and 1.4 ? Julian, Do you mean ExternalFileField? If so, there is an open issue: https://issues.apache.org/jira/browse/SOLR-1607 -- http://www.rondhuit.com/en/

Re: Solr Highlighting Issue

2010-09-17 Thread Koji Sekiguchi
(10/09/17 16:36), Ahson Iqbal wrote: Hi All I have an issue in highlighting that if i query solr on more than one fields like +Contents:risk +Form:1 and even i specify the highlighting field is Contents it still highlights risk as well as 1, because it is specified in the query.. now if i

Re: Solr Highlighting Question

2010-09-08 Thread Koji Sekiguchi
(10/09/09 2:26), Jed Glazner wrote: Thanks for taking time to read through this. I'm using a checkout from the solr 3.x branch My problem is with the highlighter and wildcards I can get the highlighter to work with wild cards just fine, the problem is that solr is returning the term

Re: Throttling replication

2010-09-02 Thread Koji Sekiguchi
(10/09/03 5:42), Brandon Evans wrote: On 9/2/10 11:16 AM, Mark wrote: I am using the built in replication. Can you send me a link to the patch so I can give it a try? Thanks I see my email wasn't very clear. Sorry to get your hopes up. The patch I have is only for the rsync based

Re: lucene + solr: corrupt index

2010-08-23 Thread Koji Sekiguchi
(10/08/24 10:02), ANurag wrote: Hi, I am using lucene 3.0 jars and built a lucene index with 200 documents. The index files were then copied over to my solr 1.4.1 installation. I get the following error every time I start SOLR: What could I be doing wrong? Solr 1.4 can read Lucene 2.9 index

about readercycle script

2010-08-23 Thread Koji Sekiguchi
I'm working on SOLR-2046 and realized that readercycle script might be looking for old(?) Solr response format, therefore, today it always fails: https://issues.apache.org/jira/browse/SOLR-2046 Since I've looked for issues regarding readercycle in jira and maling list archives so far, nobody

Re: Confused about highlighting

2010-08-20 Thread Koji Sekiguchi
(10/08/21 9:04), Mark E. Haase wrote: I have highlighting working on my project (indexing content for a web app), but the idea of highlighting withem tags doesn't make sense to me. It seems that it opens up the system to XSS attacks if you echo search result data (with highlights) into a web

Re: tii RAM usage on startup

2010-08-18 Thread Koji Sekiguchi
I'm not sure how Solr exposes this configuration though. this one? !-- To set the setTermIndexInterval, do this: -- !--indexReaderFactory name=IndexReaderFactory class=org.apache.solr.core.StandardIndexReaderFactory int name=setTermIndexInterval12/int /indexReaderFactory -- Koji --

Re: Solr branches

2010-08-12 Thread Koji Sekiguchi
(10/08/12 21:06), Tomasz Wegrzanowski wrote: Hi, I'm having oome problems with solr. From random browsing I'm getting an impression that a lot of memory fixes happened recently in solr and lucene. Could you give me a quick summary how (un)stable are different lucene / solr branches and how

Re: get-colt

2010-08-05 Thread Koji Sekiguchi
(10/08/06 2:14), sai.thumul...@verizonwireless.com wrote: Hi - I am trying to compile Solr source and during ant dist step, the build times out on get-colt: [get] Getting: http://repo1.maven.org/maven2/colt/colt/1.2.0/colt-1.2.0.jar [get] To:

Re: Date faceting

2010-08-04 Thread Koji Sekiguchi
(10/08/04 19:42), Eric Grobler wrote: Hi Solr community, How do I facet on timestamp for example? I tried something like this - but I get no result. facet=true facet.date=timestamp f.facet.timestamp.date.start=2010-01-01T00:00:00Z f.facet.timestamp.date.end=2010-12-31T00:00:00Z

Re: Highlighting parameters wiki

2010-07-27 Thread Koji Sekiguchi
(10/07/27 23:16), Stephen Green wrote: The wiki entry for hl.highlightMultiTerm: http://wiki.apache.org/solr/HighlightingParameters#hl.highlightMultiTerm doesn't appear to be correct. It says: If the SpanScorer is also being used, enables highlighting for range/wildcard/fuzzy/prefix queries.

Re: why spellcheck and elevate search components can't work together?

2010-07-19 Thread Koji Sekiguchi
(10/07/19 19:14), Chamnap Chhorn wrote: In my solrconfig.xml, I setup this way, but it doesn't work at all. Any one can help? it works one without other one. searchComponent name=elevateListings class=org.apache.solr.handler.component.QueryElevationComponent str

Re: SOLR 1.4.1 - Issue with recognition of solr.solr.home system property

2010-07-18 Thread Koji Sekiguchi
(10/07/18 10:03), Tracy Flynn wrote: That's a little telling INFO: Opening new SolrCore at /Users/johndoe/example1/solr/, dataDir=./solr/data/ Since I'm running with ~/example2 as the current working directory, then that would explain it. Schema etc. is found in ~/example1/solr/conf, but

Re: HTTP ERROR: 500 - java.lang.ArrayIndexOutOfBoundsException

2010-07-17 Thread Koji Sekiguchi
(10/07/18 4:51), Girish wrote: Hi Lance, Thanks for the reply! I checked the settings and I don't think it has multivalue setting. Here is the current field configuration: *field name=id type=string indexed=true stored=true required=true / field name=first type=textgen indexed=true

range query on TrieLongField - strange result?

2010-07-15 Thread Koji Sekiguchi
I see different results between SortableLongField and TrieLongField when I try same range query. This is the test data: add doc field name=idZERO/field field name=f_tl0/field field name=f_sl0/field /doc doc field name=idLong.MAX_VALUE-1000/field field name=f_tl9223372036854774807/field field

Re: Function Query Sorting vs 'Sort' parameter?

2010-07-09 Thread Koji Sekiguchi
(10/07/10 0:54), Saïd Radhouani wrote: Hi, I'm making some basic sorting (date, price, etc.) using the sort parameter (sort=field+asc), and it's working fine. I'm wondering whether there's a significant argument to use function query sorting instead of the sort parameter? Thanks, -S I'm

Re: Function Query Sorting vs 'Sort' parameter?

2010-07-09 Thread Koji Sekiguchi
(10/07/10 7:15), Saïd Radhouani wrote: Yes, indeed, you understood my question. Looking forward to the next version then. To your reply, I'd add that _val_ is used for standard request handler, and bf is used for dismax, right? -S Right. Koji -- http://www.rondhuit.com/en/

Re: Using hl.regex.pattern to print complete lines

2010-07-08 Thread Koji Sekiguchi
(10/07/09 9:30), Peter Spam wrote: Thanks for the note, Koji. However, hl.fragsize=0 seems to return the entire document, rather than just one single line. Here's what I tried (what I previously had was commented out): regexv = ^.*$ thequery =

Re: Problem building Nightly Solr

2010-07-06 Thread Koji Sekiguchi
(10/07/07 6:25), darknovan...@gmail.com wrote: I'd like to try the new edismax feature in Solr, so I downloaded the latest nightly (apache-solr-4.0-2010-07-05_08-06-42) and tried running ant example. It fails with a missing package error. I've pasted in the output below. I tried a nightly from

Re: Unicode processing - Issue with CharStreamAwareWhitespaceTokenizerFactory

2010-07-05 Thread Koji Sekiguchi
In the same wiki, they say that CharStreamAwareWhitespaceTokenizerFactory must be used with MappingCharFilterFactory. But, when I use these tokenizer and filter together, I get a sever error saying that the filed type containing these filter and tokenizer is unknown. However, when I use this

Re: Unicode processing - Issue with CharStreamAwareWhitespaceTokenizerFactory

2010-07-05 Thread Koji Sekiguchi
No, all tokenizer can be used with mappingcharfilter Koji Sekiguchi from mobile On 2010/07/06, at 0:32, Saïd Radhouani r.steve@gmail.com wrote: Thanks Koji for the reply and for updating wiki. As it's written now in wiki, it sounds (at least to me) like MappingCharFilterFactory works

Re: ArrayIndexOutOfBoundsException heeeeeelp !?!?!?!!?! Sorting

2010-06-30 Thread Koji Sekiguchi
(10/06/30 20:27), stockii wrote: Hello. I get an SEVERE: java.lang.ArrayIndexOutOfBoundsException and i dont know the reason for this. I have 4 cores. and every core is running but. for few minutes i get these bad exception in one core. its absolutlety not acceptable ... When i search with

Re: Wiki Documentation of facet.sort

2010-06-30 Thread Koji Sekiguchi
(10/07/01 1:12), Chantal Ackermann wrote: Hi there, in the wiki, on http://wiki.apache.org/solr/SimpleFacetParameters it says: The default is true/count if facet.limit is greater than 0, false/index otherwise. I've just migrated to 1.4.1 (reindexed). I can't remember how it was with 1.4.0.

Re: Indexing a database

2010-06-29 Thread Koji Sekiguchi
(10/06/30 1:11), Lance Hill wrote: How do I know if solr is actually loading my database driver properly? I added the mysql connector to the solr/lib directory, I addedlib dir=./lib / to the solrconfig.xml just to be sure it would find the connector. When I start the application, I see it

Re: phrase highlighting

2010-06-26 Thread Koji Sekiguchi
(10/06/26 22:19), Lukas Kahwe Smith wrote: Hi, Form googling and looking at jira tickets it seems like phrase highlighting should work out of the box, but even enabling it manually didnt get me the desired result: http://resolutionfinder.org/search?q=%22security+council%22=tm=anys=Search

Re: Synonym configuration

2010-06-24 Thread Koji Sekiguchi
(10/06/25 11:33), xdzgor wrote: Hi, can someone please confirm the following statements about configuration for the synonym filter, or correct me where I'm wrong? a = b a search for a, is changed into a search for b a, b = c a search for a or a search for b, is changed into a search for c

Re: how to apply patch SOLR-1316

2010-06-17 Thread Koji Sekiguchi
As you can see both versions don't appear to be working. I tried building each but neither would compile. Which version/tag should be used when applying this patch? In general, a patch is written against the latest trunk branch as of then. For the SOLR-1316.patch, it was posted 2010-5-31,

Re: Optimize with waitFlush=false and waitSearcher=false takes a long time

2010-06-08 Thread Koji Sekiguchi
(10/06/09 7:36), Dragisa Krsmanovic wrote: When we sendoptimize waitFlush=false waitSearcher=false/ the HTTP response sometimes takes more than 60s and our client times out after that. Whole operation takes 200+ seconds. Isn't waitFlush=false and waitSearcher=false supposed to tell Solr to

Re: Highlighting a field with a certain value

2010-06-04 Thread Koji Sekiguchi
(10/05/25 0:31), n...@frameweld.com wrote: Hello, How am I able to highlight a field that contains a specific value? If I have a field called type, how am I able to highlight the rows whose values contain something like title? http://localhost:8983/solr/select?q=titlehl=onhl.fl=type

Re: Error loading class 'solr.HTMLStripStandardTokenizerFactory'

2010-06-03 Thread Koji Sekiguchi
(10/06/03 13:46), Terance Dias wrote: Hi, I'm trying to use the field collapsing feature. For that I need to take a checkout of the trunk and apply the patch available at https://issues.apache.org/jira/browse/SOLR-236 When I take a checkout and run the example-DIH, I get following error in

Re: Reading newSearcher and firstSearcher Queries from Separate File

2010-06-03 Thread Koji Sekiguchi
(10/06/04 0:16), Kris Jack wrote: Hi everyone, I am adding queries to the newSearcher and firstSearcher listeners for warming solr up. Unfortunately, I have hundreds of queries and don't particularly want my solrconfig.xml file to become so big. Is there a way that I can point them to another

Re: NPE error when extending DefaultSolrHighlighter

2010-05-31 Thread Koji Sekiguchi
(10/06/01 6:45), Gerald wrote: I was looking at solr-386 and thought I would try to create a custom highlighter for something I was doing. I created a class that looks something like this: public class CustomOutputHighlighter extends DefaultSolrHighlighter { @Override

Re: Help with PatternReplaceFilterFactory

2010-05-27 Thread Koji Sekiguchi
(10/05/27 16:11), kirsty wrote: Hi, I have a field that is a text field eg: R500,000-550,000 Per Annum, R350,000-550,000 Per Annum Cost To Company etc. I would like to facet on the salary range. I have created a new field type fieldType name=remuneration class=solr.TextField sortMissingLast=true

Re: Help with PatternReplaceFilterFactory

2010-05-27 Thread Koji Sekiguchi
Yes you are right, I get that type of result. I guess my wording was wrong. My field looks like this in the index: str name=RemunerationR500,000-550,000 Per Annum/str str name=Remuneration_stripR500,000-550,000 Per Annum/str How would I search for say salaries in the range of 500,000 -

Re: highlighting broken for multivalued text fields?

2010-05-27 Thread Koji Sekiguchi
(10/05/28 8:16), Darren Govoni wrote: Hi, I want to verify a bug if someone can help. I have a text field: field name=text_t type=textindexed=true stored=true multiValued=true termVectors=true termPositions=true termOffsets=true/ I use to store text that I highlight on. If the

Re: Please Help, how to Xinclude in schema.xml

2010-05-09 Thread Koji Sekiguchi
(10/05/06 4:31), Turner, Robbin J wrote: I followed through some of the previous post, there seems to be a general problem with trying to us XInclude in the solr schema.xml. I use several variation to include my fieldType declarations. I keep getting an error file not found. I put the

<    1   2   3   4   5   6   >