Re: Solr is indexing but not showing results

2012-04-09 Thread Jeevanandam Madanagopal
Srini -

This text datatype comes as sample configuration in SOLR distribution. Check 
this, it may suit your need!

fieldType name=text class=solr.TextField positionIncrementGap=100 
autoGeneratePhraseQueries=true
  analyzer type=index
tokenizer class=solr.WhitespaceTokenizerFactory/
filter class=solr.StopFilterFactory
ignoreCase=true
words=stopwords.txt
enablePositionIncrements=true
/
filter class=solr.WordDelimiterFilterFactory generateWordParts=1 
generateNumberParts=1 catenateWords=1 catenateNumbers=1 catenateAll=0 
splitOnCaseChange=1/
filter class=solr.LowerCaseFilterFactory/
filter class=solr.KeywordMarkerFilterFactory 
protected=protwords.txt/
filter class=solr.PorterStemFilterFactory/
  /analyzer
  analyzer type=query
tokenizer class=solr.WhitespaceTokenizerFactory/
filter class=solr.SynonymFilterFactory synonyms=synonyms.txt 
ignoreCase=true expand=true/
filter class=solr.StopFilterFactory
ignoreCase=true
words=stopwords.txt
enablePositionIncrements=true
/
filter class=solr.WordDelimiterFilterFactory generateWordParts=1 
generateNumberParts=1 catenateWords=0 catenateNumbers=0 catenateAll=0 
splitOnCaseChange=1/
filter class=solr.LowerCaseFilterFactory/
filter class=solr.KeywordMarkerFilterFactory 
protected=protwords.txt/
filter class=solr.PorterStemFilterFactory/
  /analyzer
/fieldType


-Jeevanandam
 
On Apr 10, 2012, at 12:08 AM, Walter Underwood wrote:

 You will need to define or customize a field type for text. 
 
 The example schema.xml file that is installed with Solr 3.5 has a several 
 kinds of text fields, text_general and text_en are good places to start. 
 You can use one of those, then customize it.
 
 wunder
 
 On Apr 9, 2012, at 11:27 AM, srini wrote:
 
 Hi Thanks for your reply. As per your suggestion I changed XML field type to
 text. 
 
 field name=XML type=string indexed=true stored=true
 required=true/   
 
 but when I start solr it is throwing following exception.
 SEVERE: org.apache.solr.common.SolrException: Unknown fieldtype 'text'
 specified on field XML
 
 Any suggestions!!(Thanks for your reply)
 
 --
 View this message in context: 
 http://lucene.472066.n3.nabble.com/Solr-is-indexing-but-not-showing-results-tp3897176p3897626.html
 Sent from the Solr - User mailing list archive at Nabble.com.
 
 
 
 
 



Re: Solr is indexing but not showing results

2012-04-09 Thread Jeevanandam Madanagopal
I agree partially, it actually depends. For instance during index time few of 
the synonyms mapping may or may not expand (for e.g.. frequent data index 
population from different source). So good apply at index time as well as query 
time to achieve complete ratio. Mostly of the time I did similar settings to 
meet customer requirements.

For example: 
-
Below sample text datatype with synonyms at index  query time (below config 
has similar analyzer structure of tokenizer  filter; so we can keep commonly 
one analyzer config too.) 

fieldType name=text class=solr.TextField positionIncrementGap=100 
autoGeneratePhraseQueries=true
  analyzer type=index
tokenizer class=solr.WhitespaceTokenizerFactory/
filter class=solr.SynonymFilterFactory synonyms=synonyms.txt 
ignoreCase=true expand=true/
filter class=solr.StopFilterFactory  ignoreCase=true 
words=stopwords.txt enablePositionIncrements=true /
filter class=solr.WordDelimiterFilterFactory generateWordParts=1 
generateNumberParts=1 catenateWords=1 catenateNumbers=1 catenateAll=0 
splitOnCaseChange=1/
filter class=solr.LowerCaseFilterFactory/
filter class=solr.KeywordMarkerFilterFactory 
protected=protwords.txt/
filter class=solr.PorterStemFilterFactory/
  /analyzer
  analyzer type=query
tokenizer class=solr.WhitespaceTokenizerFactory/
filter class=solr.SynonymFilterFactory synonyms=synonyms.txt 
ignoreCase=true expand=true/
   filter class=solr.StopFilterFactory  ignoreCase=true 
words=stopwords.txt enablePositionIncrements=true /
filter class=solr.WordDelimiterFilterFactory generateWordParts=1 
generateNumberParts=1 catenateWords=0 catenateNumbers=0 catenateAll=0 
splitOnCaseChange=1/
filter class=solr.LowerCaseFilterFactory/
filter class=solr.KeywordMarkerFilterFactory 
protected=protwords.txt/
filter class=solr.PorterStemFilterFactory/
  /analyzer
/fieldType

-Jeevanandam


On Apr 10, 2012, at 12:18 AM, Walter Underwood wrote:

 That is not a good configuration. Synonyms should be expanded at index time, 
 not query time. --wunder
 
 On Apr 9, 2012, at 11:43 AM, Jeevanandam Madanagopal wrote:
 
 Srini -
 
 This text datatype comes as sample configuration in SOLR distribution. 
 Check this, it may suit your need!
 
 fieldType name=text class=solr.TextField positionIncrementGap=100 
 autoGeneratePhraseQueries=true
 analyzer type=index
   tokenizer class=solr.WhitespaceTokenizerFactory/
   filter class=solr.StopFilterFactory
   ignoreCase=true
   words=stopwords.txt
   enablePositionIncrements=true
   /
   filter class=solr.WordDelimiterFilterFactory generateWordParts=1 
 generateNumberParts=1 catenateWords=1 catenateNumbers=1 
 catenateAll=0 splitOnCaseChange=1/
   filter class=solr.LowerCaseFilterFactory/
   filter class=solr.KeywordMarkerFilterFactory 
 protected=protwords.txt/
   filter class=solr.PorterStemFilterFactory/
 /analyzer
 analyzer type=query
   tokenizer class=solr.WhitespaceTokenizerFactory/
   filter class=solr.SynonymFilterFactory synonyms=synonyms.txt 
 ignoreCase=true expand=true/
   filter class=solr.StopFilterFactory
   ignoreCase=true
   words=stopwords.txt
   enablePositionIncrements=true
   /
   filter class=solr.WordDelimiterFilterFactory generateWordParts=1 
 generateNumberParts=1 catenateWords=0 catenateNumbers=0 
 catenateAll=0 splitOnCaseChange=1/
   filter class=solr.LowerCaseFilterFactory/
   filter class=solr.KeywordMarkerFilterFactory 
 protected=protwords.txt/
   filter class=solr.PorterStemFilterFactory/
 /analyzer
   /fieldType
 
 
 -Jeevanandam
 
 On Apr 10, 2012, at 12:08 AM, Walter Underwood wrote:
 
 You will need to define or customize a field type for text. 
 
 The example schema.xml file that is installed with Solr 3.5 has a several 
 kinds of text fields, text_general and text_en are good places to 
 start. You can use one of those, then customize it.
 
 wunder
 
 On Apr 9, 2012, at 11:27 AM, srini wrote:
 
 Hi Thanks for your reply. As per your suggestion I changed XML field type 
 to
 text. 
 
 field name=XML type=string indexed=true stored=true
 required=true/   
 
 but when I start solr it is throwing following exception.
 SEVERE: org.apache.solr.common.SolrException: Unknown fieldtype 'text'
 specified on field XML
 
 Any suggestions!!(Thanks for your reply)
 
 --
 View this message in context: 
 http://lucene.472066.n3.nabble.com/Solr-is-indexing-but-not-showing-results-tp3897176p3897626.html
 Sent from the Solr - User mailing list archive at Nabble.com.
 
 
 
 
 



Re: Multi-words synonyms matching

2012-04-11 Thread Jeevanandam Madanagopal
Elisabeth -

As you described, below mapping might suit for your need.
mairie = hotel de ville, mairie

mairie gets expanded to hotel de ville and mairie at index time.  So 
mairie and hotel de ville searchable on document.

However, still white space tokenizer splits at query time will be a problem as 
described by Markus.

--Jeevanandam

On Apr 11, 2012, at 12:30 PM, elisabeth benoit wrote:

 Have you tried the =' mapping instead? Something
 like
 hotel de ville = mairie
 might work for you.
 
 Yes, thanks, I've tried it but from what I undestand it doesn't solve my
 problem, since this means hotel de ville will be replace by mairie at
 index time (I use synonyms only at index time). So when user will ask
 hôtel de ville, it won't match.
 
 In fact, at index time I have mairie in my data, but I want user to be able
 to request mairie or hôtel de ville and have mairie as answer, and not
 have mairie as an answer when requesting hôtel.
 
 
 To map `mairie` to `hotel de ville` as single token you must escape your
 white
 space.
 
 mairie, hotel\ de\ ville
 
 This results in  a problem if your tokenizer splits on white space at
 query
 time.
 
 Ok, I guess this means I have a problem. No simple solution since at query
 time my tokenizer do split on white spaces.
 
 I guess my problem is more or less one of the problems discussed in
 
 http://lucene.472066.n3.nabble.com/Multi-word-synonyms-td3716292.html#a3717215
 
 
 Thanks a lot for your answers,
 Elisabeth
 
 
 
 
 
 2012/4/10 Erick Erickson erickerick...@gmail.com
 
 Have you tried the =' mapping instead? Something
 like
 hotel de ville = mairie
 might work for you.
 
 Best
 Erick
 
 On Tue, Apr 10, 2012 at 1:41 AM, elisabeth benoit
 elisaelisael...@gmail.com wrote:
 Hello,
 
 I've read several post on this issue, but can't find a real solution to
 my
 multi-words synonyms matching problem.
 
 I have in my synonyms.txt an entry like
 
 mairie, hotel de ville
 
 and my index time analyzer is configured as followed for synonyms.
 
 filter class=solr.SynonymFilterFactory synonyms=synonyms.txt
 ignoreCase=true expand=true/
 
 The problem I have is that now mairie matches with hotel and I would
 only want mairie to match with hotel de ville and mairie.
 
 When I look into the analyzer, I see that mairie is mapped into
 hotel,
 and words de ville are added in second and third position. To change
 that, I tried to do
 
 filter class=solr.SynonymFilterFactory synonyms=synonyms.txt
 ignoreCase=true expand=true
 tokenizerFactory=solr.KeywordTokenizerFactory/ (as I read in one post)
 
 and I can see now in the analyzer that mairie is mapped to hotel de
 ville, but now when I have query hotel de ville, it doesn't match at
 all
 with mairie.
 
 Anyone has a clue of what I'm doing wrong?
 
 I'm using Solr 3.4.
 
 Thanks,
 Elisabeth
 



Re: Can you suggest a method or pattern to consistently promote a document with any query?

2012-04-18 Thread Jeevanandam Madanagopal
Chris -

Take a look - QueryElevationComponent

http://wiki.apache.org/solr/QueryElevationComponent

-Jeevanandam

On Apr 18, 2012, at 10:46 PM, Chris Warner wrote:

 Hi, folks,
 
 Perhaps I'm overlooking an obvious solution to a common desire... I'd like to 
 return a specific document with every query, as the first result. As well, 
 I'd like to have that document be the first result in a *:* query.
 
 I'm looking into index time boosting using the boost attribute on the 
 appropriate doc. I haven't tested this yet, and I'm not sure this would do 
 anything for the *:* queries.
 
 Thanks for any suggested reading or patterns...
 
 Best,
 Chris
 
  
 --
 chris_war...@yahoo.com



Re: Can you suggest a method or pattern to consistently promote a document with any query?

2012-04-18 Thread Jeevanandam Madanagopal
Chris -

If you have defined 'last-components' in search handler, forceElevation=true 
may not required.  It gets invoked in search life cycle

arr name=last-components
  strelevator/str
/arr

-Jeevanandam


On Apr 18, 2012, at 11:37 PM, Chris Warner wrote:

 Thanks to those who responded. A more thorough reading of the wiki and I see 
 the need for forceElevation=true in the elevate query.
 
 Cheers,
 Chris
 
 
 - Original Message -
 From: Otis Gospodnetic otis_gospodne...@yahoo.com
 To: solr-user@lucene.apache.org solr-user@lucene.apache.org; Chris Warner 
 chris_war...@yahoo.com
 Cc: 
 Sent: Wednesday, April 18, 2012 10:23 AM
 Subject: Re: Can you suggest a method or pattern to consistently promote a 
 document with any query?
 
 Chris,
 
 I haven't checked if Elevate Component has an easy way to push a specific doc 
 for *all* queries, but have a look 
 http://wiki.apache.org/solr/QueryElevationComponent
 
 Otis 
 
 Performance Monitoring SaaS for Solr - 
 http://sematext.com/spm/solr-performance-monitoring/index.html
 
 
 
 - Original Message -
 From: Chris Warner chris_war...@yahoo.com
 To: solr-user@lucene.apache.org solr-user@lucene.apache.org
 Cc: 
 Sent: Wednesday, April 18, 2012 1:16 PM
 Subject: Can you suggest a method or pattern to consistently promote a 
 document with any query?
 
 Hi, folks,
 
 Perhaps I'm overlooking an obvious solution to a common desire... I'd 
 like to return a specific document with every query, as the first result. As 
 well, I'd like to have that document be the first result in a *:* query.
 
 I'm looking into index time boosting using the boost attribute on the 
 appropriate doc. I haven't tested this yet, and I'm not sure this would 
 do anything for the *:* queries.
 
 Thanks for any suggested reading or patterns...
 
 Best,
 Chris
 
  
 --
 chris_war...@yahoo.com
 
 



Re: Wrong categorization with DIH

2012-04-19 Thread Jeevanandam Madanagopal
Ramo -

Please share DIH configuration with us.

-Jeevanandam

On Apr 19, 2012, at 7:46 PM, Ramo Karahasan wrote:

 Does anyone has an idea what's going wrong here?
 
 Thanks,
 Ramo
 
 -Ursprüngliche Nachricht-
 Von: Gora Mohanty [mailto:g...@mimirtech.com] 
 Gesendet: Dienstag, 17. April 2012 11:34
 An: solr-user@lucene.apache.org
 Betreff: Re: Wrong categorization with DIH
 
 On 17 April 2012 14:47, Ramo Karahasan ramo.karaha...@googlemail.com
 wrote:
 Hi,
 
 
 
 i currently face the followin issue:
 
 Testing the following sql statement which is also used in SOLR (DIH) 
 leads to a wrong categorization in solr:
 
 select p.title as title, p.id, p.category_id, p.pic_thumb, c.name as 
 category, c.id as category_id from product p, category c WHERE 
 p.category_id = c.id AND p.id = 3091328
 
 
 
 This returns in my sql client:
 
 Apple MacBook Pro MD313D/A 33,8 cm (13,3 Zoll) Notebook (Intel Core 
 i5-2435M, 2,4GHz, 4GB RAM, 500GB HDD, Intel HD 3000, Mac OS), 3091328, 
 1003, http://m-d.ww.cdn.com/images/I/41teWbp-uAL._SL75_.jpg, Computer, 
 1003
 
 
 
 As you see, the categoryid 1003 points to Computer
 
 
 
 Via the solr searchadmin i get the following result when searchgin for
 id:3091328
 
 str name=categorySport/str
 
 int name=category_id1003/int
 [...]
 
 Please share with us the rest of the DIH configuration file, i.e., the part
 where these data are saved to the Solr index.
 
 Regards,
 Gora
 



Re: PolySearcher in Solr

2012-04-19 Thread Jeevanandam Madanagopal
Please have a look 

http://wiki.apache.org/solr/DistributedSearch

-Jeevanandam

On Apr 19, 2012, at 9:14 PM, Ramprakash Ramamoorthy wrote:

 Dear all,
 
 
 I came across this while browsing through lucy
 
 http://lucy.apache.org/docs/perl/Lucy/Search/PolySearcher.html
 
 Does solr have an equivalent of this? My usecase is exactly the same
 (reading through multiple indices in a single shard and perform a
 distribution across shards).
 
 If not can someone give me a hint? I tried swapping readers for a single
 searcher, but didn't help.
 
 -- 
 With Thanks and Regards,
 Ramprakash Ramamoorthy,
 Project Trainee,
 Zoho Corporation.
 +91 9626975420



Re: Wrong categorization with DIH

2012-04-19 Thread Jeevanandam Madanagopal
Ramo -

Are you using all the selected columns from the query?

select p.title as title, p.id, p.category_id,
p.pic_thumb, c.name as category, c.id as category_id from product p,
category c ...

I see following attributes 'p.id', 'p.category_id'  'p.pic_thumb'  doesn't 
have alias defined.

Pointers:

- Select only required field in the sql query
- Ensure sql alias name and attribute name in the schema.xml should match
  or
- If you like to do explicit mapping for every column in DIH config as follow
field column=SQL-COLUMN-NAME-HERE name=SOLR-SCHEMA-ATTRIBUTE-NAME-HERE /

Detailed Info refer this: http://wiki.apache.org/solr/DataImportHandler

-Jeevanandam


On Apr 19, 2012, at 9:37 PM, Ramo Karahasan wrote:

 Hi,
 
 my config is just the following:
 
 dataConfig
  dataSource type=JdbcDataSource
  driver=com.mysql.jdbc.Driver
  url=jdbc:mysql://xx/asdx
  user=
  password=/
  document
   entity name=id
query=select p.title as title, p.id, p.category_id,
 p.pic_thumb, c.name as category, c.id as category_id from product p,
 category c WHERE p.category_id = c.id AND  '${dataimporter.request.clean}'
 != 'false' OR updated_at  '${dataimporter.last_index_time}' 
/entity
  /document
 /dataConfig
 
 I'm doing it as described on:
 
 http://wiki.apache.org/solr/DataImportHandlerDeltaQueryViaFullImport
 
 Any ideas?
 
 Best regars,
 Ramo
 
 -Ursprüngliche Nachricht-
 Von: Jeevanandam Madanagopal [mailto:je...@myjeeva.com] 
 Gesendet: Donnerstag, 19. April 2012 17:44
 An: solr-user@lucene.apache.org
 Betreff: Re: Wrong categorization with DIH
 
 Ramo -
 
 Please share DIH configuration with us.
 
 -Jeevanandam
 
 On Apr 19, 2012, at 7:46 PM, Ramo Karahasan wrote:
 
 Does anyone has an idea what's going wrong here?
 
 Thanks,
 Ramo
 
 -Ursprüngliche Nachricht-
 Von: Gora Mohanty [mailto:g...@mimirtech.com]
 Gesendet: Dienstag, 17. April 2012 11:34
 An: solr-user@lucene.apache.org
 Betreff: Re: Wrong categorization with DIH
 
 On 17 April 2012 14:47, Ramo Karahasan ramo.karaha...@googlemail.com
 wrote:
 Hi,
 
 
 
 i currently face the followin issue:
 
 Testing the following sql statement which is also used in SOLR (DIH) 
 leads to a wrong categorization in solr:
 
 select p.title as title, p.id, p.category_id, p.pic_thumb, c.name as 
 category, c.id as category_id from product p, category c WHERE 
 p.category_id = c.id AND p.id = 3091328
 
 
 
 This returns in my sql client:
 
 Apple MacBook Pro MD313D/A 33,8 cm (13,3 Zoll) Notebook (Intel Core 
 i5-2435M, 2,4GHz, 4GB RAM, 500GB HDD, Intel HD 3000, Mac OS), 
 3091328, 1003, http://m-d.ww.cdn.com/images/I/41teWbp-uAL._SL75_.jpg, 
 Computer,
 1003
 
 
 
 As you see, the categoryid 1003 points to Computer
 
 
 
 Via the solr searchadmin i get the following result when searchgin 
 for
 id:3091328
 
 str name=categorySport/str
 
 int name=category_id1003/int
 [...]
 
 Please share with us the rest of the DIH configuration file, i.e., the 
 part where these data are saved to the Solr index.
 
 Regards,
 Gora
 
 
 



Re: Auto suggest on indexed file content filtered based on user

2012-04-24 Thread Jeevanandam Madanagopal
On Apr 24, 2012, at 9:37 PM, prakash_ajp wrote:

 Right now, the query is a very simple one, something like q=text. Basically,
 it would return ['textview', 'textviewer', ..]
   hmm, so you're using default query field

 
 But the issue is, the 'textviewer' could be from a file that is out of
 bounds for this user. So, ultimately I would like to include the userName in
 the query. As mentioned earlier, userName is another field in the main
 index.
   and you like to filter the result set along with userName field value
 
 --
 View this message in context: 
 http://lucene.472066.n3.nabble.com/Auto-suggest-on-indexed-file-content-filtered-based-on-user-tp3934565p3935765.html
 Sent from the Solr - User mailing list archive at Nabble.com.

in this scenario 'fq' parameter will facilitate to achieve your desire result.
Please refer http://wiki.apache.org/solr/CommonQueryParameters#fq

try this   q=textfq=userName:prakash

Let us know!

-Jeevanandam



Re: Auto suggest on indexed file content filtered based on user

2012-04-24 Thread Jeevanandam Madanagopal
yes only spellcheck indexed build field is for suggest query
I believe, filtering a documents on search handler using fq parameter and spell 
suggest are two part we are discussing here.

lets say you have field for spellcheck - used to build spell dictionary

field name=spell type=textSpell …. …  /

using copyField for populating a spell field and get dictionary created

referring spellcheck handler in the default search handler at 'last-components' 
section, like below
 arr name=last-components
   strspellcheck/str
 /arr

then you will be able to apply search documents filtering and spellcheck params 
to search handler while querying. 

detailed info http://wiki.apache.org/solr/SpellCheckComponent [probably you 
might have already went thru :) ]

-Jeevanandam


On Apr 25, 2012, at 12:01 AM, prakash_ajp wrote:

 I read on a couple of other web pages that fq is not supported for suggester.
 I even tried the query and it doesn't help. My understanding was, when the
 suggest (spellcheck) index is built, only the field chosen is considered for
 queries and the other fields from the main index are not available for
 filtering purposes once the index is created.
 
 --
 View this message in context: 
 http://lucene.472066.n3.nabble.com/Auto-suggest-on-indexed-file-content-filtered-based-on-user-tp3934565p3936144.html
 Sent from the Solr - User mailing list archive at Nabble.com.



Re: should slave replication be turned off / on during master clean and re-index?

2012-04-27 Thread Jeevanandam Madanagopal
I guess you're looking for 'disabling replication poll on slave'

go to 'Replication dashboard[1]', there you have options like Enable/Disable 
Poll, Force replication, Abort replication
dashboard url: http://slave_host:port/solr/corename/admin/replication/index.jsp

Poll Disabled = slave will not poll master for replication

- Jeevanandam
[1] http://wiki.apache.org/solr/SolrReplication#Replication_Dashboard

On Apr 28, 2012, at 8:03 AM, geeky2 wrote:

 hello,
 
 thank you for the reply,
 
 
 Does a clean mean issuing a deletion query (e.g.
 deleteid*:*/id/delete) prior to re-indexing all of your content?  I
 don't think the slaves will download any changes until you've committed at
 some point on the master.  
 
 
 well, in this case when i say, clean  (on the Master), i mean selecting
 the Full Import with Cleaning button from the DataImportHandler
 Development Console page in solr.  at the top of the page, i have the check
 boxes selected for verbose and clean (*but i don't have the commit checkbox
 selected*).
 
 by doing the above process - doesn't this issue a deletion query - then
 start the import?
 
 and as a follow-up - when actually is the commit being done?
 
 
 here is my from my solrconfig.xml file on the master
 
  updateHandler class=solr.DirectUpdateHandler2
 *autoCommit
  maxTime6/maxTime
  maxDocs1000/maxDocs
/autoCommit*
maxPendingDeletes10/maxPendingDeletes
  /updateHandler
 
 
 
 
 
 
 --
 View this message in context: 
 http://lucene.472066.n3.nabble.com/should-slave-replication-be-turned-off-on-during-master-clean-and-re-index-tp3945531p3945954.html
 Sent from the Solr - User mailing list archive at Nabble.com.



Re: change index/store at indexing time

2012-04-27 Thread Jeevanandam Madanagopal
As Bill mentioned, you can use Script. 
Please have a look: 
http://wiki.apache.org/solr/DataImportHandler#ScriptTransformer

- Jeevanandam

On Apr 28, 2012, at 7:07 AM, Bill Bell wrote:

 Yes you can. Just use a script that is called for each row.
 
 Bill Bell
 Sent from mobile
 
 
 On Apr 27, 2012, at 6:38 PM, Vazquez, Maria (STM) 
 maria.vazq...@dexone.com wrote:
 
 Hi,
 I'm migrating a project from Lucene 2.9 to Solr 3.4.
 There is a special case in the code that indexes the same field in two 
 different ways, which is completely legal in Lucene directly but I don't 
 know how to duplicate this same behavior in Solr:
 
 if (isFirstGeo) {
document.add(new Field(geoids, geoId, Field.Store.YES, 
 Field.Index.NOT_ANALYZED_NO_NORMS));
isFirstGeo = false;
 } else {
if (countProducts  100)
 document.add(new Field(geoids, geoId, Field.Store.NO, 
 Field.Index.NOT_ANALYZED_NO_NORMS));
else
 document.add(new Field(geoids, geoId, Field.Store.YES, 
 Field.Index.NO));
 }
 
 Is there any way to do this in Solr in a Tranformer? I'm using the DIH to 
 index and I can't see a way to do this other than having three fields in the 
 schema like geoids_store_index, geoids_nostore_index, and 
 geoids_store_noindex.
 
 Thanks a lot in advance.
 Maria
 
 
 



Re: change index/store at indexing time

2012-04-29 Thread Jeevanandam Madanagopal
Maria -

thanks for detailed explanation. 
as per schema.xml; stored or indexed should be defined at schema design itself. 
 as per my understanding defining at runtime is not feasible. 
BTW, you can have multiValued=true attribute for dynamic fields too.

- Jeevanandam

On Apr 29, 2012, at 1:06 AM, Vazquez, Maria (STM) wrote:

 Thanks Jeevanandam.
 That still doesn't have the same behavior as Lucene since multiple fields 
 with different names have to be created.
 What I want is this exactly (multi-value field)
 
 document.add(new Field(geoids, geoId, Field.Store.YES, 
 Field.Index.NOT_ANALYZED_NO_NORMS));
 
 document.add(new Field(geoids, geoId, Field.Store.NO, 
 Field.Index.NOT_ANALYZED_NO_NORMS));
 
 In Lucene I can save geoids first as stored and in the next line as not 
 stored and it will do exactly that. I want to duplicate this behavior in Solr 
 but I can't do it having only one field in the schema called geoids that I an 
 manipulate at inde time whether to store or not depending on a condition.
 
 Thanks again for the help, hope this explanation makes it more clear in what 
 I'm trying to do.
 
 Maria
 
 On Apr 28, 2012, at 11:49 AM, Jeevanandam 
 je...@myjeeva.commailto:je...@myjeeva.com wrote:
 
 Maria,
 
 For your need please define unique pattern using dynamic field in schema.xml
 
 Please have a look http://wiki.apache.org/solr/SchemaXml#Dynamic_fields
 
 Hope that helps!
 
 -Jeevanandam
 
 Technology keeps you connected!
 
 On Apr 28, 2012, at 10:33 PM, Vazquez, Maria (STM) 
 maria.vazq...@dexone.commailto:maria.vazq...@dexone.com wrote:
 
 I can call a script for the logic part but what I want to figure out is how 
 to save the same field sometimes as stored and indexed, sometimes as stored 
 not indexed, etc. From a transformer or a script I didn't see anything where 
 I can modify that at indexing time.
 Thanks a lot,
 Maria
 
 
 On Apr 27, 2012, at 18:38, Bill Bell 
 billnb...@gmail.commailto:billnb...@gmail.com wrote:
 
 Yes you can. Just use a script that is called for each row.
 
 Bill Bell
 Sent from mobile
 
 
 On Apr 27, 2012, at 6:38 PM, Vazquez, Maria (STM) 
 maria.vazq...@dexone.commailto:maria.vazq...@dexone.com wrote:
 
 Hi,
 I'm migrating a project from Lucene 2.9 to Solr 3.4.
 There is a special case in the code that indexes the same field in two 
 different ways, which is completely legal in Lucene directly but I don't know 
 how to duplicate this same behavior in Solr:
 
 if (isFirstGeo) {
 document.add(new Field(geoids, geoId, Field.Store.YES, 
 Field.Index.NOT_ANALYZED_NO_NORMS));
 isFirstGeo = false;
 } else {
 if (countProducts  100)
document.add(new Field(geoids, geoId, Field.Store.NO, 
 Field.Index.NOT_ANALYZED_NO_NORMS));
 else
document.add(new Field(geoids, geoId, Field.Store.YES, Field.Index.NO));
 }
 
 Is there any way to do this in Solr in a Tranformer? I'm using the DIH to 
 index and I can't see a way to do this other than having three fields in the 
 schema like geoids_store_index, geoids_nostore_index, and 
 geoids_store_noindex.
 
 Thanks a lot in advance.
 Maria
 
 
 



Re: zkcli issues

2012-11-11 Thread Jeevanandam Madanagopal
Nick -

I believe you're experiencing a difficulties with SolrCloud CLI commands for 
interacting ZooKeeper.
Please have a look on below links, it will provide you direction.
Handy SolrCloud ZkCLI Commands
Uploading Solr Configuration into ZooKeeper ensemble

Cheers,
Jeeva

On Nov 12, 2012, at 4:45 AM, Mark Miller markrmil...@gmail.com wrote:

 On 11/11/2012 04:47 PM, Yonik Seeley wrote:
 On Sun, Nov 11, 2012 at 10:39 PM, Nick Chase nch...@earthlink.net wrote:
 So I'm trying to use ZkCLI without success.  I DID start and stop Solr in
 non-cloud mode, so everything is extracted and it IS finding zookeeper*.jar.
 However, now it's NOT finding SolrJ.
 Not sure about your specific problem in this case, but I chatted with
 Mark about this while at ApacheCon... it seems like we should be able
 to explode the WAR ourselves if necessary, eliminating the need to
 start Solr first.  Just throwing it out there before I forgot about it
 ;-)
 
 -Yonik
 http://lucidworks.com
 
 I guess the tricky part might be knowing where to extract it. We know how to 
 do it for the default jetty setup, but that could be reconfigured or you 
 could be using another web container.
 
 Kind of annoying.
 
 - Mark
 



Re: zkcli issues

2012-11-11 Thread Jeevanandam Madanagopal
Nick - Sorry, embedded links are not shown in previous email. I'm mentioning 
below.

 Handy SolrCloud ZkCLI Commands 
 (http://www.myjeeva.com/2012/10/solrcloud-cluster-single-collection-deployment/#handy-solrcloud-cli-commands)

 Uploading Solr Configuration into ZooKeeper ensemble 
 (http://www.myjeeva.com/2012/10/solrcloud-cluster-single-collection-deployment/#uploading-solrconfig-to-zookeeper)


Cheers,
Jeeva


On Nov 12, 2012, at 12:48 PM, Jeevanandam Madanagopal je...@myjeeva.com wrote:

 Nick -
 
 I believe you're experiencing a difficulties with SolrCloud CLI commands for 
 interacting ZooKeeper.
 Please have a look on below links, it will provide you direction.
 Handy SolrCloud ZkCLI Commands
 Uploading Solr Configuration into ZooKeeper ensemble
 
 Cheers,
 Jeeva
 
 On Nov 12, 2012, at 4:45 AM, Mark Miller markrmil...@gmail.com wrote:
 
 On 11/11/2012 04:47 PM, Yonik Seeley wrote:
 On Sun, Nov 11, 2012 at 10:39 PM, Nick Chase nch...@earthlink.net wrote:
 So I'm trying to use ZkCLI without success.  I DID start and stop Solr in
 non-cloud mode, so everything is extracted and it IS finding 
 zookeeper*.jar.
 However, now it's NOT finding SolrJ.
 Not sure about your specific problem in this case, but I chatted with
 Mark about this while at ApacheCon... it seems like we should be able
 to explode the WAR ourselves if necessary, eliminating the need to
 start Solr first.  Just throwing it out there before I forgot about it
 ;-)
 
 -Yonik
 http://lucidworks.com
 
 I guess the tricky part might be knowing where to extract it. We know how to 
 do it for the default jetty setup, but that could be reconfigured or you 
 could be using another web container.
 
 Kind of annoying.
 
 - Mark
 
 



Re: Multivalued or not

2012-11-14 Thread Jeevanandam Madanagopal
Hello Peter -

In Solr 3.6 multiValued is false by default.

Since Schema version 1.1 onwards multiValued attribute value is false by 
default (field …, dynamicField …, fieldType ...)

-Jeeva
Blog: http://www.myjeeva.com 


On Nov 14, 2012, at 2:04 PM, Peter Kirk p...@alpha-solutions.dk wrote:

 Hi
 
 In Solr 3.6, is multivalued for fields, default true or false?
 
 It appears that it is default false for normal fields, and default true for 
 dynamic fields - is that correct?
 
 Thanks,
 Peter
 
 
 



Re: Multivalued or not

2012-11-14 Thread Jeevanandam Madanagopal
Okay, I believe you're using Solr 3.6, here you can use schema version 1.5

However, you're currently using version 1.0, it safer to update your schema 
version to 1.1 then multiValued is false by default.

FYI. Schema version info (from schema.xml):
--
1.0: multiValued attribute did not exist, all fields are multiValued by nature
1.1: multiValued attribute introduced, false by default 
1.2: omitTermFreqAndPositions attribute introduced, true by default except for 
text fields
1.3: removed optional field compress feature
1.4: autoGeneratePhraseQueries attribute introduced to drive QueryParser 
behavior when a single string produces multiple tokens.  Defaults to off for 
version = 1.4
1.5: omitNorms defaults to true for primitive field types (int, float, boolean, 
string...)

- Jeeva
Blog: http://www.myjeeva.com


On Nov 14, 2012, at 2:54 PM, Peter Kirk p...@alpha-solutions.dk wrote:

 Hi, it says version 1.0
 
 schema name=naturereserve version=1.0
 
 /Peter
 
 
 -Original Message-
 From: Erik Hatcher [mailto:erik.hatc...@gmail.com] 
 Sent: 14. november 2012 10:22
 To: solr-user@lucene.apache.org
 Subject: Re: Multivalued or not
 
 But what is your schema version?  See the top of schema.xml. 
 
 On Nov 14, 2012, at 4:17, Peter Kirk p...@alpha-solutions.dk wrote:
 
 Hi
 
 Thanks for the reply. It is strange, because when I index to a field defined 
 like:
 
   dynamicField indexed=true
 name=*_string
 stored=true
 type=string /
 
 Then the results I receive are like:
 
 arr name=category_string 
 strWoodland/str 
 /arr
 
 Which seems to indicate a multivalued field.
 
 If I change the field definition, so I explicitly say multivalued is false:
 
   dynamicField indexed=true
 name=*_string
 stored=true
 type=string 
 multivalued=false /
 
 Then the result is like:
 
 str name=category_stringWoodland/str
 
 
 /Peter
 
 
 
 -Original Message-
 From: Jeevanandam Madanagopal [mailto:je...@myjeeva.com] 
 Sent: 14. november 2012 10:11
 To: solr-user@lucene.apache.org
 Subject: Re: Multivalued or not
 
 Hello Peter -
 
 In Solr 3.6 multiValued is false by default.
 
 Since Schema version 1.1 onwards multiValued attribute value is false by 
 default (field ..., dynamicField ..., fieldType ...)
 
 -Jeeva
 Blog: http://www.myjeeva.com 
 
 
 On Nov 14, 2012, at 2:04 PM, Peter Kirk p...@alpha-solutions.dk wrote:
 
 Hi
 
 In Solr 3.6, is multivalued for fields, default true or false?
 
 It appears that it is default false for normal fields, and default true for 
 dynamic fields - is that correct?
 
 Thanks,
 Peter
 
 
 
 
 
 
 
 



Re: zkcli issues

2012-11-16 Thread Jeevanandam Madanagopal
 guess 
 would be that the war is not exploded yet. You have to manually unzip or 
 start and stop solr.
 
 This logic could probably be added to those scripts (unzip it), but it's 
 annoying to have any logic in them since everything has to be mirrored in bat 
 and sh.
 
 - Mark
 
 On Nov 16, 2012, at 11:18 AM, Nick Chase nch...@earthlink.net wrote:
 
 I agree that it *shouldn't* be OS specific. :)  Anyway, thanks for the 
 suggestion, but that's not it.  I get the same error with the script right 
 out of the box:
 
 Error: Could not find or load main class 
 C:\sw\apache-solr-4.0.0\example\cloud-scripts\..\solr-webapp\webapp\WEB-INF\lib\apache-solr-solrj-4.0.0.jar
 
 And anyway, it's a weird error, referencing a jar as a class, isn't it?  
 Start up a JIRA?
 
 -  Nick
 
 On 11/16/2012 10:42 AM, Mark Miller wrote:
 I *think* I tested the script on windows once way back.
 
 Anyway, the code itself should not be OS specific.
 
 One thing you might want to check if you are copying unix cmd line
 stuff - I think windows separates classpath entries with ; rather than
 : - so you likely to need to change that. You'd think java could have
 been smart enough to accept either/or at worst, but meh.
 
 For example:
 .:/Users/jeeva/dc-1/solr-cli-lib/*
 should be
 .;/Users/jeeva/dc-1/solr-cli-lib/*
 
 - Mark
 
 On Thu, Nov 15, 2012 at 8:53 PM, Nick Chase nch...@earthlink.net wrote:
 Unfortunately, this doesn't seem to solve the issue; now I'm beginning to
 wonder if maybe it's because I'm on Windows.  Has anyone successfully run
 ZkCLI on Windows?
 
   Nick
 
 
 On 11/12/2012 2:27 AM, Jeevanandam Madanagopal wrote:
 
 Nick - Sorry, embedded links are not shown in previous email. I'm
 mentioning below.
 
 Handy SolrCloud ZkCLI Commands
 (http://www.myjeeva.com/2012/10/solrcloud-cluster-single-collection-deployment/#handy-solrcloud-cli-commands)
 
 
 Uploading Solr Configuration into ZooKeeper ensemble
 (http://www.myjeeva.com/2012/10/solrcloud-cluster-single-collection-deployment/#uploading-solrconfig-to-zookeeper)
 
 
 
 Cheers,
 Jeeva
 
 
 On Nov 12, 2012, at 12:48 PM, Jeevanandam Madanagopal je...@myjeeva.com
 wrote:
 
 Nick -
 
 I believe you're experiencing a difficulties with SolrCloud CLI commands
 for interacting ZooKeeper.
 Please have a look on below links, it will provide you direction.
 Handy SolrCloud ZkCLI Commands
 Uploading Solr Configuration into ZooKeeper ensemble
 
 Cheers,
 Jeeva
 
 On Nov 12, 2012, at 4:45 AM, Mark Miller markrmil...@gmail.com wrote:
 
 On 11/11/2012 04:47 PM, Yonik Seeley wrote:
 
 On Sun, Nov 11, 2012 at 10:39 PM, Nick Chase nch...@earthlink.net
 wrote:
 
 So I'm trying to use ZkCLI without success.  I DID start and stop Solr
 in
 non-cloud mode, so everything is extracted and it IS finding
 zookeeper*.jar.
 However, now it's NOT finding SolrJ.
 
 
 
 
 



Re: maven artifact for solr-solrj-4.0.0

2012-10-18 Thread Jeevanandam Madanagopal
Grzegorz Sobczyk - It's already available in Maven central repo link 

dependency
groupIdorg.apache.solr/groupId
artifactIdsolr-solrj/artifactId
version4.0.0/version
/dependency

PS: use this 'http://search.maven.org' official website of maven central 
repository for artifact search/download

Cheers, Jeeva
Blog: http://www.myjeeva.com

On Oct 18, 2012, at 12:30 PM, Amit Nithian anith...@gmail.com wrote:

 I am not sure if this repository
 https://repository.apache.org/content/repositories/releases/ works but
 the modification dates seem reasonable given the timing of the
 release. I suspect it'll be on maven central soon (hopefully)
 
 On Wed, Oct 17, 2012 at 11:13 PM, Grzegorz Sobczyk
 grzegorz.sobc...@contium.pl wrote:
 Hello
 Is there maven artifact for solrj 4.0.0 release ?
 When it will be available to download from http://mvnrepository.com/ ??
 
 version 4.0.0-BETA isn't compatibile with 4.0.0 (problems with zookeeper and
 clusterstate.json parsing)
 
 Best regards
 Grzegorz Sobczyk
 
 



Re: maven artifact for solr-solrj-4.0.0

2012-10-18 Thread Jeevanandam Madanagopal
Sorry, missed the maven central repo link -
http://search.maven.org/#artifactdetails|org.apache.solr|solr-solrj|4.0.0|jar

Cheers, Jeeva
Blog: http://www.myjeeva.com

On Oct 18, 2012, at 1:59 PM, Jeevanandam Madanagopal je...@myjeeva.com wrote:

 Grzegorz Sobczyk - It's already available in Maven central repo link 
 
 dependency
groupIdorg.apache.solr/groupId
artifactIdsolr-solrj/artifactId
version4.0.0/version
 /dependency
 
 PS: use this 'http://search.maven.org' official website of maven central 
 repository for artifact search/download
 
 Cheers, Jeeva
 Blog: http://www.myjeeva.com
 
 On Oct 18, 2012, at 12:30 PM, Amit Nithian anith...@gmail.com wrote:
 
 I am not sure if this repository
 https://repository.apache.org/content/repositories/releases/ works but
 the modification dates seem reasonable given the timing of the
 release. I suspect it'll be on maven central soon (hopefully)
 
 On Wed, Oct 17, 2012 at 11:13 PM, Grzegorz Sobczyk
 grzegorz.sobc...@contium.pl wrote:
 Hello
 Is there maven artifact for solrj 4.0.0 release ?
 When it will be available to download from http://mvnrepository.com/ ??
 
 version 4.0.0-BETA isn't compatibile with 4.0.0 (problems with zookeeper and
 clusterstate.json parsing)
 
 Best regards
 Grzegorz Sobczyk
 
 
 



Re: KeeperException (NodeExists for /overseer): SolrCloud Multiple Collections - is it safe ignore these exceptions?

2012-10-19 Thread Jeevanandam Madanagopal
Thanks Mark! 

Cheers, Jeeva

On Oct 19, 2012, at 8:35 AM, Mark Miller markrmil...@gmail.com wrote:

 Yes, those exceptions are fine. These are cases where we try to delete the 
 node if it's there, but don't care if it's not there - things like that. In 
 some of these cases, ZooKeeper logs things we can't stop, even though it's 
 expected that sometimes we will try and remove nodes that are not there or 
 create nodes that are already there.
 
 - Mark
 
 On Thu, Oct 18, 2012 at 9:01 AM, Jeevanandam Madanagopal je...@myjeeva.com 
 wrote:
 Hello -
 
 While doing prototype of SolrCloud with Multiple Collection.  Each collection 
 represents country level data.
 - searching within collection represents country level - local search
 - searching across collection represents global search
 
 Attached the graph image of SolrCoud structure.  For prototype I'm running 
 Embedded ZooKeeper ensemble (5 replicated zookeeper servers).
 - Searching and Indexing in respective collection works well
 - Search across collection works well (for global search)
 
 
 
 
 While joining the 'Collection2' to zookeeper ensemble I noticed the following 
 KeeperException in the logger.
 
 Question 'is it safe to ignore these exceptions?'
 
 Exception Log snippet:
 Oct 18, 2012 4:54:26 PM org.apache.zookeeper.server.NIOServerCnxn$Factory run
 INFO: Accepted socket connection from /fe80:0:0:0:0:0:0:1%1:62700
 Oct 18, 2012 4:54:26 PM org.apache.zookeeper.server.NIOServerCnxn 
 readConnectRequest
 INFO: Client attempting to establish new session at 
 /fe80:0:0:0:0:0:0:1%1:62700
 Oct 18, 2012 4:54:26 PM org.apache.zookeeper.server.NIOServerCnxn 
 finishSessionInit
 INFO: Established session 0x13a73521356000a with negotiated timeout 15000 for 
 client /fe80:0:0:0:0:0:0:1%1:62700
 Oct 18, 2012 4:54:26 PM org.apache.zookeeper.server.PrepRequestProcessor 
 pRequest
 INFO: Got user-level KeeperException when processing 
 sessionid:0x13a73521356000a type:create cxid:0x1 zxid:0xfffe 
 txntype:unknown reqpath:n/a Error Path:/overseer Error:KeeperErrorCode = 
 NodeExists for /overseer
 Oct 18, 2012 4:54:26 PM org.apache.zookeeper.server.PrepRequestProcessor 
 pRequest
 INFO: Got user-level KeeperException when processing 
 sessionid:0x13a73521356000a type:create cxid:0x2 zxid:0xfffe 
 txntype:unknown reqpath:n/a Error Path:/overseer Error:KeeperErrorCode = 
 NodeExists for /overseer
 Oct 18, 2012 4:54:26 PM org.apache.zookeeper.server.PrepRequestProcessor 
 pRequest
 INFO: Got user-level KeeperException when processing 
 sessionid:0x13a73521356000a type:delete cxid:0x4 zxid:0xfffe 
 txntype:unknown reqpath:n/a Error 
 Path:/live_nodes/mac-book-pro.local:7500_solr Error:KeeperErrorCode = NoNode 
 for /live_nodes/mac-book-pro.local:7500_solr
 Oct 18, 2012 4:54:26 PM org.apache.solr.common.cloud.ZkStateReader$3 process
 INFO: Updating live nodes
 
 Cheers, Jeeva
 
 
 
 
 -- 
 - Mark