Re: trouble running indexer with Solr spatial

2015-01-27 Thread Chandan Tamrakar
Po
On Jan 28, 2015 1:54 AM, vit bulgako...@yahoo.com wrote:

 I am using Solr 4.2

 I added
 fieldType name=location_rpt
 class=solr.SpatialRecursivePrefixTreeFieldType

 according to
 a href=http://;
 https://wiki.apache.org/solr/SolrAdaptersForLuceneSpatial4
  and have spatial4j-0.3.jar in my project.

 When running the indexer I started getting this error:

 java.lang.NoClassDefFoundError: com/google/common/cache/CacheBuilder
 at

 org.apache.solr.schema.AbstractSpatialFieldType.init(AbstractSpatialFieldType.java:82)
 at

 org.apache.solr.schema.AbstractSpatialPrefixTreeFieldType.init(AbstractSpatialPrefixTreeFieldType.java:32)
 at

 org.apache.solr.schema.SpatialRecursivePrefixTreeFieldType.init(SpatialRecursivePrefixTreeFieldType.java:28)
 at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native
 Method)
 at

 sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:39)
 at

 sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:27)
 at java.lang.reflect.Constructor.newInstance(Constructor.java:513)
 at java.lang.Class.newInstance0(Class.java:357)
 at java.lang.Class.newInstance(Class.java:310)
 at

 org.apache.solr.core.SolrResourceLoader.newInstance(SolrResourceLoader.java:470)
 at

 org.apache.solr.core.SolrResourceLoader.newInstance(SolrResourceLoader.java:458)
 at

 org.apache.solr.schema.FieldTypePluginLoader.create(FieldTypePluginLoader.java:81)
 at

 org.apache.solr.schema.FieldTypePluginLoader.create(FieldTypePluginLoader.java:43)
 at

 org.apache.solr.util.plugin.AbstractPluginLoader.load(AbstractPluginLoader.java:151)
 at
 org.apache.solr.schema.IndexSchema.readSchema(IndexSchema.java:374)
 at org.apache.solr.schema.IndexSchema.init(IndexSchema.java:122)
 at org.apache.solr.core.SolrCore.init(SolrCore.java:684)
 at org.apache.solr.core.SolrCore.init(SolrCore.java:618)



 --
 View this message in context:
 http://lucene.472066.n3.nabble.com/trouble-running-indexer-with-Solr-spatial-tp4182359.html
 Sent from the Solr - User mailing list archive at Nabble.com.



Re: indexing unique keys

2014-09-08 Thread Chandan Tamrakar
There would be records in millions and field would only be md5 hash. Size
of field would be 32 chars

Can you please point out what would be the best approach ?

Thanks
On Sep 5, 2014 12:47 PM, Mikhail Khludnev mkhlud...@griddynamics.com
wrote:

 Hello,

 You are asking without giving a context. What's the size of sets, desired
 TPS, key length, and even values?
 It's hard to answer definitely. It's not primary usage for Lucene, it adds
 some unnecessary overhead. However, community collected a few workaround
 for such kind of problem. From the other side, as far as I know executing
 queries like WHERE x IN (1,,2324) is not a piece of cake for SQL
 servers, also.

 you can follow link at
 https://plus.google.com/u/0/+MichaelMcCandless/posts/8VNydNi3wvK to find a
 relevant benchmark. it might help you to get least estimates for the Lucene
 solution.



 On Thu, Sep 4, 2014 at 5:53 PM, Mark , N nipen.m...@gmail.com wrote:

  I have a use-case where we want to store unique keys ( Hashes)  which
 would
  be
  used to compare against another set of  keys ( Hashes)
 
  For example
 
   Index  set= { h1, h2 , h3 , h4 }
 
  comparision set = { h1 , h2 }
 
  result set = h1,h2
 
  Would it be an advantage to store index set in  Solr instead of storing
  in traditional databases?
 
  Thanks in advance
 
 
 
 
 
 
  *Nipen Mark *
 



 --
 Sincerely yours
 Mikhail Khludnev
 Principal Engineer,
 Grid Dynamics

 http://www.griddynamics.com
 mkhlud...@griddynamics.com



Indexing scanned PDFs

2014-05-05 Thread Chandan Tamrakar
​we are using SOLr to index pdf documents but there are cases where PDFs
are usually a scanned document  with no text to extract and index .

Is there a plugin or module in SOLR that we can integrate so that it would
actually extract a text / OCR and then index?


Thanks in advance

Chandan Tamrakar


Re: searching for an id

2013-01-31 Thread Chandan Tamrakar
which analyzer are you  using to index that field ,  you can verify that
from schema file .

thanks


On Thu, Jan 31, 2013 at 2:35 PM, b.riez...@pixel-ink.de 
b.riez...@pixel-ink.de wrote:

 Hi

 I have an id wich is a string like this.
 tx-20130130-4599

 i'm using a field without processing, wich i got confirmed via the
 analyser tool
 But when i search for that it got split up, so instead of finding that
 specific entry with that unique id,
 it finds all entries with tx in it.

 Any idea how to get rid of that behavior?

 Best
 Ben




-- 
Chandan Tamrakar
*
*


Re: search with spaces

2012-12-27 Thread Chandan Tamrakar
Which Analyzer is being used in the field that was indexed ?
May be you can use solradmin to analyze and see how is your index

thanks

On Thu, Dec 27, 2012 at 2:30 PM, Sangeetha sangeetha...@gmail.com wrote:

 Hi,

 I have a text field with value O O Jaane Jaane. When i search with *q=Jaane
 Jaane* it is giving the results. But if i give *q=O O Jaane Jaane* it is
 not
 working? What could be the reason?

 Thanks,
 Sangeetha



 --
 View this message in context:
 http://lucene.472066.n3.nabble.com/search-with-spaces-tp4029265.html
 Sent from the Solr - User mailing list archive at Nabble.com.




-- 
Chandan Tamrakar
*
*


Re: Even after indexing a mysql table,in solr am not able to retreive data after querying

2012-10-19 Thread Chandan Tamrakar
status shows that all your 4 records were not indexed.

str name=Total Documents Failed4/str



On Fri, Oct 19, 2012 at 12:22 PM, Romita Saha
romita.s...@sg.panasonic.comwrote:

 Hi,

 Even after indexing a mysql table,in solr am not able to retrieve data
 after querying. Here is the status after i run
 http://localhost:8983/solr/db/dataimport

 str name=Indexing completed. Added/Updated: 0 documents. Deleted 0
 documents./str
 str name=Committed2012-10-19 14:31:28/str
 str name=Total Documents Processed0/str
 str name=Total Documents Failed4/str
 str name=Time taken0:0:0.524/str/lst
 str name=WARNINGThis response format is experimental.  It is likely to
 change in the future
 ./str
 /response

 My data-config file is :

 entity name=camera
 query=SELECT id FROM camera
 field column=id name=id/
 field column=data name=data/


 /entity

 The related schema.xml file is :

 field name=id type=integer indexed=true stored=true
 required=true/
 field name=data type=string indexed=true stored=true
 required=true/

 In my database, id is of Type int (11) and data is of Type varchar(100)
 I am new to solr. Could any one please help.

 Thanks and regards,
 Romita Saha




-- 
Chandan Tamrakar
*
*


Re: Solr Clustering

2012-09-04 Thread Chandan Tamrakar
yes there is a solr component if you want to cluster solr documents , check
the following link
http://wiki.apache.org/solr/ClusteringComponent

Carrot2 might be good if you want to cluster few thousands of documents ,
for example when user search solr , just cluster the  search results

Mahout is much more scalable and probably you need Hadoop for that


thanks
chandan

On Tue, Sep 4, 2012 at 2:10 PM, Denis Kuzmenok forward...@ukr.net wrote:



  Original Message 
 Subject: Solr Clustering
 From: Denis Kuzmenok forward...@ukr.net
 To: solr-user@lucene.apache.org
 CC:

 Hi, all.
 I know there is carrot2 and mahout for clustering. I want to implement
 such thing:
 I fetch documents and want to group them into clusters when they are added
 to index (i want to filter similar documents for example for 1 week). i
 need these documents quickly, so i cant rely on some postponed
 calculations. Each document should have assigned cluster id (like group
 similar documents into clusters and assign each document its cluster id.
 It's something similar to news aggregators like google news. I dont need
 to search for clusters with documents older than 1 week (for example). Each
 document will have its unique id and saved into DB. But solr will have
 cluster id field also.
 Is it possible to implement this with solr/carrot/mahout?




-- 
Chandan Tamrakar
*
*


Carrot2 using rawtext of field for clustering

2012-06-08 Thread Chandan Tamrakar
Is there any workaround in Solr/Carrot2 So that we could pass tokens that'd
been filtered with customer tokenizer/filters instead of rawtext that it
currently
uses for clustering ?

I read an issue in following link too .

https://issues.apache.org/jira/browse/SOLR-2917


Is writing our own parsers to filter text documents before indexing to SOLR
could be only the right approach currently ? please let me know if anyone
have come across this issue and have other better suggestions?

-- 
Chandan Tamrakar
*
*


Re: Can I update a specific field in solr?

2011-06-08 Thread Chandan Tamrakar
I think You can do that but you need to re-index a whole document again.

note that there is nothing like update  , its usually delete and then add.

thanks

On Wed, Jun 8, 2011 at 4:00 PM, ZiLi dangld...@163.com wrote:

 Hi, I try to update a specific field in solr , but I didn't find anyway to
 implement this .
 Anyone who knows how to ?
 Any suggestions will be appriciate : )


 2011-06-08



 ZiLi




-- 
Chandan Tamrakar
*
*


Re: different indexes for multitenant approach

2011-06-03 Thread Chandan Tamrakar
may be you need multi core feature of solr , you can have a single Solr
instance with separate configurations and indexes

http://wiki.apache.org/solr/CoreAdmin



On Fri, Jun 3, 2011 at 12:04 PM, Naveen Gupta nkgiit...@gmail.com wrote:

 Hi

 I want to implement different index strategy where we want to keep indexes
 with respect to each tennant and we want to maintain indexes separately ...

 first level of category -- company name

 second level of category - company name + fields to be indexed

 then further categories - group of different company name based on some
 heuristic (hashing) (if it grows furhter)

 i want to do in the same solr instance. can it be possible ?

 Thanks
 Naveen




-- 
Chandan Tamrakar
*
*


Re: DIH: Exception with Too many connections

2011-05-31 Thread Chandan Tamrakar
looks like you are not being able to connect to database , pls see if you
get similar exception when you try to connect from other clients



On Tue, May 31, 2011 at 3:01 PM, tiffany tiffany.c...@future.co.jp wrote:

 Hi all,

 I'm using DIH and getting the following error.
 My Solr version is Solr3.1.

 =
 ...
 Caused by:
 com.mysql.jdbc.exceptions.jdbc4.MySQLNonTransientConnectionException: Could
 not create connection to database server. Attempted reconnect 3 times.
 Giving up.
at sun.reflect.GeneratedConstructorAccessor98.newInstance(Unknown
 Source)
at

 sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:27)
at java.lang.reflect.Constructor.newInstance(Constructor.java:513)
at com.mysql.jdbc.Util.handleNewInstance(Util.java:406)
at com.mysql.jdbc.Util.getInstance(Util.java:381)
at com.mysql.jdbc.SQLError.createSQLException(SQLError.java:985)
at com.mysql.jdbc.SQLError.createSQLException(SQLError.java:956)
at com.mysql.jdbc.SQLError.createSQLException(SQLError.java:926)
at
 com.mysql.jdbc.ConnectionImpl.createNewIO(ConnectionImpl.java:2364)
at com.mysql.jdbc.ConnectionImpl.init(ConnectionImpl.java:781)
at com.mysql.jdbc.JDBC4Connection.init(JDBC4Connection.java:46)
at sun.reflect.GeneratedConstructorAccessor94.newInstance(Unknown
 Source)
at

 sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:27)
at java.lang.reflect.Constructor.newInstance(Constructor.java:513)
at com.mysql.jdbc.Util.handleNewInstance(Util.java:406)
at
 com.mysql.jdbc.ConnectionImpl.getInstance(ConnectionImpl.java:352)
at
 com.mysql.jdbc.NonRegisteringDriver.connect(NonRegisteringDriver.java:284)
at

 org.apache.solr.handler.dataimport.JdbcDataSource$1.call(JdbcDataSource.java:161)
at

 org.apache.solr.handler.dataimport.JdbcDataSource$1.call(JdbcDataSource.java:128)
at

 org.apache.solr.handler.dataimport.JdbcDataSource.getConnection(JdbcDataSource.java:363)
at

 org.apache.solr.handler.dataimport.JdbcDataSource.access$200(JdbcDataSource.java:39)
at

 org.apache.solr.handler.dataimport.JdbcDataSource$ResultSetIterator.init(JdbcDataSource.java:240)
... 11 more
 Caused by:
 com.mysql.jdbc.exceptions.jdbc4.MySQLNonTransientConnectionException: Data
 source rejected establishment of connection,  message from server: Too
 many
 connections
at sun.reflect.GeneratedConstructorAccessor98.newInstance(Unknown
 Source)
at

 sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:27)
at java.lang.reflect.Constructor.newInstance(Constructor.java:513)
at com.mysql.jdbc.Util.handleNewInstance(Util.java:406)
at com.mysql.jdbc.Util.getInstance(Util.java:381)
at com.mysql.jdbc.SQLError.createSQLException(SQLError.java:985)
at com.mysql.jdbc.SQLError.createSQLException(SQLError.java:956)
at com.mysql.jdbc.MysqlIO.doHandshake(MysqlIO.java:1104)
at
 com.mysql.jdbc.ConnectionImpl.createNewIO(ConnectionImpl.java:2292)
... 24 more

 =

 My dataSource setting is something like this:

dataSource type=JdbcDataSource driver=com.mysql.jdbc.Driver
 url=jdbc:mysql://database01/test?autoReconnect=true user=xxx
 password=xxx batchSize=-1 /

 Any idea to solve this problem?

 Thank you!


 --
 View this message in context:
 http://lucene.472066.n3.nabble.com/DIH-Exception-with-Too-many-connections-tp3005213p3005213.html
 Sent from the Solr - User mailing list archive at Nabble.com.




-- 
Chandan Tamrakar
*
*


Re: Facet Query

2011-05-27 Thread Chandan Tamrakar
which analyzer do you use for indexing ? You could exclude those stop words
during indexing

http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters




On Fri, May 27, 2011 at 1:36 PM, Jasneet Sabharwal 
jasneet.sabhar...@ngicorporation.com wrote:

 Hi

 When I do a facet query on my data, it shows me a list of all the words
 present in my database with their count. Is it possible to not get the
 results of common words like a, an, the, http and so one but only get the
 count of stuff we need like microsoft, ipad, solr, etc.

 --
 Thanx  Regards

 Jasneet Sabharwal




-- 
Chandan Tamrakar
*
*


Re: What is omitNorms

2011-05-26 Thread Chandan Tamrakar
  When you say omitnorms=true  for any fields it means SOLR will  not
store norms . AFAIK , if you do not store these norms then your index size
would be smaller and will take less memory  . You could safely omit these
norms for smaller fields .
i.e your indexing time is more.

 So if you  do not store norms you save the memory

Norms are used  to boosts and field length normalization during indexing
time so that short document has higher score

Turning the norms on/off may depend on your indexing size and
implementations

I hope this helps ..

thanks



On Thu, May 26, 2011 at 11:48 AM, Romi romijain3...@gmail.com wrote:

 thanks for the link, i got lot information from this document. Can u please
 tell me how can i verify omitNorms effect in my document indexing or
 searching.

 -
 Romi
 --
 View this message in context:
 http://lucene.472066.n3.nabble.com/What-is-omitNorms-tp2987547p2987649.html
 Sent from the Solr - User mailing list archive at Nabble.com.




-- 
Chandan Tamrakar
*
*


Re: problem in setting field attribute in schema.xml

2011-05-26 Thread Chandan Tamrakar
I guess you are indexing with  property index=false ,  stored = true
if  it  is  , that means you are storing the value on index  , so whenever
you do *:*  you can see the stored value

for example

 if you have a field = ID, Customer_Name   and you would only like to index
customer_name because this is a field which users is going to search ..
then you can just store ID in index without indexing .

When  customer names matches your index you would also like to show ID to
users

I do not know what is the purpose on your case  . Store fields are usually
required when you don't want to index  but show on the search results.

I hope its clear  . You can try and experiment changing these values on a
unique fields too ..

thanks.






On Thu, May 26, 2011 at 4:37 PM, Romi romijain3...@gmail.com wrote:

 i have done it, i deleted old indexes and created new indexes but still
 able
 to search it through *:*, and no result when i search it as field:value.
 really surprising result. :-O

 -
 Romi
 --
 View this message in context:
 http://lucene.472066.n3.nabble.com/problem-in-setting-field-attribute-in-schema-xml-tp2984126p2988256.html
 Sent from the Solr - User mailing list archive at Nabble.com.




-- 
Chandan Tamrakar
*
*


Re: problem in setting field attribute in schema.xml

2011-05-26 Thread Chandan Tamrakar
Yes as i said earlier

. If you want to store the value of  field as it is in index without
Tokenizing  . .for example customer_id  which is a unique fields and you
don't want to tokenize

when you index a field you could tokenize the field values to index based on
what tokenizer you use so that users can search ..



On Thu, May 26, 2011 at 5:55 PM, Romi romijain3...@gmail.com wrote:

 did u mean when i set indexed=false and store=true, solr does not index
 the field's value but store its value as it is???

 -
 Romi
 --
 View this message in context:
 http://lucene.472066.n3.nabble.com/problem-in-setting-field-attribute-in-schema-xml-tp2984126p2988458.html
 Sent from the Solr - User mailing list archive at Nabble.com.




-- 
Chandan Tamrakar
*
*


Re: What is omitNorms

2011-05-25 Thread Chandan Tamrakar
This is an advance option. pls see the details on following link

http://www.lucidimagination.com/Community/Hear-from-the-Experts/Articles/Scaling-Lucene-and-Solr#d0e71



On Thu, May 26, 2011 at 11:12 AM, Romi romijain3...@gmail.com wrote:

 and i also wanted to know  what is difference if i set omitNorms in
 fieldType
 or if i set it in field.

 -
 Romi
 --
 View this message in context:
 http://lucene.472066.n3.nabble.com/What-is-omitNorms-tp2987547p2987562.html
 Sent from the Solr - User mailing list archive at Nabble.com.




-- 
Chandan Tamrakar
*
*


Re: Indexing Mails

2011-05-12 Thread Chandan Tamrakar
what kind of emails you want to parse ?  MS emails ?

You could integrate apache tika  but it depends on what kind of emails Tika
parser would be able to parse

You can define the fields that could be parsed and define that in your xml
schema

thanks

On Tue, May 10, 2011 at 2:07 PM, Jörg Agatz joerg.ag...@googlemail.comwrote:

 will the E-Mail ID, and the recent E-Mail Ids, indext too?

 and witch fiels i have to create in schema.xml?




-- 
Chandan Tamrakar
*
*


Re: Need to create dyanamic indexies base on different document workspaces

2011-04-21 Thread Chandan Tamrakar
It depends on your application design how you want your index


There is a feature called solr core . http://wiki.apache.org/solr/CoreAdmin
You could still have a single index but a field  to differentiate the items
in index

thanks


On Thu, Apr 21, 2011 at 10:55 AM, Gaurav Shingala 
gaurav.shing...@hotmail.com wrote:





 Hi,

 Is there a way to create different solr indexes for different categories?
 We have different document workspaces and ideally want each workspace to
 have its own solr index.

 Thanks,
 Gaurav





-- 
Chandan Tamrakar
*
*


Re: Need to create dyanamic indexies base on different document workspaces

2011-04-21 Thread Chandan Tamrakar
Actually you need to put  a file named *solr.xml* in the solr.home directory
to create the solr core .
you can do that programatically if you want to make it dynamic based on your
logic

pls check the solr core admin document.



On Thu, Apr 21, 2011 at 2:52 PM, Gaurav Shingala 
gaurav.shing...@hotmail.com wrote:


 Is it possible to create solr core dyanamically?

 In our case we want each workspace to have its own solr index.



 Thanks

  From: chandan.tamra...@nepasoft.com
  Date: Thu, 21 Apr 2011 11:57:53 +0545
  Subject: Re: Need to create dyanamic indexies base on different document
 workspaces
  To: solr-user@lucene.apache.org
 
  It depends on your application design how you want your index
 
 
  There is a feature called solr core .
 http://wiki.apache.org/solr/CoreAdmin
  You could still have a single index but a field to differentiate the
 items
  in index
 
  thanks
 
 
  On Thu, Apr 21, 2011 at 10:55 AM, Gaurav Shingala 
  gaurav.shing...@hotmail.com wrote:
 
  
  
  
  
   Hi,
  
   Is there a way to create different solr indexes for different
 categories?
   We have different document workspaces and ideally want each workspace
 to
   have its own solr index.
  
   Thanks,
   Gaurav
  
 
 
 
 
  --
  Chandan Tamrakar
  *
  *





-- 
Chandan Tamrakar
*
*


Re: ComplexPhraseQueryParser and wildcards

2011-03-28 Thread Chandan Tamrakar
did you get any exceptions ?
usually wild card term you mentioned would be expanded before being actually
searched .

thanks.

On Mon, Mar 28, 2011 at 1:24 PM, jmr jmpala...@free.fr wrote:

 Hi,

 I'm using ComplexPhraseQueryParser and I'm quite happy with it.
 However, there are some queries using wildcards nor working.

 Exemple: I want to do a proximity search between the word compiler and the
 expression 'cross linker' or 'cross linking' or 'cross linked' ...

 (cross-linker compiler~50 OR cross-linking compiler~50) is working OK
 but (cross-link* compiler~50) is not working (returns nothing)

 Is there another syntax allowing to do sucj query ?

 Thanks
 JMR

 --
 View this message in context:
 http://lucene.472066.n3.nabble.com/ComplexPhraseQueryParser-and-wildcards-tp2742244p2742244.html
 Sent from the Solr - User mailing list archive at Nabble.com.




-- 
Chandan Tamrakar
*
*