Download solr data(only one field) to csv

2018-02-23 Thread Selvam Raman
Hi,

I have 10 million of record in solr index. I want to download whole record
in csv format with one field.

I have 20+ fields, but i want to download data with (fl=title) only title
field.

http://localhost:8983/solr/containerMetadata/select?q=*=external_id_s,container_title_en=csv=true=100

the above command not seems to effective to download 10 million record.
Could you please suggest an idea?

-- 
Selvam Raman
"லஞ்சம் தவிர்த்து நெஞ்சம் நிமிர்த்து"


Spark-Solr -- unresolved dependencies

2018-02-23 Thread Selvam Raman
Hi,

spark version - EMR 2.0.0

spark-shell --packages com.lucidworks.spark:spark-solr:3.0.1

when i tired about command, am getting below error


::

::  UNRESOLVED DEPENDENCIES ::

::

:: org.restlet.jee#org.restlet;2.3.0: not found

:: org.restlet.jee#org.restlet.ext.servlet;2.3.0: not found

::



:: USE VERBOSE OR DEBUG MESSAGE LEVEL FOR MORE DETAILS
Exception in thread "main" java.lang.RuntimeException: [unresolved
dependency: org.restlet.jee#org.restlet;2.3.0: not found, unresolved
dependency: org.restlet.jee#org.restlet.ext.servlet;2.3.0: not found]
at
org.apache.spark.deploy.SparkSubmitUtils$.resolveMavenCoordinates(SparkSubmit.scala:1066)
at
org.apache.spark.deploy.SparkSubmit$.prepareSubmitEnvironment(SparkSubmit.scala:294)
at org.apache.spark.deploy.SparkSubmit$.submit(SparkSubmit.scala:158)
at org.apache.spark.deploy.SparkSubmit$.main(SparkSubmit.scala:124)
at org.apache.spark.deploy.SparkSubmit.main(SparkSubmit.scala)

-- 
Selvam Raman
"லஞ்சம் தவிர்த்து நெஞ்சம் நிமிர்த்து"


Re: Highlighter is not working for wildcard query

2018-01-17 Thread Selvam Raman
Query Parser
  defType=edismax

On Wed, Jan 17, 2018 at 4:47 PM, Selvam Raman <sel...@gmail.com> wrote:

> Hi,
>
> solr version 6.4.2
>
> hl.method = unified, hl.bs.type=Word, this setting working fine for normal
> queries but failed in wildcard queries.(tried other hl.bs.type parmeter and
> without hl.bs.type as well, highlighting not working for wildcard queries)
>
> hl.method = original, this is working fine for both normal queries and
> wildcard queries.
>
> Why unified is not working and original/default is working fine for
> wildcard queries?
>
> any suggestion would be appreciated.
>
> --
> Selvam Raman
> "லஞ்சம் தவிர்த்து நெஞ்சம் நிமிர்த்து"
>



-- 
Selvam Raman
"லஞ்சம் தவிர்த்து நெஞ்சம் நிமிர்த்து"


Highlighter is not working for wildcard query

2018-01-17 Thread Selvam Raman
Hi,

solr version 6.4.2

hl.method = unified, hl.bs.type=Word, this setting working fine for normal
queries but failed in wildcard queries.(tried other hl.bs.type parmeter and
without hl.bs.type as well, highlighting not working for wildcard queries)

hl.method = original, this is working fine for both normal queries and
wildcard queries.

Why unified is not working and original/default is working fine for
wildcard queries?

any suggestion would be appreciated.

-- 
Selvam Raman
"லஞ்சம் தவிர்த்து நெஞ்சம் நிமிர்த்து"


Edismax leading wildcard search

2017-12-22 Thread Selvam Raman
Hi,

Solr version - 6.4

Parser - Edismax

Leading wildcard search is allowed in edismax.

1) how can i disable leading wildcard search
2) why leading wildcard search takes so much of time to give the response.

-- 
Selvam Raman
"லஞ்சம் தவிர்த்து நெஞ்சம் நிமிர்த்து"


Re: Wildcard searches with special character gives zero result

2017-12-15 Thread Selvam Raman
I am using edismax query parser.

On Fri, Dec 15, 2017 at 10:37 AM, Selvam Raman <sel...@gmail.com> wrote:

> Solr version - 6.4.0
>
> "title_en":["Chip-seq"]
>
> When i fired query like below
>
> 1) chip-seq
> 2) chi*
>
> it is giving expected result, for this case one result.
>
> But when i am searching with wildcard it produce zero result.
> 1) ch*p-seq
>
>
> if i use escape character in '-' it creates two terms rather than single
> term.
>
> --
> Selvam Raman
> "லஞ்சம் தவிர்த்து நெஞ்சம் நிமிர்த்து"
>



-- 
Selvam Raman
"லஞ்சம் தவிர்த்து நெஞ்சம் நிமிர்த்து"


Re: No Live SolrServer available to handle this request

2017-12-15 Thread Selvam Raman
Hi Steve,

i have raised the jira ticket SOLR-11764
<https://issues.apache.org/jira/browse/SOLR-11764>.
I am happy to work with you to solve this problem.

Thanks,
selvam R

On Thu, Dec 7, 2017 at 2:48 PM, Steve Rowe <sar...@gmail.com> wrote:

> Hi Selvam,
>
> This sounds like it may be a bug - could you please create a JIRA?  (See <
> https://wiki.apache.org/solr/HowToContribute#JIRA_tips_.
> 28our_issue.2Fbug_tracker.29> for more info.)
>
> Thanks,
>
> --
> Steve
> www.lucidworks.com
>
> > On Dec 6, 2017, at 9:56 PM, Selvam Raman <sel...@gmail.com> wrote:
> >
> > Yes. you are right. we are using preanalyzed field and that causing the
> > problem.
> > The actual problem is preanalyzed with highlight option. if i disable
> > highlight option it works fine. Please let me know if there is work
> around
> > to solve it.
> >
> > On Wed, Dec 6, 2017 at 10:19 PM, Erick Erickson <erickerick...@gmail.com
> >
> > wrote:
> >
> >> This looks like you're using "pre analyzed fields" which have a very
> >> specific format. PreAnalyzedFields are actually pretty rarely used,
> >> did you enable them by mistake?
> >>
> >> On Tue, Dec 5, 2017 at 11:37 PM, Selvam Raman <sel...@gmail.com> wrote:
> >>> When i look at the solr logs i find the below exception
> >>>
> >>> Caused by: java.io.IOException: Invalid JSON type java.lang.String,
> >>> expected Map
> >>> at
> >>> org.apache.solr.schema.JsonPreAnalyzedParser.parse(
> >> JsonPreAnalyzedParser.java:86)
> >>> at
> >>> org.apache.solr.schema.PreAnalyzedField$PreAnalyzedTokenizer.
> >> decodeInput(PreAnalyzedField.java:345)
> >>> at
> >>> org.apache.solr.schema.PreAnalyzedField$PreAnalyzedTokenizer.access$
> >> 000(PreAnalyzedField.java:280)
> >>> at
> >>> org.apache.solr.schema.PreAnalyzedField$PreAnalyzedAnalyzer$1.
> >> setReader(PreAnalyzedField.java:375)
> >>> at org.apache.lucene.analysis.Analyzer.tokenStream(Analyzer.java:202)
> >>> at
> >>> org.apache.lucene.search.uhighlight.AnalysisOffsetStrategy.
> tokenStream(
> >> AnalysisOffsetStrategy.java:58)
> >>> at
> >>> org.apache.lucene.search.uhighlight.MemoryIndexOffsetStrategy.
> >> getOffsetsEnums(MemoryIndexOffsetStrategy.java:106)
> >>> ... 37 more
> >>>
> >>>
> >>>
> >>> I am setting up lot of fields (fq, score, highlight,etc) then put it
> >> into
> >>> solrquery.
> >>>
> >>> On Wed, Dec 6, 2017 at 11:22 AM, Selvam Raman <sel...@gmail.com>
> wrote:
> >>>
> >>>> When i am firing query it returns the doc as expected. (Example:
> >>>> q=synthesis)
> >>>>
> >>>> I am facing the problem when i include wildcard character in the
> query.
> >>>> (Example: q=synthesi*)
> >>>>
> >>>>
> >>>> org.apache.solr.client.solrj.impl.HttpSolrClient$RemoteSolrException:
> >>>> Error from server at http://localhost:8983/solr/Metadata2:
> >>>> org.apache.solr.client.solrj.SolrServerException:
> >>>>
> >>>> No live SolrServers available to handle this request:[/solr/Metadata2_
> >>>> shard1_replica1,
> >>>>  solr/Metadata2_shard2_replica2,
> >>>>  solr/Metadata2_shard1_replica2]
> >>>>
> >>>> --
> >>>> Selvam Raman
> >>>> "லஞ்சம் தவிர்த்து நெஞ்சம் நிமிர்த்து"
> >>>>
> >>>
> >>>
> >>>
> >>> --
> >>> Selvam Raman
> >>> "லஞ்சம் தவிர்த்து நெஞ்சம் நிமிர்த்து"
> >>
> >
> >
> >
> > --
> > Selvam Raman
> > "லஞ்சம் தவிர்த்து நெஞ்சம் நிமிர்த்து"
>
>


-- 
Selvam Raman
"லஞ்சம் தவிர்த்து நெஞ்சம் நிமிர்த்து"


Wildcard searches with special character gives zero result

2017-12-15 Thread Selvam Raman
Solr version - 6.4.0

"title_en":["Chip-seq"]

When i fired query like below

1) chip-seq
2) chi*

it is giving expected result, for this case one result.

But when i am searching with wildcard it produce zero result.
1) ch*p-seq


if i use escape character in '-' it creates two terms rather than single
term.

-- 
Selvam Raman
"லஞ்சம் தவிர்த்து நெஞ்சம் நிமிர்த்து"


Re: No Live SolrServer available to handle this request

2017-12-06 Thread Selvam Raman
Yes. you are right. we are using preanalyzed field and that causing the
problem.
The actual problem is preanalyzed with highlight option. if i disable
highlight option it works fine. Please let me know if there is work around
to solve it.

On Wed, Dec 6, 2017 at 10:19 PM, Erick Erickson <erickerick...@gmail.com>
wrote:

> This looks like you're using "pre analyzed fields" which have a very
> specific format. PreAnalyzedFields are actually pretty rarely used,
> did you enable them by mistake?
>
> On Tue, Dec 5, 2017 at 11:37 PM, Selvam Raman <sel...@gmail.com> wrote:
> > When i look at the solr logs i find the below exception
> >
> > Caused by: java.io.IOException: Invalid JSON type java.lang.String,
> > expected Map
> > at
> > org.apache.solr.schema.JsonPreAnalyzedParser.parse(
> JsonPreAnalyzedParser.java:86)
> > at
> > org.apache.solr.schema.PreAnalyzedField$PreAnalyzedTokenizer.
> decodeInput(PreAnalyzedField.java:345)
> > at
> > org.apache.solr.schema.PreAnalyzedField$PreAnalyzedTokenizer.access$
> 000(PreAnalyzedField.java:280)
> > at
> > org.apache.solr.schema.PreAnalyzedField$PreAnalyzedAnalyzer$1.
> setReader(PreAnalyzedField.java:375)
> > at org.apache.lucene.analysis.Analyzer.tokenStream(Analyzer.java:202)
> > at
> > org.apache.lucene.search.uhighlight.AnalysisOffsetStrategy.tokenStream(
> AnalysisOffsetStrategy.java:58)
> > at
> > org.apache.lucene.search.uhighlight.MemoryIndexOffsetStrategy.
> getOffsetsEnums(MemoryIndexOffsetStrategy.java:106)
> > ... 37 more
> >
> >
> >
> >  I am setting up lot of fields (fq, score, highlight,etc) then put it
> into
> > solrquery.
> >
> > On Wed, Dec 6, 2017 at 11:22 AM, Selvam Raman <sel...@gmail.com> wrote:
> >
> >> When i am firing query it returns the doc as expected. (Example:
> >> q=synthesis)
> >>
> >> I am facing the problem when i include wildcard character in the query.
> >> (Example: q=synthesi*)
> >>
> >>
> >> org.apache.solr.client.solrj.impl.HttpSolrClient$RemoteSolrException:
> >> Error from server at http://localhost:8983/solr/Metadata2:
> >> org.apache.solr.client.solrj.SolrServerException:
> >>
> >> No live SolrServers available to handle this request:[/solr/Metadata2_
> >> shard1_replica1,
> >>   solr/Metadata2_shard2_replica2,
> >>   solr/Metadata2_shard1_replica2]
> >>
> >> --
> >> Selvam Raman
> >> "லஞ்சம் தவிர்த்து நெஞ்சம் நிமிர்த்து"
> >>
> >
> >
> >
> > --
> > Selvam Raman
> > "லஞ்சம் தவிர்த்து நெஞ்சம் நிமிர்த்து"
>



-- 
Selvam Raman
"லஞ்சம் தவிர்த்து நெஞ்சம் நிமிர்த்து"


Re: No Live SolrServer available to handle this request

2017-12-05 Thread Selvam Raman
When i look at the solr logs i find the below exception

Caused by: java.io.IOException: Invalid JSON type java.lang.String,
expected Map
at
org.apache.solr.schema.JsonPreAnalyzedParser.parse(JsonPreAnalyzedParser.java:86)
at
org.apache.solr.schema.PreAnalyzedField$PreAnalyzedTokenizer.decodeInput(PreAnalyzedField.java:345)
at
org.apache.solr.schema.PreAnalyzedField$PreAnalyzedTokenizer.access$000(PreAnalyzedField.java:280)
at
org.apache.solr.schema.PreAnalyzedField$PreAnalyzedAnalyzer$1.setReader(PreAnalyzedField.java:375)
at org.apache.lucene.analysis.Analyzer.tokenStream(Analyzer.java:202)
at
org.apache.lucene.search.uhighlight.AnalysisOffsetStrategy.tokenStream(AnalysisOffsetStrategy.java:58)
at
org.apache.lucene.search.uhighlight.MemoryIndexOffsetStrategy.getOffsetsEnums(MemoryIndexOffsetStrategy.java:106)
... 37 more



 I am setting up lot of fields (fq, score, highlight,etc) then put it into
solrquery.

On Wed, Dec 6, 2017 at 11:22 AM, Selvam Raman <sel...@gmail.com> wrote:

> When i am firing query it returns the doc as expected. (Example:
> q=synthesis)
>
> I am facing the problem when i include wildcard character in the query.
> (Example: q=synthesi*)
>
>
> org.apache.solr.client.solrj.impl.HttpSolrClient$RemoteSolrException:
> Error from server at http://localhost:8983/solr/Metadata2:
> org.apache.solr.client.solrj.SolrServerException:
>
> No live SolrServers available to handle this request:[/solr/Metadata2_
> shard1_replica1,
>   solr/Metadata2_shard2_replica2,
>   solr/Metadata2_shard1_replica2]
>
> --
> Selvam Raman
> "லஞ்சம் தவிர்த்து நெஞ்சம் நிமிர்த்து"
>



-- 
Selvam Raman
"லஞ்சம் தவிர்த்து நெஞ்சம் நிமிர்த்து"


No Live SolrServer available to handle this request

2017-12-05 Thread Selvam Raman
When i am firing query it returns the doc as expected. (Example:
q=synthesis)

I am facing the problem when i include wildcard character in the query.
(Example: q=synthesi*)


org.apache.solr.client.solrj.impl.HttpSolrClient$RemoteSolrException:
Error from server at http://localhost:8983/solr/Metadata2:
org.apache.solr.client.solrj.SolrServerException:

No live SolrServers available to handle this
request:[/solr/Metadata2_shard1_replica1,
  solr/Metadata2_shard2_replica2,
  solr/Metadata2_shard1_replica2]

-- 
Selvam Raman
"லஞ்சம் தவிர்த்து நெஞ்சம் நிமிர்த்து"


Performance Test

2017-09-04 Thread Selvam Raman
Hi All,

which is the best tool for solr perfomance test. I want to identify how
much load my solr could handle and how many concurrent users can query on
solr.

Please suggest.

-- 
Selvam Raman
"லஞ்சம் தவிர்த்து நெஞ்சம் நிமிர்த்து"


Getting solr source count without using search query every time

2017-08-10 Thread Selvam Raman
​Hi All,

​I am using solr cloud environment to search and index the data.

Example

Source_field_s:

A,

B,

C,

 etc


​Expected result:

A(100)

B(200)

C(50),

etc


​

Data stored in solr.  Every second or 10 seconds i need to get source ​
​
​facet( A,B,C)​ to produce statistics. I do not want to disturb production
solr for this facet as it is already serving end user request and indexing
data.


i read about CDCR approach, could use CDCR(Target) to get only statistics
where as it is waste of storage. I do not worry about the data and concern
about count of all sources.


is there any internal approach available to get Stat or facet about
source_field for every second or batch whenever there is update in
index(add, delete and update).


Like CDCR, is there a way to get only source count(facet query result)
update to target in sync with source solr.


Could you please provide a approach to handle this problem?

​Thanks,

selvam R​