n
> use the merge API or something to merge your index into the empty
> collection.
>
> - Mark
>
> On Sat, May 19, 2018 at 5:25 PM Ken Krugler
> wrote:
>
>> Hi all,
>>
>> Wondering if anyone has experience (this is with Solr 6.6) in setting up
>> Mi
he didn’t seem to mention it in his talk.
> If this is a huge importance to you, I might also suggest looking at vespa,
> which makes tensors a first-class citizen and makes matrix-math pretty
> seamless: http://vespa.ai
Interesting, though my client is pretty much locked into using
following the same pattern as geospatial support - so a new field type and
query/parser, plus plumbing to hook it into Solr.
Before I go much further, is there anything like this already done, or in the
works?
Thanks,
— Ken
--
Ken Krugler
+1 530-210-6378
http
rect.
Thanks,
— Ken
----------
Ken Krugler
+1 530-210-6378
http://www.scaleunlimited.com
Custom big data solutions & training
Flink, Solr, Hadoop, Cascading & Cassandra
case)?
Thanks!
— Ken
PS - yes, we’re aware of the routing issue with generating our own shards….
--
Ken Krugler
+1 530-210-6378
http://www.scaleunlimited.com
Custom big data solutions & training
Flink, Solr, Hadoop, Cascading & Cassandra
to poke around?
Thanks,
— Ken
--
Ken Krugler
+1 530-210-6378
http://www.scaleunlimited.com
custom big data solutions & training
Hadoop, Cascading, Cassandra & Solr
.
Or is there actually a way to make this work with Solr 5/6?
Thanks,
— Ken
--
Ken Krugler
+1 530-210-6378
http://www.scaleunlimited.com
custom big data solutions & training
Hadoop, Cascading, Cassandra & Solr
ields are quite big).
>
> Any suggestions ?
>
> -Mohsin
--
Ken Krugler
+1 530-210-6378
http://www.scaleunlimited.com
custom big data solutions & training
Hadoop, Cascading, Cassandra & Solr
capability than you
can get when you embed it in your application. Chances are that you can
easily replace EmbeddedSolrServer with one of the SolrClient classes and
use a separate Solr deployment from your application.
Thanks,
Shawn
--
Ken Krugler
+1 530-210-6378
http
,
-- Ken
--
Ken Krugler
+1 530-210-6378
http://www.scaleunlimited.com
custom big data solutions training
Hadoop, Cascading, Cassandra Solr
?
Or is this currently not possible?
Thanks,
-- Ken
On Fri, Mar 28, 2014 at 8:53 AM, Ken Krugler
kkrugler_li...@transpac.com wrote:
Hi all,
I've been using the SimpleTextCodec in the past, but I just noticed
something odd...
I'm running Solr 4.3, and enable the SimpleText posting format via
--
Ken Krugler
+1 530-210-6378
http://www.scaleunlimited.com
custom big data solutions training
Hadoop, Cascading, Cassandra Solr
files, e.g.
https://issues.apache.org/jira/browse/LUCENE-3074 is about adding a simple text
format for DocValues.
I can walk the code to figure out what's up, but I'm hoping I just need to
change some configuration setting.
Thanks!
-- Ken
--
Ken Krugler
+1 530-210-6378
it hard for me to write up a simple solution to a training
exercise, where students need to clean up incorrectly formatted dates :)
Thanks,
-- Ken
--
Ken Krugler
+1 530-210-6378
http://www.scaleunlimited.com
custom big data solutions training
Hadoop, Cascading, Cassandra
into body.
But is there any way to get revisioncontributorusername, for example?
Thanks,
-- Ken
--
Ken Krugler
+1 530-210-6378
http://www.scaleunlimited.com
custom big data solutions training
Hadoop, Cascading, Cassandra Solr
.
But is there any way to get revisioncontributorusername, for example?
Thanks,
-- Ken
--
Ken Krugler
+1 530-210-6378
http://www.scaleunlimited.com
custom big data solutions training
Hadoop, Cascading, Cassandra Solr
--
Ken Krugler
+1 530-210-6378
http
- From: Ken Krugler
Sent: Wednesday, September 11, 2013 8:24 PM
To: solr-user@lucene.apache.org
Subject: Grouping by field substring?
Hi all,
Assuming I want to use the first N characters of a specific field for
grouping results, is such a thing possible out-of-the-box
Hi all,
Assuming I want to use the first N characters of a specific field for grouping
results, is such a thing possible out-of-the-box?
If not, then what would the next best option be? E.g. a custom function query?
Thanks,
-- Ken
--
Ken Krugler
+1 530-210-6378
http
-- http://sematext.com/
Performance Monitoring -- http://sematext.com/spm
On Tue, Jul 2, 2013 at 3:01 PM, Ken Krugler kkrugler_li...@transpac.com
wrote:
Hi all,
After upgrading from Solr 3.5 to 4.2.1, I noticed our filterCache hit ratio
had dropped significantly.
Previously
some things that are unclear or even incorrect, so please
comment :)
Regards,
-- Ken
--
Ken Krugler
+1 530-210-6378
http://www.scaleunlimited.com
custom big data solutions training
Hadoop, Cascading, Cassandra Solr
of the above is that even with a very big filterCache size of
2K, the hit ratio is still only 60%.
Thanks for any insights,
-- Ken
--
Ken Krugler
+1 530-210-6378
http://www.scaleunlimited.com
custom big data solutions training
Hadoop, Cascading, Cassandra Solr
issue? Is there any way to disable the parsing of field names
in a field list?
Thanks,
-- Ken
--
Ken Krugler
+1 530-210-6378
http://www.scaleunlimited.com
custom big data solutions training
Hadoop, Cascading, Cassandra Solr
/api-4_0_0-ALPHA/org/apache/solr/client/solrj/impl/StreamingUpdateSolrServer.html
-- Ken
--
Ken Krugler
http://www.scaleunlimited.com
custom big data solutions training
Hadoop, Cascading, Mahout Solr
Hi Robert,
On May 1, 2012, at 7:07pm, Robert Muir wrote:
On Tue, May 1, 2012 at 6:48 PM, Ken Krugler kkrugler_li...@transpac.com
wrote:
Hi list,
Does anybody know if the Suggester component is designed to work with shards?
I'm not really sure it is? They would probably have to override
that it just
doesn't work with shards.
Thanks,
-- Ken
http://about.me/kkrugler
+1 530-210-6378
--
Ken Krugler
http://www.scaleunlimited.com
custom big data solutions training
Hadoop, Cascading, Mahout Solr
=lookupImplorg.apache.solr.spelling.suggest.fst.FSTLookup/str
str name=fieldcontent/str !-- the indexed field to derive
suggestions from --
float name=threshold0.0/float
str name=buildOnCommittrue/str
/lst
/searchComponent
Thanks,
-- Ken
On May 1, 2012, at 3:48pm, Ken Krugler wrote:
Hi list,
Does
XMLWriter object as a
parameter.
--
Ken Krugler
http://www.scaleunlimited.com
custom big data solutions training
Hadoop, Cascading, Mahout Solr
On Jan 13, 2012, at 1:39pm, Yonik Seeley wrote:
-Yonik
http://www.lucidimagination.com
On Fri, Jan 13, 2012 at 4:22 PM, Yonik Seeley
yo...@lucidimagination.com wrote:
On Fri, Jan 13, 2012 at 4:04 PM, Ken Krugler
kkrugler_li...@transpac.com wrote:
I finally got around to looking
HTTP requests are
queued up processed - under load, I see IOExceptions where it's always N-1
shards that succeed, and one shard request fails. But I don't have a good
reproducible case yet to debug.
-- Ken
--
Ken Krugler
http://www.scaleunlimited.com
custom big data
.
--
Ken Krugler
http://www.scaleunlimited.com
custom big data solutions training
Hadoop, Cascading, Mahout Solr
,
but that's pretty painful.
Any other ideas/input?
Thanks,
-- Ken
--
Ken Krugler
http://www.scaleunlimited.com
custom big data solutions training
Hadoop, Cascading, Mahout Solr
their hand, and
then walk it.
Thanks,
-- Ken
--
Ken Krugler
http://www.scaleunlimited.com
custom big data solutions training
Hadoop, Cascading, Mahout Solr
as the primary key is being used as the reduce-phase key, it's
easy to ensure uniqueness in the index.
Thanks again,
-- Ken
--
Ken Krugler
http://www.scaleunlimited.com
custom big data solutions training
Hadoop, Cascading, Mahout Solr
workflows, it's straightforward to ensure that the unique key
field is really unique, thus if the performance gain is significant, I might
look into figuring out some way (with a trigger lock) of re-enabling this
support in SolrJ.
Thanks,
-- Ken
--
Ken Krugler
http
/
/analyzer
/fieldType
-- Ken
--
Ken Krugler
+1 530-210-6378
http://bixolabs.com
custom big data solutions training
Hadoop, Cascading, Mahout Solr
and where is
the start point.
Floyd
--
Ken Krugler
+1 530-210-6378
http://bixolabs.com
custom big data solutions training
Hadoop, Cascading, Mahout Solr
there,
There are many English soundex implementation can be referenced, but I
wonder how to do Chinese soundex(sounds like) filter (maybe).
any idea?
Floyd
--
Ken Krugler
+1 530-210-6378
http://bixolabs.com
custom big data solutions training
Hadoop, Cascading, Mahout
632 536 S 0.0 0.0 0:04.69
init
--
Ken Krugler
+1 530-210-6378
http://bixolabs.com
custom big data solutions training
Hadoop, Cascading, Mahout Solr
the response times of the individual
queries.
Also 1-2% of the queries take significantly longer: avg somewhere at
100ms
while 1-2% take 1.5s or longer.
Any ideas are greatly appreciated :)
Fred.
--
Ken Krugler
+1 530-210-6378
http://bixolabs.com
custom big data
with a
different conf dir, and in that separate conf/solrschema.xml you can set up a
request handler that just dispatches to the two real cores.
-- Ken
--
Ken Krugler
+1 530-210-6378
http://bixolabs.com
custom big data solutions training
Hadoop, Cascading, Mahout Solr
? Thank you
If you turn on facets in your query (facet=truefacet.field=field name) then
you'll get back all of the distinct values, though might have to play with
other settings (e.g. facet.limit=-1) to get the results you need.
-- Ken
--
Ken Krugler
+1 530-210-6378
http
archive at Nabble.com.
--
Ken Krugler
+1 530-210-6378
http://bixolabs.com
custom big data solutions training
Hadoop, Cascading, Mahout Solr
at Nabble.com.
--
Ken Krugler
+1 530-210-6378
http://bixolabs.com
custom data mining solutions
, could not locate a jetty instance.
--
View this message in context:
http://lucene.472066.n3.nabble.com/Solr-monitoring-Newrelic-tp3042889p3043080.html
Sent from the Solr - User mailing list archive at Nabble.com.
--
Ken Krugler
+1 530-210-6378
http://bixolabs.com
custom
(assuming it contains the field names).
http://wiki.apache.org/solr/UpdateCSV#fieldnames
-Yonik
http://www.lucidimagination.com
--
Ken Krugler
+1 530-210-6378
http://bixolabs.com
custom data mining solutions
On Jun 9, 2011, at 2:21pm, Helmut Hoffer von Ankershoffen wrote:
Hi,
btw: there seems to somewhat of a non-match regarding efforts to Enhance DIH
regarding the CSV format (James Dyer) and the effort to maintain the
CSVLoader (Ken Krugler). How about merging your efforts and migrating
at Nabble.com.
--
Ken Krugler
+1 530-210-6378
http://bixolabs.com
custom data mining solutions
On Apr 3, 2011, at 6:56am, yehosef wrote:
How can they require payment for something that was developed under the
apache license?
It's the difference between free speech and free beer :)
See http://en.wikipedia.org/wiki/Gratis_versus_libre
-- Ken
--
Ken Krugler
+1
BoilerpipeContentHandler(new ContentHandlerDecorator(
Though from a quick look at that code, I'm curious why it doesn't use
BodyContentHandler, versus the current ContentHandlerDecorator.
-- Ken
--
Ken Krugler
+1 530-210-6378
http://bixolabs.com
e l a s t i c w e b m i n i
their
negative
side-effects, while still letting them crawl you?
Thanks,
Otis
Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch
Lucene ecosystem search :: http://search-lucene.com/
--
Ken Krugler
+1 530-210-6378
http://bixolabs.com
e l a s t i c w e b m i
a lot of value.
Any other ideas?
Thanks,
Otis
Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch
Lucene ecosystem search :: http://search-lucene.com/
- Original Message
From: Ken Krugler kkrugler_li...@transpac.com
To: solr-user@lucene.apache.org
Sent: Mon, January 10, 2011 9
a really big chunk used for a sorted field.
See http://wiki.apache.org/solr/SolrCaching and http://wiki.apache.org/solr/SolrPerformanceFactors
for more details.
-- Ken
-Original Message-
From: Ken Krugler [mailto:kkrugler_li...@transpac.com]
Sent: Tuesday, November 30, 2010 3:12 PM
are configured to use a VIP to talk to the master, so that
it's easy to dynamically change which master they use, via updates to
the load balancer config.
-- Ken
--
Ken Krugler
+1 530-210-6378
http://bixolabs.com
e l a s t i c w e b m i n i n g
Platform: RHEL with Sun JRE 1.6.0_18 on dual quad xeon machines with
64GB memory etc etc
http://ken-blog.krugler.org
+1 530-265-2225
--
Ken Krugler
+1 530-210-6378
http://bixolabs.com
e l a s t i c w e b m i n i
is the preferred approach in such an environment? I am
not a
programmer
and would appreciate some hand-holding here :o)
Thanks in advance,
Sesh
--
Lance Norskog
goks...@gmail.com
--
Ken Krugler
+1 530-210-6378
http://bixolabs.com
e l a s t
. All I can do is something in the server itself
What can I do to get it done? Write some RequestHandler?
REquestProcessor? Any other option?
Thanks, nizan
--
Ken Krugler
+1 530-210-6378
http://bixolabs.com
e l a s t i c w e b m i n i n g
for type=query. Please advise on how to group or
cluster
document terms so that they can be used as facets.
Many thanks in advance,
Adam Estrada
--
Ken Krugler
+1 530-210-6378
http://bixolabs.com
e l a s t i c w e b m i n i n g
. Obviously, we'd like to be able to
reproduce the
performance of the 3rd slave, and avoid the poor performance of
the first
two slaves the next time we decide it's time to optimize our index.
thanks in advance,
Mason
--
Ken Krugler
+1 530-210-6378
http
.472066.n3.nabble.com/using-HTTPClient-sending-so
lr-ping-request-wont-timeout-as-specified-tp1691292p1691355.html
Sent from the Solr - User mailing list archive at Nabble.com.
--
Ken Krugler
+1 530-210-6378
http://bixolabs.com
e l a s t i c w e b m i n i n g
: Depending on the input and the pattern used and the input
TokenStream, this TokenFilter may produce Tokens whose text is the
empty string.
-- Ken
http://ken-blog.krugler.org
+1 530-265-2225
--
Ken Krugler
+1 530-210-6378
free collections of data around, though none that I
know of which target top-ranked pages.
-- Ken
--
Ken Krugler
+1 530-210-6378
http://bixolabs.com
e l a s t i c w e b m i n i n g
found in associated text?
-- Ken
--
Ken Krugler
+1 530-210-6378
http://bixolabs.com
e l a s t i c w e b m i n i n g
about handling this
type of escaping for you.
Dismax is not (yet) an option because we need the full lucene syntax
within the query.
OK - in that case sounds like you're stuck with escaping.
-- Ken
--
Ken Krugler
+1 530-210-6378
http://bixolabs.com
e l a s t i c w e
to SOLR and wondering if the following is possible: in
addition to normal full text search, my users want to have the
option to
search only HTML heading innertext, i.e. content inside of H1,
H2, or
H3 tags.
Ken Krugler
+1 530-210-6378
http
, at 7:22pm, Lance Norskog wrote:
This assumes that the HTML is good quality. I don't know exactly what
your use case is. If you're crawling the web you will find some very
screwed-up HTML.
On Wed, Aug 25, 2010 at 6:45 AM, Ken Krugler
kkrugler_li...@transpac.com wrote:
On Aug 24, 2010, at 10
doesn't work for me.
2. Please include the full stack trace from the RuntimeException.
3. What version of Tika are you using?
Thanks,
-- Ken
Ken Krugler
+1 530-210-6378
http://bixolabs.com
e l a s t i c w e b m i n i n g
this type of custom
request handler?
Thanks,
-- Ken
Ken Krugler
+1 530-210-6378
http://bixolabs.com
e l a s t i c w e b m i n i n g
-grouping-field-collapsing-with-solr/
Yup, that's the one -
http://blog.jteam.nl/2009/10/20/result-grouping-field-collapsing-with-solr/comment-page-1/#comment-1249
So with some modifications to that patch, it could work...thanks for
the info!
-- Ken
2010/8/4 Ken Krugler kkrugler_li
solution to create my own request handler?
3. And in that case, any input/tips on developing this type of custom
request handler?
Thanks,
-- Ken
Ken Krugler
+1 530-210-6378
http://bixolabs.com
e l a s t i c w e b m i n i n g
access to a SolrIndexSearcher is
documented very clearly on how to release it when you are done
with it
so the ref count can be decremented.
-Hoss
Ken Krugler
+1 530-210-6378
http://bixolabs.com
e l a s t i c w e b m i n i n g
.
This then becomes a fairly well-bounded list of terms for an OR
query against the acl-groups field in each file/project document.
Just don't forget to set the boost to 0 for that portion of the query :)
-- Ken
Ken Krugler
+1 530-210-6378
http
directory first, before trying any of the solr-specific builds from
inside of the /solr sub-dir. Or at least that's what I ran into when
trying to build a solr dist recently.
-- Ken
Ken Krugler
+1 530-210-6378
http://bixolabs.com
e l a s t i c w
:
Ken Krugler
+1 530-210-6378
http://bixolabs.com
e l a s t i c w e b m i n i n g
On Jul 1, 2010, at 1:03pm, Ken Krugler wrote:
I've got a version 2.3 index that appears to be valid - I can open
it with Luke 1.0.1, and CheckIndex reports no problem.
[snip]
and Luke overview says:
This time as text:
Index version: 12984d2211c
Index format: -4 (Lucene 2.3)
Index
, 2010 at 7:55 PM, Ken Krugler
kkrugler_li...@transpac.com wrote:
I've got a situation where my data directory (a) needs to live
elsewhere
besides inside of Solr home, (b) moves to a different location
when updating
indexes, and (c) setting up a symlink from solr_home/data isn't
a great
option
able to programmatically change the
location of the dataDir, which is another issue.
Thanks,
-- Ken
On Fri, May 21, 2010 at 7:55 PM, Ken Krugler
kkrugler_li...@transpac.com wrote:
I've got a situation where my data directory (a) needs to live
elsewhere
besides inside of Solr home, (b) moves
?
This, of course, assumes that I'm able to programmatically change the
location of the dataDir, which is another issue.
Thanks,
-- Ken
On Fri, May 21, 2010 at 7:55 PM, Ken Krugler
kkrugler_li...@transpac.com wrote:
I've got a situation where my data directory (a) needs to live
elsewhere
besides
...@gmail.com
wrote:
Check:
http://wiki.apache.org/solr/CoreAdmin
Unless I'm missing something, I think you should be able to sort
what you need
On Fri, May 21, 2010 at 7:55 PM, Ken Krugler
kkrugler_li...@transpac.com wrote:
I've got a situation where my data directory (a) needs to live
coordinates.
Regards,
Stevo.
-- Ken
Ken Krugler
+1 530-210-6378
http://bixolabs.com
e l a s t i c w e b m i n i n g
to add to noise on the list.
Which approach is preferred?
Thanks,
-- Ken
Ken Krugler
+1 530-210-6378
http://bixolabs.com
e l a s t i c w e b m i n i n g
, 2010, at 9:12 PM, Ken Krugler wrote:
I noticed that my prices weren't showing up, even though I've got a
price field.
I think the issue is with this line from hit.vm:
pb#field('name')/b $!
number.currency($doc.getFieldValue('price'))/p
The number.currency() function needs to get passed
optional field compress feature
--
-Hoss
Ken Krugler
+1 530-210-6378
http://bixolabs.com
e l a s t i c w e b m i n i n g
.
Erik
On Jun 18, 2010, at 7:54 PM, Ken Krugler wrote:
Hi Erik,
On Jun 17, 2010, at 8:34pm, Erik Hatcher wrote:
Your wish is my command. Check out trunk, fire up Solr (ant run-
example), index example data, hit http://localhost:8983/solr/
browse - type in search box.
Just used
Ken Krugler
+1 530-210-6378
http://bixolabs.com
e l a s t i c w e b m i n i n g
a single value without brackets.
-- Ken
Ken Krugler
+1 530-210-6378
http://bixolabs.com
e l a s t i c w e b m i n i n g
! :)
Erik
On Jun 17, 2010, at 8:03 PM, Ken Krugler wrote:
I don't believe Solritas supports autocompletion out of the box.
So I'm wondering if anybody has experience using the LucidWorks
distro Solritas, plus the AJAX Solr auto-complete widget.
I realize that AJAX Solr's autocomplete support
?
response
lst name=responseHeaderint name=status0/intint
name=QTime0/int/lstlst name=termslst
name=product_nameint name=rug7/int/lst/lst
/response
Any ideas what I'm doing wrong?
Thanks,
-- Ken
On Jun 17, 2010, at 8:03 PM, Ken Krugler wrote:
I don't believe Solritas supports
plugin, and hooking it up to Solr
facets, but I was curious if there were any tricks or traps in getting
it all to work.
Thanks,
-- Ken
Ken Krugler
+1 530-210-6378
http://bixolabs.com
e l a s t i c w e b m i n i n g
terms on.
Cool?! I think so! :)
Erik
On Jun 17, 2010, at 8:03 PM, Ken Krugler wrote:
I don't believe Solritas supports autocompletion out of the box.
So I'm wondering if anybody has experience using the LucidWorks
distro Solritas, plus the AJAX Solr auto-complete widget.
I
... and
even if
this file needs to be redefined if magic is not used.
When I run my update/extract, I have an error that biosequence/
document
does not match any known parser.
Thanks
Olivier
Ken Krugler
+1 530-210-6378
http://bixolabs.com
e l a s
Ken Krugler
+1 530-210-6378
http://bixolabs.com
e l a s t i c w e b m i n i n g
- that's a good option
for cleaning up busted HTML.
-- Ken
http://ken-blog.krugler.org
+1 530-265-2225
Ken Krugler
+1 530-210-6378
http://bixolabs.com
e l a s t i c w e b m i n i n g
Ken Krugler
+1 530-210-6378
http://bixolabs.com
e l a s t i c w e b m i n i n g
with low-level SolrCore
instantiation.
Any other approaches?
Thanks,
-- Ken
Ken Krugler
+1 530-210-6378
http://bixolabs.com
e l a s t i c w e b m i n i n g
search results. I'm waiting for Hoss to jump in here on
how best to handle that :)
-- Ken
Ken Krugler
+1 530-210-6378
http://bixolabs.com
e l a s t i c w e b m i n i n g
to use SolrJ?
Thanks,
-- Ken
Ken Krugler
+1 530-210-6378
http://bixolabs.com
e l a s t i c w e b m i n i n g
about Fuzzy searches in logic. But can I really build a
request such as
mydoc.title:wordexample~ AND mydoc.content:( all the content
words)~0.9 ?
Thank you for your help
Ken Krugler
+1 530-210-6378
http://bixolabs.com
e l a s t i c w e b m i n i
file and point the
stream.url
param to file:///parsedfile.txt it works great.
-Jay
--
Ken Krugler
TransPac Software, Inc.
http://www.transpac.com
+1 530-210-6378
terms with following terms. This works well, but is
a bit more complex and can significantly grow the size of your index.
Either of the above requires the type of data analysis you're doing,
to generate the target set of common terms.
-- Ken
--
Ken Krugler
Krugle, Inc.
+1 530-210-6378
If you
source files. Without this
optimization, searches could take several seconds. With it, we got
down to 100ms with lots of breathing room.
-- Ken
--
Ken Krugler
Krugle, Inc.
+1 530-210-6378
If you can't find it, you can't fix it
1 - 100 of 140 matches
Mail list logo