[OFFER] Consulting job with search specialists based in Cambridge UK

2013-01-09 Thread Charlie Hull
work on Lucene/Solr projects would be useful. Cheers Charlie -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Solr consultant recommendation

2013-04-24 Thread Charlie Hull
and notify the dispatcher by return e-mail or at +45 36 99 00 00 P Please consider the environment before printing this mail note. -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Percolate feature?

2013-08-05 Thread Charlie Hull
Boolean strings representing their clients' interests and may monitor hundreds of thousands of news stories every day). It also records the positions of every match. We suspect it's a lot faster and more flexible than Elasticsearch's Percolate feature. Cheers Charlie -- Charlie Hull Flax - Open

Re: Percolate feature?

2013-10-01 Thread Charlie Hull
/ Performance Monitoring -- http://sematext.com/spm On Mon, Aug 5, 2013 at 6:34 AM, Charlie Hull char...@flax.co.uk wrote: On 03/08/2013 00:50, Mark wrote: We have a set number of known terms we want to match against. In Index: term one term two term three I know how to match all terms of a user

Re: Revolution writeup

2013-11-26 Thread Charlie Hull
. Sorry if I missed your talk -- I'm hoping to catch up when the videos are posted... http://blog.safariflow.com/2013/11/25/this-revolution-will-be-televised/ -Mike Sokolov -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web

Introducing Luwak for high-performance stored Lucene queries

2013-12-06 Thread Charlie Hull
monitoring applications but it could equally be useful for categorisation, classification etc. It's currently based on a fork of Lucene (details supplied) but hopefully it'll work with release versions soon. Feedback is very welcome! Cheers Charlie -- Charlie Hull Flax - Open Source

Re: Introducing Luwak for high-performance stored Lucene queries

2013-12-06 Thread Charlie Hull
-- Performance Monitoring * Log Analytics * Search Analytics Solr Elasticsearch Support * http://sematext.com/ On Fri, Dec 6, 2013 at 9:29 AM, Charlie Hull char...@flax.co.uk wrote: Hi all, We've now released the library we mentioned in our presentation at Lucene Revolution: https://github.com/flaxsearch

Re: Solr hanging when extracting a some broken .doc files

2013-12-18 Thread Charlie Hull
there are such horrors as 3000 page PDFs!). We usually run it in an external process so it can be watched and killed if necessary. Cheers Charlie -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Solr hanging when extracting a some broken .doc files

2013-12-19 Thread Charlie Hull
/in/alexandrerafalovitch - Time is the quality of nature that keeps events from happening all at once. Lately, it doesn't seem to be working. (Anonymous - via GTD book) On Wed, Dec 18, 2013 at 3:47 PM, Charlie Hull char...@flax.co.uk wrote: On 17/12/2013 15:29, Augusto Camarotti wrote: Hi guys, I'm

Re: Zookeeper as Service

2014-01-09 Thread Charlie Hull
error messages end up in a black hole, with you simply getting something unhelpful 'service failed to start' error messages from Windows itself if something goes wrong. The 'working directory' is another thing that needs careful setting up. Cheers Charlie -- Charlie Hull Flax - Open Source

Re: Alternatives to GATE?

2014-01-16 Thread Charlie Hull
://annomarket.com/ HTH Cheers Charlie -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Implementing an alerting feature

2014-01-27 Thread Charlie Hull
Charlie -- View this message in context: http://lucene.472066.n3.nabble.com/Implementing-an-alerting-feature-tp4113666.html Sent from the Solr - User mailing list archive at Nabble.com. -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767

Re: Solr is NoSQL database or not?

2014-03-03 Thread Charlie Hull
in their definitions :) C -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: OCR - Saving multi-term position

2014-07-03 Thread Charlie Hull
from Lucene Revolution is about this kind of thing: http://www.youtube.com/watch?v=rmRCsrJp2A8 Cheers Charlie -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Any Solr consultants available??

2014-07-25 Thread Charlie Hull
: http://www.solr-start.com/ and @solrstart Solr popularizers community: https://www.linkedin.com/groups?gid=6713853 -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Solr vs ElasticSearch

2014-08-01 Thread Charlie Hull
On 01/08/2014 06:43, Alexandre Rafalovitch wrote: Maybe Charlie Hull can answer that: https://twitter.com/FlaxSearch/status/494859596117602304 . He seems to think that - at least in some cases - Solr is faster. I'll try to expand on the tweet. Firstly, this is a totally unscientific

Re: Solr vs ElasticSearch

2014-08-01 Thread Charlie Hull
://www.outerthoughts.com/ and @arafalov Solr resources and newsletter: http://www.solr-start.com/ and @solrstart Solr popularizers community: https://www.linkedin.com/groups?gid=6713853 On Fri, Aug 1, 2014 at 3:44 PM, Charlie Hull char...@flax.co.uk wrote: On 01/08/2014 06:43, Alexandre Rafalovitch wrote

Re: Filter cache pollution during sharded edismax queries

2014-09-30 Thread Charlie Hull
Hi, We've just found a very similar issue at a client installation. They have around 27 million documents and are faceting on fields with high cardinality, and are unhappy with query performance and the server hardware necessary to make this performance acceptable. Last night we noticed the

Re: Filter cache pollution during sharded edismax queries

2014-10-01 Thread Charlie Hull
to stream all the return values back again. Alan -- Sincerely yours Mikhail Khludnev Principal Engineer, Grid Dynamics http://www.griddynamics.com mkhlud...@griddynamics.com -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web

Re: Filter cache pollution during sharded edismax queries

2014-10-08 Thread Charlie Hull
the filter cache to handle a lot of refinements. I'm happy to report that in our case setting facet.limit=-1 has a significant impact on performance, cache hit ratios and reduced CPU load. Thanks to all who replied! Cheers Charlie Flax Jim 2014-10-01 10:24 GMT+02:00 Charlie Hull char

Re: Solr Cloud has lower performance with more servers

2014-10-09 Thread Charlie Hull
-- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

New Meetup in London - Lucene/Solr User Group

2014-10-27 Thread Charlie Hull
. Cheers Charlie -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: A bad idea to store core data directory over NAS?

2014-11-05 Thread Charlie Hull
In our experience yes, it's a bad idea. Charlie On 5 November 2014 10:27, Walter Underwood wun...@wunderwood.org wrote: My experience was with Solr 1.2 and regular old NFS, so that was probably worst case. I was very surprised that it was that bad, though. So benchmark it before you assume

Re: New Meetup in London - Lucene/Solr User Group

2014-11-18 Thread Charlie Hull
On 27/10/2014 14:25, Charlie Hull wrote: Hi all, We noticed that there isn't a Lucene/Solr user group in London (although there is an Elasticsearch user group) - so we decided to start one! http://www.meetup.com/Apache-Lucene-Solr-London-User-Group Please join if you're interested and do pass

Comparing Solr Elasticsearch performance

2014-12-09 Thread Charlie Hull
Hi all, We've been working on a study of any performance differences between Solr and Elasticsearch and we've also published the code we used - here's the background with links to Github http://www.flax.co.uk/blog/2014/12/09/comparing-solr-and-elasticsearch-heres-the-code-we-used/ Cheers

Re: Comparing Solr Elasticsearch performance

2014-12-09 Thread Charlie Hull
-- Charlie Hull www.flax.co.uk On Dec 9, 2014 5:22 PM, Alexandre Rafalovitch arafa...@gmail.com wrote: I guess when you said you did not tune instances, you really really meant it. The Solr one looks like an example one with all the config files and Carrot enabled, etc. I was hoping for a bit more

Using SolrCloud to implement a kind of federated search

2015-01-20 Thread Charlie Hull
Hi all, We've been discussing a way of implementing a federated search by leveraging the distributed query parts of SolrCloud. I've written this up at http://www.flax.co.uk/blog/2015/01/20/solr-superclusters-for-improved-federated-search/ and would welcome any comments or feedback. So far, two

Re: OutOfMemoryError for PDF document upload into Solr

2015-01-16 Thread Charlie Hull
(AprEndpoin t.java:2451) Thanks Ganesh -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: FOSDEM Open source search devroom

2015-01-06 Thread Charlie Hull
place meet up and talk shop. I'll be there, and I hope some of you will as well. Sadly I won't, but my colleague (and committer) Alan Woodward will, talking about text search for stream processing. C - Bram [1] https://fosdem.org/2015/schedule/track/open_source_search/ -- Charlie Hull

Re: Geo Aggregations and Search Alerts in Solr

2015-02-24 Thread Charlie Hull
soon! There are a couple of videos on that page that will explain further. We suspect our approach is considerably faster than the Percolator, and it's on the list to benchmark the two. Cheers Charlie Thank you. -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700

Re: Creating facets based on the content field

2015-03-23 Thread Charlie Hull
-occurring words in the PDFs, after reading them.) Many thanks. Philippe -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Solr + RDF = SolRDF

2015-04-28 Thread Charlie Hull
://andreagazzarini.blogspot.it/2014/12/a-solr-rdf-store-and-sparql-endpoint-in.html [2] http://andreagazzarini.blogspot.it/2015/04/rdf-faceting-with-apache-solr-solrdf.html [3] https://github.com/agazzarini/SolRDF/wiki/Faceted%20search -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0

Re: Indexing PDF and MS Office files

2015-04-16 Thread Charlie Hull
your system immediately and notify us either by e-mail or telephone. You should not copy, forward or otherwise disclose the content of the e-mail. The views expressed in this communication may not necessarily be the view held by WHISHWORKS. -- Charlie Hull Flax - Open Source Enterprise Search

Re: Merging Sets of Data from Two Different Sources

2015-06-11 Thread Charlie Hull
-Sets-of-Data-from-Two-Different-Sources-tp4211166p4211172.html Sent from the Solr - User mailing list archive at Nabble.com. -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Merging Sets of Data from Two Different Sources

2015-06-11 Thread Charlie Hull
-from-Two-Different-Sources-tp4211166p4211169.html Sent from the Solr - User mailing list archive at Nabble.com. -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Merging Sets of Data from Two Different Sources

2015-06-11 Thread Charlie Hull
.n3.nabble.com/Merging-Sets-of-Data-from-Two-Different-Sources-tp4211166.html Sent from the Solr - User mailing list archive at Nabble.com. -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Deleting Fields

2015-06-01 Thread Charlie Hull
/blog/2011/06/24/how-to-remove-a-stored-field-in-lucene/ Cheers Charlie -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Use faceted search to drill down in hierarchical structure and omit node data outside current selection

2015-07-29 Thread Charlie Hull
-tp4219384p4219517.html Sent from the Solr - User mailing list archive at Nabble.com. -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Solr 5 options

2015-07-15 Thread Charlie Hull
to guess on the schema, I would want to explicitly define Solr's behavior ... but not everyone does things the same way that I do. Thanks, Shawn -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Solr node removed from zookeeper

2015-10-28 Thread Charlie Hull
for the long post. Thank you, Andrei -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-node-removed-from-zookeeper-tp4236931.html Sent from the Solr - User mailing list archive at Nabble.com. -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334

Re: Closing Windows CMD kills Solr

2015-10-29 Thread Charlie Hull
Charlie -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Bioinformatics search event in Cambridge UK Feb 3rd & 4th 2016

2015-10-14 Thread Charlie Hull
linked to our project BioSolr which is developing Solr features for bioinformaticians such as ontology indexers, JOINs with external data and faceting improvements (although we're hoping they're also of general use). Cheers Charlie -- Charlie Hull Flax - Open Source Enterprise Search tel/fax

Re: DIH parallel processing

2015-10-15 Thread Charlie Hull
. There are lots of great examples of high-performance indexing code available e.g.: http://bryanbende.com/development/2014/08/16/indexing-wikipedia-with-apache-solr/ Best Charlie -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web

Re: Instant Page Previews

2015-10-08 Thread Charlie Hull
course, it requires a community approach. Yes...and in an enterprise situation, this will depend on users spending time working on enhancing content, which is a battle seldom won :) Charlie Maybe both are needed if there's an infinite budget... Paul Charlie Hull <mailto:char...@flax.co.uk>

Re: Instant Page Previews

2015-10-08 Thread Charlie Hull
/previewgen It uses a headless version of Open Office under the hood to generate thumbbnail previews for various common file types, plus some ImageMagick for PDF, all wrapped up in Python. Bear in mind this is 6 years old so some updating might be required! Cheers Charlie -- Charlie Hull Flax - Open

Re: Instant Page Previews

2015-10-08 Thread Charlie Hull
/previewgen It uses a headless version of Open Office under the hood to generate thumbbnail previews for various common file types, plus some ImageMagick for PDF, all wrapped up in Python. Bear in mind this is 6 years old so some updating might be required! Cheers Charlie -- Charlie Hull Flax - Open

Re: Can I instruct the Tika Entity Processor to skip the first page using the DIH?

2015-07-09 Thread Charlie Hull
.nabble.com/Can-I-instruct-the-Tika-Entity-Processor-to-skip-the-first-page-using-the-DIH-tp4216373.html Sent from the Solr - User mailing list archive at Nabble.com. -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Please answer my question on StackOverflow ... Best approach to guarantee commits in SOLR

2015-08-26 Thread Charlie Hull
. Cheers Charlie -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: concept and choice: custom sharding or auto sharding?

2015-09-03 Thread Charlie Hull
n when you decide how to shard. If most of your queries are for recent articles, then some shards will be loaded far more than others. Here's a rather old blog post we wrote on the subject (actually based on Xapian, another open source search engine, but the concepts are the same for Solr): http://www

Re: Can StandardTokenizerFactory works well for Chinese and English (Bilingual)?

2015-09-30 Thread Charlie Hull
been a little painful. Have you tried to use HMMChineseTokenizer and JiebaTokenizer as well? I don't think so. Charlie Regards, Edwin On 25 September 2015 at 18:46, Charlie Hull <char...@flax.co.uk> wrote: On 25/09/2015 11:43, Zheng Lin Edwin Yeo wrote: Hi Charlie, Thanks fo

Re: Can StandardTokenizerFactory works well for Chinese and English (Bilingual)?

2015-09-30 Thread Charlie Hull
September 2015 at 16:20, Charlie Hull <char...@flax.co.uk> wrote: On 30/09/2015 04:09, Zheng Lin Edwin Yeo wrote: Hi Charlie, Hi, I've checked that Paoding's code is written for Solr 3 and Solr 4 versions. It is not written for Solr 5, thus I was unable to use it in my Solr 5.x v

Re: Facet queries blow out the filterCache

2015-10-02 Thread Charlie Hull
; -- Sincerely yours Mikhail Khludnev Principal Engineer, Grid Dynamics <http://www.griddynamics.com> <mkhlud...@griddynamics.com> -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Can StandardTokenizerFactory works well for Chinese and English (Bilingual)?

2015-09-25 Thread Charlie Hull
check, will StandardTokenizerFactory works well for indexing both English and Chinese (Bilingual) documents, or do we need tokenizers that are customised for chinese (Eg: HMMChineseTokenizerFactory)? Regards, Edwin -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile:

Re: Can StandardTokenizerFactory works well for Chinese and English (Bilingual)?

2015-09-25 Thread Charlie Hull
on the Paoding? Solr v4.6 I believe. Charlie Regards, Edwin On 25 September 2015 at 16:43, Charlie Hull <char...@flax.co.uk> wrote: On 23/09/2015 16:23, Alexandre Rafalovitch wrote: You may find the following articles interesting: http://discovery-grindstone.blogspot.ca/2014/01/searching-i

Re: NRT vs Redis for Dynamic Data in SOLR (like counts, viewcounts, etc) -

2015-12-15 Thread Charlie Hull
t. -- Sincerely yours Mikhail Khludnev Principal Engineer, Grid Dynamics <http://www.griddynamics.com> <mkhlud...@griddynamics.com> -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Many patterns against many sentences, storing all results

2016-01-06 Thread Charlie Hull
-scale prototype was done with postgres full text searching, but that can't do exact phrase matching or other more sophisticated searches, so it's out. Thanks very much Will -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web

Re: Issues when indexing PDF files

2015-12-17 Thread Charlie Hull
es of "??" or an empty content. I'm using the post.jar that comes together with Solr. What could be the reason that causes this? Regards, Edwin -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Solr Search: Access Control / Role based security

2015-11-18 Thread Charlie Hull
Tyger, tyger burning bright In the forests of the night, What immortal hand or eye Could frame thy fearful symmetry?" William Blake - Songs of Experience -1794 England -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Bypassing ExtractingRequestHandler

2016-06-10 Thread Charlie Hull
pulse of the project? Thanks, Justin -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Add a new field dynamically to each of the result docs and sort on it

2016-06-01 Thread Charlie Hull
oesn't involve "Y" at all? See Also: http://www.perlmonks.org/index.pl?node_id=542341 -Hoss http://www.lucidworks.com/ -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: How is Tika used with Solr

2016-02-10 Thread Charlie Hull
cess application or does it link with Tika JARs directly? If it links in directly, are there known issues with Solr integrated with Tika because of Tika issues? Thanks Steve -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

XJoin, a way to use external data sources with Solr

2016-01-29 Thread Charlie Hull
: http://www.flax.co.uk/blog/2016/01/25/xjoin-solr-part-1-filtering-using-price-discount-data/ http://www.flax.co.uk/blog/2016/01/29/xjoin-solr-part-2-click-example/ We're very interested in other use cases - one that occurs to us is security filtering. Cheers Charlie -- Charlie Hull Flax

Re: Tutorial or Code Samples to explain how to Write Solr Plugins

2016-02-03 Thread Charlie Hull
Here's one we wrote recently for indexing ontologies with Solr as part of the BioSolr project: https://github.com/flaxsearch/BioSolr/tree/master/ontology/solr and a presentation on how it works (explained in the second half of the talk) https://www.youtube.com/watch?v=v1qKNX_axdI - hope this

Re: What search metrics are useful?

2016-02-25 Thread Charlie Hull
a video or presentation on search metrics that would be useful? -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Reverse Eningeer Query For a Given Result Set?

2016-02-18 Thread Charlie Hull
. HTH, Charlie -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: using data from external service in Solr: value source or auxiliary core?

2016-03-14 Thread Charlie Hull
-value-source-or-auxiliary-core-tp4263334.html Sent from the Solr - User mailing list archive at Nabble.com. -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Relevancy for "tablet"

2016-03-09 Thread Charlie Hull
Berryman's new book 'Relevant Search' (available on MEAP at Manning Publications) which is an excellent take on this. In short, you need a sensible methodology for tuning relevance, otherwise it can easily become a game of whack-a-mole! Cheers Charlie -- Charlie Hull Flax - Open Source Enterpr

Re: What is the best way to index 15 million documents of total size 425 GB?

2016-03-04 Thread Charlie Hull
, State and University Library, Denmark -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Indexing Twitter - Hypothetical

2016-03-04 Thread Charlie Hull
shards if there is not enough resent results is an example. I highly doubt that a single SolrCloud is the best answer here. Maybe one cloud for each month and a lot of external logic? - Toke Eskildsen -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 8258

Re: Hierarchial Support - Solr

2016-05-19 Thread Charlie Hull
| 13 | 131 | | 1 | 13 | 132 | | 1 | 13 | 133 | -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: [scottchu] What kind of configuration to use for this size ofnews data?

2016-05-11 Thread Charlie Hull
/11 (週三) - Original Message - From: Charlie Hull To: solr-user@lucene.apache.org CC: Date: 2016/5/11 (週三) 16:21 Subject: Re: [scottchu] What kind of configuration to use for this size ofnews data? On 11/05/2016 04:27, scott.chu wrote: Fix some typos, add some words and resend same question

Re: dtSearch parser & Introduction

2016-05-13 Thread Charlie Hull
think it'd be great if I could get a bare-bones example of a parser so that I can modify it--perhaps even keeping it in a separate Java project. Don't feel like you have to answer all of my questions--an answer to any of them would be quite helpful. Thank you guys and God bless! -- Charlie Hull F

Re: dtSearch parser & Introduction

2016-05-13 Thread Charlie Hull
On 13/05/2016 10:41, Charlie Hull wrote: On 12/05/2016 23:50, Brandon Miller wrote: Hello, all! I'm a BloombergBNA employee and need to obtain/write a dtSearch parser for solr (and probably a bunch of other things a little later). I've looked at the available parsers and thought

Re: [scottchu] What kind of configuration to use for this size of news data?

2016-05-11 Thread Charlie Hull
as this possibility of needs.) Yes, I guess so, but why copy it when you could just search it with a filter for the paper types? I'd like to hear and use some well suggestion and experiences. Thanks in advance and best regards. Scott Chu @ 2016/5/11 11:26 GMT+8 Hope this helps! Cheers C

Re: Verifying - SOLR Cloud replaces load balancer?

2016-04-19 Thread Charlie Hull
y, this can increase performance if you are returning large amounts of data - many or large fields or many documents. Cheers Tom -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Running out of disk space for Solr, a proposed solution

2016-04-21 Thread Charlie Hull
://github.com/flaxsearch/harahachibu There's a blog post explaining how and why we built it at http://www.flax.co.uk/blog/2016/04/21/running-disk-space-elasticsearch-solr/ Cheers Charlie -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web

Re: Solr and Drupal

2016-08-10 Thread Charlie Hull
eing set in the Solr configuration files. This is a generic issue when Solr or another search engine is embedded in another product - the people doing the embedding may not know enough about search to do it right. In any case, you'll probably be fine, but do be aware. Cheers Charlie -- Cha

Re: Solr more like this

2016-07-06 Thread Charlie Hull
file and get mlt result.can I do this?? If Solr hasn't indexed a PDF file, it can't work out it's 'like this'. So I'd say, no, you can't. Cheers Charlie -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: How to combine third party search data as top results ?

2017-02-01 Thread Charlie Hull
from an external system with Solr. Here are two blog posts about it: http://www.flax.co.uk/blog/2016/01/25/xjoin-solr-part-1-filtering-using-price-discount-data/ http://www.flax.co.uk/blog/2016/01/29/xjoin-solr-part-2-click-example/ Cheers Charlie -- Charlie Hull Flax - Open Source Enterprise

Re: Upserting doc fields from a SearchComponent

2017-02-01 Thread Charlie Hull
more broadly, has some experience in personalizing a search response in the Solr guts. Best Ugo -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: A tool to quickly browse Solr documents ?

2017-01-24 Thread Charlie Hull
://github.com/flaxsearch/marple Cheers Charlie -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Announcing Marple, a RESTful API & GUI for inspecting Lucene indexes

2017-02-24 Thread Charlie Hull
a work in progress (we started it at the Lucene hackday we ran in London last autumn) so contributions, bug reports & feature requests very welcome! We'll also be talking about it at the next London Lucene/Solr Meetup on March 23rd. Best Charlie -- Charlie Hull Flax - Open Source Enterp

Re: Announcing Marple, a RESTful API & GUI for inspecting Lucene indexes

2017-02-24 Thread Charlie Hull
On 24/02/2017 17:24, Charlie Hull wrote: Hi all, Very pleased to announce the first release of Marple, an open source tool for inspecting Lucene indexes. We've blogged about it here: http://www.flax.co.uk/blog/2017/02/24/release-1-0-marple-lucene-index-detective/ which contains links

Re: minimal solrconfig example

2017-03-02 Thread Charlie Hull
of hacking large chunks of it out and seeing what breaks what. Cheers Charlie -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Apache SOLR Search Errors ?

2016-09-06 Thread Charlie Hull
list archive at Nabble.com. -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Writing Solr Custom Components

2016-10-05 Thread Charlie Hull
-indexing/ Charlie -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: (ANNOUNCEMENT) Solr Examples reading group

2016-10-05 Thread Charlie Hull
rectly and share them via email or in person. The opinions do not have to be positive, though having them constructive would be an nice. :-) Newsletter and resources for Solr beginners and intermediates: http://www.solr-start.com/ -- Charlie Hull Flax - Open Source Enterprise Search tel/

Re: London Lucene Hackday is now running

2016-10-07 Thread Charlie Hull
ing reports out > of Jira exports. > > On 7 Oct 2016 4:52 PM, "Charlie Hull" <char...@flax.co.uk> wrote: > > > Hi all, > > > > We're running a Lucene hackday in London - you can follow along with > > Twitter using hashtag #LuceneSolrLondon and see

London Lucene Hackday is now running

2016-10-07 Thread Charlie Hull
Hi all, We're running a Lucene hackday in London - you can follow along with Twitter using hashtag #LuceneSolrLondon and see what we're doing on Github at https://github.com/flaxsearch/london-hackday-2016 - as the README shows we're currently looking at: 1. A Browser-driven explorer for

Hackday next month

2016-09-21 Thread Charlie Hull
://www.meetup.com/New-England-Search-Technologies-NEST-Group/events/233492535/ Cheers Charlie -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Hackday next month

2016-09-22 Thread Charlie Hull
day before ? Not sure about others but it certainly would work much better for me. -Anshum On Wed, Sep 21, 2016 at 2:18 PM Charlie Hull <char...@flax.co.uk> wrote: Hi all, If you're coming to Lucene Revolution next month in Boston, we're running a Lucene-focused hackday (Lucene, Sol

Three Lucene hackdays coming soon

2016-08-24 Thread Charlie Hull
to achieve! Cheers Charlie -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Solr/lucene "planet" + recommendations for blogs to follow

2016-11-23 Thread Charlie Hull
Hi all, We also blog about various Solr topics at www.flax.co.uk/blog and also run the London Lucene/Solr Meetup. I'd encourage you to attend a Meetup if you can find one locally, they're great places to hear about Solr projects and meet others working in the field. Alex & others efforts in

Re: London Lucene Hackday is now running

2016-10-21 Thread Charlie Hull
On 07/10/2016 10:52, Charlie Hull wrote: Hi all, We're running a Lucene hackday in London - you can follow along with Twitter using hashtag #LuceneSolrLondon and see what we're doing on Github at https://github.com/flaxsearch/london-hackday-2016 - as the README shows we're currently looking

Re: London Lucene Hackday is now running

2016-10-12 Thread Charlie Hull
th streaming. > > > > On Friday, October 7, 2016 5:24 PM, Charlie Hull <char...@flax.co.uk> > wrote: > > > Yes I'll blog about it and we'll try and get as much as possible captured > in the Github folder. If you've got ideas for Tuesday please could you add > them to that even

Re: The state of Solr 5. Is it in maintenance mode only?

2016-11-29 Thread Charlie Hull
committed for SOLR-2242. The changes for SOLR-6348 were committed to 5.2 and 6.0. I have updated the fix versions in the older issue to match. The versions should probably all be removed, but I am not sure what our general rule is for duplicates. Thanks, Shawn -- Charlie Hull Flax - Open Source

Re: [ANN] InvisibleQueriesRequestHandler

2016-12-05 Thread Charlie Hull
ing Solr is doing (e.g. Hybris, Drupal...) so to be able to run multiple searches in Solr itself is very useful. Nice one! Charlie -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: SolrCloud different score for same document on different replicas.

2017-01-05 Thread Charlie Hull
hub.com/flaxsearch/london-hackday-2016 I'm not sure there is a way to get a homogenous score - this patch tries to keep you connected to the same replica during a session so you don't see results jumping over pagination. Cheers Charlie -- Charlie Hull Flax - Open Source Enterprise Search

Re: Stop Solr Node (in distress)?

2016-12-20 Thread Charlie Hull
-- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Partial Match with DF

2017-03-16 Thread Charlie Hull
Hi Mark, Open Source Connection's excellent www.splainer.io might also be useful to help you break down exactly what your query is doing. Cheers Charlie P.S. planning a blog soon listing 'useful Solr tools' On 16 March 2017 at 14:39, Mark Johnson wrote: >

  1   2   3   >