Re: Mongo DB Users

2014-09-15 Thread Ed Smiley
Remove

On 9/15/14, 8:35 AM, Aaron Susan aaronsus...@gmail.com wrote:

Hi,

I am here to inform you that we are having a contact list of *Mongo DB
Users *would you be interested in it?

Data Field¹s Consist Of: Name, Job Title, Verified Phone Number, Verified
Email Address, Company Name  Address Employee Size, Revenue size, SIC
Code, Industry Type etc.,

We also provide other technology users as well depends on your
requirement.

For Example:


*Red Hat *

*Terra data *

*Net-app *

*NuoDB*

*MongoHQ ** and many more*


We also provide IT Decision Makers, Sales and Marketing Decision Makers,
C-level Titles and other titles as per your requirement.

Please review and let me know your interest if you are looking for above
mentioned users list or other contacts list for your campaigns.

Waiting for a positive response!

Thanks

*Aaron Susan*
Data Specialist

If you are not the right person, feel free to forward this email to the
right person in your organization. To opt out response Remove



TB scale

2014-04-25 Thread Ed Smiley
Anyone with experience, suggestions or lessons learned in the 10 -100 TB scale 
they'd like to share?
Researching optimum design for a Solr Cloud with, say, about 20TB index.
-
Thanks

Ed Smiley, Senior Software Architect, Ebooks
ProQuest | 161 Evelyn Ave. | Mountain View, CA 94041 USA | +1 640 475 8700 ext. 
3772
ed.smi...@proquest.commailto:ed.smi...@proquest.com
www.proquest.comhttp://www.proquest.com/ | 
www.ebrary.comhttp://www.ebrary.com/ | www.eblib.comhttp://www.eblib.com/
ebrary and EBL, ProQuest businesses



Re: TB scale

2014-04-25 Thread Ed Smiley
Not looking for a cookbook.
Just curious to hear some war stories since this is relatively rare.

‹Ed :)
-- 

Ed Smiley, Senior Software Architect, Ebooks
ProQuest | 161 Evelyn Ave. | Mountain View, CA 94041 USA | +1 640 475 8700
ext. 3772
ed.smi...@proquest.com
www.proquest.com http://www.proquest.com/ | www.ebrary.com
http://www.ebrary.com/ | www.eblib.com http://www.eblib.com/
ebrary and EBL, ProQuest businesses
 





On 4/25/14, 2:01 PM, Otis Gospodnetic otis.gospodne...@gmail.com wrote:

Hi Ed,

Unfortunately, there is no good *general* advice, so you'd need to provide
a lot more detail to get useful help.

Otis
--
Performance Monitoring * Log Analytics * Search Analytics
Solr  Elasticsearch Support * http://sematext.com/


On Fri, Apr 25, 2014 at 3:48 PM, Ed Smiley esmi...@ebrary.com wrote:

 Anyone with experience, suggestions or lessons learned in the 10 -100 TB
 scale they'd like to share?
 Researching optimum design for a Solr Cloud with, say, about 20TB index.
 -
 Thanks

 Ed Smiley, Senior Software Architect, Ebooks
 ProQuest | 161 Evelyn Ave. | Mountain View, CA 94041 USA | +1 640 475
8700
 ext. 3772
 ed.smi...@proquest.commailto:ed.smi...@proquest.com
 www.proquest.comhttp://www.proquest.com/ | www.ebrary.com
 http://www.ebrary.com/ | www.eblib.comhttp://www.eblib.com/
 ebrary and EBL, ProQuest businesses





Spurious spellcheck results

2014-04-21 Thread Ed Smiley
Entered https://issues.apache.org/jira/browse/SOLR-5995.

--

Ed Smiley, Senior Software Architect, eBooks
ProQuest | 161 E Evelyn Ave|
Mountain View, CA 94041 | USA |
+1 650 475 8700 extension 3772
ed.smi...@proquest.com
www.proquest.comhttp://www.proquest.com/ | 
www.ebrary.comhttp://www.ebrary.com/ | www.eblib.comhttp://www.eblib.com/
ebrary and EBL, ProQuest businesses.

From: Smiley, Ed esmi...@ebrary.commailto:esmi...@ebrary.com
Date: Thursday, April 17, 2014 at 4:50 PM
To: solr-user@lucene.apache.orgmailto:solr-user@lucene.apache.org 
solr-user@lucene.apache.orgmailto:solr-user@lucene.apache.org
Subject: Spelling suggestions--any ideas?

Correctly spelled words are returning as not spelled correctly, with the 
original, correctly spelled word with a single oddball character appended as 
multiple suggestions...
--

Ed Smiley, Senior Software Architect, eBooks
ProQuest | 161 E Evelyn Ave|
Mountain View, CA 94041 | USA |
+1 650 475 8700 extension 3772
ed.smi...@proquest.commailto:ed.smi...@proquest.com
www.proquest.comhttp://www.proquest.com/ | 
www.ebrary.comhttp://www.ebrary.com/ | www.eblib.comhttp://www.eblib.com/
ebrary and EBL, ProQuest businesses.


Spelling suggestions--any ideas?

2014-04-17 Thread Ed Smiley
Correctly spelled words are returning as not spelled correctly, with the 
original, correctly spelled word with a single oddball character appended as 
multiple suggestions...
--

Ed Smiley, Senior Software Architect, eBooks
ProQuest | 161 E Evelyn Ave|
Mountain View, CA 94041 | USA |
+1 650 475 8700 extension 3772
ed.smi...@proquest.com
www.proquest.comhttp://www.proquest.com/ | 
www.ebrary.comhttp://www.ebrary.com/ | www.eblib.comhttp://www.eblib.com/
ebrary and EBL, ProQuest businesses.


Odd extra character duplicates in spell checking

2014-04-15 Thread Ed Smiley
Hi,
I am going to make this question pretty short, so I don’t overwhelm with 
technical details until  the end.
I suspect that some folks may be seeing this issue without the particular 
configuration we are using.

What our problem is:

  1.  Correctly spelled words are returning as not spelled correctly, with the 
original, correctly spelled word with a single oddball character appended as 
multiple suggestions.
  2.  Incorrectly spelled words are returning correct spelling suggestions with 
a single oddball character appended as multiple suggestions.
  3.  We’re seeing this in Solr 4.5x and 4.7x.

Example:

The return values are all a single character (unicode shown in square brackets).

correction=attitude[2d]
correction=attitude[2f]
correction=attitude[2026]

Spurious characters:

  *   Unicode Character 'HYPHEN-MINUS' (U+002D)
  *   Unicode Character 'SOLIDUS' (U+002F)
  *   Unicode Character 'HORIZONTAL ELLIPSIS' (U+2026)

Anybody see anything like this?  Anybody fix something like this?

Thanks!
—Ed


OK, here’s the gory details:


What we are doing:
We have developed an application that returns  did you mean” spelling 
alternatives against a specific (presumably misspelled word).
We’re using the vocabulary of indexed pages of a specified book as the source 
of the alternatives, so this is not a general dictionary spell check, we are 
returning only matching alternatives.
So when I say “correctly spelled” I mean they are words found on at least one 
page.  We are using the collations, so that we restrict ourselves to those 
pages in one book.
We are having to check for and “fix up” these faulty results.  That’s not a 
robust or desirable solution.

We are using SolrJ to get the collations,
  private static final String DID_YOU_MEAN_REQUEST_HANDLER = 
/spell”;
….
SolrQuery query = new SolrQuery(q);
query.set(spellcheck, true);
query.set(SpellingParams.SPELLCHECK_COUNT, 10);
query.set(SpellingParams.SPELLCHECK_COLLATE, true);
query.set(SpellingParams.SPELLCHECK_COLLATE_EXTENDED_RESULTS, true);
query.set(wt, json);
query.setRequestHandler(DID_YOU_MEAN_REQUEST_HANDLER);
query.set(shards.qt, DID_YOU_MEAN_REQUEST_HANDLER);
query.set(shards.tolerant, true);
etc……

but we can duplicate the behavior without SolrJ with the collations/ 
misspellingsAndCorrections below:, e.g.:
solr/pg1/spell?q=+doc-id:(810500)+AND+attitudexspellcheck=truespellcheck.count=10spellcheck.collate=truespellcheck.collateExtendedResults=truewt=jsonqt=%2Fspellshards.qt=%2Fspellshards.tolerant=true.out.print


{responseHeader:{status:0,QTime:60},response:{numFound:0,start:0,maxScore:0.0,docs:[]},spellcheck:{suggestions:[attitudex,{numFound:6,startOffset:21,endOffset:30,origFreq:0,suggestion:[{word:attitudes,freq:362486},{word:attitu
 dex,freq:4819},{word:atti tudex,freq:3254},{word:attit 
udex,freq:159},{word:attitude-,freq:1080},{word:attituden,freq:261}]},correctlySpelled,false,collation,[collationQuery,
 doc-id:(810500) AND 
attitude-,hits,2,misspellingsAndCorrections,[attitudex,attitude-]],collation,[collationQuery,
 doc-id:(810500) AND 
attitude/,hits,2,misspellingsAndCorrections,[attitudex,attitude/]],collation,[collationQuery,
 doc-id:(810500) AND 
attitude…,hits,2,misspellingsAndCorrections,[attitudex,attitude…]]]}}

The configuration is:

requestHandler name=/spell class=solr.SearchHandler startup=lazy

lst name=defaults

  str name=dftext/str

  str name=spellcheck.dictionarydefault/str

  str name=spellcheck.dictionarywordbreak/str

  str name=spellcheckon/str

  str name=spellcheck.extendedResultstrue/str

  str name=spellcheck.count10/str

  str name=spellcheck.alternativeTermCount5/str

  str name=spellcheck.maxResultsForSuggest5/str

  str name=spellcheck.collatetrue/str

  str name=spellcheck.collateExtendedResultstrue/str

  str name=spellcheck.maxCollationTries10/str

  str name=spellcheck.maxCollations5/str

name=last-components

  strspellcheck/str

/arr

  /requestHandler


lst name=spellchecker

  str name=namewordbreak/str

  str name=classnamesolr.WordBreakSolrSpellChecker/str

  str name=fieldtext/str

  str name=combineWordstrue/str

  str name=breakWordstrue/str

  int name=maxChanges25/int

  int name=minBreakLength3/int

/lst


lst name=spellchecker

  str name=namedefault/str

  str name=fieldtext/str

  str name=classnamesolr.DirectSolrSpellChecker/str

  str name=distanceMeasureinternal/str

  float name=accuracy0.2/float

  int name=maxEdits2/int

  int name=minPrefix1/int

  int name=maxInspections25/int

  int name=minQueryLength4/int

  float name=maxQueryFrequency1/float

/lst

--

Ed Smiley, Senior Software Architect, eBooks
ProQuest | 161 E Evelyn Ave|
Mountain View, CA 94041 | USA |
+1 650 475 8700 extension 3772
ed.smi