Huh, I didn't spot the where clause the first time around - I usually group all my indexes, has, set_property, where (etc) methods together in roughly that order.
Let's see if we can get the right value from Sphinx in a search: ThinkingSphinx.search_for_ids( :index => 'asset_core', :max_matches => 100_000 ).total_entries # vs Asset.search_count # vs Asset.search_count(:max_matches => 100_000) -- Pat On 03/03/2011, at 5:30 AM, Jake wrote: > For a moment just now, I thought it was the where statement in the > define_index block, but I checked the db and all Assets have the > 'approved' status. Here is the output from rake ts:index. It says it > collected 1827 docs for asset_core, which is the right amount. > > Generating Configuration to /Users/jakemack/Work/Repositories/git/ > RightsGenie/config/development.sphinx.conf > Sphinx 0.9.9-release (r2117) > Copyright (c) 2001-2009, Andrew Aksyonoff > > using config file '/Users/jakemack/Work/Repositories/git/RightsGenie/ > config/development.sphinx.conf'... > indexing index 'asset_core'... > collected 1827 docs, 0.2 MB > collected 3703 attr values > sorted 0.0 Mvalues, 100.0% done > sorted 0.7 Mhits, 98.2% done > total 1827 docs, 151014 bytes > total 0.894 sec, 168857 bytes/sec, 2042.87 docs/sec > distributed index 'asset' can not be directly indexed; skipping. > indexing index 'company_core'... > collected 4 docs, 0.0 MB > sorted 0.0 Mhits, 100.0% done > total 4 docs, 81 bytes > total 0.044 sec, 1840 bytes/sec, 90.87 docs/sec > distributed index 'company' can not be directly indexed; skipping. > indexing index 'person_core'... > collected 6 docs, 0.0 MB > sorted 0.0 Mhits, 100.0% done > total 6 docs, 74 bytes > total 0.017 sec, 4197 bytes/sec, 340.32 docs/sec > distributed index 'person' can not be directly indexed; skipping. > total 16 reads, 0.009 sec, 267.9 kb/call avg, 0.5 msec/call avg > total 40 writes, 0.011 sec, 264.9 kb/call avg, 0.2 msec/call avg > rotating indices: succesfully sent SIGHUP to searchd (pid=28312). > > > On Mar 2, 2:00 am, Pat Allan <[email protected]> wrote: >> Hi Jake >> >> What's the output when you index the data? How many records does Sphinx say >> it's processed? >> >> -- >> Pat >> >> On 02/03/2011, at 8:09 AM, Jake wrote: >> >>> I'll post my new numbers so you can see the latest. I'm using MySQL and in >>> my sphinx.yml file I've set max_matches: 100000 and a stopwords file for my >>> local development. I've also added pid_file and searchd_file_path for >>> staging and production, but I'm assuming that's irrelevant. My smallest id >>> for the Asset model is 30 and the largest is 5479. Here are the set of ids >>> that are not being returned through Asset.search: >> >>> [5318, 5319, 5320, 5321, 5322, 5323, 5324, 5325, 5326, 5327, 5328, 5329, >>> 5330, 5331, 5332, 5333, 5334, 5335, 5336, 5337, 5338, 5339, 5340, 5341, >>> 5342, 5343, 5344, 5345, 5346, 5347, 5348, 5349, 5350, 5351, 5352, 5353, >>> 5354, 5355, 5356, 5357, 5358, 5359, 5360, 5361, 5392, 5393, 5394] >> >>> It seems that there was a time when something about creating the assets >>> caused them to not be indexed, but I don't know what would have caused >>> that. I've tried comparing the various columns for these un-indexed Assets, >>> but haven't been able to notice a pattern. I'm really just at a loss of >>> where to look next. Let me know if any other information would be helpful >>> and thanks for your help. >> >>> -- >>> You received this message because you are subscribed to the Google Groups >>> "Thinking Sphinx" group. >>> To post to this group, send email to [email protected]. >>> To unsubscribe from this group, send email to >>> [email protected]. >>> For more options, visit this group >>> athttp://groups.google.com/group/thinking-sphinx?hl=en. > > -- > You received this message because you are subscribed to the Google Groups > "Thinking Sphinx" group. > To post to this group, send email to [email protected]. > To unsubscribe from this group, send email to > [email protected]. > For more options, visit this group at > http://groups.google.com/group/thinking-sphinx?hl=en. > -- You received this message because you are subscribed to the Google Groups "Thinking Sphinx" group. To post to this group, send email to [email protected]. To unsubscribe from this group, send email to [email protected]. For more options, visit this group at http://groups.google.com/group/thinking-sphinx?hl=en.
