Hi,
I'm adding a new index to my application. It looks like this:
class Account < ActiveRecord::Base
define_index do
indexes account_name.first_name
indexes account_name.last_name
indexes email_addresses.email_address
has created_at
set_property :delta => :datetime, :threshold => 2.minutes
end
end
I'm testing how long the full index takes to generate on a staging
server where we typically have only sanitized data from production.
But for this task, I'm working with our entire accounts,
account_names, and email_addresses tables from production.
When I generate the index, I get the following warning during the
accounts index phase:
WARNING: duplicate document ids found
In the Rails console, I observe the following:
>> Account.search.total_entries
=> 260793
>> Account.count
=> 602083
Locally, with a much smaller subset of the data, I also get a
different count from each data source, but I don't receive the
"duplicate document ids" warning when generating the index.
My research so far has indicated that this is an issue with merging
indexes. But here I'm generating a full index, not a generating a
delta index and then merging it into a full index.
My questions are:
1. The warning and the discrepancy in count, are they related?
2. What does the warning mean?
3. Is all of my data accessible via searching, despite the different
counts?
4. How can I fix this?
Thanks in advance for any assistance,
Alex Kahn
P.S. I'm using Rails 2.3.14, Sphinx 0.9.9, thinking-sphinx 1.4.7, ts-
datetime-delta 1.0.2
--
You received this message because you are subscribed to the Google Groups
"Thinking Sphinx" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to
[email protected].
For more options, visit this group at
http://groups.google.com/group/thinking-sphinx?hl=en.