Hi,

I'm adding a new index to my application. It looks like this:

class Account < ActiveRecord::Base
  define_index do
    indexes account_name.first_name
    indexes account_name.last_name
    indexes email_addresses.email_address

    has created_at

    set_property :delta => :datetime, :threshold => 2.minutes
  end
end

I'm testing how long the full index takes to generate on a staging
server where we typically have only sanitized data from production.
But for this task, I'm working with our entire accounts,
account_names, and email_addresses tables from production.

When I generate the index, I get the following warning during the
accounts index phase:

  WARNING: duplicate document ids found

In the Rails console, I observe the following:
>> Account.search.total_entries
 => 260793
>> Account.count
 => 602083

Locally, with a much smaller subset of the data, I also get a
different count from each data source, but I don't receive the
"duplicate document ids" warning when generating the index.

My research so far has indicated that this is an issue with merging
indexes. But here I'm generating a full index, not a generating a
delta index and then merging it into a full index.

My questions are:

1. The warning and the discrepancy in count, are they related?
2. What does the warning mean?
3. Is all of my data accessible via searching, despite the different
counts?
4. How can I fix this?

Thanks in advance for any assistance,
Alex Kahn

P.S. I'm using Rails 2.3.14, Sphinx 0.9.9, thinking-sphinx 1.4.7, ts-
datetime-delta 1.0.2

-- 
You received this message because you are subscribed to the Google Groups 
"Thinking Sphinx" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to 
[email protected].
For more options, visit this group at 
http://groups.google.com/group/thinking-sphinx?hl=en.

Reply via email to