Hi everyone,

I'm the founder of EDU 2.0 (http://www.edu20.org) and due to the rapid
growth of the site, much of the text that used to be stored in MySQL
is now being stored in Amazon S3.

For example, if a lesson is created that has 100K of HTML in it, we
store the text itself in S3 and just hold its name + URL in MySQL.
This approach has allowed us to shrink our MySQL memory requirements
dramatically, which is important for the long-term.

We already use Sphinx to index things like messages, forum postings,
etc. but not lessons. Now we'd like to start indexing the lessons as
well. So my question is - what is the best way to use Thinking Sphinx
if text is stored in S3? Is there a way to use Sphinx to update its
indexes as files are uploaded to S3? That way, when I search for a
term, I could get back a list of URLs where the terms were found.

One last thing; we keep a MySQL table that keeps track of every single
S3 file + its URL, so its easy to correlate the S3 files with their
URLs.

I would be willing to fund extensions to Thinking Sphinx to make this
possible!

Cheers,
Graham

-- 
You received this message because you are subscribed to the Google Groups 
"Thinking Sphinx" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to 
[email protected].
For more options, visit this group at 
http://groups.google.com/group/thinking-sphinx?hl=en.

Reply via email to