Hi everyone, I'm the founder of EDU 2.0 (http://www.edu20.org) and due to the rapid growth of the site, much of the text that used to be stored in MySQL is now being stored in Amazon S3.
For example, if a lesson is created that has 100K of HTML in it, we store the text itself in S3 and just hold its name + URL in MySQL. This approach has allowed us to shrink our MySQL memory requirements dramatically, which is important for the long-term. We already use Sphinx to index things like messages, forum postings, etc. but not lessons. Now we'd like to start indexing the lessons as well. So my question is - what is the best way to use Thinking Sphinx if text is stored in S3? Is there a way to use Sphinx to update its indexes as files are uploaded to S3? That way, when I search for a term, I could get back a list of URLs where the terms were found. One last thing; we keep a MySQL table that keeps track of every single S3 file + its URL, so its easy to correlate the S3 files with their URLs. I would be willing to fund extensions to Thinking Sphinx to make this possible! Cheers, Graham -- You received this message because you are subscribed to the Google Groups "Thinking Sphinx" group. To post to this group, send email to [email protected]. To unsubscribe from this group, send email to [email protected]. For more options, visit this group at http://groups.google.com/group/thinking-sphinx?hl=en.
