Wow! Someone else dealing with the exact same thing as me!
Matt: your suggestion to use the "-m utf-8" flag for antiword was exactly the right solution. Conceptually it makes the most sense, too. I.e.: "Convert this Word doc to UTF-8 and parse it into text" as the first step. Much much nicer! It's good to know that Iconv could probably do the same thing later in the process, but it's nice to just handle it up-front and the resulting String object is already UTF-8. Whee! Thank you! (my solution was much less than 38 hours, primarily thanks to this thread) -Danimal --~--~---------~--~----~------------~-------~--~----~ You received this message because you are subscribed to the Google Groups "Ruby on Rails: Talk" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/rubyonrails-talk?hl=en -~----------~----~----~----~------~----~------~--~---

