I dont know how to re-create your Hproicot doc from the modified string please help me.
On Thu, Sep 10, 2009 at 10:20 AM, prashanth hiremath < [email protected]> wrote: > > Thank u i have done what u told using gsub operator i replaces the tags to > the form as u told,but problem is that > > if > doc = Hpricot(open('Delhi.txt')) > x=doc.to_s > doc1=x.gsub(/<(\/?)li><span>/,'</li></see><see><span>') > > puts doc1 > doc1.search('span').each do |y| > puts y.inner_text > end > > > its giving error > > undefined method `search' for #<String:0xb7d0bc74> (NoMethodError) > because doc1 is string how can i conevrt so that i can read the file again > by hpricot > > Regards > Prashanth Hiremath > > > On Wed, Sep 9, 2009 at 10:38 PM, Ar Chron < > [email protected]> wrote: > >> >> Your html is still flat, so you have to work with the patterns that you >> see. >> You have: >> span >> li >> li >> li >> span >> li >> li >> li >> etc... >> >> An ugly, brute force, one case solution is to: >> >> read the page with Hpricot >> remove the header >> convert it to a simple string representation >> stick your opening tag '<see>' at the head >> stick your closing tag and a div end '</div></see>' at the tail >> change all '<span>' to '</div><div><span>' >> doctor up the new head from '<see></div><div>' to just '<see><div>' >> re-create your Hproicot doc from the modified string >> >> which takes about 8 lines of code. >> >> YMMV >> -- >> Posted via http://www.ruby-forum.com/. >> >> >> >> > --~--~---------~--~----~------------~-------~--~----~ You received this message because you are subscribed to the Google Groups "Ruby on Rails: Talk" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/rubyonrails-talk?hl=en -~----------~----~----~----~------~----~------~--~---

