Ya I am scraping someone html so that i cant change the format right. can u help me .
Regards Prashant On Tue, Sep 8, 2009 at 10:40 PM, Ar Chron <[email protected]>wrote: > > Is this your html, or are you scraping someone else's html? > > If it's yours, organize your html differently... if you know you want to > be processing a section at a time, wrap those sections with an > identifiable container, then scope your searches by the container. > > <div> > <h3>blah</h3> > <li>a</li> > <li>b</li> > </div> > <div> > <h3>blah2</h3> > <li>c</li> > <li>d</li> > </div> > > (doc/"div").each do |dv| > this_h3 = (dv/"h3") > if this_h3.inner_html == "blah2" > (dv/"li").each do |li| > puts li.inner_html > end > end > end > > emits just c, and d > > If its someone else's html in that format, you'll probably have to go > elem by elem for the whole doc with state machine-ish code to track what > you've seen previously since there doesn't seem to be any real 'path' to > the li's per h3. > -- > Posted via http://www.ruby-forum.com/. > > > > --~--~---------~--~----~------------~-------~--~----~ You received this message because you are subscribed to the Google Groups "Ruby on Rails: Talk" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/rubyonrails-talk?hl=en -~----------~----~----~----~------~----~------~--~---

