You probably want this: http://smalltalkhub.com/#!/~PharoExtras/Soup
abergel wrote > Hi! > > Together with Nicolas we are trying to get all the > from html files. > We have tried to use XMLDOMParser, but many webpages are actually not well > formed, therefore the parser is complaining. > > Anyone has tried to get some particular tags from HTML files? This looks > like a classical thing to do. Maybe some of you have done it. > Is there a way to configure the parser to accept a broken XML/HTML > content? > > Cheers, > Alexandre > -- > _,.;:~^~:;._,.;:~^~:;._,.;:~^~:;._,.;:~^~:;._,.;: > Alexandre Bergel http://www.bergel.eu > ^~:;._,.;:~^~:;._,.;:~^~:;._,.;:~^~:;._,.;:~^~:;. -- View this message in context: http://forum.world.st/Getting-some-tag-in-an-HTML-file-tp4842650p4842660.html Sent from the Pharo Smalltalk Developers mailing list archive at Nabble.com.
