Re: Scraping news feeds?

2001-03-09 Thread Dave Cross
At Fri, 9 Mar 2001 12:00:19 +, Michael Stevens <[EMAIL PROTECTED]> wrote: > On Fri, Mar 09, 2001 at 12:50:17PM -0500, mallum wrote: > > For aggregating RSS feeds I use RSSLite [1] rather than XML::RSS. > > RSSLite avoids using expat and is a little naughty in parsing XML > > that would make

Re: Scraping news feeds?

2001-03-09 Thread DJ Adams
On Fri, Mar 09, 2001 at 12:02:52PM +, DJ Adams wrote: > > I didn't know you were on this list - then again, I am crap at keeping > up anyway... Wow, 3, nay, 4 posts to the list in one day. Must get a grip on myself... dj

Re: Scraping news feeds?

2001-03-09 Thread DJ Adams
On Fri, Mar 09, 2001 at 12:50:17PM -0500, mallum wrote: > > > I run http://10.am and do this on a largish scale. Mallum - greetings! I didn't know you were on this list - then again, I am crap at keeping up anyway... dj

Re: Scraping news feeds?

2001-03-09 Thread DJ Adams
On Fri, Mar 09, 2001 at 12:00:19PM +, Michael Stevens wrote: > > avoids using expat and is a little naughty in parsing XML that would make > > expat barf ( Alot of RSS feeds unfortunatly contain bad XML ). > > That way lies madness. Which is exactly what separates theory from reality ;-) dj

Re: Scraping news feeds?

2001-03-09 Thread Dave Cross
At Fri, 9 Mar 2001 12:50:17 -0500, mallum <[EMAIL PROTECTED]> wrote: [RSS feeds] > I run http://10.am and do this on a largish scale. > > For aggregating RSS feeds I use RSSLite [1] rather than XML::RSS. > RSSLite avoids using expat and is a little naughty in parsing XML > that would make exp

Re: Scraping news feeds?

2001-03-09 Thread mallum
I run http://10.am and do this on a largish scale. For aggregating RSS feeds I use RSSLite [1] rather than XML::RSS. RSSLite avoids using expat and is a little naughty in parsing XML that would make expat barf ( Alot of RSS feeds unfortunatly contain bad XML ). For actual scaping of sites I ba

Scraping news feeds?

2001-03-08 Thread Adam Worrall
> "DH" == Dave Hodgkinson <[EMAIL PROTECTED]> writes: DH> What's the best way to scrape a variety of news headlines from DH> various sites? Sort of a moreover for the intranet... Find their AvantGo equivalents, and scrape those instead. - Adam

Re: Scraping news feeds?

2001-03-07 Thread Simon Wilcox
At 12:54 07/03/2001 -0500, Dave Cross wrote: [snip] > > > > Chapter 10 isn't it Dave ? > >Section 10.4 to be precise. "Specialized parsers - XML::RSS" :) > >You've got a bit further since last Thursday then! Yep, but not quite that far ! Also been reading Rebel Code which has a nice bit about

Re: Scraping news feeds?

2001-03-07 Thread Dave Cross
At Wed, 07 Mar 2001 17:52:33 +, Simon Wilcox <[EMAIL PROTECTED]> wrote: > At 12:43 07/03/2001 -0500, Dave Cross wrote: > >At Wed, 7 Mar 2001 17:37:00 +, Michael Stevens <[EMAIL PROTECTED]> wrote: > > > On Wed, Mar 07, 2001 at 04:36:56PM +, Dave Hodgkinson wrote: > > > > What's the best

Re: Scraping news feeds?

2001-03-07 Thread Simon Wilcox
At 12:43 07/03/2001 -0500, Dave Cross wrote: >At Wed, 7 Mar 2001 17:37:00 +, Michael Stevens <[EMAIL PROTECTED]> wrote: > > On Wed, Mar 07, 2001 at 04:36:56PM +, Dave Hodgkinson wrote: > > > What's the best way to scrape a variety of news headlines from > > > various sites? Sort of a moreo

Re: Scraping news feeds?

2001-03-07 Thread Jon Eyre
On Wed, 7 Mar 2001, Michael Stevens wrote: > On Wed, Mar 07, 2001 at 04:36:56PM +, Dave Hodgkinson wrote: > > What's the best way to scrape a variety of news headlines from various > > sites? Sort of a moreover for the intranet... > > RSS. > > You want RSS. > > Use RSS. > > (although this

Re: Scraping news feeds?

2001-03-07 Thread Dave Cross
At Wed, 7 Mar 2001 17:37:00 +, Michael Stevens <[EMAIL PROTECTED]> wrote: > On Wed, Mar 07, 2001 at 04:36:56PM +, Dave Hodgkinson wrote: > > What's the best way to scrape a variety of news headlines from > > various sites? Sort of a moreover for the intranet... > > RSS. > > You want RSS

Re: Scraping news feeds?

2001-03-07 Thread Dominic Mitchell
On Wed, Mar 07, 2001 at 04:36:56PM +, Dave Hodgkinson wrote: > What's the best way to scrape a variety of news headlines from various > sites? Sort of a moreover for the intranet... Probably using RSS (XML file format) and XML::RSS (which includes a nice scraper tool). -Dom

Scraping news feeds?

2001-03-07 Thread Dave Hodgkinson
What's the best way to scrape a variety of news headlines from various sites? Sort of a moreover for the intranet... -- Dave Hodgkinson, http://www.hodgkinson.org Editor-in-chief, The Highway Star http://www.deep-purple.com Apache, mod_perl, MySQL, S