At Fri, 9 Mar 2001 12:00:19 +, Michael Stevens <[EMAIL PROTECTED]> wrote:
> On Fri, Mar 09, 2001 at 12:50:17PM -0500, mallum wrote:
> > For aggregating RSS feeds I use RSSLite [1] rather than XML::RSS.
> > RSSLite avoids using expat and is a little naughty in parsing XML
> > that would make
On Fri, Mar 09, 2001 at 12:02:52PM +, DJ Adams wrote:
>
> I didn't know you were on this list - then again, I am crap at keeping
> up anyway...
Wow, 3, nay, 4 posts to the list in one day. Must get a grip on myself...
dj
On Fri, Mar 09, 2001 at 12:50:17PM -0500, mallum wrote:
>
>
> I run http://10.am and do this on a largish scale.
Mallum - greetings!
I didn't know you were on this list - then again, I am crap at keeping
up anyway...
dj
On Fri, Mar 09, 2001 at 12:00:19PM +, Michael Stevens wrote:
> > avoids using expat and is a little naughty in parsing XML that would make
> > expat barf ( Alot of RSS feeds unfortunatly contain bad XML ).
>
> That way lies madness.
Which is exactly what separates theory from reality ;-)
dj
At Fri, 9 Mar 2001 12:50:17 -0500, mallum <[EMAIL PROTECTED]> wrote:
[RSS feeds]
> I run http://10.am and do this on a largish scale.
>
> For aggregating RSS feeds I use RSSLite [1] rather than XML::RSS.
> RSSLite avoids using expat and is a little naughty in parsing XML
> that would make exp
I run http://10.am and do this on a largish scale.
For aggregating RSS feeds I use RSSLite [1] rather than XML::RSS. RSSLite
avoids using expat and is a little naughty in parsing XML that would make
expat barf ( Alot of RSS feeds unfortunatly contain bad XML ).
For actual scaping of sites I ba
> "DH" == Dave Hodgkinson <[EMAIL PROTECTED]> writes:
DH> What's the best way to scrape a variety of news headlines from
DH> various sites? Sort of a moreover for the intranet...
Find their AvantGo equivalents, and scrape those instead.
- Adam
At 12:54 07/03/2001 -0500, Dave Cross wrote:
[snip]
> >
> > Chapter 10 isn't it Dave ?
>
>Section 10.4 to be precise. "Specialized parsers - XML::RSS" :)
>
>You've got a bit further since last Thursday then!
Yep, but not quite that far !
Also been reading Rebel Code which has a nice bit about
At Wed, 07 Mar 2001 17:52:33 +, Simon Wilcox <[EMAIL PROTECTED]> wrote:
> At 12:43 07/03/2001 -0500, Dave Cross wrote:
> >At Wed, 7 Mar 2001 17:37:00 +, Michael Stevens <[EMAIL PROTECTED]> wrote:
> > > On Wed, Mar 07, 2001 at 04:36:56PM +, Dave Hodgkinson wrote:
> > > > What's the best
At 12:43 07/03/2001 -0500, Dave Cross wrote:
>At Wed, 7 Mar 2001 17:37:00 +, Michael Stevens <[EMAIL PROTECTED]> wrote:
> > On Wed, Mar 07, 2001 at 04:36:56PM +, Dave Hodgkinson wrote:
> > > What's the best way to scrape a variety of news headlines from
> > > various sites? Sort of a moreo
On Wed, 7 Mar 2001, Michael Stevens wrote:
> On Wed, Mar 07, 2001 at 04:36:56PM +, Dave Hodgkinson wrote:
> > What's the best way to scrape a variety of news headlines from various
> > sites? Sort of a moreover for the intranet...
>
> RSS.
>
> You want RSS.
>
> Use RSS.
>
> (although this
At Wed, 7 Mar 2001 17:37:00 +, Michael Stevens <[EMAIL PROTECTED]> wrote:
> On Wed, Mar 07, 2001 at 04:36:56PM +, Dave Hodgkinson wrote:
> > What's the best way to scrape a variety of news headlines from
> > various sites? Sort of a moreover for the intranet...
>
> RSS.
>
> You want RSS
On Wed, Mar 07, 2001 at 04:36:56PM +, Dave Hodgkinson wrote:
> What's the best way to scrape a variety of news headlines from various
> sites? Sort of a moreover for the intranet...
Probably using RSS (XML file format) and XML::RSS (which includes a nice
scraper tool).
-Dom
What's the best way to scrape a variety of news headlines from various
sites? Sort of a moreover for the intranet...
--
Dave Hodgkinson, http://www.hodgkinson.org
Editor-in-chief, The Highway Star http://www.deep-purple.com
Apache, mod_perl, MySQL, S
14 matches
Mail list logo