Apologies if you already saw this over on xml4lib...but it's relevant
here given the use of Perl. //Ed

===

Nature Publishing Group (NPG) are pleased to announce the lastest
release of their open-source RSS aggregator 'Urchin' to SourceForge
(<http://urchin.sf.net>). Initially funded by the UK Joint Information
Systems Committee (JISC, <http://www.jisc.ac.uk/>) as one of the
Publisher and Library/Learning Systems (PALS,
<http://www.jisc.ac.uk/index.cfm?name=programme_pals>) Metadata and
Interoperability Group projects, Urchin has been substantially
improved. Version 0.92, the current stable release, introduces the
following changes:

* A mod_perl front end for performance and persistence
* Using XML::LibXSLT to improve performance 
* A new option for using HTTP status codes for error reporting 
* Old items can be now be expunged after an update via an rator-defined query
* Access, admin and error logs 
* Web-triggerable remote refresh 
* Configurable RDF output by adminstrator-defined inclusion or n of namespaces 
* The ability to combine several simple RDF query conditions using AND and OR 
* Numerous bug fixes 
 
Urchin is a Web based, customisable, RSS aggregator and filter. Its primary
purpose is to allow the generation of new RSS feeds by running queries
against the collection of items in the Urchin database. However, other
arbitrary output formats can be defined and generated using XSL
transformations or HTML::Template templates. In other words, the collection
of Urchin Perl modules form a foundation for building an RSS aggregation or
portal service. 

Urchin is a classic LAMP implementation written in Perl using, wherever
possible, pre-existing Perl modules. It uses MySQL for its database
functionality and can run using either Apache with a mod_perl handler or any
CGI-enabled web server using the CGI script that is included with the
distribution. While Urchin is developed on a Red Hat Linux system, it has
been ported to Mac OS X, and earlier versions have run successfully under
Windows 2000, XP and CygWin.

This code has been tested on Red Hat Linux 8.0 running Apache version
2.0.40, MySQL version 4.0.13 and Perl version 5.8.0, and on Mac OS X 10.3.5
running Apache version 2.0.49, MySQL version 4.0.16 and Perl version 5.8.1.

Urchin's feature set includes:

* Reads RSS 0.9*, 1.0 and 2.0
* Stores all incoming data in RSS 0.9x and 1.0 feeds
* Queryable on arbitrary data fields
* Supports boolean, simple RDF, full RDF querying
* Arbitrary output formats

Alongside version 0.92, a development code snapshot has been released that
includes support for importing Atom feeds, new administrative commands for
defining feed aggregates, and the ability to use RSS and Atom auto-discovery
links.

Urchin is Free Software. Portions of the code are licensed under the GNU
General Public License, the rest under the GNU Lesser General Public
License.

Reply via email to