The nature of jwebunit and httpunit for that matter is that they are intended for running unit tests.

You can try to use them for other types of software, and at least use the http driver to your advantage. It is open source, so you can use it in any way you want.

That said, I don't know if your journey with jwebunit will be a great one in this respect. I haven't used it this way, and the current API isn't reallly fit to do data mining. You'll have to write your own driver for that. There is really no point in asserting that some data is present, you want to get that data.

It can be done, but I don't know if there are better frameworks out there that provide out of the box functionality for you.

Also consider the fact that jwebunit et al don't really enjoy working with _javascript_.

Martijn


On 3/13/06, Alasdair <[EMAIL PROTECTED]> wrote:
I'm writing a quick personal app that will mine a few websites on an hourly
basis to grab various bits of data and file them away in a DB for later
analysis.

The two simplest technologies I can use to emulate a browser and grab some
basic html from a website are jwebunit/httpunit and Curl. Does anyone have
any advice or reccomendations when it comes to writing a simple data-mining
application for the web using either of these?

(I am particularly interested in a recommendation between jwebunit (which I
like because it's java based, even if it's more a testing framework than a
browser emulator) and curl (which is just a simple command line tool that
can grab data from a site for later parsing.)

Has anyone done anything like this using jwebunit or another technology
before? (Doubtless they have :) I could really use some advice before I
continue though.)

Thanks,

- alasdair

Alasdair Young




--
No virus found in this outgoing message.
Checked by AVG Free Edition.
Version: 7.1.375 / Virus Database: 268.2.2/280 - Release Date: 03/13/2006




-------------------------------------------------------
This SF.Net email is sponsored by xPML, a groundbreaking scripting language
that extends applications into web and mobile media. Attend the live webcast
and join the prime developer group breaking into this new coding territory!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=110944&bid=241720&dat=121642
_______________________________________________
Jwebunit-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/jwebunit-users



--
Nominate Wicket for the SourceForge.net's Choice awards!
http://www.wilsonresearch.com/2006/ostgawards06/ostgawards4.php

Reply via email to