Re: Solvent and Piggy Bank Question

Ryan Lee Wed, 13 Feb 2008 12:06:32 -0800

Teresa Havel wrote:
> Ryan Lee wrote:
>> Teresa Havel wrote:
>>  
>>> I'm not sure if this is the correct forum for questions on these 
>>> applications, and if it isn't, please let me know how I can get such 
>>> questions answered.
>>> I am wondering if Solvent can be used to scrape a graphic from a web 
>>> page. If it can, I'll follow up with the technical details of what I 
>>> have done, because so far, I could not get it to work. But since this 
>>> is the first time I've tried using it, I don't know if I am not using 
>>> it properly, or if it isn't capable of grabbing an entire graphic 
>>> from a page and saving it. Thanks.
>>> Teresa
>>>     
>>
>> Piggy Bank doesn't store binary data, but you could scrape the URL of 
>> a graphic with Solvent and have your scraper save that for later 
>> reference.
>>
>>   
> The URL stays the same but the graphic that is stored there changes one 
> to three times a day. I would like to save off  the graphics and the 
> accompanying text below the graphic, and build a database locally so 
> they could be accessed by date and time. Can Solvent do this and maybe 
> not use Piggy Bank and just save the graphic and text directly to a 
> local directory?


So Solvent assists scraper generation.  The scraping system is generally 
geared towards generating an RDF model for Piggy Bank.  You could use 
the mechanisms involved to instead generate text, but what you do after 
that is up to your own invention.  You might use Crowbar for that.

http://simile.mit.edu/wiki/Fresno sounds like it would match your use 
case much more closely, though, particularly if you have a machine that 
runs Firefox and cron.

-- 
Ryan Lee                  [EMAIL PROTECTED]
MIT CSAIL Research Staff  http://simile.mit.edu/
http://people.csail.mit.edu/ryanlee/
_______________________________________________
General mailing list
[email protected]
http://simile.mit.edu/mailman/listinfo/general

Re: Solvent and Piggy Bank Question

Reply via email to