Teresa Havel wrote: > Ryan Lee wrote: >> Teresa Havel wrote: >> >>> I'm not sure if this is the correct forum for questions on these >>> applications, and if it isn't, please let me know how I can get such >>> questions answered. >>> I am wondering if Solvent can be used to scrape a graphic from a web >>> page. If it can, I'll follow up with the technical details of what I >>> have done, because so far, I could not get it to work. But since this >>> is the first time I've tried using it, I don't know if I am not using >>> it properly, or if it isn't capable of grabbing an entire graphic >>> from a page and saving it. Thanks. >>> Teresa >>> >> >> Piggy Bank doesn't store binary data, but you could scrape the URL of >> a graphic with Solvent and have your scraper save that for later >> reference. >> >> > The URL stays the same but the graphic that is stored there changes one > to three times a day. I would like to save off the graphics and the > accompanying text below the graphic, and build a database locally so > they could be accessed by date and time. Can Solvent do this and maybe > not use Piggy Bank and just save the graphic and text directly to a > local directory?
So Solvent assists scraper generation. The scraping system is generally geared towards generating an RDF model for Piggy Bank. You could use the mechanisms involved to instead generate text, but what you do after that is up to your own invention. You might use Crowbar for that. http://simile.mit.edu/wiki/Fresno sounds like it would match your use case much more closely, though, particularly if you have a machine that runs Firefox and cron. -- Ryan Lee [EMAIL PROTECTED] MIT CSAIL Research Staff http://simile.mit.edu/ http://people.csail.mit.edu/ryanlee/ _______________________________________________ General mailing list [email protected] http://simile.mit.edu/mailman/listinfo/general
