Thanks Peter! I am thinking that I may just use Nutch to do the crawl and index 
off of these sites. I need to check out the APIs for each to make sure I'm not 
missing anything related to the geospatial data for each image. Obviously both 
do the extraction when the images are uploaded so I'm guessing that it's also 
stored somewhere too ;-)

Adam 

Sent from my iPhone

On Apr 12, 2011, at 4:00 PM, Péter Király <kirun...@gmail.com> wrote:

> Hi,
> 
> I did Flickr into Lucene about 3 years ago. There is a Flickr API,
> which covers almost everything you need (as I remember, not always
> Flickr feature was implemented at that time in the API, like the
> "collection" was not searchable). You can harvest by user ID or
> searching for a topic. You can use a language library (PHP, Java etc.)
> to wrap the details of communication. It is possible, that you would
> like to merge information into one entity before send to Solr (like
> merging the user, collection and set info into each pictures). The
> last step is to transform this information into a Solr document (again
> either directly or with a language library). I am not sure if it helps
> you, but if you ask more specific question, I try to answer.
> 
> regards,
> Péter
> 
> 2011/4/12 Estrada Groups <estrada.adam.gro...@gmail.com>:
>> Has anyone tried doing this? Got any tips for someone getting started?
>> 
>> Thanks,
>> Adam
>> 
>> Sent from my iPhone
>> 

Reply via email to