Hi Tizy,

Actually the modifications should be very similar. First you'll need a way of 
extracting the video URLs from the HTML, if the video es embedded using HTML5 
it should be very straightforward, otherwise it gets a little more difficult, 
because you'll need to convert the URL that you put in your HTML page to an 
actual video URL that can be downloaded by Nutch. This should be developed for 
any popular video sharing platform that you're targeting. 

Depending on how much data you want to extract about the video and how much 
metadata the video have, then you'll need more or less tweaks. The overall 
conclusion is that is possible you'll need to do some work, but its 
definitively posible. 

Regards,

----- Original Message -----
From: "Tizy Ninan" <[email protected]>
To: [email protected], [email protected]
Sent: Friday, May 8, 2015 4:57:07 AM
Subject: Crawl sites containing videos

Hi,

Is it possible to crawl the videos (ex. YouTube videos) embedded in a
website? If so, what changes need to be made to enable video crawling. Will
it be similar to crawling images?

Kindly provide insights on this. Thanks in advance.

Thanks and Regards,
Tizy

Reply via email to