Hello!

I am working on a research project in which I need the dataset of all 
android application webpages from the Google Play store. I want to crawl 
the Google Play store to download the web pages of all the android 
application (All the webpages with the following base url: 
https://play.google.com/store/apps/). I checked the robots.txt file of the 
play store and it disallows crawling these URLs.

Also, when I browse the Google Play store I can only see top applications 
up to 3 pages for each of the categories. How can I get the other 
application pages?

If anyone has tried crawling the Google Play please let me know the 
following things: a) Were you successful in crawling the play store. If 
yes, please let me know how you did that. b) How to crawl the hidden 
application pages not visible in top apps for each of the categories? c) Is 
there a techniques to download the applications also and not just the 
webpages?

I already searched around and found the following links:

a) https://code.google.com/p/android-market-api/ 
b) https://code.google.com/p/android-marketplace-crawler/source/checkout 
c) 
http://mohsin-junaid.blogspot.co.uk/2012/12/how-to-install-android-marketplace.html
 
d) 
http://mohsin-junaid.blogspot.in/2012/12/how-to-download-multiple-android-apks.html

-- 
-- 
You received this message because you are subscribed to the Google
Groups "Android Developers" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/android-developers?hl=en
--- 
You received this message because you are subscribed to the Google Groups 
"Android Developers" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/groups/opt_out.


Reply via email to