On 28Jul2016 19:28, Gordon Levi <[email protected]> wrote:
Arshpreet Singh <[email protected]> wrote:
I am writing Imdb scrapper, and getting available list of titles from IMDB
website which provide txt file in very raw format, Here is the one part of
file(http://pastebin.com/fpMgBAjc) as the file provides tags like
Distribution Votes,Rank,Title I want to parse title names, I tried with
readlines() method but it returns only list which is quite heterogeneous, is
it possible that I can parse each value comes under title section?
Beautiful Soup will make your task much easier
<https://www.crummy.com/software/BeautifulSoup/>.
Did you look at his sample data? Plain text, not HTML or XML. Beautiful Soup is
not what he needs here.
Cheers,
Cameron Simpson <[email protected]>
--
https://mail.python.org/mailman/listinfo/python-list