I used urllib to get html stuff, then used gluon.html TAG for the
filtering jQuery style.
Here are some of problems I encountered while using the TAG from
though:
1. I needed to reencode the string to utf-8 cause it was throwing
errors if its not proper utf8 - maybe this should be automated?
2. I also throw an error when there is something like '<scri'+'pt>'
which is found on google analytics, so I just stripped out all script
tags.
3. There are malformed html, without closing tags or something - this
one was difficult, I didn't think if it is internally supported to get
fixed so I had to google and find out if there are built stuff to fix
it and I found BeautifulSoup.prettify() which will fix it - maybe this
should also be built in TAG() cause TAG is so awesome, I can do
everything in DOM very easily.

On Aug 6, 3:28 pm, mdipierro <[email protected]> wrote:
> +1
>
> How does it work?
>
> On Aug 6, 2:46 pm, Lasiaf <[email protected]> wrote:
>
>
>
> > Hi All,
>
> > This is my first web2py application.http://feedcreator.appspot.com
>
> > What it does is create rss feeds for sites that don't have rss feeds.
>
> > I did this for a whole day, web2py is so easy to learn I did this
> > without knowing much about web2py and python.
> > Since I finished this project, I decided to do all my own projects
> > using web2py.
>
> > - Lasiaf

Reply via email to