Hello,

I can't find how to tell lxml/BS to preserve carriage returns in an HTML 
snippet when calling soup.body.text: After removing </br>'s, it also removes 
the CRLF that follows.

==========
builder = LXMLTreeBuilderForXML(preserve_whitespace_tags=["body"])
rows = cur.fetchall()
for row in rows:
        #BAD soup = BeautifulSoup(row["introtext"], 
builder=builder,features='lxml')
        soup = BeautifulSoup(row["intro"],features='lxml')
        print(soup.body.text)
        break
==========

Is there an option?

Thank you.
_______________________________________________
lxml - The Python XML Toolkit mailing list -- lxml@python.org
To unsubscribe send an email to lxml-le...@python.org
https://mail.python.org/mailman3/lists/lxml.python.org/
Member address: arch...@mail-archive.com

Reply via email to