Unicode HTML Conversion to ASCII in Python -
possible duplicate:
unescaping characters in string python
i have string of unicode html in python begins with: \u003ctable>\u003ctr
need convert ascii can parse beautifulsoup. however, python's encode , decode functions seem have no effect; original string no matter try. i'm new python , unicode in general, appreciated.
use
s.decode("unicode-escape")
to decode html data first (no idea how character crap from).
Comments
Post a Comment