HTML/XML parser for quick-turnaround applications like screen-scraping
Description
Beautiful Soup is a Python HTML/XML parser designed for quick
turnaround projects like screen-scraping. Three features make it
powerful:
Beautiful Soup wont choke if you give it bad markup.
Beautiful Soup provides a few simple methods and Pythonic idioms for
navigating, searching, and modifying a parse tree.
Beautiful Soup automatically converts incoming documents to Unicode
and outgoing documents to UTF-8.
Beautiful Soup parses anything you give it.
Valuable data that was once locked up in poorly-designed websites is
now within your reach. Projects that would have taken hours take only
minutes with Beautiful Soup.