Python Beautiful Soup

Beautiful Soup is a Python library designed for web scraping data out of HTML and XML files.

It was developed in response to pulling data from modern complex (and messy) HTML which has nested tags and attributes as well as dynamically generated pages.

It also integrates with other Python utilities / ecosystems for data analysis and other automation tasks.