A while ago, we discussed how to scrape information from websites that don't offer information in a structured format like XML or JSON. We noted that urllib and lxml are indispensable tools in web scraping. While urllib enables us to connect to websites and retrieve information, lxml helps convert HTML, broken or not, to valid XML and parse it. In this post, I will demonstrate how to retrieve information from web pages that require a login session.
Micro How To
Very small how to article
shutdown -h 60
Is scrolling vertically on web pages in your Firefox horribly slow?
I encountered this issue recently on Fedora 10. Initially, I suspected the binary NVIDIA driver. But I was wrong. I found a simple solution.
Disable smooth scrolling in the Firefox preferences.
- On the Firefox window click Edit
- Click Preferences
- Click Advanced tab
- Click Use Smooth Scrolling to uncheck the checkbox
- Click Close