Webscrapping in Python for Beginners

Max Schallwig, Power through programming

Play Speed
  • 0.5x
  • 1x (Normal)
  • 1.25x
  • 1.5x
  • 2x
28 Lessons (3h 58m)
    • 1. Introduction

      1:02
    • 2. Prerequisit libraries

      3:00
    • 3. Introduction to The Modulus Operation

      5:01
    • 4. Introduction to Simple Error Handling

      4:25
    • 5. Introduction to Pandas

      6:41
    • 6. Response Status Codes From a HTTP Request

      7:18
    • 7. Reading The Response Text From Our Request

      11:40
    • 8. First Approach at Parsing The Data

      13:18
    • 9. Understanding the Exception Cases

      6:39
    • 10. Parsing Out All Data for One Company

      9:33
    • 11. Determining Where We Can Get More Ticker Symbols

      15:46
    • 12. Extracting Company Ticker Symbols Part 1

      16:32
    • 13. Extracting Company Ticker Symbols Part 2

      10:41
    • 14. Getting Data For All Parsed Companies

      8:11
    • 15. Final Data For All Parsed Companies

      5:13
    • 16. Final Result Static Websites

      1:40
    • 17. Prerequisite Libraries for Dynamic Web Scrapping

      5:02
    • 18. Short review: Recursive Functions

      7:43
    • 19. Getting started with Selenium

      8:47
    • 20. View The Page Source

      9:14
    • 21. Website Elements and XPath

      8:11
    • 22. Navigating Deeper Into The Page Source

      14:37
    • 23. Identifying The Path To Our Data

      19:28
    • 24. Using The XPath To Our Data

      9:50
    • 25. Parsing Out Our Data

      8:42
    • 26. Getting Our Final Data

      14:56
    • 27. Final Results Dynamic Websites

      4:13
    • 28. WebscrapingPythonOutro

      0:52

Project Description

The website http://webscraper.io/test-sites/tables has test tables that you can use to try some scraping techniques. Try to reduce the website so that you can extract all the #'s, First Names, Last Names, and Usernames and put them into a neat format (such as a pandas DataFrame). You can run everything in your local coding environment. The solution is written in Python 3.5. 

You can find the file that gets your started as well as the solution file attached.

Good Luck!

Resources(2)

Student Projects