Unit 6 - Lesson 11

67.1.1. Introduction to web scrapping

(a) Using socket programming we can retrieve web pages by posting HTTP requests to the remote web server, This is one way of web scraping.
(b) Regex in python can be used to extract information from the web as per our requirement
(c) Its little complex but for small set of data we can manually scrap the data from the web using copy paste technique, and we can store this data in text files or spreadsheets for future analysis.
(d) We can use some semi-structured data query languages like XQuery and the HTQL to parse HTML pages to retrieve and transform web pages information.

Post a Comment

Cookie Consent
We serve cookies on this site to analyze traffic, remember your preferences, and optimize your experience.
Oops!
It seems there is something wrong with your internet connection. Please connect to the internet and start browsing again.
AdBlock Detected!
We have detected that you are using adblocking plugin in your browser.
The revenue we earn by the advertisements is used to manage this website, we request you to whitelist our website in your adblocking plugin.