Data Scraping Crawling - taoualiw/My-Knowledge-Base GitHub Wiki

Data Scraping vs Crawling

Web crawling : - aka "indexing", is used to index the information on different web sites using bots also known as crawlers.

  • is the process of locating information on World Wide Web(WWW), indexing all the words in a document, adding them to a database, then following all hyper links and indexes and adds that information also to the database.

Web scraping : - aka web data extraction is an automated software technique of extracting information from a fetched web site/page.

  • is the process of automatically requesting a web document and collecting information from it. Strictly speaking, to do web scraping, you have to do some degree of web crawling to move around the websites.
⚠️ **GitHub.com Fallback** ⚠️