Data Scraping Crawling - taoualiw/My-Knowledge-Base GitHub Wiki
Web crawling : - aka "indexing", is used to index the information on different web sites using bots also known as crawlers.
- is the process of locating information on World Wide Web(WWW), indexing all the words in a document, adding them to a database, then following all hyper links and indexes and adds that information also to the database.
Web scraping : - aka web data extraction is an automated software technique of extracting information from a fetched web site/page.
- is the process of automatically requesting a web document and collecting information from it. Strictly speaking, to do web scraping, you have to do some degree of web crawling to move around the websites.