A19 Test data as‐a‐code configuration ‐ Linkedin Scraping - dcasota/m365-scripts GitHub Wiki
On the internet you will find a ton of tools to scrape data e.g. from Linkedin.
A easy-to-use tool is Axiom.ai.
Prerequisites are:
- Windows
- Google Chrome
- Axiom.ai login account (start with a free user account)
- Axiom.ai Google Chrome Extension
- Axiom.ai Desktop agent
- A Google Chrome Spreadsheet shared to contribute publicly.
Once all prerequisites are installed, in Chrome you find the extension window.
Proceed with a login.
The sample project 'Scrape Page to Google Sheet' is a jumpstart project. This template easily scrapes the data from Linkedin and saves it to a Google spreadsheet.
The template consists of two steps.
This steps consists of several input fields.
Specifiy the Linkedin url which is capable to list all employees as pageable list-
https://www.linkedin.com/search/results/people/?heroEntityKey=urn%3Ali%3Aorganization<identification>&keywords=<company>%20group&origin=CLUSTER_EXPANSION&sid=%3B8K
In 'Click 'Select' to choose the data you wish to scrape.', specify e.g. name, title, location and picture of Linkedin list objects. In the Axiom.ai extension you have to select click on two object so that the column is specified. In case of a different type of data e.g. image, it has to be specified on the data field.
In 'Find pager (if any)', specify the pager button on the bottom of the Linkedin list.
For max results, start with a small sample amount e.g. of 20 entries.
In the output preview, analyze if the data scraped matches with your needs.
In Step2, specify the shared Google spreadsheet, sheet name, DATA, write options and write method.
Save the project. Press 'run w/ desktop app'.
After the run, you should find the data in your Google spreadsheet.
In run reports, you can see the amount of time it took per project and the remaining time accordingly to your subscription.
In addition, you easily can modify an existing project.
Be aware, do not delete projects in the free account subscription. It might delete a cascade of projects as well.