HTML Elements and Web Requests - Snowboundport37/champlain GitHub Wiki
HTML Elements and Web Requests Lab
Setup
- Started XAMPP, ran Apache and MySQL.
- Found my IPv4 with
ipconfig: 10.0.17.11.
- Placed
ToBeScraped.html in C:\xampp\htdocs.
- Verified in browser:
http://10.0.17.11/ToBeScraped.html.
- Screenshot: Web page in browser
PowerShell Work
# Point to the hosted page
$uri = "http://10.0.17.11"
$scraped_page = Invoke-WebRequest -Uri $uri
# 9. Count links
$scraped_page.Links.Count
# 10. Show links as HTML elements
$scraped_page.Links
# 11. Show only URL and text
$scraped_page.Links | Select-Object outerText, href
# 12. Get outer text of all h2 tags
$h2s = $scraped_page.ParsedHtml.body.getElementsByTagName("h2")
$h2s | ForEach-Object { $_.outerText }
# 13. Get innerText of divs with class "div-1"
$divs = $scraped_page.ParsedHtml.body.getElementsByTagName("div")
$divs | Where-Object { $_.getAttributeNode("class").Value -like "*div-1*" } | Select-Object innerText
Output