HTML Elements and Web Requests - Snowboundport37/champlain GitHub Wiki

HTML Elements and Web Requests Lab

Setup

  • Started XAMPP, ran Apache and MySQL.
  • Found my IPv4 with ipconfig: 10.0.17.11.
  • Placed ToBeScraped.html in C:\xampp\htdocs.
  • Verified in browser: http://10.0.17.11/ToBeScraped.html.
  • Screenshot: Web page in browser

PowerShell Work

# Point to the hosted page
$uri = "http://10.0.17.11"
$scraped_page = Invoke-WebRequest -Uri $uri

# 9. Count links
$scraped_page.Links.Count

# 10. Show links as HTML elements
$scraped_page.Links

# 11. Show only URL and text
$scraped_page.Links | Select-Object outerText, href

# 12. Get outer text of all h2 tags
$h2s = $scraped_page.ParsedHtml.body.getElementsByTagName("h2")
$h2s | ForEach-Object { $_.outerText }

# 13. Get innerText of divs with class "div-1"
$divs = $scraped_page.ParsedHtml.body.getElementsByTagName("div")
$divs | Where-Object { $_.getAttributeNode("class").Value -like "*div-1*" } | Select-Object innerText 

Output