Scraping - mwicat/personal GitHub Wiki

Extract list of URL from HTML

sudo apt-get install html-xml-utils

hxwls http://www.o2.pl
hxwls 'http://www.info.ucl.ac.be/~pvr/bookfigures/' | grep '\.oz$' | xargs wget