Webrexp tips - Twinside/Webrexp GitHub Wiki
In this page some tips about the webrexp are presented
Reading some RSS headlines
webrexp '"http://someurl.com" >> item title [.]'
Each item title in the RSS feed should be printed in the command line. In the same way, you can display title for atom feeds :
webrexp '"http://someurl.com" >> entry title [.]'
And we can even combine the two commands in one :
webrexp '"http://someurl.com" >> (item | entry) title [.]'
Dumping some webcomics
Here some real-world examples used to dump some... comics. Don't do that, it's bad bad bad.
webrexp '"http://www.someComic.com/chapterone/" >> (div.comicpane img [.]; div.nav-next a >>)*'
webrexp '"http://www.someOtherComic.com/" >> div.nav-first a >> (div#comic img [.]; div.nav-next a >>)*'
Listing all linked pages
For some reason you might want to list all linked page of a webpage
webrexp '"http://www.somepage.com" >> a [href]'