Webrexp tips - Twinside/Webrexp GitHub Wiki

Home

In this page some tips about the webrexp are presented

Reading some RSS headlines

    webrexp '"http://someurl.com" >> item title [.]'

Each item title in the RSS feed should be printed in the command line. In the same way, you can display title for atom feeds :

    webrexp '"http://someurl.com" >> entry title [.]'

And we can even combine the two commands in one :

    webrexp '"http://someurl.com" >> (item | entry) title [.]'

Dumping some webcomics

Here some real-world examples used to dump some... comics. Don't do that, it's bad bad bad.

    webrexp '"http://www.someComic.com/chapterone/" >> (div.comicpane img [.]; div.nav-next a >>)*'

    webrexp '"http://www.someOtherComic.com/" >> div.nav-first a >> (div#comic img [.]; div.nav-next a >>)*'

Listing all linked pages

For some reason you might want to list all linked page of a webpage

    webrexp '"http://www.somepage.com" >> a [href]'