Privacy - danny0838/webscrapbook GitHub Wiki

Privacy Policy

WebScrapBook does not collect your personal information. However, the web pages you capture may contain personal information, which is saved as-is, and you are responsible for ensuring its appropriateness.

WebScrapBook only sends web requests when necessary to access the web resources specified for capture by the user and to communicate with the backend server configured by the user. These requests simulate normal browsing behavior, with no additional information being transferred.

The backend server can be any application that implements a conforming API, which can usually be set up using PyWebScrapBook, the open-source sibling of WebScrapBook, which does not collect personal information or initiate any web request.

Permissions Requested and Their Use

  • Access any web page (http://*, https://*, file:///*): To capture web resources and communicate with the backend server.
  • tabs: To read the content of the tabs specified for capture by the user.
  • webNavigation: To access the frames in a webpage for a faithful capture.
  • storage: To store customization settings and cache intermediate data of the captured webpages or the viewed archive files.
  • unlimitedStorage: To allow caching large intermediate data.
  • downloads: To save a captured webpage with its associated resources as a folder.
  • webRequest, webRequestBlocking: To rewrite the referrer of a fetching request to simulate normal browsing behavior, and to redirect a request pointing to an archive file to the viewer.
  • contextMenus: To display a context menu for shortcut features.
  • system.display: To retrieve the dimensions of all display devices for proper sidebar window placement.

Caveats

  • When WebScrapBook is installed, some resources can be fetched by websites (see web_accessible_resources in manifest.json), and a website may detect whether WebScrapBook is installed, especially in Chromium-based browsers, whose resources are placed under a fixed domain matching the extension ID.