Project Ideas Improve package license detection - aboutcode-org/aboutcode GitHub Wiki

Improve ScnnCode package license detection

The goal of this project is to improve package license detection across the board. ScanCode toolkit package license detection works primarily by:

  1. extracting license statements found in package manifests
  2. applying license detection and normalization on these extracted statements

In some case things are super simple: for instance the npm package.json license tag "license": "MIT" is pretty straight forward and obvious. The difficulty is in the details and getting these right at a large scale on most or all the packages.

This project idea can take multiple shapes and it should focus on one or more package types. The end results should be a significant improvement to the license detection quality for the selected package types.

These are details on sub-projects focused each on a different package type. Some may be completed or quite advanced. The long term goal is to work closely with each package ecosystem to contribute the improved license data upstream.