OP Should Not Have Unused Input(English) - PaddlePaddle/Paddle GitHub Wiki
Specification summary:
- Section 1, Background
- Section 2, Check method
- Section 3, Suggested fix of failed PR
Additional instructions:
- During the implementation process, the specifications may find aspects that are not considered by the existing specifications, and need to be supplemented and improved in the implementation process. Please also give positive feedback.
- Further analysis will be performed on the white list and the Operators will be refined if they can.
Currently, the inputs of an Operator are registered by v in PaddlePaddle. However, there are several Operators which contains some inputs that is not used during the computation. It causes some problems,
- Waste of memory. PaddlePaddle's memory optimization strategy relies on precise life cycle analysis of Variable by reference count. If a Variable is registered as input of an Operator, its reference count increases by 1, which will extend the Variable's life and leads to waste of memory, especially GPU memory.
- Inconsistent of code. It may confuse readers if a Variable is registered as input but never used.
In order to avoid unused inputs of Operators, we proposed this OP Specification and add a corresponding check in CI, called UnusedVarCheck.
- Before running the Operator,initialize a
ThreadLocal set
to record the inputs used during computation. - After running,compare the inputs of the Operator and recorded set, find the unused inputs.
- Skip Operators that use
mkldnn
, since they adopt special memory optimization strategy. - Skip inputs that registered
NoNeedBufferVarsInference
, since they hold no memory buffer. - Skip inputs that is not initialized, since they hold no memory buffer.
- Skip Operators that use
- Use
FLAGS_enable_unused_var_check
to control the check. UnusedVarCheck is enabled inPR_CI_Coverage
only. - Use a white list to record the Operators which are failed in UnusedVarCheck, see details in unused_var_check.cc.
If an Operator is failed in UnusedVarCheck,build_log
should contains message like:
Error Message Summary:
----------------------
PermissionDeniedError: Unused input variables check failed: Operator op_with_unused_var has input(s) not uesed: X, please make sure it(them) is(are) needed.
If not, remove it(them) from inputs; if yes, register NoNeedBufferVars or add the operator to white list in unused_var_check.h.
[Hint: Expected unsed_input_var_names.size() == 0, but received unsed_input_var_names.size():1 != 0:0.] at (/Paddle/Paddle/paddle/fluid/framework/unused_var_check.cc:82)
The three potential reasons of the failure and corresponding suggested fixes are,
- The dims and tensor of the input is never used during computation.
- Fix: remove the input from
OpMaker/GradOpMaker
. - Example: abs_grad in PR24107.
- Fix: remove the input from
- The dims the input is used during computation but the tensor is not.
- Fix:register NoNeedBufferVarsInference for the input.
- Example: flatten_grad in PR21169.
- The tensor of the input is used during computation in some cases(in
if
condition or in CUDA kernel but not in CPU kernel).- Fix:add the Operator to the white list in unused_var_check.cc。
- Example: fake_quantize_range_abs_max in PR21169, since
FakeQuantizeRangeAbsMaxKernel
uses different inputs in differentif
conditions. - Note: changing white list requires approval which will be described in the CI message.
If you have any problem, please feel free to contact @zhiqiu。