Fix None Type error while using MultiHeadAttention #191

GCS-ZHN · 2022-10-26T06:08:44Z

This PR is modified based on previous PR #165 by @cainmagi ,

Main change features:

automatical detect and filter not array like elements in forward output list/tuple/dict. For example, MultiHeadAttention module return a tuple which contain a NoneType value as a placeholder of attention weight.
If filtered output contain no element, raise a ValueError to notify user instead of original NoneType AtrributeError.
Replace -1 to batch_size in dict/list/tuple output shape because I believe it will be more properly.

1. Fix the bug of parameter number calculation when there are more than one output variables, including both sequence case and dict case. 2. Make multuple output variables split into multiple lines. 3. Remove the last line break of summary_string() 4. Enable argument "device" to accept both str and torch.device. 5. Fix a bug when the model requires "batch_size" to be a specific number. 6. Fix a bug caused by multiple input case when "dtypes=None". 7. Add text auto wrap when the layer name is too long. 8. Add docstring.

Support counting all parameters instead of `weight` and `bias`.

Using numpy sum/prod to calculate the total size may cause overflow problem. This modification would drop the numpy and use the python built-in method to calculate the size.

Fix the bug caused by layers with dict input values.

Fix the data type of the output params_info from torch.tensor to int.

cainmagi and others added 6 commits February 27, 2021 01:21

Fix parameter counting problem.

18bf210

Support counting all parameters instead of `weight` and `bias`.

Fix the long int overflow problem.

c8836d5

Using numpy sum/prod to calculate the total size may cause overflow problem. This modification would drop the numpy and use the python built-in method to calculate the size.

Fix dict input problem.

37f8e5a

Fix the bug caused by layers with dict input values.

Fix the output params_info type.

37ab4ad

Fix the data type of the output params_info from torch.tensor to int.

Fix NoneType Error for multi head attention module

2179d8e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix None Type error while using MultiHeadAttention #191

Fix None Type error while using MultiHeadAttention #191

Uh oh!

GCS-ZHN commented Oct 26, 2022 •

edited

Loading

Uh oh!

Uh oh!

Fix None Type error while using MultiHeadAttention #191

Are you sure you want to change the base?

Fix None Type error while using MultiHeadAttention #191

Uh oh!

Conversation

GCS-ZHN commented Oct 26, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Main change features:

Uh oh!

Uh oh!

GCS-ZHN commented Oct 26, 2022 •

edited

Loading