Open
Description
BertClassifier
supports one output for the entire input sequence, but span labeling is an important research benchmark. This will require some scoping and design:
- Do we need a separate
BertSpanLabelingPreprocessor
or just a more complicated label to pass through? - Do we want to offer a script that can turn a standard dataset like squadv2 into something our preprocessor can use? The "answer text" must be turned into indicies in the tokenized input.
- Create a
BertSpanLabeler
task model with the correct task-specific layers and preprocessing.