Skip to content

I want to provide the Keras_hub implementation of RoFormer. #2118

Open
@pass-lin

Description

@pass-lin

https://github.com/ZhuiyiTechnology/roformer
Roformer is a BERT-like model. It adds the now very commonly used Rope position encoding on top of BERT. In fact, this is the first practical application of Rope position encoding.
I found that Keras_hub lacks a powerful Chinese BERT-like model. And RoFormer happens to be a native Chinese BERT model, and its architecture is very similar to that of Modern BERT. This will also be helpful for future implementations related to Modern BERT.

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions