Skip to content

Add from_huggingface method to KerasNLP models #1294

Open
@kanpuriyanawab

Description

@kanpuriyanawab

Add support for loading huggingface model checkpoints in KerasNLP backbones

Is your feature request related to a problem? Please describe.
As of now KerasNLP backbones load pretrained weights of standard checkpoints. However there are lots of fine-tuned checkpoints on huggingface hub which most of the time solve a lot of problems. If we add this functionality of supporting HF checkpoints, we can truly fulfil them with Keras's Multi-backend-promise with KerasNLP's modular design for most of the NLP community.

Describe the solution you'd like
Implemnting from_huggingface method passing checkpoint name from huggingface
All it will require is mapping layer names and implementing checkpoint-conversion scripts as methods.

Alternative solution
Instead of implementing a seperate method, we could modify from_preset method to use huggingface checkpoints

I'm up for contributing this feature.

cc: @abheesht17

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions