Skip to content

Conversation

Riccardo231
Copy link
Collaborator

Starting to work on the code; still a draft, maybe have a look so to tell me if somebody is visibly wrong.

(I thought i was working on my fork but instead i created a branch on the main project, i hope this is not a problem. Thank you)

Copy link
Member

@milancurcic milancurcic left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good, thank you! It's exactly what I had in mind. And the same approach with set_params.

@Riccardo231
Copy link
Collaborator Author

Riccardo231 commented Aug 6, 2025

Hello, I've also removed set_params()

What I wanted to discuss with you is that in the layer embedding_layer there are only weights and not biases, so I cannot properly use the get_params_ptr, I think you already noticed because you didn't implement it.

What should I do? I think that adding biases to the layer wouldn't break anything.

Plus, in self_attention_layer I cannot find the submodule and the get_set procedures. Where are they?

When I'll be done with these two layers I can finally remove every instance of the getset procedures. Thanks, I'll wait for an answer @milancurcic

Copy link
Collaborator

@jvdp1 jvdp1 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. Thank you @Riccardo231

@Riccardo231
Copy link
Collaborator Author

Thanks. Could you provide a feedback on the question above?

@milancurcic
Copy link
Member

Thank you, @Riccardo231 for the work on this, and sorry for the delay. Thanks @jvdp1 for the review.

I think we can already merge this as is because the functionality remains the same and a decent chunk of the code is removed, so it's already a great net positive.

For the embedding and attention layer, we can just skip those for now and address later, with @OneAdder's guidance.

So, I'd say, as soon as you're ready to mark this as ready for review, you can also merge it as is and we take it from there.

@Riccardo231 Riccardo231 marked this pull request as ready for review August 24, 2025 09:09
@Riccardo231 Riccardo231 merged commit 3658b8a into main Aug 24, 2025
8 checks passed
@Riccardo231 Riccardo231 deleted the getset_removal branch August 24, 2025 09:11
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants