-
Notifications
You must be signed in to change notification settings - Fork 80
header checksum and Schema Extension frame added separately #341
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
modules/rntuple.mjs
Outdated
@@ -148,7 +149,7 @@ deserializeFooter(footer_blob) { | |||
// Feature flag(32 bits) | |||
this._readFeatureFlags(reader); | |||
// Header checksum (64-bit xxhash3) | |||
this.headerChecksum = reader.readU64(); | |||
this.headerChecksumFromFooter = reader.readU64(); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The check on headerChecksumFromFooter === headerEnvelopeChecksum
should happen here rather than in the test, as if it fails the RNTuple is corrupted and the reader should stop and report error.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
(also, this can be a local variable)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ok
modules/rntuple.mjs
Outdated
@@ -180,6 +178,14 @@ _readEnvelopeMetadata(reader) { | |||
return { envelopeType, envelopeLength }; | |||
} | |||
|
|||
_readSchemaExtension(reader) { | |||
this._readFieldDescriptors(reader); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Be careful: since in _readFieldDescriptors
and _readColumnDescriptors
you are doing this.fieldDescriptors = fieldDescriptors
and similar, every time you call this function you will overwrite the previous ones.
You should either append to the existing array or return that array and handle it on the caller side.
Same goes for _readAliasColumns
and _readExtraTypeInformation
.
The Footer's schema description should be merged with the Header's to form the complete schema description (most of the time the Footer's schema extension will be empty btw).
modules/rntuple.mjs
Outdated
@@ -180,6 +178,14 @@ _readEnvelopeMetadata(reader) { | |||
return { envelopeType, envelopeLength }; | |||
} | |||
|
|||
_readSchemaExtension(reader) { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
_readSchemaExtension(reader) { | |
_readSchemaDescription(reader) { |
@silverweed please again check the PR #341 I have done the suggested changes |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks! Now you should be able to start deserializing the Page List inside each Cluster Group (you should only have 1 cluster group for most RNTuples)
For that I have started implementation in a separate PR #342 ,kindly please check that pr and please provide help with its entry point |
@silverweed please merge this pr |
No description provided.