Skip to content

Spark 3.4: Enhance FunctionRegistry to support more hash functions #268

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 5 commits into from
Aug 25, 2023

Conversation

pan3793
Copy link
Collaborator

@pan3793 pan3793 commented Aug 25, 2023

This is part of #253, with some enhancements.

Lead-authored-by: Xinyuan Yang [email protected]
Co-authored-by: Cheng Pan [email protected]

do {
inputStrings(i) = input.getUTF8String(i).getBytes
i += 1
} while (i < input.numFields)
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

According to https://github.com/databricks/scala-style-guide#traversal-and-zipwithindex, we should use while for performance-sensitive code.

@pan3793 pan3793 changed the title Spark 3.4: Enhance FunctionRegistry to support some hash functions Spark 3.4: Enhance FunctionRegistry to support more hash functions Aug 25, 2023
@pan3793 pan3793 merged commit 5468585 into master Aug 25, 2023
pan3793 pushed a commit that referenced this pull request Aug 25, 2023
@pan3793 pan3793 deleted the 253-1 branch August 25, 2023 18:25
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants