-
Notifications
You must be signed in to change notification settings - Fork 888
Update blastn module #8765
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Update blastn module #8765
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can you add a test for the new input?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There are multiple taxon filters:
-taxidsand-taxidlist-negative_taxidsand-negative_taxidlist
Can we find a way of supporting all four cases ?
modules/nf-core/blast/blastn/main.nf
Outdated
|
|
||
| """ | ||
| if [ "${is_compressed}" == "true" ]; then | ||
| gzip -c -d ${fasta} > ${fasta_name} | ||
| fi | ||
|
|
||
| export BLASTDB=${db} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What is this addition for ?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I see now how it is unnecessary in this context.
|
@muffato @SPPearce A problem I've had is that the nf-test relies on the makeblastdb module which does not create the database with a taxid map (vital for taxid filtering to work) and if I were to update that module as well one problem still stands, which is the addition of another two vital files from ncbi ( I'm not sure how to proceed. Do you have any inputs of the possibility of this? |
|
Please don't add extra |
|
@SPPearce Okay, I see. That would mean four inputs instead of two. So as for the minimum size of the database, created with only one reference genome and taxid mapping and including those vital ncbi files it constitutes 189MB (locally from what I can see). In order to actually run blastn with taxid flags the database needs to be created through taxid mapping and include those files so I don't think there is a way around that. I will update the pr with these latest changes including nf-test with empty taxid inputs. |
|
I’m not sure about including an input channel for each of the arguments: |
That can be handled with To limit the number of input channels of the pipeline, an idea is to have 3 arguments:
The only thing to check is that Another idea would be to not have an input channel named |
There's an example of that in the vcftools module: https://github.com/nf-core/modules/blob/master/modules/nf-core/vcftools/main.nf#L103-L108 |
This pr adds taxid input (to decrease run time) and updates the test+snapshot
PR checklist
Closes #XXX
versions.ymlfile.labelnf-core modules test <MODULE> --profile dockernf-core modules test <MODULE> --profile singularitynf-core modules test <MODULE> --profile condanf-core subworkflows test <SUBWORKFLOW> --profile dockernf-core subworkflows test <SUBWORKFLOW> --profile singularitynf-core subworkflows test <SUBWORKFLOW> --profile conda