Questions tagged [blast]

BLAST is a Basic Local Alignment Search Tool for comparing biological sequence information.

Given some query string, it finds similar (although not necessarily identical) biological sequence strings in a large set of possible candidates. BLAST supports searches with possible sequence mismatches, deletions and insertions. BLAST is open source, written in C++, and was originally developed in 1990.

Wikipedia summary

214 questions
7
votes
2 answers

Python implementation of BLAST alignment algorithm?

Is anyone aware of a pure python implementation of BLAST alignment? I am trying to study this algorithm...
234523458
  • 141
  • 1
  • 3
4
votes
2 answers

Is it possible to pass a string variable to a BLAST search instead of a file?

I'm writing a python script and want to pass the query sequence information into blastn as a string variable rather than a FASTA format file if possible. I used Biopython's SeqIO to store several transcript names as key and its sequences as the…
4
votes
2 answers

BLAST Database error: No alias or index file found for nucleotide database

I am trying to run blastn, and then also SIFT standalone. I am having database configuration issues however as I am getting the following: arron@arron-Ideapad-Z570 ~/Phd/programs/sift4.0.3b $ blastn -query test/lacI.fasta -db db/swissprot/ BLAST…
brucezepplin
  • 7,402
  • 18
  • 68
  • 115
3
votes
1 answer

Making Blast database from FASTA in Python

How can I do this? I use Biopython and saw manual already. Of course I can make blastdb from FASTA using "makeblastdb" in standalone NCBI BLAST+, but I want to whole process in one program. It seems there are two possible solutions. Find a function…
3
votes
2 answers

Filtering a dataframe of BLAST sequences to get within each cluster the maximum pident_x

I have a problem, I need to parse the following dataframe: cluster_name qseqid sseqid pident_x qstart qend sstar send 2 1 seq1_0035_0035 seq13_0042_0035 0.73 42 133 46 189 3 1 seq1_0035_0035 seq13_0042_0035 0.73 …
Grendel
  • 501
  • 3
  • 10
3
votes
3 answers

Iterate through files in a directory, create output files, linux

I am trying to iterate through every file in a specific directory (called sequences), and perform two functions on each file. I know that the functions (the 'blastp' and 'cat' lines) work, since I can run them on individual files. Ordinarily I would…
lynkyra
  • 59
  • 5
3
votes
2 answers

blast could not create a unit counts container

I build a blast local database. However, when I run the blastn command I got this error message: T0…
Hamid_UMB
  • 297
  • 4
  • 14
3
votes
0 answers

Python script skips writing trimmed DNA sequences to files

Edit 2/18: I figured out the issue. It's not the code directly, although someone has pointed out this sample I have put up is not the way I should have put it up. I apologize! The issue is the blastx results. They were not meeting the threshold set…
BrianW
  • 31
  • 2
3
votes
2 answers

Nested Quotes in Perl System()

I'm trying to modify a perl script. Here is the part I am trying to modify: Original: system ("tblastn -db $BLASTDB -query $TMP/prot$$.fa \\ -word_size 6 -max_target_seqs 5 -seg yes -num_threads $THREADS -lcase_masking \\ …
Blaze
  • 31
  • 3
3
votes
2 answers

Python: Running Multidimensional Scaling with Incomplete Pairwise Dissimilarity Matrix in HDF5 format

I am working with large datasets of protein-protein similarities generated in NCBI BLAST. I have stored the results in a large pairwise matrices (25,000 x 25,000) and I am using multidimensional scaling (MDS) to visualize the data. These matrices…
3
votes
1 answer

custom blast db with NcbiblastxCommandline

it's the first time that i use blast inside biopython, and i'm having a problem. i created a custom blast database from a fasta file which contain 20 sequence using : os.system('makeblastdb -in newtest.fasta -dbtype nucl -out newtest.db') and…
ifreak
  • 1,495
  • 3
  • 21
  • 44
3
votes
2 answers

Biopython local BLAST database error

I am trying to run blastx locally with the "nr" database using Biopython's NcbiblastxCommandline tool but I always get the following error regarding the protein database search path: >>> from Bio.Blast.Applications import NcbiblastxCommandline >>>…
priyasshah
  • 63
  • 1
  • 6
2
votes
1 answer

Can I use BLAST to compare 2 non-biological strings?

I know what BLAST (The Basic Local Alignment Search Tool) was designed for. But I am interested in using such advanced text comparison and its final effects.
rzur2004
  • 131
  • 5
2
votes
1 answer

BLAST Database allocation error

I asked this question on the bioinformatics version of stackexchange, but since I think it is a computer problem I thought I should try my luck here. When running local BLAST (v2.2.24+) on a big database (all human proteins) I get the following…
Niek de Klein
  • 7,839
  • 18
  • 67
  • 129
2
votes
0 answers

wsimport import fails with NCBI/Blast

I'm trying to generate the files handling the SOAP services documented here: http://www.ncbi.nlm.nih.gov/books/NBK55699/ But wsimport fails: $ wsimport -d generated "http://www.ncbi.nlm.nih.gov/blast/netblast/blastws.cgi?WSDL" parsing…
Pierre
  • 31,741
  • 29
  • 101
  • 180
1
2 3
14 15