- Timestamp:
- Dec 22, 2009, 7:52:22 AM (15 years ago)
- File:
-
- 1 edited
Legend:
- Unmodified
- Added
- Removed
-
liacs/dbdm/dbdm_4/parse-fasta.py
r54 r55 1 1 #!/usr/bin/env python 2 # 3 # Parse 2 FASTA files and print statistics 4 # BSDLicence 5 # Rick van der Zwet - 0433373 - <info@rickvaderzwet.nl> 2 6 from Bio import SeqIO,Seq 3 7 from Bio import Alphabet … … 70 74 71 75 76 # Get data 72 77 file1 = parse_file("data/AE005174v2-1.fas") 73 78 file2 = parse_file("data/AE005174v2-2.fas") 79 74 80 file1 = fasta.replace(file1) 75 81 file2 = fasta.replace(file2) 82 83 # Find overlap 76 84 (retval, common, result) = concat(file2,file1) 77 85 print retval, common 86 87 # Strictly speaking there is a gap of about 4 kbs (4000 bs) between file1 and 88 # file2, so lets' put that into the the statistics as well. Due to circular 89 # nature, does not matter wether we add it in the beginning or in the end 90 result = result + "n" * 4000; 78 91 stats(result) 79 92 93 # Write to file for later further processing 80 94 out = open("full_contig.raw","w") 81 95 out.write(result)
Note:
See TracChangeset
for help on using the changeset viewer.