Spo08785.1 (mRNA)

Overview
NameSpo08785.1
TypemRNA
OrganismSpinacia oleracea (Spinach)
DescriptionCathepsin B-like cysteine proteinase
Locationchr1 : 43266689 .. 43271161 (+)
Sequence length1940
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTAGGTAACCACTTGTGTAGAATTATCCAAGTCAGTCCAAAAATCTGACCACACCCATAACCAATTTCTTCTGGGGATCAAATTTTCAATTGGTTCCAACTAAAAATACAAAAATATAAATTCAATTGGGCCAACTCAATCTACCATAATAAAGATTCCTGTCAAACATTAAAGAATCTACATATTTTCACATTTCCAACACCTTATCCTCTGCAAAAAAAAATTAAAAAAATTCTTGAATTAATTCACACCCCCAAAATATCTTTCAAAAACTGTATAAATTGAGAGGGAGAACCCCAGAATTTGTGGGCATCATCTTTCTTCATCCCCCTAATAAAGTGGAACATTGCTTTTGATTGAAGGGGGTTGAAGATGGCACCTTTATTTCTGGGTTTATCTCTTCTATTTCTGCTTGCACCTTCGGTAAGTCATCTAATCCACCTTAATTTTATTGGTTGATCTTTTTAGTTTCCAAAATTTTCAATTATTAATGATCAATCCTTTTATATTATGGTTGATTATACTTTTTTTTTGATGAAATTTTGTTATGTTTGCCTTATATTGATTGACCTGATCCACTTATCTTAAAGTTATGATCACTAGTTATAAGTTTAATTGGTCAAATATAATTATTACCTATTTCAAGGATCTATTTAAGCATCTGATATTGTTATTCGGGTTGTGGATGCAAGCTAGCTTAGTTTGTAAAGTTTTGGAGTGTGTATCTAAGACATGGGATTAATCCGGTCTATGATCTGGATCTGTTGAACTTTTTTTTGAGGGAAGGATCTGTTGAACTTTGATAGTTAAGCTGATGTATTTATGAATTTCAGTTGTTTAATGTCTTGGATTTGTGGGAAGAACTGCTGAACTGGGAATGAATTAATATTTGGATTATATGAATATTTTAAGTGAGAAGATTTAAGAAAAGTTCTTAATTTGTTTTTCAAATTCAATTTTTGATTTGAGCTTTTTGTATAGGTGGGGTGGGGCAGGATAACTGGGAGGTTTTGGTGTATTCAATAGTGGTGTTTTAAGAGTCAGTTAGCAAATTTGTCTTTTGGCAAACCCTTGTTTTATAAAGAATCTGGACTTATCATAGTTTGTCTTGTGTTATTTTTTTTGGTGTATATGGATCCAGACAATGGTGTAGACGGCATTTGATTCTAAACTTTAACTAACTAACTAACTGACTAAGCTAGTTGACATCCACACTGTTTTGCCGTGTAGATGGAGGGCTTCACTAAATTTTGTTGAGAATGTTATGGGTTTGATCTTTTGAGTTTAATTTCACTTTTGACGTATGGTTTAGTCATTTGGAATTTGTAAGAAATATTTTGCTAGTGTTCAAAGGTGCAGATGTTTTGAGGCTGATTTTCGTGCTCCCATTTTTTTAACTGTTGTCTACTTTTATGAAGAGGGAAGTGTGGTACTAGATTCGTGTAGTCCCATAGTCCTACAACTATTTGAAGTTTTGAACTAAGTACTCATTAGCACTGAACATTGTGCTTTTTGAATCTAATAAAAGAGTAGGGTAATGGTAAACATCTAACATAGACATAAAGACTAATACTTAACATGCATTTAATGGCACGAAACAACTAAATACCATATTTGGAATAACATTTAGTGAAACCAAAGGAGTCATGTAGACTGCACAGACAACCCTATGATTTTTCTGATGGTAATTGAAACTGCTGAATGTTGAAATCAAATTTTACTACCTTTATTTTCTTAAGCATGCACAATTACATCCACGAGTTCATGAATGTTCCTAGCCACAAAACTACTTATGCAAGCATAATAGGATTCTTCTTATTTTCTTTAGTGTGGCTGTTATTTAGTTCCCTCCTTTCTCTGCCACAACCTGAGTCATAACTCCTGAATGTATGGAAGTTTTGTTCTTTAATAGTCATGAAGTGGTAGAAAGTAGTATTTGATTTCTTACATCCAGAAGTTGCGTGTTATGCTTCTCAGTATATCAGCTTTTTTGTTATCTATTCCAATTTTTTACTCTGTGGATACATAATTGAAGTTATTTTCATTGTAGGTTTTAGTAGCAGAATCAGTTCCTGAGCTGAAGTTGAACTCCAACCTTCTCCAGGTATACTTCTTGCTAGCTTGAGTTATGTATAACTAATGAGCCATCACTATACTGATCACATTTCACCTCCAAATTATTATTTCAGGACTCAATAGTTAATTCGATAAATGCAAATCCAAAAGCTGGATGGAAGGCTGGCATGAACCAGCAATTATCCGACTATACAGTGAGTTTTTAGCCGAATGTTCTCCTCCAGTTTTCTGATGAAAGCGAGACTTAATCTTGCATCTTTAATTTACTAGAATGTCACAATTTCACTGATATTTTTTATATTCTATTGTTAAGCTTGGTCAATTCAAGTACATCTTAGGTGCTAGACCTACCCCAGAAAGTGAGAAGAGGAGCATCCCTCTCGTGCATCATGACAGGTCGTTGAACTTGCCAAAGGAGTTCGATGCCAGAAAAGCCTGGCCTCAGTGTACCTCTATTGGCAGAATTCTTGGTAAGCCAAATGCTATGTCCTTCTTCCTCATGTTCTCAGTTCTTACCAATTATGCAATGGATAATCTAACTTTTTTTGTCTCCCTTTTCCTCTATCTTACATGGGGCCATTGCTAAATCTCATATTAAAGACCAGGTACTTGGCACTTTATCACATTCATTTGTTAAAATGTCGACTTTTCAAGCTCACTTAATTAATTGATCTTCTGCAATCTGTCAAATAGGGTCATTGTGGTTCTTGTTGGGCATTTGCTGCTGTTGAAGCCCTTTCAGACCGTTTCTGCATCAATTATAGCATGGTAGTTCCCTCTCTACTTTTCGAATACTTCCATTGTATTTCCATTTTCCACCCTTATTGAAATAAATCATAACACCCAGTTCATCCAAATACTTGTAGAACATCACGTTGTCAGTTAATGATCTCTTGGCATGCTGTGGATTCTTGTGTGGAGCTGGATGTGATGGCGGGACGCCCATCTATGCTTGGCGCTATTTCGTTCACCATGGTGTTGTCACGGAAGAAGTAAAAGATCTTTCTTTACCCCCAAGGACATATTTTTTTCAACCTTTTGTACTATACTCTCTTCATTTCTAAAAGTTGATTTGTGTTATTGATTTCTCTGTGACAGTGTGATCCATACTTCGACACCACTGGTTGCTCCCACCCAGGTTGTGAGCCTGGTTACCCTACTCCAAAGTGTGTAAGGAAATGTGTGAGCGGAAACCAACTTTGGAGACAATCAAAGCATTATGGTTCAAATGCCTACAGAGTCAAATCGGACCAATATCAGATCATGGCTGAACTCTACAAGAACGGACCTGTTGAGGTGGCTTTCGATGTTTATGAGGTAAAAGGACGGATTTCTGAATCATTCATATGCGTCATTTTGATGGATAGTAGTTTAATTCAGTAGCAGATCTAGAATTGAATCCTATAAATGAGCTGCATTGGTGCCATGTATGTTTCTTTCGCGGTTTACACAATTCCAACGTTTTGGTATTCTTTTTTCAGGATTTTGCTCATTACAAGTCGGGGGTTTACAAACATGTAACTGGGCAATACCTTGGAGGACACGCTGTGAAGCTGATTGGATGGGGAACTTCAGCTGACGGAGAGGATTATTGGGTACTTTTTTGTTACCTTTTTAGTATTTTCCCCTTTAGATTTATGAATCATATCTCAATTTTGGTGCCTGATTGCTAATTCACTATCATGTTTCACAGCTTTTAGCGAATCAGTGGAACAGAAGCTGGGGTGATGTATGTTTCTTCTCTCTAGTAACCTTATACAAGTATTAGATTATATGTTTATATGTTTGGGACTCAGAAATGTTTCGTTGATTATAACAGGACGGATACTTCATGATCAGCCGGGGAACCAACGAATGTGGCATTGAAGAAGATGTGGTGGCTGGCTTGCCTTCACCCAAGAACATGTTCAAGGTGGTAACTGAATCAGATGATGCTGGTTCTGCCTCCATATAAACCATATGATTATTTCTCACTGATTGGCTTCAATAACGTCGTTTCAAAATCTGCATTGTTGCTGGAAATGTACACTATTTGTCATATTTGATATCCAGTGCTCATTTACTGTTATCAATATTGTATTTTGTGATTCAGAACATCAATGAAGATTGTAATATTCCCGTTATACAGGATTACAGGGACCATGAATTCTCAGAATCAGTCTCTGGTAGTAGTTTATTACATGTAAATATGCCTTTGAAAGAGAAAACTCTGTAATTAGTTGTCTAATTCTTTTTCTTCCCTCCCCCTTTGGATCCTGCTTGAATGTGTGAAGTTCTCAAGTGCCATGCTTAACCTGCTAATTGCTTCTTAGATCCATGCTTAGCGCGCATTTGACTAGATAACCTCTCGTTATCCCTTGCCAATTATGTCTTATGTCATTCTTAACATAAACCGAG

mRNA sequence

TTAGGTAACCACTTGTGTAGAATTATCCAAGTCAGTCCAAAAATCTGACCACACCCATAACCAATTTCTTCTGGGGATCAAATTTTCAATTGGTTCCAACTAAAAATACAAAAATATAAATTCAATTGGGCCAACTCAATCTACCATAATAAAGATTCCTGTCAAACATTAAAGAATCTACATATTTTCACATTTCCAACACCTTATCCTCTGCAAAAAAAAATTAAAAAAATTCTTGAATTAATTCACACCCCCAAAATATCTTTCAAAAACTGTATAAATTGAGAGGGAGAACCCCAGAATTTGTGGGCATCATCTTTCTTCATCCCCCTAATAAAGTGGAACATTGCTTTTGATTGAAGGGGGTTGAAGATGGCACCTTTATTTCTGGGTTTATCTCTTCTATTTCTGCTTGCACCTTCGGTTTTAGTAGCAGAATCAGTTCCTGAGCTGAAGTTGAACTCCAACCTTCTCCAGGACTCAATAGTTAATTCGATAAATGCAAATCCAAAAGCTGGATGGAAGGCTGGCATGAACCAGCAATTATCCGACTATACACTTGGTCAATTCAAGTACATCTTAGGTGCTAGACCTACCCCAGAAAGTGAGAAGAGGAGCATCCCTCTCGTGCATCATGACAGGTCGTTGAACTTGCCAAAGGAGTTCGATGCCAGAAAAGCCTGGCCTCAGTGTACCTCTATTGGCAGAATTCTTGGTAAGCCAAATGCTATGTCCTTCTTCCTCATGTTCTCAGTTCTTACCAATTATGCAATGGATAATCTAACTTTTTTTGGTCATTGTGGTTCTTGTTGGGCATTTGCTGCTGTTGAAGCCCTTTCAGACCGTTTCTGCATCAATTATAGCATGAACATCACGTTGTCAGTTAATGATCTCTTGGCATGCTGTGGATTCTTGTGTGGAGCTGGATGTGATGGCGGGACGCCCATCTATGCTTGGCGCTATTTCGTTCACCATGGTGTTGTCACGGAAGAATGTGATCCATACTTCGACACCACTGGTTGCTCCCACCCAGGTTGTGAGCCTGGTTACCCTACTCCAAAGTGTGTAAGGAAATGTGTGAGCGGAAACCAACTTTGGAGACAATCAAAGCATTATGGTTCAAATGCCTACAGAGTCAAATCGGACCAATATCAGATCATGGCTGAACTCTACAAGAACGGACCTGTTGAGGTGGCTTTCGATGTTTATGAGGATTTTGCTCATTACAAGTCGGGGGTTTACAAACATGTAACTGGGCAATACCTTGGAGGACACGCTGTGAAGCTGATTGGATGGGGAACTTCAGCTGACGGAGAGGATTATTGGCTTTTAGCGAATCAGTGGAACAGAAGCTGGGGTGATGACGGATACTTCATGATCAGCCGGGGAACCAACGAATGTGGCATTGAAGAAGATGTGGTGGCTGGCTTGCCTTCACCCAAGAACATGTTCAAGGTGGTAACTGAATCAGATGATGCTGGTTCTGCCTCCATATAAACCATATGATTATTTCTCACTGATTGGCTTCAATAACGTCGTTTCAAAATCTGCATTGTTGCTGGAAATGTACACTATTTGTCATATTTGATATCCAGTGCTCATTTACTGTTATCAATATTGTATTTTGTGATTCAGAACATCAATGAAGATTGTAATATTCCCGTTATACAGGATTACAGGGACCATGAATTCTCAGAATCAGTCTCTGGTAGTAGTTTATTACATGTAAATATGCCTTTGAAAGAGAAAACTCTGTAATTAGTTGTCTAATTCTTTTTCTTCCCTCCCCCTTTGGATCCTGCTTGAATGTGTGAAGTTCTCAAGTGCCATGCTTAACCTGCTAATTGCTTCTTAGATCCATGCTTAGCGCGCATTTGACTAGATAACCTCTCGTTATCCCTTGCCAATTATGTCTTATGTCATTCTTAACATAAACCGAG

Coding sequence (CDS)

ATGGCACCTTTATTTCTGGGTTTATCTCTTCTATTTCTGCTTGCACCTTCGGTTTTAGTAGCAGAATCAGTTCCTGAGCTGAAGTTGAACTCCAACCTTCTCCAGGACTCAATAGTTAATTCGATAAATGCAAATCCAAAAGCTGGATGGAAGGCTGGCATGAACCAGCAATTATCCGACTATACACTTGGTCAATTCAAGTACATCTTAGGTGCTAGACCTACCCCAGAAAGTGAGAAGAGGAGCATCCCTCTCGTGCATCATGACAGGTCGTTGAACTTGCCAAAGGAGTTCGATGCCAGAAAAGCCTGGCCTCAGTGTACCTCTATTGGCAGAATTCTTGGTAAGCCAAATGCTATGTCCTTCTTCCTCATGTTCTCAGTTCTTACCAATTATGCAATGGATAATCTAACTTTTTTTGGTCATTGTGGTTCTTGTTGGGCATTTGCTGCTGTTGAAGCCCTTTCAGACCGTTTCTGCATCAATTATAGCATGAACATCACGTTGTCAGTTAATGATCTCTTGGCATGCTGTGGATTCTTGTGTGGAGCTGGATGTGATGGCGGGACGCCCATCTATGCTTGGCGCTATTTCGTTCACCATGGTGTTGTCACGGAAGAATGTGATCCATACTTCGACACCACTGGTTGCTCCCACCCAGGTTGTGAGCCTGGTTACCCTACTCCAAAGTGTGTAAGGAAATGTGTGAGCGGAAACCAACTTTGGAGACAATCAAAGCATTATGGTTCAAATGCCTACAGAGTCAAATCGGACCAATATCAGATCATGGCTGAACTCTACAAGAACGGACCTGTTGAGGTGGCTTTCGATGTTTATGAGGATTTTGCTCATTACAAGTCGGGGGTTTACAAACATGTAACTGGGCAATACCTTGGAGGACACGCTGTGAAGCTGATTGGATGGGGAACTTCAGCTGACGGAGAGGATTATTGGCTTTTAGCGAATCAGTGGAACAGAAGCTGGGGTGATGACGGATACTTCATGATCAGCCGGGGAACCAACGAATGTGGCATTGAAGAAGATGTGGTGGCTGGCTTGCCTTCACCCAAGAACATGTTCAAGGTGGTAACTGAATCAGATGATGCTGGTTCTGCCTCCATATAA

Protein sequence

MAPLFLGLSLLFLLAPSVLVAESVPELKLNSNLLQDSIVNSINANPKAGWKAGMNQQLSDYTLGQFKYILGARPTPESEKRSIPLVHHDRSLNLPKEFDARKAWPQCTSIGRILGKPNAMSFFLMFSVLTNYAMDNLTFFGHCGSCWAFAAVEALSDRFCINYSMNITLSVNDLLACCGFLCGAGCDGGTPIYAWRYFVHHGVVTEECDPYFDTTGCSHPGCEPGYPTPKCVRKCVSGNQLWRQSKHYGSNAYRVKSDQYQIMAELYKNGPVEVAFDVYEDFAHYKSGVYKHVTGQYLGGHAVKLIGWGTSADGEDYWLLANQWNRSWGDDGYFMISRGTNECGIEEDVVAGLPSPKNMFKVVTESDDAGSASI
Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Spo08785Spo08785gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Spo08785.1Spo08785.1-proteinpolypeptide


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
Spo08785.1.exon.1Spo08785.1.exon.1exon
Spo08785.1.exon.2Spo08785.1.exon.2exon
Spo08785.1.exon.3Spo08785.1.exon.3exon
Spo08785.1.exon.4Spo08785.1.exon.4exon
Spo08785.1.exon.5Spo08785.1.exon.5exon
Spo08785.1.exon.6Spo08785.1.exon.6exon
Spo08785.1.exon.7Spo08785.1.exon.7exon
Spo08785.1.exon.8Spo08785.1.exon.8exon
Spo08785.1.exon.9Spo08785.1.exon.9exon
Spo08785.1.exon.10Spo08785.1.exon.10exon


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Spo08785.1.utr5p.1Spo08785.1.utr5p.1five_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Spo08785.1.CDS.1Spo08785.1.CDS.1CDS
Spo08785.1.CDS.2Spo08785.1.CDS.2CDS
Spo08785.1.CDS.3Spo08785.1.CDS.3CDS
Spo08785.1.CDS.4Spo08785.1.CDS.4CDS
Spo08785.1.CDS.5Spo08785.1.CDS.5CDS
Spo08785.1.CDS.6Spo08785.1.CDS.6CDS
Spo08785.1.CDS.7Spo08785.1.CDS.7CDS
Spo08785.1.CDS.8Spo08785.1.CDS.8CDS
Spo08785.1.CDS.9Spo08785.1.CDS.9CDS
Spo08785.1.CDS.10Spo08785.1.CDS.10CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Spo08785.1.utr3p.1Spo08785.1.utr3p.1three_prime_UTR


Homology
BLAST of Spo08785.1 vs. NCBI nr
Match: gi|902170206|gb|KNA07799.1| (hypothetical protein SOVF_168600 [Spinacia oleracea])

HSP 1 Score: 788.9 bits (2036), Expect = 3.900e-225
Identity = 374/374 (100.00%), Postives = 374/374 (100.00%), Query Frame = 1

		  

Query: 1   MAPLFLGLSLLFLLAPSVLVAESVPELKLNSNLLQDSIVNSINANPKAGWKAGMNQQLSD 60
           MAPLFLGLSLLFLLAPSVLVAESVPELKLNSNLLQDSIVNSINANPKAGWKAGMNQQLSD
Sbjct: 1   MAPLFLGLSLLFLLAPSVLVAESVPELKLNSNLLQDSIVNSINANPKAGWKAGMNQQLSD 60

Query: 61  YTLGQFKYILGARPTPESEKRSIPLVHHDRSLNLPKEFDARKAWPQCTSIGRILGKPNAM 120
           YTLGQFKYILGARPTPESEKRSIPLVHHDRSLNLPKEFDARKAWPQCTSIGRILGKPNAM
Sbjct: 61  YTLGQFKYILGARPTPESEKRSIPLVHHDRSLNLPKEFDARKAWPQCTSIGRILGKPNAM 120

Query: 121 SFFLMFSVLTNYAMDNLTFFGHCGSCWAFAAVEALSDRFCINYSMNITLSVNDLLACCGF 180
           SFFLMFSVLTNYAMDNLTFFGHCGSCWAFAAVEALSDRFCINYSMNITLSVNDLLACCGF
Sbjct: 121 SFFLMFSVLTNYAMDNLTFFGHCGSCWAFAAVEALSDRFCINYSMNITLSVNDLLACCGF 180

Query: 181 LCGAGCDGGTPIYAWRYFVHHGVVTEECDPYFDTTGCSHPGCEPGYPTPKCVRKCVSGNQ 240
           LCGAGCDGGTPIYAWRYFVHHGVVTEECDPYFDTTGCSHPGCEPGYPTPKCVRKCVSGNQ
Sbjct: 181 LCGAGCDGGTPIYAWRYFVHHGVVTEECDPYFDTTGCSHPGCEPGYPTPKCVRKCVSGNQ 240

Query: 241 LWRQSKHYGSNAYRVKSDQYQIMAELYKNGPVEVAFDVYEDFAHYKSGVYKHVTGQYLGG 300
           LWRQSKHYGSNAYRVKSDQYQIMAELYKNGPVEVAFDVYEDFAHYKSGVYKHVTGQYLGG
Sbjct: 241 LWRQSKHYGSNAYRVKSDQYQIMAELYKNGPVEVAFDVYEDFAHYKSGVYKHVTGQYLGG 300

Query: 301 HAVKLIGWGTSADGEDYWLLANQWNRSWGDDGYFMISRGTNECGIEEDVVAGLPSPKNMF 360
           HAVKLIGWGTSADGEDYWLLANQWNRSWGDDGYFMISRGTNECGIEEDVVAGLPSPKNMF
Sbjct: 301 HAVKLIGWGTSADGEDYWLLANQWNRSWGDDGYFMISRGTNECGIEEDVVAGLPSPKNMF 360

Query: 361 KVVTESDDAGSASI 375
           KVVTESDDAGSASI
Sbjct: 361 KVVTESDDAGSASI 374

BLAST of Spo08785.1 vs. NCBI nr
Match: gi|731368155|ref|XP_010695986.1| (PREDICTED: cathepsin B-like [Beta vulgaris subsp. vulgaris])

HSP 1 Score: 642.5 bits (1656), Expect = 4.600e-181
Identity = 300/376 (79.79%), Postives = 329/376 (87.50%), Query Frame = 1

		  

Query: 1   MAPLFLGLS--LLFLLAPSVLVAESVPELKLNSNLLQDSIVNSINANPKAGWKAGMNQQL 60
           MA  FLGLS  LL LLAPS+ VAESVPELKLNSNLLQDS++NSIN NPKAGWKAGMNQQL
Sbjct: 1   MASSFLGLSIFLLLLLAPSIFVAESVPELKLNSNLLQDSMINSINGNPKAGWKAGMNQQL 60

Query: 61  SDYTLGQFKYILGARPTPESEKRSIPLVHHDRSLNLPKEFDARKAWPQCTSIGRILGKPN 120
           SDYT+GQFK ILGAR TP  EKR+IP+VHHDRSL LPKEFDARKAWPQC +IGRI  +  
Sbjct: 61  SDYTVGQFKRILGARKTPPGEKRNIPVVHHDRSLKLPKEFDARKAWPQCRTIGRIYDQ-- 120

Query: 121 AMSFFLMFSVLTNYAMDNLTFFGHCGSCWAFAAVEALSDRFCINYSMNITLSVNDLLACC 180
                                 GHCGSCWA+AAVEAL DRFCI+Y MNI+LSVNDLLACC
Sbjct: 121 ----------------------GHCGSCWAYAAVEALQDRFCIHYGMNISLSVNDLLACC 180

Query: 181 GFLCGAGCDGGTPIYAWRYFVHHGVVTEECDPYFDTTGCSHPGCEPGYPTPKCVRKCVSG 240
           GF+CG+GC+GGTPI+AWRYFVHHGVVTEECDPYFDTTGCSHPGCEPGYPTPKCVRKCV+G
Sbjct: 181 GFMCGSGCNGGTPIFAWRYFVHHGVVTEECDPYFDTTGCSHPGCEPGYPTPKCVRKCVAG 240

Query: 241 NQLWRQSKHYGSNAYRVKSDQYQIMAELYKNGPVEVAFDVYEDFAHYKSGVYKHVTGQYL 300
           NQLW+QSKHYG+NAYR+K D  QIMAELYKNGPVEVAFDVYEDFAHYKSGVYKHVTG+YL
Sbjct: 241 NQLWKQSKHYGANAYRIKQDPQQIMAELYKNGPVEVAFDVYEDFAHYKSGVYKHVTGEYL 300

Query: 301 GGHAVKLIGWGTSADGEDYWLLANQWNRSWGDDGYFMISRGTNECGIEEDVVAGLPSPKN 360
           GGHAVKLIGWGT+ADGEDYWL+ANQWN+SWGDDGYFMISRGTNEC +EEDVV G+PSPKN
Sbjct: 301 GGHAVKLIGWGTTADGEDYWLVANQWNKSWGDDGYFMISRGTNECCVEEDVVGGMPSPKN 352

Query: 361 MFKVVTESDDAGSASI 375
           +F+VVT SDDAG+AS+
Sbjct: 361 LFEVVTGSDDAGAASL 352

BLAST of Spo08785.1 vs. NCBI nr
Match: gi|223545619|gb|EEF47123.1| (cathepsin B, putative [Ricinus communis])

HSP 1 Score: 544.3 bits (1401), Expect = 1.700e-151
Identity = 245/355 (69.01%), Postives = 293/355 (82.54%), Query Frame = 1

		  

Query: 10  LLFLLAPS-----VLVAESVPELKLNSNLLQDSIVNSINANPKAGWKAGMNQQLSDYTLG 69
           LLFL+A S     V+  E   +LKLNS +LQ+SI+  +N NP AGW+A MN QLS++T+G
Sbjct: 12  LLFLVALSSFHSRVISTELDSKLKLNSRILQESIIKKVNENPDAGWEAAMNPQLSNFTVG 71

Query: 70  QFKYILGARPTPESEKRSIPLVHHDRSLNLPKEFDARKAWPQCTSIGRILGKPNAMSFFL 129
           QFKY+LGA+PTP+ E   +P++ H ++L LPKEFDAR AWP C++IG+ILG+   +SF+ 
Sbjct: 72  QFKYLLGAKPTPKKELMGVPMISHPKTLKLPKEFDARTAWPHCSTIGKILGQ--LLSFYN 131

Query: 130 MFSVLTNYAMDNLTFFGHCGSCWAFAAVEALSDRFCINYSMNITLSVNDLLACCGFLCGA 189
           +FS+     ++     GHCGSCWAF AVE+LSDRFCI++ MNI+LSVNDLLACCGFLCG 
Sbjct: 132 IFSIFFFLFLE-----GHCGSCWAFGAVESLSDRFCIHFGMNISLSVNDLLACCGFLCGD 191

Query: 190 GCDGGTPIYAWRYFVHHGVVTEECDPYFDTTGCSHPGCEPGYPTPKCVRKCVSGNQLWRQ 249
           GCDGG P+YAWRYFVHHGVVTEECDPYFD  GCSHPGCEPG+PTPKCVRKC+  NQLWRQ
Sbjct: 192 GCDGGYPMYAWRYFVHHGVVTEECDPYFDNIGCSHPGCEPGFPTPKCVRKCIDKNQLWRQ 251

Query: 250 SKHYGSNAYRVKSDQYQIMAELYKNGPVEVAFDVYEDFAHYKSGVYKHVTGQYLGGHAVK 309
           SKHY  NAYR+ SD + +MAE+YKNGPVEV+F VYEDFAHYKSGVYKH+TG+ +GGHAVK
Sbjct: 252 SKHYSVNAYRISSDPHDVMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGEVMGGHAVK 311

Query: 310 LIGWGTSADGEDYWLLANQWNRSWGDDGYFMISRGTNECGIEEDVVAGLPSPKNM 360
           LIGWGTS +GEDYWLLANQWNR WGDDGYF I RGTNECGIE+D VAGLPS +N+
Sbjct: 312 LIGWGTSDNGEDYWLLANQWNRGWGDDGYFKIRRGTNECGIEDDAVAGLPSARNL 359

BLAST of Spo08785.1 vs. NCBI nr
Match: gi|802694007|ref|XP_012083054.1| (PREDICTED: cathepsin B [Jatropha curcas])

HSP 1 Score: 541.6 bits (1394), Expect = 1.100e-150
Identity = 257/375 (68.53%), Postives = 288/375 (76.80%), Query Frame = 1

		  

Query: 4   LFLGLSLLFLLAPSVL-VAESVPE--LKLNSNLLQDSIVNSINANPKAGWKAGMNQQLSD 63
           LF  L  L  +APS L V    P+  LKL+S +LQDSI+  IN NP AGW+A MN + S+
Sbjct: 7   LFTLLLFLGTIAPSHLQVIAEAPDSKLKLSSRVLQDSIIRKINENPNAGWEAAMNPRFSN 66

Query: 64  YTLGQFKYILGARPTPESEKRSIPLVHHDRSLNLPKEFDARKAWPQCTSIGRILGKPNAM 123
           YT+G+FKY+LG +PTP+ E R +PLV H +SL LPKEFDAR AWPQC++IGRIL +    
Sbjct: 67  YTVGEFKYLLGVKPTPKKELRGVPLVSHPKSLKLPKEFDARSAWPQCSTIGRILDQ---- 126

Query: 124 SFFLMFSVLTNYAMDNLTFFGHCGSCWAFAAVEALSDRFCINYSMNITLSVNDLLACCGF 183
                               GHCGSCWAF AVE+LSDRFCIN+ MNI+LSVNDLLACCGF
Sbjct: 127 --------------------GHCGSCWAFGAVESLSDRFCINFGMNISLSVNDLLACCGF 186

Query: 184 LCGAGCDGGTPIYAWRYFVHHGVVTEECDPYFDTTGCSHPGCEPGYPTPKCVRKCVSGNQ 243
           LCG GCDGG P+YAWRY VHHGVVTEECDPYFD  GCSHPGCEPG+PTP+CVRKCV  NQ
Sbjct: 187 LCGNGCDGGYPLYAWRYLVHHGVVTEECDPYFDDIGCSHPGCEPGFPTPRCVRKCVDKNQ 246

Query: 244 LWRQSKHYGSNAYRVKSDQYQIMAELYKNGPVEVAFDVYEDFAHYKSGVYKHVTGQYLGG 303
            WRQSKHY  NAYR++SD Y IMAELYKNGPVEVAF VYEDFAHYKSGVYKH+TG  LGG
Sbjct: 247 FWRQSKHYSVNAYRIRSDPYDIMAELYKNGPVEVAFTVYEDFAHYKSGVYKHITGDQLGG 306

Query: 304 HAVKLIGWGTSADGEDYWLLANQWNRSWGDDGYFMISRGTNECGIEEDVVAGLPSPK--N 363
           HAVKLIGWGTS DGEDYWLLANQWNR WGDDGYF I RG NECGIEEDVVAGLPS +  N
Sbjct: 307 HAVKLIGWGTSDDGEDYWLLANQWNRGWGDDGYFKIKRGVNECGIEEDVVAGLPSSRNLN 357

Query: 364 MFKVVTESDDAGSAS 374
           + + V  +D  G AS
Sbjct: 367 LVREVAGTDIVGDAS 357

BLAST of Spo08785.1 vs. NCBI nr
Match: gi|702240634|ref|XP_010045446.1| (PREDICTED: cathepsin B-like [Eucalyptus grandis])

HSP 1 Score: 541.2 bits (1393), Expect = 1.400e-150
Identity = 247/369 (66.94%), Postives = 296/369 (80.22%), Query Frame = 1

		  

Query: 8   LSLLFLLAPSVLVAESVPELKLNSNLLQDSIVNSINANPKAGWKAGMNQQLSDYTLGQFK 67
           L+ L +L   V   +S+ +LKLNS++LQ+SI+  IN NP AGW+A MN + S++T+GQFK
Sbjct: 15  LATLSILLAQVNAEKSLSQLKLNSHILQNSIIKEINENPNAGWQAAMNPRFSNFTVGQFK 74

Query: 68  YILGARPTPESEKRSIPLVHHDRSLNLPKEFDARKAWPQCTSIGRILGKPNAMSFFLMFS 127
           ++LG +PTP  E   +P+  H +SL LP++FDAR AW QC++IGRILG+   + F L++S
Sbjct: 75  HLLGVKPTPHGELTQVPIKTHPKSLKLPEKFDARTAWSQCSTIGRILGQFVCLVFHLIYS 134

Query: 128 VLTNYAMDNLTFF--GHCGSCWAFAAVEALSDRFCINYSMNITLSVNDLLACCGFLCGAG 187
               Y  D  +    GHCGSCWAF AVE+LSDRFCI++ MNI+LSVNDLLACCGF+CGAG
Sbjct: 135 ---RYYADAFSLLCDGHCGSCWAFGAVESLSDRFCIHFGMNISLSVNDLLACCGFMCGAG 194

Query: 188 CDGGTPIYAWRYFVHHGVVTEECDPYFDTTGCSHPGCEPGYPTPKCVRKCVSGNQLWRQS 247
           C+GG P++AWRYF+HHGVVTEEC PYFD  GCSHPGCEP YPTPKCVRKCV+GNQ+WR S
Sbjct: 195 CNGGYPMFAWRYFMHHGVVTEECYPYFDDIGCSHPGCEPEYPTPKCVRKCVNGNQMWRSS 254

Query: 248 KHYGSNAYRVKSDQYQIMAELYKNGPVEVAFDVYEDFAHYKSGVYKHVTGQYLGGHAVKL 307
           KHY  +AYRV SD Y IMAE+YKNGPVEV+F VYEDFAHYKSGVYKHVTG  LGGHAVKL
Sbjct: 255 KHYSVSAYRVDSDPYNIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHVTGDVLGGHAVKL 314

Query: 308 IGWGTSADGEDYWLLANQWNRSWGDDGYFMISRGTNECGIEEDVVAGLPSPKNMFKVVTE 367
           IGWGT+ DGEDYWL+ANQWNRSWGDDGYF I RGTNECGIE DVV GLPS KN+ + V  
Sbjct: 315 IGWGTTDDGEDYWLIANQWNRSWGDDGYFKIRRGTNECGIEGDVVTGLPSTKNLVRKVVS 374

Query: 368 SDDAGSASI 375
            DD+G AS+
Sbjct: 375 VDDSGDASL 380

BLAST of Spo08785.1 vs. UniProtKB/TrEMBL
Match: A0A0K9QKM1_SPIOL (Uncharacterized protein OS=Spinacia oleracea GN=SOVF_168600 PE=3 SV=1)

HSP 1 Score: 788.9 bits (2036), Expect = 2.700e-225
Identity = 374/374 (100.00%), Postives = 374/374 (100.00%), Query Frame = 1

		  

Query: 1   MAPLFLGLSLLFLLAPSVLVAESVPELKLNSNLLQDSIVNSINANPKAGWKAGMNQQLSD 60
           MAPLFLGLSLLFLLAPSVLVAESVPELKLNSNLLQDSIVNSINANPKAGWKAGMNQQLSD
Sbjct: 1   MAPLFLGLSLLFLLAPSVLVAESVPELKLNSNLLQDSIVNSINANPKAGWKAGMNQQLSD 60

Query: 61  YTLGQFKYILGARPTPESEKRSIPLVHHDRSLNLPKEFDARKAWPQCTSIGRILGKPNAM 120
           YTLGQFKYILGARPTPESEKRSIPLVHHDRSLNLPKEFDARKAWPQCTSIGRILGKPNAM
Sbjct: 61  YTLGQFKYILGARPTPESEKRSIPLVHHDRSLNLPKEFDARKAWPQCTSIGRILGKPNAM 120

Query: 121 SFFLMFSVLTNYAMDNLTFFGHCGSCWAFAAVEALSDRFCINYSMNITLSVNDLLACCGF 180
           SFFLMFSVLTNYAMDNLTFFGHCGSCWAFAAVEALSDRFCINYSMNITLSVNDLLACCGF
Sbjct: 121 SFFLMFSVLTNYAMDNLTFFGHCGSCWAFAAVEALSDRFCINYSMNITLSVNDLLACCGF 180

Query: 181 LCGAGCDGGTPIYAWRYFVHHGVVTEECDPYFDTTGCSHPGCEPGYPTPKCVRKCVSGNQ 240
           LCGAGCDGGTPIYAWRYFVHHGVVTEECDPYFDTTGCSHPGCEPGYPTPKCVRKCVSGNQ
Sbjct: 181 LCGAGCDGGTPIYAWRYFVHHGVVTEECDPYFDTTGCSHPGCEPGYPTPKCVRKCVSGNQ 240

Query: 241 LWRQSKHYGSNAYRVKSDQYQIMAELYKNGPVEVAFDVYEDFAHYKSGVYKHVTGQYLGG 300
           LWRQSKHYGSNAYRVKSDQYQIMAELYKNGPVEVAFDVYEDFAHYKSGVYKHVTGQYLGG
Sbjct: 241 LWRQSKHYGSNAYRVKSDQYQIMAELYKNGPVEVAFDVYEDFAHYKSGVYKHVTGQYLGG 300

Query: 301 HAVKLIGWGTSADGEDYWLLANQWNRSWGDDGYFMISRGTNECGIEEDVVAGLPSPKNMF 360
           HAVKLIGWGTSADGEDYWLLANQWNRSWGDDGYFMISRGTNECGIEEDVVAGLPSPKNMF
Sbjct: 301 HAVKLIGWGTSADGEDYWLLANQWNRSWGDDGYFMISRGTNECGIEEDVVAGLPSPKNMF 360

Query: 361 KVVTESDDAGSASI 375
           KVVTESDDAGSASI
Sbjct: 361 KVVTESDDAGSASI 374

BLAST of Spo08785.1 vs. UniProtKB/TrEMBL
Match: B9RN00_RICCO (Cathepsin B, putative OS=Ricinus communis GN=RCOM_1341910 PE=3 SV=1)

HSP 1 Score: 544.3 bits (1401), Expect = 1.200e-151
Identity = 245/355 (69.01%), Postives = 293/355 (82.54%), Query Frame = 1

		  

Query: 10  LLFLLAPS-----VLVAESVPELKLNSNLLQDSIVNSINANPKAGWKAGMNQQLSDYTLG 69
           LLFL+A S     V+  E   +LKLNS +LQ+SI+  +N NP AGW+A MN QLS++T+G
Sbjct: 12  LLFLVALSSFHSRVISTELDSKLKLNSRILQESIIKKVNENPDAGWEAAMNPQLSNFTVG 71

Query: 70  QFKYILGARPTPESEKRSIPLVHHDRSLNLPKEFDARKAWPQCTSIGRILGKPNAMSFFL 129
           QFKY+LGA+PTP+ E   +P++ H ++L LPKEFDAR AWP C++IG+ILG+   +SF+ 
Sbjct: 72  QFKYLLGAKPTPKKELMGVPMISHPKTLKLPKEFDARTAWPHCSTIGKILGQ--LLSFYN 131

Query: 130 MFSVLTNYAMDNLTFFGHCGSCWAFAAVEALSDRFCINYSMNITLSVNDLLACCGFLCGA 189
           +FS+     ++     GHCGSCWAF AVE+LSDRFCI++ MNI+LSVNDLLACCGFLCG 
Sbjct: 132 IFSIFFFLFLE-----GHCGSCWAFGAVESLSDRFCIHFGMNISLSVNDLLACCGFLCGD 191

Query: 190 GCDGGTPIYAWRYFVHHGVVTEECDPYFDTTGCSHPGCEPGYPTPKCVRKCVSGNQLWRQ 249
           GCDGG P+YAWRYFVHHGVVTEECDPYFD  GCSHPGCEPG+PTPKCVRKC+  NQLWRQ
Sbjct: 192 GCDGGYPMYAWRYFVHHGVVTEECDPYFDNIGCSHPGCEPGFPTPKCVRKCIDKNQLWRQ 251

Query: 250 SKHYGSNAYRVKSDQYQIMAELYKNGPVEVAFDVYEDFAHYKSGVYKHVTGQYLGGHAVK 309
           SKHY  NAYR+ SD + +MAE+YKNGPVEV+F VYEDFAHYKSGVYKH+TG+ +GGHAVK
Sbjct: 252 SKHYSVNAYRISSDPHDVMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGEVMGGHAVK 311

Query: 310 LIGWGTSADGEDYWLLANQWNRSWGDDGYFMISRGTNECGIEEDVVAGLPSPKNM 360
           LIGWGTS +GEDYWLLANQWNR WGDDGYF I RGTNECGIE+D VAGLPS +N+
Sbjct: 312 LIGWGTSDNGEDYWLLANQWNRGWGDDGYFKIRRGTNECGIEDDAVAGLPSARNL 359

BLAST of Spo08785.1 vs. UniProtKB/TrEMBL
Match: A0A067K985_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_14145 PE=3 SV=1)

HSP 1 Score: 541.6 bits (1394), Expect = 7.600e-151
Identity = 257/375 (68.53%), Postives = 288/375 (76.80%), Query Frame = 1

		  

Query: 4   LFLGLSLLFLLAPSVL-VAESVPE--LKLNSNLLQDSIVNSINANPKAGWKAGMNQQLSD 63
           LF  L  L  +APS L V    P+  LKL+S +LQDSI+  IN NP AGW+A MN + S+
Sbjct: 7   LFTLLLFLGTIAPSHLQVIAEAPDSKLKLSSRVLQDSIIRKINENPNAGWEAAMNPRFSN 66

Query: 64  YTLGQFKYILGARPTPESEKRSIPLVHHDRSLNLPKEFDARKAWPQCTSIGRILGKPNAM 123
           YT+G+FKY+LG +PTP+ E R +PLV H +SL LPKEFDAR AWPQC++IGRIL +    
Sbjct: 67  YTVGEFKYLLGVKPTPKKELRGVPLVSHPKSLKLPKEFDARSAWPQCSTIGRILDQ---- 126

Query: 124 SFFLMFSVLTNYAMDNLTFFGHCGSCWAFAAVEALSDRFCINYSMNITLSVNDLLACCGF 183
                               GHCGSCWAF AVE+LSDRFCIN+ MNI+LSVNDLLACCGF
Sbjct: 127 --------------------GHCGSCWAFGAVESLSDRFCINFGMNISLSVNDLLACCGF 186

Query: 184 LCGAGCDGGTPIYAWRYFVHHGVVTEECDPYFDTTGCSHPGCEPGYPTPKCVRKCVSGNQ 243
           LCG GCDGG P+YAWRY VHHGVVTEECDPYFD  GCSHPGCEPG+PTP+CVRKCV  NQ
Sbjct: 187 LCGNGCDGGYPLYAWRYLVHHGVVTEECDPYFDDIGCSHPGCEPGFPTPRCVRKCVDKNQ 246

Query: 244 LWRQSKHYGSNAYRVKSDQYQIMAELYKNGPVEVAFDVYEDFAHYKSGVYKHVTGQYLGG 303
            WRQSKHY  NAYR++SD Y IMAELYKNGPVEVAF VYEDFAHYKSGVYKH+TG  LGG
Sbjct: 247 FWRQSKHYSVNAYRIRSDPYDIMAELYKNGPVEVAFTVYEDFAHYKSGVYKHITGDQLGG 306

Query: 304 HAVKLIGWGTSADGEDYWLLANQWNRSWGDDGYFMISRGTNECGIEEDVVAGLPSPK--N 363
           HAVKLIGWGTS DGEDYWLLANQWNR WGDDGYF I RG NECGIEEDVVAGLPS +  N
Sbjct: 307 HAVKLIGWGTSDDGEDYWLLANQWNRGWGDDGYFKIKRGVNECGIEEDVVAGLPSSRNLN 357

Query: 364 MFKVVTESDDAGSAS 374
           + + V  +D  G AS
Sbjct: 367 LVREVAGTDIVGDAS 357

BLAST of Spo08785.1 vs. UniProtKB/TrEMBL
Match: A0A061DRR6_THECC (Cysteine proteinases superfamily protein OS=Theobroma cacao GN=TCM_004987 PE=3 SV=1)

HSP 1 Score: 533.9 bits (1374), Expect = 1.600e-148
Identity = 247/368 (67.12%), Postives = 286/368 (77.72%), Query Frame = 1

		  

Query: 3   PLFLGLSLLFLLA---PSVLVAESVPELKLNSNLLQDSIVNSINANPKAGWKAGMNQQLS 62
           PL    S L LL+   P V+  E + E+KLNS +LQDSIV  +N NPKAGWKA +N +LS
Sbjct: 7   PLLFLASFLLLLSTVHPKVIAVEQLSEVKLNSQILQDSIVKQVNENPKAGWKAALNPRLS 66

Query: 63  DYTLGQFKYILGARPTPESEKRSIPLVHHDRSLNLPKEFDARKAWPQCTSIGRILGKPNA 122
           +YT+G+FK++LG +PTP+ E   IP++ H +SL +P +FDAR AWPQC++IGRIL +   
Sbjct: 67  NYTVGEFKHLLGVKPTPKKELLGIPVITHGKSLKVPTKFDARTAWPQCSTIGRILDQ--- 126

Query: 123 MSFFLMFSVLTNYAMDNLTFFGHCGSCWAFAAVEALSDRFCINYSMNITLSVNDLLACCG 182
                                GHCGSCWAF AVE+LSDRFCI++SMNI+LSVNDLLACCG
Sbjct: 127 ---------------------GHCGSCWAFGAVESLSDRFCIHFSMNISLSVNDLLACCG 186

Query: 183 FLCGAGCDGGTPIYAWRYFVHHGVVTEECDPYFDTTGCSHPGCEPGYPTPKCVRKCVSGN 242
           FLCG+GCDGG PI AWRYFV  GVVTEECDPYFD TGCSHPGCEP YPTP+CV+KCV GN
Sbjct: 187 FLCGSGCDGGYPISAWRYFVRRGVVTEECDPYFDDTGCSHPGCEPAYPTPRCVKKCVKGN 246

Query: 243 QLWRQSKHYGSNAYRVKSDQYQIMAELYKNGPVEVAFDVYEDFAHYKSGVYKHVTGQYLG 302
           QLWR+SKHY   AYR+ SD   IMAE+Y NGPVEV+F VYEDFAHYKSGVYKHVTG  +G
Sbjct: 247 QLWRESKHYSVGAYRINSDPADIMAEVYTNGPVEVSFTVYEDFAHYKSGVYKHVTGGVMG 306

Query: 303 GHAVKLIGWGTSADGEDYWLLANQWNRSWGDDGYFMISRGTNECGIEEDVVAGLPSPKNM 362
           GHAVKLIGWGTS DGEDYWLLANQWNR WGDDGYF ISRGTNECGIE+DVVAGLPS KN+
Sbjct: 307 GHAVKLIGWGTSDDGEDYWLLANQWNRGWGDDGYFKISRGTNECGIEDDVVAGLPSTKNL 350

Query: 363 FKVVTESD 368
            + V + D
Sbjct: 367 VREVGDMD 350

BLAST of Spo08785.1 vs. UniProtKB/TrEMBL
Match: B9GRU7_POPTR (Putative cathepsin B-like protease family protein OS=Populus trichocarpa GN=POPTR_0002s18510g PE=3 SV=2)

HSP 1 Score: 531.6 bits (1368), Expect = 7.900e-148
Identity = 245/370 (66.22%), Postives = 281/370 (75.95%), Query Frame = 1

		  

Query: 4   LFLGLSLLFLLAPSVLVAESVPELKLNSNLLQDSIVNSINANPKAGWKAGMNQQLSDYTL 63
           L L +  +F     V+  E V +LKLNS +LQDSI+  +N NPKAGWKA MN   S+YT+
Sbjct: 11  LLLLIGAIFTFQSQVIAVEPVSDLKLNSRILQDSILKKVNGNPKAGWKATMNHHFSNYTV 70

Query: 64  GQFKYILGARPTPESEKRSIPLVHHDRSLNLPKEFDARKAWPQCTSIGRILGKPNAMSFF 123
            QFKY+LG +PTP+ E R IP++ H +SL LP+EFDAR AWPQC++IG+IL +       
Sbjct: 71  AQFKYLLGVKPTPKEELRGIPVISHPKSLRLPEEFDARTAWPQCSTIGKILDQ------- 130

Query: 124 LMFSVLTNYAMDNLTFFGHCGSCWAFAAVEALSDRFCINYSMNITLSVNDLLACCGFLCG 183
                            GHCGSCWAF AVE+LSDRFCI+Y MNI+LSVNDLLACCGFLCG
Sbjct: 131 -----------------GHCGSCWAFGAVESLSDRFCIHYGMNISLSVNDLLACCGFLCG 190

Query: 184 AGCDGGTPIYAWRYFVHHGVVTEECDPYFDTTGCSHPGCEPGYPTPKCVRKCVSGNQLWR 243
           +GC+GG PI AWRYFVHHGVVTEECDPYFD  GCSHPGCEPGYPTPKC RKCV+ NQLW+
Sbjct: 191 SGCNGGYPISAWRYFVHHGVVTEECDPYFDDIGCSHPGCEPGYPTPKCARKCVNKNQLWK 250

Query: 244 QSKHYGSNAYRVKSDQYQIMAELYKNGPVEVAFDVYEDFAHYKSGVYKHVTGQYLGGHAV 303
           +SKHYG   YR+ SD   IMAE+YKNGPVEVAF VYEDFAHYKSGVYKH+TG  +GGHAV
Sbjct: 251 KSKHYGVKPYRIDSDPDSIMAEIYKNGPVEVAFTVYEDFAHYKSGVYKHITGGMMGGHAV 310

Query: 304 KLIGWGTSADGEDYWLLANQWNRSWGDDGYFMISRGTNECGIEEDVVAGLPSPKNMFKVV 363
           KLIGWGTS DGE YWLLANQWNR WGDDG+F I RGTNECGIE DVVAGLPS +N+ + V
Sbjct: 311 KLIGWGTSEDGEAYWLLANQWNRGWGDDGFFKIRRGTNECGIEGDVVAGLPSTRNLVREV 356

Query: 364 TESDDAGSAS 374
              D    AS
Sbjct: 371 VSIDAREDAS 356

BLAST of Spo08785.1 vs. ExPASy Swiss-Prot
Match: CATB_MACFA (Cathepsin B OS=Macaca fascicularis GN=CTSB PE=2 SV=1)

HSP 1 Score: 264.6 bits (675), Expect = 1.600e-69
Identity = 144/339 (42.48%), Postives = 185/339 (54.57%), Query Frame = 1

		  

Query: 34  LQDSIVNSINANPKAGWKAGMNQQLSD--YTLGQFKYILGARPTPESEKRSIPLVHHDRS 93
           L D +VN +N      W+AG N    D  Y        LG    P+        V     
Sbjct: 26  LSDELVNYVNKQ-NTTWQAGHNFYNVDVSYLKRLCGTFLGGPKPPQR-------VMFTED 85

Query: 94  LNLPKEFDARKAWPQCTSIGRILGKPNAMSFFLMFSVLTNYAMDNLTFFGHCGSCWAFAA 153
           L LP+ FDAR+ WPQC +I  I  +                        G CGSCWAF A
Sbjct: 86  LKLPESFDAREQWPQCPTIKEIRDQ------------------------GSCGSCWAFGA 145

Query: 154 VEALSDRFCI--NYSMNITLSVNDLLACCGFLCGAGCDGGTPIYAWRYFVHHGVVTEE-- 213
           VEA+SDR CI  N  +++ +S  DLL CCG +CG GC+GG P  AW ++   G+V+    
Sbjct: 146 VEAISDRICIHTNAHVSVEVSAEDLLTCCGIMCGDGCNGGYPAGAWNFWTRKGLVSGGLY 205

Query: 214 -----CDPYFDTTGCSH------PGCEPGYPTPKCVRKCVSG-NQLWRQSKHYGSNAYRV 273
                C PY     C H      P C     TPKC + C  G +  ++Q KHYG N+Y V
Sbjct: 206 DSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSV 265

Query: 274 KSDQYQIMAELYKNGPVEVAFDVYEDFAHYKSGVYKHVTGQYLGGHAVKLIGWGTSADGE 333
            + +  IMAE+YKNGPVE AF VY DF  YKSGVY+HVTG+ +GGHA++++GWG   +G 
Sbjct: 266 SNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE-NGT 325

Query: 334 DYWLLANQWNRSWGDDGYFMISRGTNECGIEEDVVAGLP 355
            YWL+AN WN  WGD+G+F I RG + CGIE +VVAG+P
Sbjct: 326 PYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIP 330

BLAST of Spo08785.1 vs. ExPASy Swiss-Prot
Match: CATB_PONAB (Cathepsin B OS=Pongo abelii GN=CTSB PE=2 SV=1)

HSP 1 Score: 264.2 bits (674), Expect = 2.100e-69
Identity = 144/339 (42.48%), Postives = 185/339 (54.57%), Query Frame = 1

		  

Query: 34  LQDSIVNSINANPKAGWKAGMNQQLSD--YTLGQFKYILGARPTPESEKRSIPLVHHDRS 93
           L D +VN +N      W+AG N    D  Y        LG    P+        V     
Sbjct: 26  LSDELVNYVNKR-NTTWQAGHNFYNVDVSYLKKLCGTFLGGPKPPQR-------VMFTED 85

Query: 94  LNLPKEFDARKAWPQCTSIGRILGKPNAMSFFLMFSVLTNYAMDNLTFFGHCGSCWAFAA 153
           L LP+ FDAR+ WPQC +I  I  +                        G CGSCWAF A
Sbjct: 86  LKLPESFDAREQWPQCPTIKEIRDQ------------------------GSCGSCWAFGA 145

Query: 154 VEALSDRFCI--NYSMNITLSVNDLLACCGFLCGAGCDGGTPIYAWRYFVHHGVVTEE-- 213
           VEA+SDR CI  N  +++ +S  DLL CCG +CG GC+GG P  AW ++   G+V+    
Sbjct: 146 VEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLY 205

Query: 214 -----CDPYFDTTGCSH------PGCEPGYPTPKCVRKCVSG-NQLWRQSKHYGSNAYRV 273
                C PY     C H      P C     TPKC + C  G +  ++Q KHYG N+Y V
Sbjct: 206 ESHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSV 265

Query: 274 KSDQYQIMAELYKNGPVEVAFDVYEDFAHYKSGVYKHVTGQYLGGHAVKLIGWGTSADGE 333
            + +  IMAE+YKNGPVE AF VY DF  YKSGVY+HVTG+ +GGHA++++GWG   +G 
Sbjct: 266 SNSERDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE-NGT 325

Query: 334 DYWLLANQWNRSWGDDGYFMISRGTNECGIEEDVVAGLP 355
            YWL+AN WN  WGD+G+F I RG + CGIE +VVAG+P
Sbjct: 326 PYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIP 330

BLAST of Spo08785.1 vs. ExPASy Swiss-Prot
Match: CATB_HUMAN (Cathepsin B OS=Homo sapiens GN=CTSB PE=1 SV=3)

HSP 1 Score: 263.5 bits (672), Expect = 3.600e-69
Identity = 144/339 (42.48%), Postives = 184/339 (54.28%), Query Frame = 1

		  

Query: 34  LQDSIVNSINANPKAGWKAGMNQQLSD--YTLGQFKYILGARPTPESEKRSIPLVHHDRS 93
           L D +VN +N      W+AG N    D  Y        LG    P+        V     
Sbjct: 26  LSDELVNYVNKR-NTTWQAGHNFYNVDMSYLKRLCGTFLGGPKPPQR-------VMFTED 85

Query: 94  LNLPKEFDARKAWPQCTSIGRILGKPNAMSFFLMFSVLTNYAMDNLTFFGHCGSCWAFAA 153
           L LP  FDAR+ WPQC +I  I  +                        G CGSCWAF A
Sbjct: 86  LKLPASFDAREQWPQCPTIKEIRDQ------------------------GSCGSCWAFGA 145

Query: 154 VEALSDRFCI--NYSMNITLSVNDLLACCGFLCGAGCDGGTPIYAWRYFVHHGVVTEE-- 213
           VEA+SDR CI  N  +++ +S  DLL CCG +CG GC+GG P  AW ++   G+V+    
Sbjct: 146 VEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLY 205

Query: 214 -----CDPYFDTTGCSH------PGCEPGYPTPKCVRKCVSG-NQLWRQSKHYGSNAYRV 273
                C PY     C H      P C     TPKC + C  G +  ++Q KHYG N+Y V
Sbjct: 206 ESHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSV 265

Query: 274 KSDQYQIMAELYKNGPVEVAFDVYEDFAHYKSGVYKHVTGQYLGGHAVKLIGWGTSADGE 333
            + +  IMAE+YKNGPVE AF VY DF  YKSGVY+HVTG+ +GGHA++++GWG   +G 
Sbjct: 266 SNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE-NGT 325

Query: 334 DYWLLANQWNRSWGDDGYFMISRGTNECGIEEDVVAGLP 355
            YWL+AN WN  WGD+G+F I RG + CGIE +VVAG+P
Sbjct: 326 PYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIP 330

BLAST of Spo08785.1 vs. ExPASy Swiss-Prot
Match: CATB_RAT (Cathepsin B OS=Rattus norvegicus GN=Ctsb PE=1 SV=2)

HSP 1 Score: 258.5 bits (659), Expect = 1.200e-67
Identity = 146/360 (40.56%), Postives = 193/360 (53.61%), Query Frame = 1

		  

Query: 13  LLAPSVLVAESVPELKLNSNLLQDSIVNSINANPKAGWKAGMNQQLSD--YTLGQFKYIL 72
           L+  S L+A +    K +S+ L D ++N IN      W+AG N    D  Y       +L
Sbjct: 5   LIPLSCLLALTSAHDKPSSHPLSDDMINYINKQ-NTTWQAGRNFYNVDISYLKKLCGTVL 64

Query: 73  GARPTPESEKRSIPLVHHDRSLNLPKEFDARKAWPQCTSIGRILGKPNAMSFFLMFSVLT 132
           G    PE        V     +NLP+ FDAR+ W  C +I +I  +              
Sbjct: 65  GGPNLPER-------VGFSEDINLPESFDAREQWSNCPTIAQIRDQ-------------- 124

Query: 133 NYAMDNLTFFGHCGSCWAFAAVEALSDRFCI--NYSMNITLSVNDLLACCGFLCGAGCDG 192
                     G CGSCWAF AVEA+SDR CI  N  +N+ +S  DLL CCG  CG GC+G
Sbjct: 125 ----------GSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAEDLLTCCGIQCGDGCNG 184

Query: 193 GTPIYAWRYFVHHGVVTEE-------CDPYFDTTGCSH------PGCEPGYPTPKCVRKC 252
           G P  AW ++   G+V+         C PY     C H      P C     TPKC + C
Sbjct: 185 GYPSGAWNFWTRKGLVSGGVYNSHIGCLPY-TIPPCEHHVNGSRPPCTGEGDTPKCNKMC 244

Query: 253 VSG-NQLWRQSKHYGSNAYRVKSDQYQIMAELYKNGPVEVAFDVYEDFAHYKSGVYKHVT 312
            +G +  +++ KHYG  +Y V   + +IMAE+YKNGPVE AF V+ DF  YKSGVYKH  
Sbjct: 245 EAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEA 304

Query: 313 GQYLGGHAVKLIGWGTSADGEDYWLLANQWNRSWGDDGYFMISRGTNECGIEEDVVAGLP 355
           G  +GGHA++++GWG   +G  YWL+AN WN  WGD+G+F I RG N CGIE ++VAG+P
Sbjct: 305 GDVMGGHAIRILGWGIE-NGVPYWLVANSWNVDWGDNGFFKILRGENHCGIESEIVAGIP 330

BLAST of Spo08785.1 vs. ExPASy Swiss-Prot
Match: CYSP_SCHJA (Cathepsin B-like cysteine proteinase OS=Schistosoma japonicum GN=CATB PE=2 SV=1)

HSP 1 Score: 257.7 bits (657), Expect = 2.000e-67
Identity = 140/339 (41.30%), Postives = 188/339 (55.46%), Query Frame = 1

		  

Query: 34  LQDSIVNSINANPKAGWKAGMNQQLSDYTLGQFKYILGARPTPESEKRSI-PLV-HHDRS 93
           L D +++ IN +P AGWKA  + +   ++L   + ++GAR      KR+  P V HHD +
Sbjct: 30  LSDEMISFINEHPDAGWKADKSDRF--HSLDDARILMGARKEDAEMKRNRRPTVDHHDLN 89

Query: 94  LNLPKEFDARKAWPQCTSIGRILGKPNAMSFFLMFSVLTNYAMDNLTFFGHCGSCWAFAA 153
           + +P +FD+RK WP C SI +I  +                          CGSCWAF A
Sbjct: 90  VEIPSQFDSRKKWPHCKSISQIRDQ------------------------SRCGSCWAFGA 149

Query: 154 VEALSDRFCINYS--MNITLSVNDLLACCGFLCGAGCDGGTPIYAWRYFVHHGVVT---- 213
           VEA++DR CI      +  LS  DL++CC   CG GC GG P  AW Y+V  G+VT    
Sbjct: 150 VEAMTDRICIQSGGGQSAELSALDLISCCKD-CGDGCQGGFPGVAWDYWVKRGIVTGGSK 209

Query: 214 ---EECDPYFDTTGCSH------PGCEPG-YPTPKCVRKCVSGNQL-WRQSKHYGSNAYR 273
                C PY     C H      P C    Y TP+C + C  G +  + Q KHYG  +Y 
Sbjct: 210 ENHTGCQPY-PFPKCEHHTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDESYN 269

Query: 274 VKSDQYQIMAELYKNGPVEVAFDVYEDFAHYKSGVYKHVTGQYLGGHAVKLIGWGTSADG 333
           V++++  I  ++   GPVE AFDVYEDF +YKSG+Y+HVTG  +GGHA+++IGWG     
Sbjct: 270 VQNNEKVIQRDIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVE-KR 329

Query: 334 EDYWLLANQWNRSWGDDGYFMISRGTNECGIEEDVVAGL 354
             YWL+AN WN  WG+ G F + RG +EC IE DVVAGL
Sbjct: 330 TPYWLIANSWNEDWGEKGLFRMVRGRDECSIESDVVAGL 339

BLAST of Spo08785.1 vs. TAIR (Arabidopsis)
Match: AT1G02305.1 (Cysteine proteinases superfamily protein)

HSP 1 Score: 521.5 bits (1342), Expect = 4.100e-148
Identity = 239/350 (68.29%), Postives = 273/350 (78.00%), Query Frame = 1

		  

Query: 19  LVAESVPELKLNSNLLQDSIVNSINANPKAGWKAGMNQQLSDYTLGQFKYILGARPTPES 78
           + AE++ + KL S +LQ+ IV  +N NP AGWKA  N + ++ T+ +FK +LG +PTP++
Sbjct: 31  IAAENLSKQKLTSWILQNEIVKEVNENPNAGWKASFNDRFANATVAEFKRLLGVKPTPKT 90

Query: 79  EKRSIPLVHHDRSLNLPKEFDARKAWPQCTSIGRILGKPNAMSFFLMFSVLTNYAMDNLT 138
           E   +P+V HD SL LPKEFDAR AW QCTSIGRIL +                      
Sbjct: 91  EFLGVPIVSHDISLKLPKEFDARTAWSQCTSIGRILDQ---------------------- 150

Query: 139 FFGHCGSCWAFAAVEALSDRFCINYSMNITLSVNDLLACCGFLCGAGCDGGTPIYAWRYF 198
             GHCGSCWAF AVE+LSDRFCI Y+MN++LSVNDLLACCGFLCG GC+GG PI AWRYF
Sbjct: 151 --GHCGSCWAFGAVESLSDRFCIKYNMNVSLSVNDLLACCGFLCGQGCNGGYPIAAWRYF 210

Query: 199 VHHGVVTEECDPYFDTTGCSHPGCEPGYPTPKCVRKCVSGNQLWRQSKHYGSNAYRVKSD 258
            HHGVVTEECDPYFD TGCSHPGCEP YPTPKC RKCVSGNQLWR+SKHYG +AY+V+S 
Sbjct: 211 KHHGVVTEECDPYFDNTGCSHPGCEPAYPTPKCARKCVSGNQLWRESKHYGVSAYKVRSH 270

Query: 259 QYQIMAELYKNGPVEVAFDVYEDFAHYKSGVYKHVTGQYLGGHAVKLIGWGTSADGEDYW 318
              IMAE+YKNGPVEVAF VYEDFAHYKSGVYKH+TG  +GGHAVKLIGWGTS DGEDYW
Sbjct: 271 PDDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGTNIGGHAVKLIGWGTSDDGEDYW 330

Query: 319 LLANQWNRSWGDDGYFMISRGTNECGIEEDVVAGLPSPKNMFKVVTESDD 369
           LLANQWNRSWGDDGYF I RGTNECGIE  VVAGLPS +N+ K +T SDD
Sbjct: 331 LLANQWNRSWGDDGYFKIRRGTNECGIEHGVVAGLPSDRNVVKGITTSDD 356

BLAST of Spo08785.1 vs. TAIR (Arabidopsis)
Match: AT4G01610.1 (Cysteine proteinases superfamily protein)

HSP 1 Score: 508.4 bits (1308), Expect = 3.600e-144
Identity = 238/374 (63.64%), Postives = 279/374 (74.60%), Query Frame = 1

		  

Query: 1   MAPLFLGLSLLFLLAPSVLVAESVPELKLNSNLLQDSIVNSINANPKAGWKAGMNQQLSD 60
           +A +FL L LL       + AES+ + KL+S +LQD IV  +N NP AGWKA +N + S+
Sbjct: 10  LASVFLLLGLLLAFDLKGIEAESLTKQKLDSKILQDEIVKKVNENPNAGWKAAINDRFSN 69

Query: 61  YTLGQFKYILGARPTPESEKRSIPLVHHDRSLNLPKEFDARKAWPQCTSIGRILGKPNAM 120
            T+ +FK +LG +PTP+     +P+V HD SL LPK FDAR AWPQCTSIG IL +    
Sbjct: 70  ATVAEFKRLLGVKPTPKKHFLGVPIVSHDPSLKLPKAFDARTAWPQCTSIGNILDQ---- 129

Query: 121 SFFLMFSVLTNYAMDNLTFFGHCGSCWAFAAVEALSDRFCINYSMNITLSVNDLLACCGF 180
                               GHCGSCWAF AVE+LSDRFCI + MNI+LSVNDLLACCGF
Sbjct: 130 --------------------GHCGSCWAFGAVESLSDRFCIQFGMNISLSVNDLLACCGF 189

Query: 181 LCGAGCDGGTPIYAWRYFVHHGVVTEECDPYFDTTGCSHPGCEPGYPTPKCVRKCVSGNQ 240
            CG GCDGG PI AW+YF + GVVTEECDPYFD TGCSHPGCEP YPTPKC RKCVS N+
Sbjct: 190 RCGDGCDGGYPIAAWQYFSYSGVVTEECDPYFDNTGCSHPGCEPAYPTPKCSRKCVSDNK 249

Query: 241 LWRQSKHYGSNAYRVKSDQYQIMAELYKNGPVEVAFDVYEDFAHYKSGVYKHVTGQYLGG 300
           LW +SKHY  + Y VKS+   IMAE+YKNGPVEV+F VYEDFAHYKSGVYKH+TG  +GG
Sbjct: 250 LWSESKHYSVSTYTVKSNPQDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGG 309

Query: 301 HAVKLIGWGTSADGEDYWLLANQWNRSWGDDGYFMISRGTNECGIEEDVVAGLPSPKNMF 360
           HAVKLIGWGTS++GEDYWL+ANQWNR WGDDGYFMI RGTNECGIE++ VAGLPS KN+F
Sbjct: 310 HAVKLIGWGTSSEGEDYWLMANQWNRGWGDDGYFMIRRGTNECGIEDEPVAGLPSSKNVF 359

Query: 361 KVVTESDDAGSASI 375
           +V T S+D   AS+
Sbjct: 370 RVDTGSNDLPVASV 359

BLAST of Spo08785.1 vs. TAIR (Arabidopsis)
Match: AT1G02300.1 (Cysteine proteinases superfamily protein)

HSP 1 Score: 494.2 bits (1271), Expect = 7.100e-140
Identity = 236/374 (63.10%), Postives = 283/374 (75.67%), Query Frame = 1

		  

Query: 1   MAPLFLGLSLLFLLAPSVLVAESVPELKLNSNLLQDSIVNSINANPKAGWKAGMNQQLSD 60
           +A +FL L   F L    + AE++ + KL S +LQ+ IV  +N NP AGWKA  N + ++
Sbjct: 12  LASVFLLLFSSFNLQG--IAAENLSKQKLTSLILQNEIVKEVNENPNAGWKAAFNDRFAN 71

Query: 61  YTLGQFKYILGARPTPESEKRSIPLVHHDRSLNLPKEFDARKAWPQCTSIGRILGKPNAM 120
            T+ +FK +LG   TP++    +P+V HD SL LPKEFDAR AW  CTSI RIL     +
Sbjct: 72  ATVAEFKRLLGVIQTPKTAYLGVPIVRHDLSLKLPKEFDARTAWSHCTSIRRIL-VGYIL 131

Query: 121 SFFLMFSVLTNYAMDNLTFFGHCGSCWAFAAVEALSDRFCINYSMNITLSVNDLLACCGF 180
           +  L++S +T +    L   GHCGSCWAF AVE+LSDRFCI Y++N++LS ND++ACCG 
Sbjct: 132 NNVLLWSTITLWFWFLL---GHCGSCWAFGAVESLSDRFCIKYNLNVSLSANDVIACCGL 191

Query: 181 LCGAGCDGGTPIYAWRYFVHHGVVTEECDPYFDTTGCSHPGCEPGYPTPKCVRKCVSGNQ 240
           LCG GC+GG P+ AW YF +HGVVT+ECDPYFD TGCSHPGCEP YPTPKC RKCVS NQ
Sbjct: 192 LCGFGCNGGFPMGAWLYFKYHGVVTQECDPYFDNTGCSHPGCEPTYPTPKCERKCVSRNQ 251

Query: 241 LWRQSKHYGSNAYRVKSDQYQIMAELYKNGPVEVAFDVYEDFAHYKSGVYKHVTGQYLGG 300
           LW +SKHYG  AYR+  D   IMAE+YKNGPVEVAF VYEDFAHYKSGVYK++TG  +GG
Sbjct: 252 LWGESKHYGVGAYRINPDPQDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGTKIGG 311

Query: 301 HAVKLIGWGTSADGEDYWLLANQWNRSWGDDGYFMISRGTNECGIEEDVVAGLPSPKNMF 360
           HAVKLIGWGTS DGEDYWLLANQWNRSWGDDGYF I RGTNECGIE+ VVAGLPS KN+F
Sbjct: 312 HAVKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECGIEQSVVAGLPSEKNVF 371

Query: 361 KVVTESDDAGSASI 375
           K +T SDD   +S+
Sbjct: 372 KGITTSDDLLVSSV 379

BLAST of Spo08785.1 vs. TAIR (Arabidopsis)
Match: AT3G45310.1 (Cysteine proteinases superfamily protein)

HSP 1 Score: 106.3 bits (264), Expect = 4.200e-23
Identity = 71/212 (33.49%), Postives = 95/212 (44.81%), Query Frame = 1

		  

Query: 141 GHCGSCWAFAAVEALSDRFCINYSMNITLSVNDLLACCGFLCGAGCDGGTPIYAWRYFVH 200
           GHCGSCW F+   AL   +   +   I+LS   L+ C G     GC GG P  A+ Y  +
Sbjct: 160 GHCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYIKY 219

Query: 201 H-GVVTEECDPYFDTTGCSHPGCEPGYPTPKCVRKCVSGNQLWRQSKHYGSNAYRVKSDQ 260
           + G+ TEE  PY    G    GC+             S   +  Q +    N      D+
Sbjct: 220 NGGLDTEEAYPYTGKDG----GCK------------FSAKNIGVQVRD-SVNITLGAEDE 279

Query: 261 YQIMAELYKNGPVEVAFDVYEDFAHYKSGVYKHVTGQYLG------GHAVKLIGWGTSAD 320
            +    L +  PV VAF+V  +F  YK GV+   T    G       HAV  +G+G   D
Sbjct: 280 LKHAVGLVR--PVSVAFEVVHEFRFYKKGVF---TSNTCGNTPMDVNHAVLAVGYGVE-D 339

Query: 321 GEDYWLLANQWNRSWGDDGYFMISRGTNECGI 346
              YWL+ N W   WGD+GYF +  G N CG+
Sbjct: 340 DVPYWLIKNSWGGEWGDNGYFKMEMGKNMCGV 348

BLAST of Spo08785.1 vs. TAIR (Arabidopsis)
Match: AT5G60360.3 (aleurain-like protease)

HSP 1 Score: 99.8 bits (247), Expect = 3.900e-21
Identity = 70/213 (32.86%), Postives = 91/213 (42.72%), Query Frame = 1

		  

Query: 141 GHCGSCWAFAAVEALSDRFCINYSMNITLSVNDLLACCGFLCGAGCDGGTPIYAWRYFVH 200
           G CGSCW F+   AL   +   +   I+LS   L+ C G     GC+GG P  A+ Y   
Sbjct: 160 GGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYIKS 219

Query: 201 HGVV-TEECDPYF--DTTGCSHPGCEPGYPTPKCVRKCVSGNQLWRQSKHYGSNAYRVKS 260
           +G + TE+  PY   D T C       G      V   +                     
Sbjct: 220 NGGLDTEKAYPYTGKDET-CKFSAENVGVQVLNSVNITLGAE------------------ 279

Query: 261 DQYQIMAELYKNGPVEVAFDVYEDFAHYKSGVYKHVTGQYLGG------HAVKLIGWGTS 320
           D+ +    L +  PV +AF+V   F  YKSGVY   T  + G       HAV  +G+G  
Sbjct: 280 DELKHAVGLVR--PVSIAFEVIHSFRLYKSGVY---TDSHCGSTPMDVNHAVLAVGYGVE 339

Query: 321 ADGEDYWLLANQWNRSWGDDGYFMISRGTNECG 345
            DG  YWL+ N W   WGD GYF +  G N CG
Sbjct: 340 -DGVPYWLIKNSWGADWGDKGYFKMEMGKNMCG 347

The following BLAST results are available for this feature:
BLAST of Spo08785.1 vs. NCBI nr
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. NCBI nr)
Total hits: 5
Match NameE-valueIdentityDescription
gi|902170206|gb|KNA07799.1|3.9e-225100.hypothetical protein SOVF_1686... [more]
gi|731368155|ref|XP_010695986.1|4.6e-18179.7PREDICTED: cathepsin B-like [B... [more]
gi|223545619|gb|EEF47123.1|1.7e-15169.0cathepsin B, putative [Ricinus... [more]
gi|802694007|ref|XP_012083054.1|1.1e-15068.5PREDICTED: cathepsin B [Jatrop... [more]
gi|702240634|ref|XP_010045446.1|1.4e-15066.9PREDICTED: cathepsin B-like [E... [more]
back to top
BLAST of Spo08785.1 vs. UniProtKB/TrEMBL
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. UniprotKB/TrEMBL)
Total hits: 5
Match NameE-valueIdentityDescription
A0A0K9QKM1_SPIOL2.7e-225100.Uncharacterized protein OS=Spi... [more]
B9RN00_RICCO1.2e-15169.0Cathepsin B, putative OS=Ricin... [more]
A0A067K985_JATCU7.6e-15168.5Uncharacterized protein OS=Jat... [more]
A0A061DRR6_THECC1.6e-14867.1Cysteine proteinases superfami... [more]
B9GRU7_POPTR7.9e-14866.2Putative cathepsin B-like prot... [more]
back to top
BLAST of Spo08785.1 vs. ExPASy Swiss-Prot
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. ExPASy SwissProt)
Total hits: 5
Match NameE-valueIdentityDescription
CATB_MACFA1.6e-6942.4Cathepsin B OS=Macaca fascicul... [more]
CATB_PONAB2.1e-6942.4Cathepsin B OS=Pongo abelii GN... [more]
CATB_HUMAN3.6e-6942.4Cathepsin B OS=Homo sapiens GN... [more]
CATB_RAT1.2e-6740.5Cathepsin B OS=Rattus norvegic... [more]
CYSP_SCHJA2.0e-6741.3Cathepsin B-like cysteine prot... [more]
back to top
BLAST of Spo08785.1 vs. TAIR (Arabidopsis)
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. TAIR)
Total hits: 5
Match NameE-valueIdentityDescription
AT1G02305.14.1e-14868.2Cysteine proteinases superfami... [more]
AT4G01610.13.6e-14463.6Cysteine proteinases superfami... [more]
AT1G02300.17.1e-14063.1Cysteine proteinases superfami... [more]
AT3G45310.14.2e-2333.4Cysteine proteinases superfami... [more]
AT5G60360.33.9e-2132.8aleurain-like protease[more]
back to top
InterPro
Analysis Name: InterPro Annotations of S. oleracea
Date Performed: 2018-06-29
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000668Peptidase C1A, papain C-terminalPRINTSPR00705PAPAINcoord: 140..155
score: 3.5E-7coord: 301..311
score: 3.5E-7coord: 317..323
score: 3.
IPR000668Peptidase C1A, papain C-terminalPFAMPF00112Peptidase_C1coord: 141..352
score: 7.2
IPR000668Peptidase C1A, papain C-terminalSMARTSM00645pept_c1coord: 94..353
score: 7.2
IPR012599Peptidase C1A, propeptidePFAMPF08127Propeptide_C1coord: 34..76
score: 6.4
IPR013128Peptidase C1APANTHERPTHR12411CYSTEINE PROTEASE FAMILY C1-RELATEDcoord: 1..113
score: 6.3E-200coord: 138..374
score: 6.3E
IPR025660Cysteine peptidase, histidine active sitePROSITEPS00639THIOL_PROTEASE_HIScoord: 299..309
scor
NoneNo IPR availableGENE3D3.90.70.10coord: 50..358
score: 2.2
NoneNo IPR availablePANTHERPTHR12411:SF367SUBFAMILY NOT NAMEDcoord: 1..113
score: 6.3E-200coord: 138..374
score: 6.3E
NoneNo IPR availableunknownSSF54001Cysteine proteinasescoord: 39..355
score: 3.05

GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
biological_process GO:0050790 regulation of catalytic activity
molecular_function GO:0004197 cysteine-type endopeptidase activity
molecular_function GO:0008234 cysteine-type peptidase activity