Spo00502.1 (mRNA)

Overview
NameSpo00502.1
TypemRNA
OrganismSpinacia oleracea (Spinach)
Description(Xaa-pro dipeptidase, putative) (3.4.13.9)
LocationSuper_scaffold_143 : 84144 .. 117734 (-)
Sequence length3654
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CAACGAGGCTAGAAAGTTTGCAAAACACACTACCCCCCTTAAACCGTCTTGCTAATGGCGGAAGCTAAAGAGGCCATTCCAATGGCGACTGCAGAAGTACCACGCTCCTCTCTCACTCCTCTTGAAGTTCCAATGCAACTTCACGAAATCAATCGCCAAAAGCTCGTCAAATATCTTCAACAACACCTCACTGATTCTGGTCGCCCAATTCAAGGCATCGTTCTCCTTCAGGTAAATCAATCAATCATCTATTCAATTTAATTCCTCTATTTTTATGTTAATTTGTGTTTTTTTGTATCGAATTACTCAAATATTTCATGCAATTCTTTCGATTGTGTTTGTTATTAGGGTGGAGAAGAGAAGAATCGTTATTGCACCGATCATACCGAACTTTTCCGGTTAGTTTTCTGATTTTGATCACAGATTGAATCATGTTAATTTGAAATTGTGTTTAATGTACAGTATAAACTATGAACCTATGAAGATAAACTTTTTGATTTATGTTACATACTTGCAATCAGTTCCATGTTACCTGGACTCGGGTACCCATGTCGGACACGGATACGTGGCCAAGTGTCCGACACGGCTAATTTGTGAAAAGATTGCATGATTTTGGTCTAAATTGAAATGTCCAAGTGTTCATACCCTCGTACGAGTGTCGTGGATTCAACACGGGTGTTTGAGGTAAAATGAAGAGGCTGGGTAACGGATTGCGTGTGGTGCTGCAACTTCTCCTAGAGGATAAATTAAATCTAAAAATCTTGTCTATATGAAATTATGAATAGGAAATATTGGAAATTAATGTCACAAAGTTTTCGTTTTGAGATCAATACTGTCAATGTGTAGCCTGTAGGTTGTCATAGAGTTTTTGGTTGATTAGTATTGGAGTATTGGTTTTTATTTTGTTTTGTATGATGCATATCTGAATCTGCTGTATTTTAGAAACTTGTAGGCGTAGACCAGAAACATGTCAATCTTACAAGCTGTTGGCTTTCTACCTATTAGCGTAGTCGAAGGGAAAATTAGGAACTCACTGTACAATTATAGTTGTGCATTTTGATAATTAGCTTCTTGGCAAAAAACCCATCTAGGTGATGTGGCTCAAGGTCGTCTTGAGGGAGATATAATGCGAACAAAGAATTTACAGCTTAAGTTTGTACTTCTCATACCAAATGCCATAAAAAAAAAAGACCTAAAGAAGCACTTACTAGTTATGATGAAAATTTGATCAATAATGTCGTATCCATACTTCCATTGCAAGAGTCTGTAACTCTATATTAACTCAAGAGTAATTTTCTGCTCCTGAAGATGAACTTACCAGTCGGTAACTTTAACTTGAATAATGTGGCCCTTATTGTAAAGGGAGCGAGCCAACAAAATACTAAGTAAAAATCAAAGACGGTGCAAACGAAGGACCTTATTGTATGTTAACAAGTATACCTACAGTAGACAGGCACATAAAAGAAAAATAGATGTCTCCACTCTTGAAGTATGGTGGAAAGACTGATTTATAATCTTTTTCAGTTTCAATCCAACTTGGGAATATGTTACTTCTGCATTTTTTGACTTTTACAGTTATACCAATAACCGTAATTAAGTTTTTCCCGCCTGCTGCTCAATGTTTAAATGTTTATTTCACTTGATGAAATCTTGGGTTCTCTTAAAGGGTAAGATATTGAGTATTTGTTATAGCGCCTCAATTGCGAAAGGCCGAAAGCAAAACTTTATCACCTGCTGGTTTTTCTCTTAAAAATAGCAACTTATTTATGTGTTTTTTATGGGCCGTTTGGATCATATACATGTATATGGTTCTGTTTTCAATACATCTATATTAACAATTTGATTGTGCTGAGCTTAGGTTATGTGCTATGGCCCTTAATATTCAAGAGAGATGCTCACTTAGAGTATAAAACTTGATTCATTTAGAGTATACAATGTTCCTTTGCATCCTTGTATTTTATACATCCAAATATATTAGAAGTTTTGTTCAGATTCTAGATGCTAAATTGGTGTTGTTTACTTGTGTTCAATAGGCAGGAGAGTTATTTTGCTTACTTATTTGGAGTGAGGGAGCCAAGTTTCTACGGAGCTGTTGTAAGTTTTTCTCTCTATTCTCTCCTTCCTATTCCTTCTTCCGTACCCCCTCTTACTTCGTAAAGGTAACAACAAAACCTAATCAAATGTCTCGTCAAGTATTAAAGAAAGATCGACAGTGCAAACAACCAAGAACAAGTGCGTCAATCCTACTTAGGTGTAGGACTCATTAGGCTAGATTTTTTATGTTTAGACTGTGGAAGTCCGATAGTTTAGGACTCTAGGAATACAATAGTTTTATGCTTCTAGGAGTTTGTAAAATGTTTGAATAAATAAATGGCAATAGCTAGATTTATATTCCTAGCAAGTGTTTTGACCCACGCATGTTCGTTACAATTATATCCAATTTTATGATAATGATCTCTTGAATTAATTAAAATTAACTAGATCTTACTTTGCTTTAATATAATTTTTAACACGAAGCTAATGATGTAAAGCTTATAAGATGTTTATTGGAAAATATAATTTATTTTATGCATACATAGTAGTTTATTGAAACTAAAACTCTAAATACACATATTTATATAGATACAAAATCAATAAAATTCACACATCTTATTTGCACGATTGCTTTTGTTGTAGTCATATGAGGCACGAAACTTACAAAAATTTGGAATGCCAAGTGTCAATATCTCGGTAGTATTACAAGTTTTAATTACAGGGGATTACTAAGTTGGTTTTTATAAGAATCAAATTCTTTTATTAGTACATATTTGAGTTGAAGTAGGAGGATTATTAAGTAGTGTTTTAATTAGAAGCGAATTCTTTTGTTAGTTGGAATAAATGTTTTTGTCTGATACTCGATCTTACTAAATAGGCGTGTAGCAAAGTTGTAGTTGTCGACTTCATTTCTTAGTGTCGTAGTGCATGGAAATTCACTTGAACAATGCTATTATTATGAGCTAAGAATTACGATGTACTAAACCTATCATTAATTGGTGTATGTCCTCAGGATTCTCATCCTTGGACAAGGCAAAGCAAGTCAACGTATGCTCGTCCCAATTACAAATTAGCTCCTAATTAGGACAAATTATGCCATTAGTAAATATCACACTAGACACAAAGTTTCCATTCGCTACAAGAATAGGATGTTATGCGTGGCATATGGTTGAACACATCAAAATAGTTTGATTAAATCCATGGTTAACTTTGGTAATTATGGGATGAAAGAACAAACATGATACAATATCATGTAAATAAACTGAATCATAGAAGAATTTCTTTAGTGAGCGAAATCAGGGTGGGGAGAAATAGAGAGTTTGGGAGAGAGAATGATGCGGAAGCATTTGTAGTTTTAGTTTAATTTTTATTTTAGAGTTGGATTACATGTTTTTGGCATTAGGAGTTGTTTAATTACGTATTATTAGTTTACTTTCCAAGTTATATTAGGGTTAAGAATTCTAGATTGTTCTATTAGGGTTTGTTAGGTTGTTTTCCTTCTCCTATAAAAGCACACAACTACATATGATAAAAAAAAAAGTTTTTTTGATTAATTTGATGAGCGTATGCTATCTAGGTTTAGGAAGTTAAACCAAACACACAACTGCGACTTGTGCCTCCCTCAAAAGTGCGAATTTTGAGTTACCTATTCACGCTCGTAGACTAAATCATATTACATATGCAAAATCATATTACTTTGCGATTTTCATATGCTATACTTAAACTATCAAAGTTTGCATCTATGTGGTTTGAACGGACGCAAACCGAGAGAGCTCGATTAAACAAAGCAGAGATTAGTACTTGGACTGCCCTTAAGACCAAGATGCGTAAACGGTTTGTCCCAAGAACTTACAAACAAGACCTGTACATGAAGCTCAACTCTTTGGAGCAAAACCATCTTTCAGTCGAACAATACATCAAGGAGTTTGTCATTGGCTTGTGAGTGCAAGGATGAGGATGAGCAAAAGGCAGCAAAATTCTTGATTGGATTAAAAAACGCTATTGCAAATCAGGTGGAACTGCAACATTTTTACTCGTTTGACGAGGCTTGTCAGTTGGCTATTAAGGTAGAAAGACAACTCGCAGAAGCTAAAGCTAAGACCCCTCGATCTCCTTTTACCCCTATAGCTGAAAATTCAAGGCCTTGGGATACTCCAGTTTCACTCGGGAAAGGTTCGAATTCTACTCCACCACCACCTACGAGTGAACCGAGTTCAAAAAAACATATGAACTTGACTCCCGGACAAGCAACAATGAGAGAAAAGCAATGTTTCAAGTGTTAGGGGCATGGACATATTGCTAGGCAATGTCCAAACATGATGTTAGACTGTTAGTTACTTTAGAAGATCACCAAGCATTGTTAATGGCGCAACAAGAGGAGAAAGAAGGAGAAGCTCGTGTTCTTGCCTCGAATACTAATTCACGTTTTGAGTTCAAGGAGATAGAAAATAAATTATTTTCGACCAATGTGGTGCAACCTATGGATGAGGAAGTTAAAAGTTTGGGTATTTTGAGAAAAATCTTACATATGGGAGCTTCACTTGTTGAGGAGAATGGGCAACGTGAGAACTTATTCCACACTCGTTGTTTGGTCAATGGTAAGGTATGCTCGTTGATTGTGGATAGTGGTTCGTGTACTAATGCCGTAGCTAGCAAGGTGACAAAGGAGTTAGGGCTACAAACTCGTGAGCATGTTCGTCCTTACAAGCTGAATTGGTTAAATGAAGATGGTGGGATTCGGGTCACTAAGCAGGCTTTGGTTAGTTTCAAGTTGGGCGAGTACATAGACGAGATATGGTGCGATGTGCTTCCCATGACAACTTGTAATATTCTGTTGGGTCGTCCGTGGCAATACGATCGAGAAGTACTTCACCATGGAAAGGATAATATCTACTCGGTCAAGATCGGTGATAGTCGATTTAGCTTGGCTCCATTACCTCCAAAAATGGCACCCCTTGCTACAGATGATCGTATGAATGTGTGCGTCACAGATTTGCGTGAAGCCATTGTGCACAACAAAAAGGAAAGACGAGCTAATGTGTTGAACGTTTGGAAGACACCACCTTTTGATGTGGGCGACTATGTTTTCTTAAGTTTGAACGCGCAACAAGTGTTCAATAATTCCAAAGGGAAAACTGAAGCTATTGAGAATGGTCCTTTTAAGGTGGTGCACCGTGAGAACTACGACACTTTTCAGATTCTATTGGGCCAAGGTGTATGTGCTTCTATGAAGGCTTGTGATCTAGTTCCGTGCAACTTGTAAGGAACTTGAGGGCAAGTTCGTTTTCAAGACGGGGAGTATGATACGGAAGCATTCGTAGTTTTAGTTTAATTTTTTTTTTAGAGTTGGATTACATGTTTTTGGCGTATAGGAGTTGTTTTAATTACGTATTATTAGTTTATTTTCCAAGTTATATTAGGGTTAAGAATTCTAGATTGTTCTATTAGGGTTTGTTAGGTTGTTTTCCTTCTCCTATAAAAGCACACAACTACATATAATTAAAAAAAAAAATGTTTTTTTGATTAATTTGAGGAGCGTATGCTATCTAGGTTTAGGAAGTTAAACCAAACACACAACTCCGACTTGTGCGTCCCTCAAAACTCCGACTTTTGAGTTACCTATTCACGCTCGTAGTCTAAATCATATTACATACTCAAAATTCATGTTTAGATCCACATCAGAGAAAGTATTAGTGATATATCGAATTAATTATTATCATTACAAGTTAATCCTTTTATATGGCAGTAAATAATTAAAATTAAATGAATATCATGTAAAGAAACTGAATCATAGAAGAATTTCTTCCATTAGTGAGCGAAATCAGGGTATAAAGGGGAGAGATAGAGAGTTTGGGAAAGAGAAAGTATTTGTGATATATCGAGTTAATTATTATCATTACAAGCTAATCCTTTTATATGGCAGTAAATAATTTAAATTAAATGCTGAAATACAGGAAGGTTGTTATGACAACTTGATCAGCATGTTAAACTGGGATGAGGGAAGTTAATGTTGTGTAGGTAAATATGTCAGCTCAATTTACCCTTCTCCCTTGTGAAATATGGAAGGCTAACAATTATCACAATGTAGCATCACGAGCTTCAAATTCAAAACTCTAATTTCATTCAAAAAAATTCTGACCATCCAAATAGCTTCAGATGCTGCAGGAGCCATATCTCTCTATTCTGCCTCAGAGTTGTTTCTTGCTTTTCCAACTAATCGGAGAGTTCCCCAATAAAAGAACATAGCCTGATATGAACCTGACATGAACCTCATATGAATTGGCTGAGTGTCTTGTCTACACCGTATAAGAGAGATCGGATCGAGTGTTTGTTAAGAAATTATGTTTTCCTACCAAAAATGTGTTACCAAACATGTGCAGTAAGTAGCATTGCATTTGCTGAAGACCAGTGTATAAAAAGGCGCGCCTAGGCTCTGAGGCGCAAGGCGATTGGAGCCAGCGCCTTTTATTGTCTGGACGAGGCGATTTCGATGGGGCGCACCTAAGGCGCAAAAAGGCGCACTTTGAGGCGCAAAAAGGCGCAAAAAAGGCGCACCTACTAAGGCACACACCAATTTTCACTTTTTTTTTAAACTTTTTATTAGGGTGCTTCACTTCAATGAGGCGCACCGAGGCACACCGAGGCGCGCCTAAGGCGCTAAGAACTCCAAATCGCCTTGTTGCGCCTTGAGCCTTTTTATATATTGCTGAAGACCATGGTGCATCTCTACGTAATTGTGGAAACACATATATTGTATAGGTTGCTAGAGATATGTAATAGACTTGATATCCTACAATGTCCATCAACAACCTTGATTTTAAAGGAACTACAATGTCAACCAACAACCTTGATTTTAAAGGAAATTGTGAATCTACGTGTAAATATATGTATGGTATACATATTTTTAATACTTATTGTATTCATTATTTAACCAATAGAATCCAAATGATAACATGTAATTTAATTTTTGTTACGTCCGAGAAACAGCTGGGAAATTACGATTTTTGATAACCCAAATAATATATATTATATAGTACTCGTTTTCTAAATGGTGCTAAGAAATTCCATTTTGGTACATTTTCAAAAGTGATTAACAAGTTTGATTTATTCTACTTTTGGTATATCTCTTTCTTACTCAAAACACCTTATCAACCATCTTTGTACACTCACCCAATGCTCTCTCCTCCTTTCTCTTATTTTATTCATCTTTTTCTTATATTCACTCACCACACCAACACTCTGTCTCATTTGTCTTAATATTTGTGCAAATAGTAAACATTAACGTTTAATGGAAACGGAGCTAGTAGAATATTGTTATGTGTTGGCCTCTTGACGCTAGTCGAACACGCAAGAAGAGAATTTTGGGGCAAATGAAGAAGTGGAGTGCTTAAGCAGAGTACGTGTATTGTTTGGAAAAATGCTTACAAATGTTTGATGTTATATACAAATGAACCTCTAAGCCTATTTATTGGAAGGGCTTTAATTCTAGAACCTCCTCCCTAACTAGTGGCTAGGAAAATATACACATGGCACTCACATGTATGCATCCTACATTCTTTTAAATAATTACAACTCACAAGATATTTACAAGGGTATAGTAGAACTCTTCAAACAAATGGAACTCACATGTATGCATCCTATATTCTTTGACATAATTACAAATCACAAGATATTTACAAGGGTATACTAGAATCTTCACAGTTCACACTAGAAGGTTATCCTGGTCCAATAGAGCCTAGAAGCTTCTCCTTAATGCCCCATGCGTCCTTGCGTTTGGCCCATCAGATAGGGGCCCTTGTATCCCTCTCGGCTTATCTAGTCGCATCTTTATATTTATGTCATTTTTATTTATCTCGGGCATGAAAATTTTAAGAAGGGTTTTTATGGGCATCATTAGATTCGCACACTTAACGATGCTCGGTAGGCTAAGAAATGTATCACGATATAAAGGTACACTAAGCACATAAAGAGGCCTAGGCCAATGGCTTGCGGGCCCTAAGTAGGGATGGTGTAAATTTATCATGATATAAAGGCATCATACTGAATTGTGGACTATTAGTCTGAATTGCTATTCATCTAAGGGACAATTTTCTTGACATTACTTCATCGTTAAGGCCATGAGGTGTGGTGGTGGTTTAGTCGGCAGATGATGTTGTGCATGGTTATGGTCAATTAGATGAAGCTGCTGGCAGTTCTGTAGTAGCAGTAATTCAATTAGTAATGCATTCACGATATGACTGCAAAGTAGTCCCAGAAAACTTAGCTAAATTGGCAGAAGAGCAAACTGGTTCACTAATAGATGGAGAAAGATGCCTCCACATCAATTTCATTTTTATATTTATTATTTTGAGACACTTATGTTTGTACTTTCTATATGAAATAATAAACTCTTGGATGGTATCCTAAATAAAGATAGTAAACCACTCCCATAGTTACCTGTTAGTTCGTTGAGCTTTGAAATTCAATTTGACCAATTTGTTGTGTGCACACTCTTTTCTGGATCAAATTCCAGTCAACATGATTCTTCTTTTGTTTTTGGCCTTCATTTACTTTTATTGGTAGACAATGTGACAGGATGTCACTTTTGTTTTTTTGTTTTTTTTTTCTTGAACTAGCATCCTTCGCTTTTCACAATGTTTATTATCTTGACTATTTTAATCATATACATTTGTTCAGATTTGCTTGTGAATGCTTTTTTTTTCTTTGTTTCGTTTTCCTTTGTAACTCATAACGGGATAGTTCAGTTAGCTTTTGTGTTGATGCTAATACTTGGACCATAATTTACTGTCAAGGATATTGCCACAGGGAAATCTATCCTGTTTGCTCCACATCTCCCAGCAGAATACGCTGTTTGGATGGGGCAGATAAAACCACTGTCACATTATACGGTCAGAGCAGATTATTTTACCATTCGATTCCACTGTTAGTATTTATTATACTTTTCTTTGCTCATATCTTTTCGGGGTTCGCACTTTGCAGGAAAGGTATATGGTTAACATGGTCTGTTATGTGGATGAGATGGAAGAAGTTTTGCTCAATCTATACAAAGGAGAAGATAAACCTTTGCTGTTTCTATTGCATGGGCTTAATACAGACAGTGGCAATTTCTCAAAACCAGCAGAATTTAAGGTGTCTACTAGTATTTGTTCTTGTTAATGATTATTGTCGTTGTTTTACAATACCAACTACGGTCAAAGCTACATATATAGACCTGGTCAAGAATTATACATGAATAACCCAGATGTTTTGAAATATTCTACAGTGGTTGACTTAGAGTTTCACCACCAATATTATCAGTACACTCTTGGTGATAGCCCTTAAATAAATGAAATTTAATAGTTCAAGTCATGGTTGATTCAGTAAAAGTAGACCAAACTCTCATGTCACTGAAATAATCACATGCGGAGAGTTTAAAGAAAGACATAGTTGCAAATTTCAAGTGACTAACAACCACACCAACAAAAGAACAAGAAATGTTTATTTGGAGGAATTTTCGGACTAAACAAGGATTTTGGATGATTTCATCATTTTTCTTTTAGAATTTGACAACACCAAATAGCTCTGATACAAGTAAATACAAACATTAGACGTATGCATTACATTATGATACATAAAACGGGTATGGGAAATCACTACCTAGTTTTGTTAAACAAGTCCACAATCGTACCATTTTACCATGTACCACATACCAAATCTCGTGTTCCAGATTATGATTTTCTTTATTTTTTGGTTGGGTGGGGTGGTGGGTGTCCTCCGGACAACTGACACAGGGTTAGGATCAGCCCCTTTTAAAGTTTGCATTATCACAAATGTCTTTCAAATCATCATTGTTATTTGGTCTTGTATGATGCTGAAACTGATCTGTAATTTTTTTGTAAATCAGAATTTTATATTTTGACTAGTTCATATGCTAGCAAATCGTTGACAGATACATTTTTTTACTTGTCAGGGTATTGAGAATTTCAAAACAGATTTGAGTACATTGCATCCTATACTGACCGAGTGTCGTGTTATAAAGTCAAAGTTGGAGCTCGACGTTATCCAATATGCAACTGATATAAGCTCAGAGGCTCATGTTGAGGTATTATTTAATGTTTATCATAAATATACATCTTTTGTCTCTAAAAGCTACCTCATTTTTGTTAACTCTCTGGATGGAACCTTTGAACCTTCGAAAAAAATAAATTTCATTGAATATAATCTAAGATAAGAACATATGAAGCCTTAGATGTGTGAACAGATTGGCAGATTTTTTGTCTCAAGGAGCATAACGAGAACTTGATGCTCTAAGCAATTAGTCTTTTTTTGGTATCTTCAGTCGGTTGTTATACTTCAAACATTGGGATAATGGGGCATGTACCTTCACCTACTCAGACAACGGAAGGATGTACTGTGGATGTCAACTTTTGGGACATTGAAGACAAGTTGGGAGGTTATATACATTCAAGTTAGCATTGCAAAGCTCTGTTTTTTTTTTGTTTTTTTATATCATAGATGGGTTTGTTACATGATAATAATTGAGAGTAGGTGCTGATCAAAGTCTGATTACGGCGTTGGATAGATTTTTCTTCATGAGTTGATGCTCCTCTTTTTAAATTGATTGGTTTGAGTGCAAAGGTTTAGTACAAAATGGCTACGATGGACTGGTGGTGCTGTATGGGTAAAGGAAATAGTGTTTAAGTCAAGATATCGAGAAAAGCAGAAGAAACAGGTTGGATAGTGTAATAGACTTATATTTGGTGATAGAAGCTAAGAGAAAAAGATTGACATGTAGTTGATAACCTAGAAAATAGATGTACTGGACAGAAGATTAATATTTGGAAGACAGAAAATTCGTGGAAATGGATATTTGATTGTAATGTGCTGCAATTAAATACTCAACAACAGTGTTACAGTCTCAGATCTTCGAGTGGCTGGATACCCCAACTACCAAAAGACAATAAAATAAAGCGAGTCAATTCTCTGATTACTTTATTTCTAATCTCCGCTTTATATAACATTTCTGACTTCCTTTCCTCTCTCACTATATTCTTACCAGCAAGTAGGACTATATAGAAATCAGGAGGAAACTCAAGCAAGCGAGGACTTTGTGCATCACTAGCTTTACTGGACATGGTAGTAGTTTTAAGTAAAAGTTAGTTATGGTGCAATTGAAGTTGAATATTATGACTTTTCCAGACCATATCTGACAGTACACATCAAGACTGCTAGCCGTTTCTTCTCCTCTTTTTCTTGGAGATTCTGTTCGGCGCGTTTATTTGTCAGTGATTGTTTGCCTTTGTTGAGATTTTGAAACATTTCTGTTACTTGAAATTATTGGTTGTGTTTTCTGTGATGGGGCATGCGCAATTCTCTGTTACATACATATTTCAATTTCAAACGCAAACATGTTGCTTTAAAATATTTCCTTGCCCATCCCCAATCCATATGCAATCACACAATGGAACTGTAACATCCATTTTAACCAATGCCTGGGTTATTTTTCTTTTCTTTTCCATTGCTTTTTCTTGCTTTAGATCAATGTAATTTGCTTGTCAACTAATAGGCTTGAACCCCAGGAATTCAGGCAAACTTATCTATTATGCTTTTGTCATTTTACTGGATAGATTATATCTTCTCATTATTCATCTACGGAGCAAATATCCTGTACTTTTATCGGTAGTAATATTTATACGAGAATGTTTGGCTGACAGGTAATGAGAAATACAAAAGTGGGCATGAAAGAATATCAATTGGAGAGCATGTTTCTCCATCACACCTATATGTATGGTGGATGTAGGCATTGCTCATACACATGTATTTGTGCCACAGGTGATAATAGGTATGGTTTTCGTTTGTCTTTACTTTATTTTTGCTAGGGATAAAACATTGAGTTATTTAACTGTTTTCCTATTACGGGGTTCATACTTTTTCCCGCATGAGTTTGGGAGTTGGCACTGTGTTGCACCATTGTTATCGTAATTTACTTTGCTGCTGTAATGCACCCCATTTCCCTCAACACTCAGCCCAAAGAAATTGCATTAAAAAAAATGAAAATAGATGTTGAGTTTATTTCCTAATGAAGGCAGAAGTGTTTCAAAAATGTACTTTTTGTTAAAAAAAATCACTCAAGTGTGGTATATCTTCCTGCAGTTCTGTTCTCCACTATGGGCATGCAGCAGCTCCCAATGAACGGGTATGTTGGTGTTGGCAGCTCCTCTCTCCTAAGATCCCAAATTTGTCAGTTTTTCGCGATAGTTGCACCAAAAAGAAAGCTCCTTAGTGAAAGATATATCATATGAACACAGGATGGGTTGATTGAATGAATTTTTTGATATTACAAATATGGAGACATTTAATTGTGAAGAGAAGTTGTATGTTATAACTAATAAAATAGTGAAGCTACAGGAGAAATAAAACTATCTTCTTAGTTGTTACGGAGCTCTTTAAATGTATATTTTTTCCTTTTATTCCCCCTGTTTCTGATAGATCTATCTATTCGCATTTACCTCTGCAGACTTTTATGGATGGAAATATGGCATTGCTGGATATGGGAGCTGAATATCATTTTTATGGTTCTGACATAACCTGTTCGTTCCCTGTATGTAGCTCCATCTTGATTCTTGAATTTTATCCTAAGTACTAGTTTACTAGCATATGTAAATAAGTAAATGTACTGAAAGATGTAAGAGAAGTTGCTTTGTGGATTATATTCTTGACGTCTATTCTTTTGTCTGTATATTTGTATATGGGCACAAAATCAACACTTGTGAGAAGTTCCTCTGGAAATTGGATGACGTATTATTGGCAATAGTCACCTTTAAACCCTTAGAAAATAAGTTTTTTGTTTTTTTGTGGTATCAGTTATTCATGCGATGAAGCTATTTGTCATGTGACTCTTGAGTACTGATGTATTATTTTGCTTCAGGTGAATGGAAAATTTACTGTTGATCAGCGTCTTATATACAATGTGAGTTTTTACTAAGATGTATGCTATGGTCTTTTGAGTTATAGTCGCTATTTGACTTTTAAGGCTGGCAACTTTGTTGCTGCAAGCTAAAAATAAAACTCCTCTTTTTGCAGGCTGTCCTTGGTGCTCACAATTCTGTCATAGCCGCAATGAAACCTGGAGTAAGTTGGATAAATATGCACAAGTAAGGCTGTCTTAGTCCTTTCGTACTTATTTTTTTCTACTTTCTGTCTTTAGATGTTTGCATACTTTCTGTCATTCCAATTAGAGTTTACACATCTATGTTTAACTTTCTTCTCTTTTCTTTTTTTAAAAGAAGGATATCATAAATCAGGTACATGAAATTCTTTGCGTATCAAAATGCCTAATTATGGAAACCTATCATGATCTTTCTGGGAATTGTACTTGTGTTTTGAATAGTTAGATTTTTTTTTGACATGTCTAGTTGTCTACTTCCGTTTGAAAAGACATTCATCTAAATGCCTGTGTTTTGCATGTGGTCTGTTTCATGATTTTTGCACTTTCTATCTTTGTGTCAGACTTGCAGAGAAAGTCATACTCGAGGCTTTAAAGAAAGGGAACATCATCACTGGGTAGGCAGTGTCTATTGCTCTTCTCTTGTCATACCTTATGGAAAGTAAATTCTTATTCTCATGGTATTTCTTTTCCCTGATTTTCTTAATTATTAGTGATATCGAGGATATGATGACTCAAAGGCTGGGAGCTGTTTTTATGCCTCATGGTCTTGGCCACTTGCTAGGGATCGACACCCATGATCCAGGAGGCTATCCCAAGGTTCTGATTACTTATTCTCTTGATCTTCAATATTTCTTCATTCTTGGTGTTTCCAATTCATACCATAGAAAAAAAAGTCGCGGTTGCCGTTGCGTTAACAGTTGCGTTATCGCGGGAACCGCTTGTAATGGGATAAATAGAACGGATCGTGGCTAACGCGGTGTGAATTTTCTCAAAAAATTCATATTAATAGGTTTAAGCTTTGTTTATACCTTTTTAAGTTTAAAAGGCATTATTTCAACACAACCTAACAAACCCTATCACAATTTGAGAAAGGGACATATTTTGTGAACAAATGATCATACAAAATATCTTGTTTTTAAAATACATAACTAATTGCTAAAATATTTTTGTCTAATTTGTGCAAAAGAACGGTGTAACGGGAAAAAAAAAAGCGTTCTAACGCGGCGAGTCACCGTTATGTAACGGGCGACTCGGGGAGAAAGATGTGATTTTTGTCACACATTTATGTAACGGGGTTTTGCGGAACGGTAAATCCCAAATCCGTTACGTAACGGAACGACCGAGTTTTAATTCCATGATTCATACCAACTGGTTGTCGAATTTGCAGGGACTGGAGCGACCAAAAGAACCTGGGTTAAGTTCTCTACGCACAGCCCGAGAACTGAAGGAGGACATGGTTAGCATTTCTGTTTTGTCTCGTGCTTTACTTCCCACTGTGCATGTTATATCTAGATTCTTCCAGAAGAGGCTACTTCTGGAAATCAGGTACAGACTAGTGTTATACTCCATTATAATTGGCCACTGGTTGGAGGTTGTCCCCAGGCCCGTCCATTTGCCGGCCTAAATGGGACACAGCACAGGGCCTCCGGCCCTCCAACAAAAAAGGGCCCCAAAATTGTATGAAAAATTAAATAAAAGCCCAAAAAGAAAATGTATACACACATGCCCACACATGCCCAGGCGCACAGCTGGAGAGGGACGATACATCGCAGTTCACCAGCGATGTATCTGCCCCATACCCACAAAAAAAAAAGTGAAAAGAAAATAAATTGCAATGATGGCCCACCCCTTAAATAAATGGTAAAAATGTTGCGAACTTGTTGTATACTGTCAAAGGTTAGATAAATACTGTCATTATTGTGCATATACTGTCATTGTTGTATTAAAATTTGTCATAATCATCCAAAAAGAAAAAAACGAAATGAAACAAAAATAGTAAAAGGACCACTTAAATGACTGGATAAACGTGTTACATAAGCTCTCTGTCATAATCATCCAAAAAGAAGAAAACGAAATGAAACAAAAATAGTAAAGGGACCACTTAAATGACTGGATAAACGTGTTACATATACTCTGTCATTGTTTGTATAAATACTGTCATTGTTGTATTGAATACTGTCATTTATTTCCGATACTTAGTACTTTGTTTGACTTTTCAACTAATTAAGTGAAAAGACGATTTTACCCCTGGGTATGGGCAAAAAAGCCGCTGGTGAACTGCGATGTATCCCGCCTCCACAACTGGATGCTAGGGGATCTTTTCTACACATGCTAGGGGATCTTTTCTACTTTCTTCTCTTTCTTTTCTCACTTCCCAAACACAAAGGCAGAAAGGTCAGAGACTCAGAGTACTACTAAAGTAATTCTTCATCATTTTTTTTTAACATTTGTTTTCTAAAAGGAGAATATACTTTGTAGCTAATACTAGCTTTTACAGGAAAAAAAATCTTCCTTATTATGCAATTAAAAAAAATTTGAAGAAAAAAATTTAGGGAAATTCTCTATGGTAACATTATACTTTTGCAAATTCTCATTGGTAACAAAACTTTTGACAATTTCTCTATAATAGCATTAGTAATCTAAATTCTCTCAAATATAACATTATTTCACTAATTTCGACTTAATTAATATAATTAGAATTAACCATAGTTAACTGACTTCCCCTTTTATCTTCTTCTTCGTGTATTTCCTCTCACCCCCTTCATCTCCTTCCTCAGATTCATCTAATTTCCCCCCGAGATTCTTTTATATATTCAGTTTCGTTCAATTCAAAACCCCTTTCATCGGTTATGTACAAATCCAATTTCTATATTGAGCAATATGAAATACAATTTGAGATTGAATAAATATTCGTCACCACTCTTCCTCTTATTTTCGCATTCATCTCCGGCGAGAGGTTTGCGTCGTCGTCTTATGTTTGGTCTCTACGCGTTGTCGCCATCCCTGACCGTTAGACCTATCCTCAGTGTTCCATGATTTGCAACAGAAAGGGACGCACATTGATCTGAAGATTGAAGAGCATCATTGCGCCACCACTTCTATATGTCCCAAAATTTGTCACAGTGGCAACTTCTTCTAGTACAATCATTCAAACAAAAGTTTTAAAGTAGTTTTGATGAACATAGTCTCTTATAGTTGATGTAACAGAAGGTTGCTGATGTAAAATATTTTTTCTTTTCGGCATTATTAGGTCCATTTGTTTTAAGGAATAAGGAGTGACAAATTATGGAAGGTTGATGGTTTTTCAAAGGAATGAGTGATCAGTGAAGGTTAATTAATTTGTTAGGACTTGGGAGATCGAATTTACTATAGTAAACAAAGGTTAATTTAGTACTTTTACTAATACTTAACCATAGTTAGTAGAAGATTGGTTAATTAAGTTAAAATTAGGGGAATAATGTTATATTAGAGAAAATTTAGATTAGTAATGCTATTTGAGAGAAAGTGACAAAGATTTTGTTACCAATGAGAATTTGCAAAAGTATAATGCTACCATAGAGAATTTCCCAAAAAATTATAGGGGCCTATTTTTTTATAGTAGCACAGGGCCTCAAAGTCATTTGGACCGGGCCTGGTTATCCCCCTGAAGTTCAGACCAAGCTCTTGCATTTTCTTTCTTAAAGAGTAAGGGGGTTTATAGTCCATGTTTTTGGTTGCCGTGTCTTGCTGTTGTGCAGCTCCCGTTTTTTCAATTCTGTTAACGTTTTCGTCTGTGTACTATAGATAGTTTATCGTTGTTGATGTGTGTTAGACTCTGTTGTCGTATACGCACAAAATCAGTAAATATAAACAAAACAGATAAATAGCTGAACTTATCAACTTGTTTCAGGTAATCACAGTGGAGCCTGGGTGCTACTTCATCAGTGCGTTACTGATTCCAGCAATGGAAAGTTCAGAAACGTCAAAATTCTTCAATCCTGATGCTATAAGGAGATTTATGGTCTTTGGCGGTGTTCGAATTGAAAGTGATGTGGTATTGTTCAGTTGTCCTGTTACTCCTAATTTCGATTTCAATTCCCTGGTGCAGCACCCATATATGAATTTGTGCATTGGTGTTAGATCATGTTGTATGCTCGCGAGCTTGCTGTGTTGTGATTAGCTGGATTTAATGTCTTGATGGGTTTGATATTAGATGCATCTCTCTCTCTCATCTAGATGAACCCGTAGTTCCTTTCTGTTAATGTTTTAAAGATTATTCGACTCATAAAACATGACTTAGGCTTTGTTTTTTCGACTTATTTTGACTTATTTCAAACAAAGTAAGTTCAAATAAGTTCAGATAATATAAGTTCAGAAAAAATAAGTTTTTTCGTACATTTTCACACACAAATAAGTTTAATGCAGACAAAATAAGTTTTTTTCCAGATAAATTAAGTTCAGTTAAGTTCAGGCAAATAAGTCCAGTAGAACGAAGCCTTAATGCAAGATTAATCATGTCATGGAACAAATCTAAATTGAAGTAAAGTTCTGATAAGTTTCATGTTTTTCTTCCAAGCACACTAGAACAACTTGTTGTCATCCCAGTTCTTTGTACTTGATTGGCAATTCTACACGGCTCCTCGACGTGCTTGTTTGTGTCATTTTAATTCTATGTTTCTGAATATTGACGATCACTGTGTACTCTTTTCTTGTCACAGATATGGATTTCTTCTTTTTCTTTTCACCTTATCTTTATTGCTTATTAGATCAGATCATCAGACCAGATTAGATCAGAAAAAACAAAAAACATTCACAGAAAACATTAAGGGGACCAGGTCTGTAGGCATTTGTGCATCAGTCTGACTCAATTTTATTGCTGGAAGCTAGCTTTATTCATGTCTGCCCTTGGTTTGTACACGTTTGTGTATCATGTTGCTTATTATCACCATCATATTCGTTGCTTTTAGCTGCATTTGGTTGAAAACTTTTCTAAACTTTTTACCTGGTGAATTATGTAGTATGTCACATCAATTGGCTGCAAGAACATGACCAATGTTCCCCGCGAGACATGGGAGATTGAAGCTGTTATGGCAGGGGCGAAATGGCCGCTAGAGAAAGGTGTTGTTGATCATTCATATAATGGAATTAGAAATTCTCATTCATAGAGAGAATGCTAAACTTTGCCAACAGATACAAAGAGATGCTAAACTGTAATACTGGTTCATAAAGAGATGCTAAACTGTACTGTAATACTGTTAGTTGTAATTAGAGCTTCTGTTGCATAGTACAATAAGGTCAGTGTCATTTACACCACTTGACAATCAAGTTCAATAATCTTGCAGAAAGCACATGGATATCGATTATCATTTTGCGCGTAAATAAGTGGCTAATGGTTCTACCAAAGTTCATTGTGTTCCTTTTCAATCAGTGACGGATATAGGGGGTTTAAGTGGGGGTCACGTGGCCCCATTCCCCCCCCCCCCCCCCCCCCCCCCCCCNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCTAATTTTAGTTAACTTTCATTATTTAAAAGACAACTTTACTGTAGCTATTTTTTCATTAATTTTTAAGTAGTGACTTTACTACCTACACTTTCTAGATTCGCCACTGTTTTCAATACTAGTTACCATTTACCACAGGTTTTGAGATGTCAAAGAATTGAGCATATTTCAATCATAATTGTAGGCTCAATTGCACCAAACCTTTCTTAGCTCAAAGATGATCATCAACATTATGAACAAGCACTTCAGTATTTTCGAATGTATTAATTTATGCGTTGTTGTTGGTGTGTTTATGTGCACACACATGTTGATACTTGGACTTTTCTATTTTCTAAACAAAAATGAAATAGGATGTTGTAAATGACATTACTTAGTTGGTAGGTTGTTGTGGCTTGTGCCAAACGACTTTGGGTAACACTTACATTCCAGGTACCGGTACGACATGACTCAAAACGTGATGAGTAGTTTACTGTATCTTGTAATGGTCGTTCCAACATGACTCCATGGGGGCGTCTCTTAGCCCGTACAATATCTAGCATAGTCTACCCAACTAACTTTTCGTCGTAACATTGTCAAATTTGTGATTTTATGAGAGATGTTAGGCAAACTCTATTTGTTATTTTGTATTTAGTCGTATCGTATGTTATCTGGACTCTTCATTTCACCTCAAGTAATCATGTCGGATTCACAATACGCTATCTCCTGACATGGATATGACACTTATTTTTGAGCCAAAGATATGACATTTTTCACAAAATAATCGAGTCAAACACTTACAGAAATACATACTCCGACTCCATACTTGTGTCGAACACTTCTTGATGTTAAGCACACTCTATTTGTTATTTAGTCGTATCGTATGTTATCTGAACTCTTTATTTCACCTCAAGTAATCATGTCAGATTCTTAACACACATATCTCCTAACGTGGATATGCCACTTACTTTTTTAATCCAAAACTTTGATCTTTTTCACAAAATAGTAGAGTCAAACACTTAAGTACACATGTCTTGTCCTCTCCACCCGATTTGGTCTGATTTTTCTAGTTTCCACTCTGACTTGACCTGGTCTAATCTGATCTGAATGTTTTCTACGAGTGTTTTTATTTGTTCCGATCTAGTCTGATCTGATCTCATAAGCGATAAAAATAAGCCGAGGAGAACAAAGCCCTATTTCATCGAACACTTTTCAAGTTTTCGTGTCATTTTTCTCCCAAGTTGGTTTAGTGCTTTATTTGTTTTTGCTGTTACCCTGTAATGTGTTTTTGGTCTGTGCACAAATATCAGTAGTTCTGTCCCTTTTTCCCCATTTCCCTCCACCACAACAGCCCAGGCAGCCTACCACCACCACCACCACCACCACCACCCTTAACGTGCCTCACAGTACCTCTCTTTCTCTCTCCTCTGCTCTTCAATTTCTGACGATCAACAGTTCTACTTCTGCTATTCAATTTCATGCAAATTTAACCCCCACTCTTAAGTAATTTCTTTGACATTTTTCATGTGCTAATTTTCTTGCTGTTTTACAATTTTATTTGTGCAAATTATGCATTGAAGTTAGGTCTCAGATTTATTCATGGGTTTGCAATTCTAGAATCACTTTTTCCAGGAAATGCAAATTCTAGAATCTTTAGATTGAACGAGAATTGATTGGAAACAGGGAGAAAAATCTGGGTTTTTTTGATTGATTGAATTGATAAGCAGCAATTTCTAGGGTTTCAAAGAGTGGTAAGTTAATAAATTCCCCCCACCCCCCCTTTTTTTTTGTTTTACTTTACTTGTTCTTTTTTATGTTTTTTTTTTTGGTGAGATTACATGGTTAATTAAATACCGTTCTGAAACGAATTTGAGTACAATGAGATGAGAATATCAATTTCATTGGGTGTTGAACAATGGCAGATGGATTCGAAACCCGAGACGGGGAATGCAGTTGATGGTGGTGTAGGGGTGTTGAATCATCATATTCAGAGTTGTTCACAGTTATCAGGTCTCAGCATTACATCTGAAAGTTCTGCCCCTAATTTTGTTTATCAAAGAAAGAGGATTCAAAAGAACCCTAATCCTGTTTTCCCAACCCTGTCACCTGATAATATTGCTTATGTCTATGAACGAAGGAAACATCAAAAGAGTGCACCCACTAATTTTACACTTATGGTCTCGGAAAATTCATCACCTATTAAAAGAGGGTTGAATGAAAGGGAGGGTACTAGGGTTTCTTCAATAGAGCAGCCATTGACAACTTCAGAAGCAATTGATGGTTATGTGACCGAGGACCGTGGTTTTGATGCAAAAGAAAAGAGGAACAAGGGTTTTGAGTTGTATAGCATAGATGATAGTTGTTCGTCGTCGTTGCTTAATGTGGATAGTGGTTCTTCTCCATTGAAGAGTAAAGTTGATGAAACTGATGAGTGTTCATCCTCTGGTGCACTTGTTGTAGATGTGATGGGAAACGACAAATCGCTATCAGTTAAAGGTCTTTGTGTTTCTATTCTTCGGAGTCATGGGTTGTTGTCTAATTATGAGTCAAACAGATCTGGTCTTTCTGAGGAAGATGCTCCTTCATGTAGTGGAAGCAGTTGTTCTCGCACTTGTAATGTTTGCGGTGTAGTTGGTACGGCCATTGACCTGCTAATTTGTGATGAGTGTGAGAAAGCATTCCATATATCTTGTTTCAATCCATGCATAAAGAAGATTCCCGATGATGATTGGTATTGTCAACCTTGCTCTAAAAAGAAGCATAACTTGTTAAAAGAGAGAGCCATTAGAAGGTCATCAATTATCAGAACTGGTAAAGGTTCTGTGTCATCTGAAGGAGATTTCAGTCCAATAGCACTGATGTTAAATGATTCTCAACCTTACATAACTGGTGTCCGGGTTGGTAAAGGATTCCAAGCATTTGTGCCTGAATGGTGTGGCCCTGTTGCAAGGTATGTTCCAGGAATTTTTCTTTATGCATCTTCATGACACATTGACACGATTGTATTGTAACCTTTTTCTCTTTGAGGGTTTTGTCTAGCAATTATAACCTTTAATAGGTAAGTGAATGGGTTGCGTTTAGGACCTTTTTCTTTGCATGCCATCGTGCATTGAAGAATGTCTTTTTTTCACCAAGAGGTACTAAGATACAATTTAGTTAGGGTGTCTGTCGGGGTCCCTTGGATATCTTTATGGTCACGGTTATACTCAACATCTTTACAGTTTGCTTCGAAAAGGCATGGGTCCGTGATTTTTCCCATTCGAGGGTTCTTTTGTGGTTAGGGAGAGGAGGTGCTTACTACAGGCACTAGGTAATTTAGTTAACTCTATGAAAGTAGGTCATAGAGAAGAGCATGACGAACAATGTGGCACACATGCAAATTTTGATGTATTTATGGCCAATTAATACAAACCATTTATCCATGAAGTTTCTTTACAGGGGAGAACTTTTATGTTACCTTTGTAGATGGTGTTTGGGAGGGCTTCATTTCTTCTTTAGCCATGTTGCTTGGTTATTCTGTTGGTAGGTTCTGCAATGAGAACCTTCGTTGTTTTAGGGCGGGAATTGCTAGAGAAATCTCATGCTGCTTTCAGTCCGAAGTTCTTTGTTGAGTTTTTGGGTTTATGAAATCTGAGAAGAGTGGAAATTGTCACAGATTCTTAGCTTAACTTCAACCATTCAGAAGAAGTGTGAATGTCTGTGGGATGACATGTTACCTATGTAGCGCTAGTAGTAGATATTTTGAGAAGTTTAATTGGTTTAGGGTATGCAGGTTCGTTGAGCTTATCTTCAGATGCCTTTTCTATTTGCTTGAAGAAGTGGGGTAGTTTTGTGGCCATTCAGGGTATGGAATCTTACTGCTATGCTATTTGGATTTAGGTTATTGGTCCATTCAGATGCCCTCCATGCTTGCTAATATTTTTTCATACTCAATTTGAGAAAGTGGAAGAAATCTCCAGCCAAAAATCTCTGTAATTCTGATTACCCAATTGGCTCTTGTTCTTATTTCCTTGTGTGTTGTTATGTGGTATTTTTCTACTCCTTTAGGATTTTTTGAGTTCTACCTTGTAAGACAAAAGAAGTGCTTGTGTCTTTGAGAAGGAGGATAAAGATGGCATAGATCATTCAAATACGGGGAAGGGTTCCTTTGGCAAGCTTGTAGCGTTTAACTTGAAACTGTGTAAACCAGTTAATCAGTGATGATAAAGCTTCTTCAAAATCAAGGGTTTGTACTCGCTTTTTAAAGTTTTTTGTGTGTTCTTCCCACTTGCTTTTGGAGAGATAGCAAGGAACAAGGGTTAATGGCGGGGCGTTGGAGTAATGTGGTTAGCTTTTCTGAAAGAAACAGTAGATGTTTGTACTTTGTAGGAGCAAAGAATCCCAACAAGGGAAGTGGGTAGGACAAAAAAGGTGGAGGGAATATTTATTTTCTTTTAGTCCCAAGAGCTCCAGGTTTTTCTTTTATTGAATTTGAATACATGACACTTACATGACAAACCATCCAAATAGAAGTTTATCCCAGGTATATGAGTTGACATGGAATAGATTGAGTAAAGCTTGGTTTTCTCTTTCACCTTCAACGATCTTTTCCATATCAGGTGATCTAGTGAAGTAGGAGATTTTTGAAGGTGGTGTTGGGTTAAATTAGACCAGGCCTTTGGTTAATTGCAAGGCCTGGGCAGCTGTCATTCCATGCATTTGAAGCCTTTCCTAATCCTACCTTAACCTTGTTTGAGGGGCATTACAACGTGAAATTATTGTAAATTGAAGAAAAGATCAACATACTGTTTAAAAATGTTATGCCAAACTATCGAAAACTTGTAACAGATTCCTTCAAGAAGTATTTGTTTCGAATTTCTAAGAGACTCCATATTCTTTTAACGTAATATTTTGAAAATAGGAAGAGAATATTGGAACCATTGAGTTTTACTTAATAAGGTTATGGGTTAGATCAATTTCTTCATAAGATGACCAACGGATTTTTGAGGGGATAGCTTCTCCCGTGACAAGGATTCAAGTTCCTTATTAGGTTTTTGTATAAGCGCGAGTCAAGTTTTAGCTTGTGAGTGTCACTTGACTTCTCTTGATTTTGTAGATAAACTACTAGATAGCTTGAGGACATAAGACTTTTTGTCATTTGCCCTCCTTTTATATATAGGTGACATAACGTCTTAGTGGAAGGGGAGAGTAAGCTTTCCATGGAAAGGGGGGGAAGGGGAGGATAAACTATTAGGTAGCTTGAGGACATAGACTTTTTGTCATTTGCCCTCCTTTTATATATGGGTGACACCAGATGTCGTAGTGGAGGGGGAGAGTAAGCTTTCCATCGGAGGGGGGAGCTAATGGGGGAAGAGAGAGAGGAAGAGAAAATGGACATGGGGAGAGGTGAAGTGTGTGCTTGACTACATGTATCATACTCTCAATTCTTTTGTTTTAATTTTGTAATTCTAGCAAACAAAGGATGGAGAAATGGATAATGTTTTCCACAACTAACACCCTCTAGATTCCGAAGTTGGGCTGGGGGAGACTGGGAGAGAAAGACAGATATACCATTGCTTGGATCTCTGGGACTAGAGGTTAGAGATTTCATTCTCAAACACTAGTCAATAAGGAATGCTTGTAAATTGGAGATTATATGTGCATTGCAGGTATAGGCATAGGCAACTAAAAATTTGAACTACAATTTTATCATTTCCAGTGCGCATGATTTTTGTGATACAAGTGCACATTTTTTGTTTGGTTGAGCCTGAGAAATATTTGCTCCCTGAAAATGAATATAGCTCAAATTCTTTTCAATCTTGCTAGGAAAATCAAATAATGTGCTTGCAGCTCCAAACCTTCAATACCTTGCCATTCTTCTCAATTTTAGGACTTATTGGTGTTATTTTTTTAGTGAAGTTGTTATTCCTATGCAAGATGTTATATGTGCTTTTGGTTTGCTGCAGGAGAGGCACTGTCTTTCCTGAACCAATGGAGTTGGATCCATTGCAACCGGTTCATTTGCTGGTGAGGCTTACAAAATTTCAATTTACATTGTCATTGGTTTGTCTTGTATGATTTACTGAAAATAATTCCAAATATCGTCTCCTCAGGGACCAACTTCTGGCAAGCATCGTAAGGCCTTTATTGGTAATTGGCTTCAGTGTAGAGATGTCATTGTTGGTATGGGAGATAGCATTGATGGAACTATATGTGGAAAGTGGCGCAGGTAATATGAAATGTTGAATCTTGTTTGGTGCTTTCTTTTCCTTTAGGCATGTGCACCATCTATGTTACTTCGACACGGCAATATAGCCGTCGTACCCGGGTCGACCCGACGCGACAAGGATCCCGGTACGGAATCCGCACCGGATACTTGTGGTCTGTCGGACACGGCAGCTAAAAGGAAGAGGGAGAATCAATTTATTTTTACTTTTTTTCGATTTTCAAACCCTAGATCTGTTAATTGGGTTGAAGATTACCTATTCTCGTTGAAAATTTGGGGCTTTTATTAGAAACGAAATTCCGAATCTGTTAGTGTAATTTTTGTCGACCACCCCGCCGTTGGTACTCGCCGGCGGCAATGAGAGAGCCGATAGCGGTGGCCTCTGTTCTTGATTTTTGGACAATTTATGTTTTGTTGGCTTGTTGCAGACTTCTTGAAAATATACACGTGAATCACAAAATCACAATACTTCAATTTTTTGTCCTCTTGCAGAGTTGTAGTACAGTTAGACTTTCTATTTTTGGTGGAAAGTATGGGGTGTACTGTATTCAAAAAGGAATATTTAAATAAACCGTAACTTGTTAGTTTTTATCCTTAATTTAAGGCTAACTTTTATTTATTTCTTTTTTTTCTATTTAAAATTTTTATATATATATATATATATATATATATATATTTATTTTCCATATCCCCGTACCCGTGTCTTAATTTTTAATTTAGGACAGGGTCCCCGTGCCTTCAAATTTTGATTTTTTCGAGTCCGACACTCGGATCCGTACCCGTGTCCAACACCCGTACCCGAGTCCGAGTAACATAGTGCACCATCTTCGAATTTGTGAGATAATAACTTAATTGGACCTCAACGTTCTATTAGTACAACAAATCAACTATTATCGGATTTTCCAGCTCACCATTTTTGGTTTGCTTCTAAAGAGATAGATATAGTTGGCTTTCTTCTTTGTTATGACTTATGAAGGACTTGAGTAGGGGCCACTGACTGGCACTGTGCACTAGAGTCTTATGAGTGGATACCAATTACCAAGTGTAATCGTGATCATTGCTAAGGATTTTTAGGTGGACAGATGCTGTCTTTAGATGCTTGGTTGGAAGATTCCAAGTTGTAAGCTAGAAACACAGGATGATTCTTTACTTGTATGTTTGCTGTGGTGTCTGATAATTTGCAATTACATTAGTGATTGCAGTAGGAAAAGTTCTTGGTTGGGTTTCTGGTTTGGTGAATTTATGTAGCTAGTTGCCAATCAAGTGAATGTCTAATTGTCTATAGAGGTTACTGTTTTTCTAGGCATAAATAGATACTTTGGGACACCAATGTTTGTATTCCGTGCACTAAGGGAGATCCGAGAAAACTCTATAATCAGCTGGCTATAAAAAGGGGAAGAACTGTGCATGCTGAAATTCTATCATTTCCATGGTTTAGTGTTTAGGAGAACTAGCGATTGCTGGTTCCTCAGAGCTGTGTCAAAATCAAGGCTTTTCGTCACAATATTTTTAGTATCATGCTGATATATCTGAGTAATTTAGTGACTAAAGCAGCTGCCATGAATCTGTTAAAAGAATTTGTATCAAGAACAGCTCACTGCCAACAGCTTGTTGCCTCTCATCTACTTCATTGCTGACATGAGGTTAAATTTTCTGCAAACCTAACTAGAGAAATATGGGTTACTTTGTTCTTGAGCTCTCTTTGTGATATTGGAACCTTGGAGAGGATCATGGGTCTTAGTAGACCTGGTTATCGGGCGGGGCGGGTCAGGGTCGGTGCGGGTCAAAAGAGGGTCGGGTATTGAAAGGGTCATTTTTAGCGGGTCGATAATGGGCGGGTCAAAAGCGGGTCGCGGGTCAATAGCGGGTCATTAATGGGCGGGTCAATAAACGAGCAATGAAAAATGAAAAATGAAAGAGCCATGTGCAAGTACAAGTAAATACGAAGCTATACGCGTAATTTTTTGTATCATTACTATTTTAATTTTCAAAATAATTGTAGTATAAGTTTGACCCTATAATGACCCTGTCCAATAAAGGCCCTGTCCAATAAGAGCCCTGCCCAATAAAAACCATATTAACTACCCTAACCCTATATCCATTGGACTACCCTGTCCAATAACCAGGTCTAGGTCTTAGTATTGAATTTCGTATTTGTGATTCATCTTATTTTTTGGTGCTCTTGCTGCATTCTTTGTTGAACTTTCAGTTATGAAGTACTTCCTCCGTTTTTTAATACTCGCAACGTTTGTTACTTTCACGCATGTCAATGCACTACTTTGTTCATTTATATCTTAATTTCTTTTTATGCAAAAAATATAAAAAGTTGATATTTTGAAAATACACATTGAGACGAATCTAACAAGATCCTACATGACTATGTTTTATTTTATATAAAAGCACCAAGATTAGTCAAAGTAGATTATATGAATAGTGTAAAAAGTCCAAACGTTGCGAGTATTTAAAAACGGAGGAAGTATATGTTTTGCTACGGCCATGTATGAGTCTTCCTTTGGGGCTCCTTTTAAATTGAACCTATTTTTGTTAAAATTTTGAGGGACTACAAAAGAGGCTCGTGGGATGGAAAGTTAAGCTCTTCCTCTGTTCCCAAAAGGGGATGGATTCATGGATCAACCTTGCTAAAGAGCTACTCATCTAGATTGGCCAATTTTATTTGAACTTGTAAAATGTAGAAAATTAAAAGACACTTCATTTATGTTAGCAAGGATGCCAAAGAGACTTGAGAAGGTGTTCATAGGGTTAGTTGACAAGATTTTATTCGAAATGGGATAGGTGATGTATGTTCAATCCGTGCTATCAGCTTCGATGTACGTTGGACAATGGACCCCTTTATATCATAGGTGACACTGTTGTCTTTGTGGGCTCACTAATCGTAATACACTTTGGGAGACATTCATCGAGCTGAAACATAGCTAATCTGTATTATTCTTGAAGCATCAAAAGGCATTGGTCTGATATTTTTGAAATTGCACACCTTGTATAGTAACAAAAAAGAGGGGGATGCTGTAAAACTTACCACTGAAAACAAGCTATTATATATCTGACATTGTTGTTCAACACCAGGATAGATTAGCTTTCTAGAATTTGGTTACTCTGGTTTCCACATATTCGGTGCTTTCATTGGGGCTTGAAACTTGTTTTTTTTATGTTATTTATATTATTGTTCAATGTGCTTTTCTGTTCAACTTGTCTACATTGTTTGCATAGAGGACTCTTAATTTCTTGATACTTGCTTGTTGGTACCATCCTTGTTTATTGAGCCTACTCAACGAGTCTTTTGCATGCAGGGCTCCTCTGTTTGAGATCCAGACTGACAAATGGGATTGTTTTCGTTCTGTTCTTTGGGATCCTTCGCATGCTGATTGCTCTGTACCTCAGGTAAGAATAGCTTTGCATCTATCAATACCTCTGTACCAGGCTAATCAGTCATTTCTATCACACCTAGTACGTGGCTTTCTTCGTCTTTTTGTGGCGGTTGATTTCCTTTGCATATCTTTATATGTTGATACTTGATAGATGAAGGTTTTAAGGTCTTGTTTGTGTCATGGAAATGGGCAGTATGATGCTATTATCTTGTTTTCCTCTTCAGCTGTGCCTCTGAGTGAAATATAATGTGTGATTTCTCTAAATCATGTTTCTTTTTGCAACTCCCTCTTCAACAATACTAGAATAGTAGTCTCCTAGTGTACATATTGTAGTGAAAGAGAGTATGAGTTTCCATTCCTTTTTTTGTTTGTGTTGTCAAAGACAGAAAGATTGTGAAAAAACACAATACAGTACAGTAATACTGTTAGAGGACAAGGCACACTCTTTAAGAAGAACAGAACTAGAAGCATGCCAGTTGCCGTTGGTATACTGAAAGCAACAAGTAGGAAAAAGACCTGATTCAACATATTACAAAGAGGGCAGGAACATAATAAAATACATACCATCTCTAATTCTCTATCTACTAGCCTTATTACGGAAGATACAAATCATAATCCCAAACGAAGCACTCAAATCACCTCTTTAAACGCGCAACTTCGAAGTTCCAATGTTGTTTGCTACATTGCTTCTCCTGATCACGTGGCGCATAAAAAAAGGTAGAGCACTTGAACTCCCTTTTTAAGTAAAATCCACGCATCATAAAGTACCACTAATGACTAATTTCATAATGAAGGAGGAGATGAGAACATATTCTTCACCTTTGTGACACTGTTTCCAACATTATCAGATAGGATCTAGTTGGCCATCACCACCAGTTCCTCGGATTATCAAGCTGTTTAGTTAATCAAAACTACTTTCCGTACAGAAATAGTGGGAATTGCTGGGTTTAGTGTATTCTGTGAGTTGTTATAGAAGATTTTTCAAGAGAAGTAGGTAGAAGAATCAGATAGCTAGATCAGTGAAAAAAAAATAAGATATGGGTTTTCTTACAAGTGGCCATGTCACCAGCTCCATGTTAGTTTCTTTCGAACAGAAGTTCCAGACGAGTTGTGAGTTTTCCAAATAATGTACAATGAAATTGGCACCTTGGAACCAACCAACTGAATATTTTAGAAAGGATAAATCAGTGAAAACGTGAGAATCTAGGAATTTGTATAAAGAGGCTACATTGCTTTTGCTCCTTCTCATGTAGATTCCTTTTTAAATGGAAGTTTCAAGGCCTTCCTTACAATAAATTGTGAAACAGTAGAAGGTGTGAGGTAATTGATAAAGCTAGGAAAAAATTGAAATGCATGCTCTTTTTCCCAAGAGTTCGTCCACAACTCTATGTCTCAACTCTCAACAACTAATCAAAATTTAATCAAGGAAAAAAGACATCTCTTGTTTCTGGATTGAGTTCAAAAGAGATCTATGAATGTGATGGGTATATTCCCATGAAAAAGAGTTGACTTTCTTTTAGGATGTAATTGGATGTGATCATAATGATCTTTTCGGGTAAATTATCTTTCTTAACAGATATTCTCTTCAATATCAAAATCTGGAGATATCTTTGGTCTGATATCTACAAAAGAAACCCAGCTTAAAAGAGAAAAAAAAGAAAGTCCAGTGGTAAACCTGTACAACCACTTCCTCACTAGATGTGTTTAGACACCTCATTCCCAAATCCAATCCTTAATTGACCAACTTATGAAAACCCCGATCAAATCTGTATAAACTTCTTTGTTTGTTGTAGTAGTGAATAAGGTACTAGAAAAAAAAATTAGGACATCTGAAGTATGGTACTGAGCATTATGCAAAATGAAGTTTGTTCACCAATCTTCTTTGTTTTTTTTATCCCAGTTGACTTTCAATAGCATATGATTGTATATGTTTATTTGAATGCACATTTTGTTTTACGTGTCTAATTGAATTTTGGAGTTGTACGTTTGGATTCTTTGGACTGAAAAGCTTTAATTTTTCAGAGGATGCTTTTAATATTCATCTGAGTTTTTACAACTGGAATGTTTCAGGAACTGGAAACTGAAGAAGTACTTAAGCAATTGAAGTACATTGAAATGGTAAGTTTAATTGCTTAATCTGTCAGTTTCCCGCATGAAAAAGTCCATTTAAATGCTTATTTATTGTGCTGTTGTTCCTCCGTTTAGTCGTGTTTATGAATGGTTTCGAAGAGCATTTTCTTCTTTTGGGGATGGGGAGGGGAGGGAGGGAGGGCTAGGATATCCTTTCTGTCATCTTCCCTGTTGGTTTTTCCTACCTCCTATTCATGGTTCTTTGCTTAGTTATGTTCAGTAGTGCTTCATTTTTGTCTTATGTTTTATAAGGGCTTTGTGATTATTTTTCTCTAAATCAGATCCTTGCAGCCTTCCACTTAGGAGCTAGAGGTGGTTGCTTAGCACTTTAAAATTTCTTTCTTTTTTCTTGGGGTGCAGGTGTGAAGGTAGTGAGCACTTACTTACCAAGGTCTATCAGTCAATGTATAGTAGGTTGGAATAGGAATAATTGGGTGCAGATTAGCACCGTTAAGTCGTTAGCCCCTTCATCTGCTCTAGTTGAGGCTTGAATTTGGGAACTTTTAAGCTTGCTTGGTGACAAGAAGAGTTGGAAAGGGTTGGAAACTTCTTTTTTGATTCCTTTCTGATCTTATTGGTATAGAGTGTTTGCTTAAAGATCTGCATACTGCACTTTTCCAACATCAAAGCTTTTCTGGATGAGGGCGGTGTGGTGGGTGGGTTGTACTTTATATATTGTATCTTTGGTGCTTTCACACCTTTCCTTTGTTTTCCTGCTGAGGTGTGTTGTATAAATCTTTTTTTCCTTTCAGGGAGGGGCATAAGGTGGCAAAGGGTTGTATGTCTTGCACCTTAATTTTTGTGAGTATTGCTAAATTAAGGAGTAAATGGATGAAGCAGATGAGGGTTGAAATAGATTACTTAATGGGCAAATTAGGGTTCTGAAGTAATGAATACTTTTCAAGCACAGCTGTACAGGTTTCTTTTCTTCTGTTCATTGGGCTTTCTTTCTATTGAGTGCCAAAATTTGCTCTCCCGTGTTCTGCATTGACATTTATGGCTACATTGTCTTGGTTAGGAGAGTGAGGTTGAAATTTTGAATTAGGCTAGCTGTTGTTTTTCAAGCAATAAGGACTGAAGTTGTAATTTTTTTATTGAAGTGCAAGAGTAAATTGTGACAATCTGAAGGGAAAAAAGAGTGCATGTTTACAATGGCATGAATGTCATGCTCTTTTGAATTTGAAATACTTCTGTCTTTCCATTCTATCATTAACGCAGTTACCGTAATGCCTCTTTTAGGCTATGGCTTTTTGTGTGTGTGTGTGTGTGGGGGGGGGGGGGGGGGGGGGGGGGGGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGGGGGGGGGGTTGAACATTATTTTATCCGTGTTGACTCATGACAGATGTTTGCAATTGAATAAGCAACAACTGTCTGATACTTCCCCATTGAAACCGTCAGCAAGTGTTGTCCAGCTCTGCGACAGTATGACTTATGAGGCTAATCAGGCCACAGAAAAAAATCGCGGATATGGTCGCGATCGCGTTATCGCGGAAACCGCTTGTAACAGATAAATAGAACGGATCTCGGCTAATGCGGTGTGAATTCTCTCAAAAAATCATATTAATAAGTTTAAGCTTTGTTTATACCTCTTAAGTTTAAAAAACATTATTTCAACACAATCTAACAAACCCCAAATCAATTGGAAACAACCTCTCTGGCATTGCAGGGGTAAGGTTGCGCACACCCGACCCCCCTTACCCCGCTCCTTGCGGGAGCCTCTTTGAGGCAATGGGGTAATGATAATAATGATGATAATGAATCTAACAAACCCCATTATAATGCTAGAGGGACATACAAATGATCATACAAAATATATTGTTTTTAAAATACATAACTAATTACTAAAATATTTTGTGTACAATTTGTACAAAAGAAAGGTGTAACGGAACAAAAAAAGCGTTCTAATGCGGCGAGTCACCGTTATGTAACGGGCGATGCGGGTAGAAAGATATGATTTTTGTCGCACCATTATTTAACGGGGGTTTCGCGGAACGGTAAATCCAAAATCCGTTACGGAACGACCGAGTTTTAATTCCATGAATCAGGCTGATTCATGAAGGTTGGGTAGGAAAGTTGGGTTTGAACTACCAGTTTGTAGTGAGAAATATTCACATGGTGCTTTGGCTGGAAGGTTATTTGTTGCTGCAGAAAAAAAATTGATTTATTCTGTTTGGTAATAAACTCGGGTTTCATTTCTTCTTGCAGTTGAGGCCTCGATTGGCAGCAAAGAGGCGGAAGTTTGATGTCTGTACTGCTAATGGCTCTGAGGCTGTTACAGAGGACGTAAGCAAGTGTACAGGCTCTGTTGATGCAGAAAGGAATTTGTAGTTTTGCTGAATTCGTAGTTTTTTTTTTTTTTAGCAAGGGAATTCGTAGTTTTGCTGCGCAATATATTGGTCANAGTTTTTTTTTTTTTTAGCAAGGGAATTCGTAGTTTTGCTGCGCAATATATTGGTCAATTCAATCAACGTAAATTCATGTGTCTGAGTTTGGTGCATGCTTTCTGACCATGTTGTATTCGAAACCAAAAGCAAGCATAGCTAGTGTACATGTAAAAAGGAACTAAGCTAGGTTTCGGTTCTGTCATCTGGGGTCGTTCGTGCCAAAATGTTAGAATCTATCCCTTACGTTTTTGATGTGGTTGTAGGGGAAAAATATCTTTGTATCAAACATAATGCGGTTGTTAAATCAATGAATTTGTTATTCATTTCCCGATTTTAGTATTTCTGGTGCTAACTGCTATCATTCTGTTGTACAGATTTATAAGCAACCAAATTGTGAATTAGGGATTATGAGATGTTTAGCTTGTTTTTATTCAGTTTTTTTAAGGAACAGGCCACACATAAACGGCGTGACAAATCGGCAAATATCGTATCGCCAACTTGAAAATCCCAATTA

mRNA sequence

CAACGAGGCTAGAAAGTTTGCAAAACACACTACCCCCCTTAAACCGTCTTGCTAATGGCGGAAGCTAAAGAGGCCATTCCAATGGCGACTGCAGAAGTACCACGCTCCTCTCTCACTCCTCTTGAAGTTCCAATGCAACTTCACGAAATCAATCGCCAAAAGCTCGTCAAATATCTTCAACAACACCTCACTGATTCTGGTCGCCCAATTCAAGGCATCGTTCTCCTTCAGGGTGGAGAAGAGAAGAATCGTTATTGCACCGATCATACCGAACTTTTCCGGCAGGAGAGTTATTTTGCTTACTTATTTGGAGTGAGGGAGCCAAGTTTCTACGGAGCTGTTGATATTGCCACAGGGAAATCTATCCTGTTTGCTCCACATCTCCCAGCAGAATACGCTGTTTGGATGGGGCAGATAAAACCACTGTCACATTATACGGAAAGGTATATGGTTAACATGGTCTGTTATGTGGATGAGATGGAAGAAGTTTTGCTCAATCTATACAAAGGAGAAGATAAACCTTTGCTGTTTCTATTGCATGGGCTTAATACAGACAGTGGCAATTTCTCAAAACCAGCAGAATTTAAGGGTATTGAGAATTTCAAAACAGATTTGAGTACATTGCATCCTATACTGACCGAGTGTCGTGTTATAAAGTCAAAGTTGGAGCTCGACGTTATCCAATATGCAACTGATATAAGCTCAGAGGCTCATGTTGAGGTAATGAGAAATACAAAAGTGGGCATGAAAGAATATCAATTGGAGAGCATGTTTCTCCATCACACCTATATGTATGGTGGATGTAGGCATTGCTCATACACATGTATTTGTGCCACAGGTGATAATAGTTCTGTTCTCCACTATGGGCATGCAGCAGCTCCCAATGAACGGACTTTTATGGATGGAAATATGGCATTGCTGGATATGGGAGCTGAATATCATTTTTATGGTTCTGACATAACCTGTTCGTTCCCTGTGAATGGAAAATTTACTGTTGATCAGCGTCTTATATACAATGCTGTCCTTGGTGCTCACAATTCTGTCATAGCCGCAATGAAACCTGGAGTAAGTTGGATAAATATGCACAAACTTGCAGAGAAAGTCATACTCGAGGCTTTAAAGAAAGGGAACATCATCACTGGTGATATCGAGGATATGATGACTCAAAGGCTGGGAGCTGTTTTTATGCCTCATGGTCTTGGCCACTTGCTAGGGATCGACACCCATGATCCAGGAGGCTATCCCAAGGGACTGGAGCGACCAAAAGAACCTGGGTTAAGTTCTCTACGCACAGCCCGAGAACTGAAGGAGGACATGGTAATCACAGTGGAGCCTGGGTGCTACTTCATCAGTGCGTTACTGATTCCAGCAATGGAAAGTTCAGAAACGTCAAAATTCTTCAATCCTGATGCTATAAGGAGATTTATGGTCTTTGGCGGTGTTCGAATTGAAAGTGATGTGTATGTCACATCAATTGGCTGCAAGAACATGACCAATGTTCCCCGCGAGACATGGGAGATTGAAGCTGTTATGGCAGGGGCGAAATGGCCGCTAGAGAAAGAAAGCACATGGATATCGATTATCATTTTGCGCCAATTTCTAGGGTTTCAAAGAGTGATGGATTCGAAACCCGAGACGGGGAATGCAGTTGATGGTGGTGTAGGGGTGTTGAATCATCATATTCAGAGTTGTTCACAGTTATCAGGTCTCAGCATTACATCTGAAAGTTCTGCCCCTAATTTTGTTTATCAAAGAAAGAGGATTCAAAAGAACCCTAATCCTGTTTTCCCAACCCTGTCACCTGATAATATTGCTTATGTCTATGAACGAAGGAAACATCAAAAGAGTGCACCCACTAATTTTACACTTATGGTCTCGGAAAATTCATCACCTATTAAAAGAGGGTTGAATGAAAGGGAGGGTACTAGGGTTTCTTCAATAGAGCAGCCATTGACAACTTCAGAAGCAATTGATGGTTATGTGACCGAGGACCGTGGTTTTGATGCAAAAGAAAAGAGGAACAAGGGTTTTGAGTTGTATAGCATAGATGATAGTTGTTCGTCGTCGTTGCTTAATGTGGATAGTGGTTCTTCTCCATTGAAGAGTAAAGTTGATGAAACTGATGAGTGTTCATCCTCTGGTGCACTTGTTGTAGATGTGATGGGAAACGACAAATCGCTATCAGTTAAAGGTCTTTGTGTTTCTATTCTTCGGAGTCATGGGTTGTTGTCTAATTATGAGTCAAACAGATCTGGTCTTTCTGAGGAAGATGCTCCTTCATGTAGTGGAAGCAGTTGTTCTCGCACTTGTAATGTTTGCGGTGTAGTTGGTACGGCCATTGACCTGCTAATTTGTGATGAGTGTGAGAAAGCATTCCATATATCTTGTTTCAATCCATGCATAAAGAAGATTCCCGATGATGATTGGTATTGTCAACCTTGCTCTAAAAAGAAGCATAACTTGTTAAAAGAGAGAGCCATTAGAAGGTCATCAATTATCAGAACTGGTAAAGGTTCTGTGTCATCTGAAGGAGATTTCAGTCCAATAGCACTGATGTTAAATGATTCTCAACCTTACATAACTGGTGTCCGGGTTGGTAAAGGATTCCAAGCATTTGTGCCTGAATGGTGTGGCCCTGTTGCAAGGAGAGGCACTGTCTTTCCTGAACCAATGGAGTTGGATCCATTGCAACCGGTTCATTTGCTGGGACCAACTTCTGGCAAGCATCGTAAGGCCTTTATTGGTAATTGGCTTCAGTGTAGAGATGTCATTGTTGGTATGGGAGATAGCATTGATGGAACTATATGTGGAAAGTGGCGCAGGGCTCCTCTGTTTGAGATCCAGACTGACAAATGGGATTGTTTTCGTTCTGTTCTTTGGGATCCTTCGCATGCTGATTGCTCTGTACCTCAGGAACTGGAAACTGAAGAAGTACTTAAGCAATTGAAGTACATTGAAATGTTGAGGCCTCGATTGGCAGCAAAGAGGCGGAAGTTTGATGTCTGTACTGCTAATGGCTCTGAGGCTGTTACAGAGGACGTAAGCAAGTGTACAGGCTCTGTTGATGCAGAAAGGAATTTGTAGTTTTGCTGAATTCGTAGTTTTTTTTTTTTTTAGCAAGGGAATTCGTAGTTTTGCTGCGCAATATATTGGTCANAGTTTTTTTTTTTTTTAGCAAGGGAATTCGTAGTTTTGCTGCGCAATATATTGGTCAATTCAATCAACGTAAATTCATGTGTCTGAGTTTGGTGCATGCTTTCTGACCATGTTGTATTCGAAACCAAAAGCAAGCATAGCTAGTGTACATGTAAAAAGGAACTAAGCTAGGTTTCGGTTCTGTCATCTGGGGTCGTTCGTGCCAAAATGTTAGAATCTATCCCTTACGTTTTTGATGTGGTTGTAGGGGAAAAATATCTTTGTATCAAACATAATGCGGTTGTTAAATCAATGAATTTGTTATTCATTTCCCGATTTTAGTATTTCTGGTGCTAACTGCTATCATTCTGTTGTACAGATTTATAAGCAACCAAATTGTGAATTAGGGATTATGAGATGTTTAGCTTGTTTTTATTCAGTTTTTTTAAGGAACAGGCCACACATAAACGGCGTGACAAATCGGCAAATATCGTATCGCCAACTTGAAAATCCCAATTA

Coding sequence (CDS)

ATGGCGGAAGCTAAAGAGGCCATTCCAATGGCGACTGCAGAAGTACCACGCTCCTCTCTCACTCCTCTTGAAGTTCCAATGCAACTTCACGAAATCAATCGCCAAAAGCTCGTCAAATATCTTCAACAACACCTCACTGATTCTGGTCGCCCAATTCAAGGCATCGTTCTCCTTCAGGGTGGAGAAGAGAAGAATCGTTATTGCACCGATCATACCGAACTTTTCCGGCAGGAGAGTTATTTTGCTTACTTATTTGGAGTGAGGGAGCCAAGTTTCTACGGAGCTGTTGATATTGCCACAGGGAAATCTATCCTGTTTGCTCCACATCTCCCAGCAGAATACGCTGTTTGGATGGGGCAGATAAAACCACTGTCACATTATACGGAAAGGTATATGGTTAACATGGTCTGTTATGTGGATGAGATGGAAGAAGTTTTGCTCAATCTATACAAAGGAGAAGATAAACCTTTGCTGTTTCTATTGCATGGGCTTAATACAGACAGTGGCAATTTCTCAAAACCAGCAGAATTTAAGGGTATTGAGAATTTCAAAACAGATTTGAGTACATTGCATCCTATACTGACCGAGTGTCGTGTTATAAAGTCAAAGTTGGAGCTCGACGTTATCCAATATGCAACTGATATAAGCTCAGAGGCTCATGTTGAGGTAATGAGAAATACAAAAGTGGGCATGAAAGAATATCAATTGGAGAGCATGTTTCTCCATCACACCTATATGTATGGTGGATGTAGGCATTGCTCATACACATGTATTTGTGCCACAGGTGATAATAGTTCTGTTCTCCACTATGGGCATGCAGCAGCTCCCAATGAACGGACTTTTATGGATGGAAATATGGCATTGCTGGATATGGGAGCTGAATATCATTTTTATGGTTCTGACATAACCTGTTCGTTCCCTGTGAATGGAAAATTTACTGTTGATCAGCGTCTTATATACAATGCTGTCCTTGGTGCTCACAATTCTGTCATAGCCGCAATGAAACCTGGAGTAAGTTGGATAAATATGCACAAACTTGCAGAGAAAGTCATACTCGAGGCTTTAAAGAAAGGGAACATCATCACTGGTGATATCGAGGATATGATGACTCAAAGGCTGGGAGCTGTTTTTATGCCTCATGGTCTTGGCCACTTGCTAGGGATCGACACCCATGATCCAGGAGGCTATCCCAAGGGACTGGAGCGACCAAAAGAACCTGGGTTAAGTTCTCTACGCACAGCCCGAGAACTGAAGGAGGACATGGTAATCACAGTGGAGCCTGGGTGCTACTTCATCAGTGCGTTACTGATTCCAGCAATGGAAAGTTCAGAAACGTCAAAATTCTTCAATCCTGATGCTATAAGGAGATTTATGGTCTTTGGCGGTGTTCGAATTGAAAGTGATGTGTATGTCACATCAATTGGCTGCAAGAACATGACCAATGTTCCCCGCGAGACATGGGAGATTGAAGCTGTTATGGCAGGGGCGAAATGGCCGCTAGAGAAAGAAAGCACATGGATATCGATTATCATTTTGCGCCAATTTCTAGGGTTTCAAAGAGTGATGGATTCGAAACCCGAGACGGGGAATGCAGTTGATGGTGGTGTAGGGGTGTTGAATCATCATATTCAGAGTTGTTCACAGTTATCAGGTCTCAGCATTACATCTGAAAGTTCTGCCCCTAATTTTGTTTATCAAAGAAAGAGGATTCAAAAGAACCCTAATCCTGTTTTCCCAACCCTGTCACCTGATAATATTGCTTATGTCTATGAACGAAGGAAACATCAAAAGAGTGCACCCACTAATTTTACACTTATGGTCTCGGAAAATTCATCACCTATTAAAAGAGGGTTGAATGAAAGGGAGGGTACTAGGGTTTCTTCAATAGAGCAGCCATTGACAACTTCAGAAGCAATTGATGGTTATGTGACCGAGGACCGTGGTTTTGATGCAAAAGAAAAGAGGAACAAGGGTTTTGAGTTGTATAGCATAGATGATAGTTGTTCGTCGTCGTTGCTTAATGTGGATAGTGGTTCTTCTCCATTGAAGAGTAAAGTTGATGAAACTGATGAGTGTTCATCCTCTGGTGCACTTGTTGTAGATGTGATGGGAAACGACAAATCGCTATCAGTTAAAGGTCTTTGTGTTTCTATTCTTCGGAGTCATGGGTTGTTGTCTAATTATGAGTCAAACAGATCTGGTCTTTCTGAGGAAGATGCTCCTTCATGTAGTGGAAGCAGTTGTTCTCGCACTTGTAATGTTTGCGGTGTAGTTGGTACGGCCATTGACCTGCTAATTTGTGATGAGTGTGAGAAAGCATTCCATATATCTTGTTTCAATCCATGCATAAAGAAGATTCCCGATGATGATTGGTATTGTCAACCTTGCTCTAAAAAGAAGCATAACTTGTTAAAAGAGAGAGCCATTAGAAGGTCATCAATTATCAGAACTGGTAAAGGTTCTGTGTCATCTGAAGGAGATTTCAGTCCAATAGCACTGATGTTAAATGATTCTCAACCTTACATAACTGGTGTCCGGGTTGGTAAAGGATTCCAAGCATTTGTGCCTGAATGGTGTGGCCCTGTTGCAAGGAGAGGCACTGTCTTTCCTGAACCAATGGAGTTGGATCCATTGCAACCGGTTCATTTGCTGGGACCAACTTCTGGCAAGCATCGTAAGGCCTTTATTGGTAATTGGCTTCAGTGTAGAGATGTCATTGTTGGTATGGGAGATAGCATTGATGGAACTATATGTGGAAAGTGGCGCAGGGCTCCTCTGTTTGAGATCCAGACTGACAAATGGGATTGTTTTCGTTCTGTTCTTTGGGATCCTTCGCATGCTGATTGCTCTGTACCTCAGGAACTGGAAACTGAAGAAGTACTTAAGCAATTGAAGTACATTGAAATGTTGAGGCCTCGATTGGCAGCAAAGAGGCGGAAGTTTGATGTCTGTACTGCTAATGGCTCTGAGGCTGTTACAGAGGACGTAAGCAAGTGTACAGGCTCTGTTGATGCAGAAAGGAATTTGTAG

Protein sequence

MAEAKEAIPMATAEVPRSSLTPLEVPMQLHEINRQKLVKYLQQHLTDSGRPIQGIVLLQGGEEKNRYCTDHTELFRQESYFAYLFGVREPSFYGAVDIATGKSILFAPHLPAEYAVWMGQIKPLSHYTERYMVNMVCYVDEMEEVLLNLYKGEDKPLLFLLHGLNTDSGNFSKPAEFKGIENFKTDLSTLHPILTECRVIKSKLELDVIQYATDISSEAHVEVMRNTKVGMKEYQLESMFLHHTYMYGGCRHCSYTCICATGDNSSVLHYGHAAAPNERTFMDGNMALLDMGAEYHFYGSDITCSFPVNGKFTVDQRLIYNAVLGAHNSVIAAMKPGVSWINMHKLAEKVILEALKKGNIITGDIEDMMTQRLGAVFMPHGLGHLLGIDTHDPGGYPKGLERPKEPGLSSLRTARELKEDMVITVEPGCYFISALLIPAMESSETSKFFNPDAIRRFMVFGGVRIESDVYVTSIGCKNMTNVPRETWEIEAVMAGAKWPLEKESTWISIIILRQFLGFQRVMDSKPETGNAVDGGVGVLNHHIQSCSQLSGLSITSESSAPNFVYQRKRIQKNPNPVFPTLSPDNIAYVYERRKHQKSAPTNFTLMVSENSSPIKRGLNEREGTRVSSIEQPLTTSEAIDGYVTEDRGFDAKEKRNKGFELYSIDDSCSSSLLNVDSGSSPLKSKVDETDECSSSGALVVDVMGNDKSLSVKGLCVSILRSHGLLSNYESNRSGLSEEDAPSCSGSSCSRTCNVCGVVGTAIDLLICDECEKAFHISCFNPCIKKIPDDDWYCQPCSKKKHNLLKERAIRRSSIIRTGKGSVSSEGDFSPIALMLNDSQPYITGVRVGKGFQAFVPEWCGPVARRGTVFPEPMELDPLQPVHLLGPTSGKHRKAFIGNWLQCRDVIVGMGDSIDGTICGKWRRAPLFEIQTDKWDCFRSVLWDPSHADCSVPQELETEEVLKQLKYIEMLRPRLAAKRRKFDVCTANGSEAVTEDVSKCTGSVDAERNL
Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Spo00502Spo00502gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Spo00502.1Spo00502.1-proteinpolypeptide


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Spo00502.1.utr5p.1Spo00502.1.utr5p.1five_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Spo00502.1.CDS.24Spo00502.1.CDS.24CDS
Spo00502.1.CDS.23Spo00502.1.CDS.23CDS
Spo00502.1.CDS.22Spo00502.1.CDS.22CDS
Spo00502.1.CDS.21Spo00502.1.CDS.21CDS
Spo00502.1.CDS.20Spo00502.1.CDS.20CDS
Spo00502.1.CDS.19Spo00502.1.CDS.19CDS
Spo00502.1.CDS.18Spo00502.1.CDS.18CDS
Spo00502.1.CDS.17Spo00502.1.CDS.17CDS
Spo00502.1.CDS.16Spo00502.1.CDS.16CDS
Spo00502.1.CDS.15Spo00502.1.CDS.15CDS
Spo00502.1.CDS.14Spo00502.1.CDS.14CDS
Spo00502.1.CDS.13Spo00502.1.CDS.13CDS
Spo00502.1.CDS.12Spo00502.1.CDS.12CDS
Spo00502.1.CDS.11Spo00502.1.CDS.11CDS
Spo00502.1.CDS.10Spo00502.1.CDS.10CDS
Spo00502.1.CDS.9Spo00502.1.CDS.9CDS
Spo00502.1.CDS.8Spo00502.1.CDS.8CDS
Spo00502.1.CDS.7Spo00502.1.CDS.7CDS
Spo00502.1.CDS.6Spo00502.1.CDS.6CDS
Spo00502.1.CDS.5Spo00502.1.CDS.5CDS
Spo00502.1.CDS.4Spo00502.1.CDS.4CDS
Spo00502.1.CDS.3Spo00502.1.CDS.3CDS
Spo00502.1.CDS.2Spo00502.1.CDS.2CDS
Spo00502.1.CDS.1Spo00502.1.CDS.1CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Spo00502.1.utr3p.1Spo00502.1.utr3p.1three_prime_UTR


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
Spo00502.1.exon.24Spo00502.1.exon.24exon
Spo00502.1.exon.23Spo00502.1.exon.23exon
Spo00502.1.exon.22Spo00502.1.exon.22exon
Spo00502.1.exon.21Spo00502.1.exon.21exon
Spo00502.1.exon.20Spo00502.1.exon.20exon
Spo00502.1.exon.19Spo00502.1.exon.19exon
Spo00502.1.exon.18Spo00502.1.exon.18exon
Spo00502.1.exon.17Spo00502.1.exon.17exon
Spo00502.1.exon.16Spo00502.1.exon.16exon
Spo00502.1.exon.15Spo00502.1.exon.15exon
Spo00502.1.exon.14Spo00502.1.exon.14exon
Spo00502.1.exon.13Spo00502.1.exon.13exon
Spo00502.1.exon.12Spo00502.1.exon.12exon
Spo00502.1.exon.11Spo00502.1.exon.11exon
Spo00502.1.exon.10Spo00502.1.exon.10exon
Spo00502.1.exon.9Spo00502.1.exon.9exon
Spo00502.1.exon.8Spo00502.1.exon.8exon
Spo00502.1.exon.7Spo00502.1.exon.7exon
Spo00502.1.exon.6Spo00502.1.exon.6exon
Spo00502.1.exon.5Spo00502.1.exon.5exon
Spo00502.1.exon.4Spo00502.1.exon.4exon
Spo00502.1.exon.3Spo00502.1.exon.3exon
Spo00502.1.exon.2Spo00502.1.exon.2exon
Spo00502.1.exon.1Spo00502.1.exon.1exon


Homology
BLAST of Spo00502.1 vs. NCBI nr
Match: gi|902231892|gb|KNA22433.1| (hypothetical protein SOVF_033800 [Spinacia oleracea])

HSP 1 Score: 1013.8 bits (2620), Expect = 2.000e-292
Identity = 492/493 (99.80%), Postives = 493/493 (100.00%), Query Frame = 1

		  

Query: 10  MATAEVPRSSLTPLEVPMQLHEINRQKLVKYLQQHLTDSGRPIQGIVLLQGGEEKNRYCT 69
           MATAEVPRSSLTPLEVPMQLHEINRQKLVKYLQQHLTDSGRPIQGIVLLQGGEEKNRYCT
Sbjct: 1   MATAEVPRSSLTPLEVPMQLHEINRQKLVKYLQQHLTDSGRPIQGIVLLQGGEEKNRYCT 60

Query: 70  DHTELFRQESYFAYLFGVREPSFYGAVDIATGKSILFAPHLPAEYAVWMGQIKPLSHYTE 129
           DHTELFRQESYFAYLFGVREPSFYGAVDIATGKSILFAPHLPAEYAVWMGQIKPLSHYTE
Sbjct: 61  DHTELFRQESYFAYLFGVREPSFYGAVDIATGKSILFAPHLPAEYAVWMGQIKPLSHYTE 120

Query: 130 RYMVNMVCYVDEMEEVLLNLYKGEDKPLLFLLHGLNTDSGNFSKPAEFKGIENFKTDLST 189
           RYMVNMVCYVDEMEEVLLNLYKGEDKPLLFLLHGLNTDSGNFSKPAEFKGIENFKTDLST
Sbjct: 121 RYMVNMVCYVDEMEEVLLNLYKGEDKPLLFLLHGLNTDSGNFSKPAEFKGIENFKTDLST 180

Query: 190 LHPILTECRVIKSKLELDVIQYATDISSEAHVEVMRNTKVGMKEYQLESMFLHHTYMYGG 249
           LHPILTECRVIKSKLELDVIQYATDISSEAHVEVMRNTKVGMKEYQLESMFLHHTYMYGG
Sbjct: 181 LHPILTECRVIKSKLELDVIQYATDISSEAHVEVMRNTKVGMKEYQLESMFLHHTYMYGG 240

Query: 250 CRHCSYTCICATGDNSSVLHYGHAAAPNERTFMDGNMALLDMGAEYHFYGSDITCSFPVN 309
           CRHCSYTCICATGDNSSVLHYGHAAAPNERTFMDG+MALLDMGAEYHFYGSDITCSFPVN
Sbjct: 241 CRHCSYTCICATGDNSSVLHYGHAAAPNERTFMDGDMALLDMGAEYHFYGSDITCSFPVN 300

Query: 310 GKFTVDQRLIYNAVLGAHNSVIAAMKPGVSWINMHKLAEKVILEALKKGNIITGDIEDMM 369
           GKFTVDQRLIYNAVLGAHNSVIAAMKPGVSWINMHKLAEKVILEALKKGNIITGDIEDMM
Sbjct: 301 GKFTVDQRLIYNAVLGAHNSVIAAMKPGVSWINMHKLAEKVILEALKKGNIITGDIEDMM 360

Query: 370 TQRLGAVFMPHGLGHLLGIDTHDPGGYPKGLERPKEPGLSSLRTARELKEDMVITVEPGC 429
           TQRLGAVFMPHGLGHLLGIDTHDPGGYPKGLERPKEPGLSSLRTARELKEDMVITVEPGC
Sbjct: 361 TQRLGAVFMPHGLGHLLGIDTHDPGGYPKGLERPKEPGLSSLRTARELKEDMVITVEPGC 420

Query: 430 YFISALLIPAMESSETSKFFNPDAIRRFMVFGGVRIESDVYVTSIGCKNMTNVPRETWEI 489
           YFISALLIPAMESSETSKFFNPDAIRRFMVFGGVRIESDVYVTSIGCKNMTNVPRETWEI
Sbjct: 421 YFISALLIPAMESSETSKFFNPDAIRRFMVFGGVRIESDVYVTSIGCKNMTNVPRETWEI 480

Query: 490 EAVMAGAKWPLEK 503
           EAVMAGAKWPLEK
Sbjct: 481 EAVMAGAKWPLEK 493

BLAST of Spo00502.1 vs. NCBI nr
Match: gi|902231891|gb|KNA22432.1| (hypothetical protein SOVF_033790 [Spinacia oleracea])

HSP 1 Score: 999.6 bits (2583), Expect = 4.000e-288
Identity = 487/488 (99.80%), Postives = 488/488 (100.00%), Query Frame = 1

		  

Query: 522  MDSKPETGNAVDGGVGVLNHHIQSCSQLSGLSITSESSAPNFVYQRKRIQKNPNPVFPTL 581
            MDSKPETGNAVDGGVGVLNHHIQSCSQLSGLSITSESSAPNFVYQRKRIQKNPNPVFPTL
Sbjct: 1    MDSKPETGNAVDGGVGVLNHHIQSCSQLSGLSITSESSAPNFVYQRKRIQKNPNPVFPTL 60

Query: 582  SPDNIAYVYERRKHQKSAPTNFTLMVSENSSPIKRGLNEREGTRVSSIEQPLTTSEAIDG 641
            SPDNIAYVYERRKHQKSAPTNFTLMVSENSSPIKRGLNEREGTRVSSIEQPLTTSEAIDG
Sbjct: 61   SPDNIAYVYERRKHQKSAPTNFTLMVSENSSPIKRGLNEREGTRVSSIEQPLTTSEAIDG 120

Query: 642  YVTEDRGFDAKEKRNKGFELYSIDDSCSSSLLNVDSGSSPLKSKVDETDECSSSGALVVD 701
            YVTEDRGFDAKEKRNKGF+LYSIDDSCSSSLLNVDSGSSPLKSKVDETDECSSSGALVVD
Sbjct: 121  YVTEDRGFDAKEKRNKGFDLYSIDDSCSSSLLNVDSGSSPLKSKVDETDECSSSGALVVD 180

Query: 702  VMGNDKSLSVKGLCVSILRSHGLLSNYESNRSGLSEEDAPSCSGSSCSRTCNVCGVVGTA 761
            VMGNDKSLSVKGLCVSILRSHGLLSNYESNRSGLSEEDAPSCSGSSCSRTCNVCGVVGTA
Sbjct: 181  VMGNDKSLSVKGLCVSILRSHGLLSNYESNRSGLSEEDAPSCSGSSCSRTCNVCGVVGTA 240

Query: 762  IDLLICDECEKAFHISCFNPCIKKIPDDDWYCQPCSKKKHNLLKERAIRRSSIIRTGKGS 821
            IDLLICDECEKAFHISCFNPCIKKIPDDDWYCQPCSKKKHNLLKERAIRRSSIIRTGKGS
Sbjct: 241  IDLLICDECEKAFHISCFNPCIKKIPDDDWYCQPCSKKKHNLLKERAIRRSSIIRTGKGS 300

Query: 822  VSSEGDFSPIALMLNDSQPYITGVRVGKGFQAFVPEWCGPVARRGTVFPEPMELDPLQPV 881
            VSSEGDFSPIALMLNDSQPYITGVRVGKGFQAFVPEWCGPVARRGTVFPEPMELDPLQPV
Sbjct: 301  VSSEGDFSPIALMLNDSQPYITGVRVGKGFQAFVPEWCGPVARRGTVFPEPMELDPLQPV 360

Query: 882  HLLGPTSGKHRKAFIGNWLQCRDVIVGMGDSIDGTICGKWRRAPLFEIQTDKWDCFRSVL 941
            HLLGPTSGKHRKAFIGNWLQCRDVIVGMGDSIDGTICGKWRRAPLFEIQTDKWDCFRSVL
Sbjct: 361  HLLGPTSGKHRKAFIGNWLQCRDVIVGMGDSIDGTICGKWRRAPLFEIQTDKWDCFRSVL 420

Query: 942  WDPSHADCSVPQELETEEVLKQLKYIEMLRPRLAAKRRKFDVCTANGSEAVTEDVSKCTG 1001
            WDPSHADCSVPQELETEEVLKQLKYIEMLRPRLAAKRRKFDVCTANGSEAVTEDVSKCTG
Sbjct: 421  WDPSHADCSVPQELETEEVLKQLKYIEMLRPRLAAKRRKFDVCTANGSEAVTEDVSKCTG 480

Query: 1002 SVDAERNL 1010
            SVDAERNL
Sbjct: 481  SVDAERNL 488

BLAST of Spo00502.1 vs. NCBI nr
Match: gi|731373074|ref|XP_010666538.1| (PREDICTED: xaa-Pro dipeptidase [Beta vulgaris subsp. vulgaris])

HSP 1 Score: 917.5 bits (2370), Expect = 2.000e-263
Identity = 441/502 (87.85%), Postives = 470/502 (93.63%), Query Frame = 1

		  

Query: 1   MAEAKEAIPMATAEVPRSSLTPLEVPMQLHEINRQKLVKYLQQHLTDSGRPIQGIVLLQG 60
           M EAKE I MA+   P SSL+P EVPMQLH INR+KLVK LQQHL DSGRPIQG+VLLQG
Sbjct: 1   MTEAKEVISMAS---PCSSLSPPEVPMQLHVINREKLVKSLQQHLADSGRPIQGLVLLQG 60

Query: 61  GEEKNRYCTDHTELFRQESYFAYLFGVREPSFYGAVDIATGKSILFAPHLPAEYAVWMGQ 120
           GEEK RYCTDH+ELFRQESYFAYLFGVREP FYGA+D+ATGKSILFAP LP EYAVWMG 
Sbjct: 61  GEEKTRYCTDHSELFRQESYFAYLFGVREPGFYGAIDVATGKSILFAPRLPKEYAVWMGD 120

Query: 121 IKPLSHYTERYMVNMVCYVDEMEEVLLNLYKGEDKPLLFLLHGLNTDSGNFSKPAEFKGI 180
           IKP S++ E YMVN+V YVDEMEEVL++ Y GEDK +L+LLHGLNTDSGNFSKPAEFKGI
Sbjct: 121 IKPQSYFMECYMVNLVSYVDEMEEVLVSQYDGEDKCVLYLLHGLNTDSGNFSKPAEFKGI 180

Query: 181 ENFKTDLSTLHPILTECRVIKSKLELDVIQYATDISSEAHVEVMRNTKVGMKEYQLESMF 240
           ENF+TDLSTLHPILTECRV+KSKLELDVIQYA DISSEAHVEVMRNTKVGMKEYQ+ES+F
Sbjct: 181 ENFETDLSTLHPILTECRVLKSKLELDVIQYANDISSEAHVEVMRNTKVGMKEYQMESLF 240

Query: 241 LHHTYMYGGCRHCSYTCICATGDNSSVLHYGHAAAPNERTFMDGNMALLDMGAEYHFYGS 300
           LHHTYMYGGCRHCSYTCICATG+NSSVLHYGHAAAPNERTFMDG+MALLDMGAEYHFYGS
Sbjct: 241 LHHTYMYGGCRHCSYTCICATGENSSVLHYGHAAAPNERTFMDGDMALLDMGAEYHFYGS 300

Query: 301 DITCSFPVNGKFTVDQRLIYNAVLGAHNSVIAAMKPGVSWINMHKLAEKVILEALKKGNI 360
           DITCSFPVNGKFT DQRLIYNAVL AHNSVIAAM+PGVSW+NMHKLAEK+IL+ LKKGNI
Sbjct: 301 DITCSFPVNGKFTADQRLIYNAVLDAHNSVIAAMRPGVSWLNMHKLAEKIILDGLKKGNI 360

Query: 361 ITGDIEDMMTQRLGAVFMPHGLGHLLGIDTHDPGGYPKGLERPKEPGLSSLRTARELKED 420
           ITGDI+DMM+QRLGAVFMPHGLGHLLGIDTHDPGGYPKGLERPKEPGL SLRTARELKED
Sbjct: 361 ITGDIDDMMSQRLGAVFMPHGLGHLLGIDTHDPGGYPKGLERPKEPGLKSLRTARELKED 420

Query: 421 MVITVEPGCYFISALLIPAMESSETSKFFNPDAIRRFMVFGGVRIESDVYVTSIGCKNMT 480
           MVITVEPGCYFISALLIPAME+S+TSKFFNPD IRRFMVFGGVRIESDV+VTS GCKNMT
Sbjct: 421 MVITVEPGCYFISALLIPAMENSDTSKFFNPDTIRRFMVFGGVRIESDVHVTSTGCKNMT 480

Query: 481 NVPRETWEIEAVMAGAKWPLEK 503
           NVPRETWEIEAVMAG KWP EK
Sbjct: 481 NVPRETWEIEAVMAGGKWPPEK 499

BLAST of Spo00502.1 vs. NCBI nr
Match: gi|870842688|gb|KMS96044.1| (hypothetical protein BVRB_002720 [Beta vulgaris subsp. vulgaris])

HSP 1 Score: 909.8 bits (2350), Expect = 4.100e-261
Identity = 433/487 (88.91%), Postives = 461/487 (94.66%), Query Frame = 1

		  

Query: 16  PRSSLTPLEVPMQLHEINRQKLVKYLQQHLTDSGRPIQGIVLLQGGEEKNRYCTDHTELF 75
           P SSL+P EVPMQLH INR+KLVK LQQHL DSGRPIQG+VLLQGGEEK RYCTDH+ELF
Sbjct: 4   PCSSLSPPEVPMQLHVINREKLVKSLQQHLADSGRPIQGLVLLQGGEEKTRYCTDHSELF 63

Query: 76  RQESYFAYLFGVREPSFYGAVDIATGKSILFAPHLPAEYAVWMGQIKPLSHYTERYMVNM 135
           RQESYFAYLFGVREP FYGA+D+ATGKSILFAP LP EYAVWMG IKP S++ E YMVN+
Sbjct: 64  RQESYFAYLFGVREPGFYGAIDVATGKSILFAPRLPKEYAVWMGDIKPQSYFMECYMVNL 123

Query: 136 VCYVDEMEEVLLNLYKGEDKPLLFLLHGLNTDSGNFSKPAEFKGIENFKTDLSTLHPILT 195
           V YVDEMEEVL++ Y GEDK +L+LLHGLNTDSGNFSKPAEFKGIENF+TDLSTLHPILT
Sbjct: 124 VSYVDEMEEVLVSQYDGEDKCVLYLLHGLNTDSGNFSKPAEFKGIENFETDLSTLHPILT 183

Query: 196 ECRVIKSKLELDVIQYATDISSEAHVEVMRNTKVGMKEYQLESMFLHHTYMYGGCRHCSY 255
           ECRV+KSKLELDVIQYA DISSEAHVEVMRNTKVGMKEYQ+ES+FLHHTYMYGGCRHCSY
Sbjct: 184 ECRVLKSKLELDVIQYANDISSEAHVEVMRNTKVGMKEYQMESLFLHHTYMYGGCRHCSY 243

Query: 256 TCICATGDNSSVLHYGHAAAPNERTFMDGNMALLDMGAEYHFYGSDITCSFPVNGKFTVD 315
           TCICATG+NSSVLHYGHAAAPNERTFMDG+MALLDMGAEYHFYGSDITCSFPVNGKFT D
Sbjct: 244 TCICATGENSSVLHYGHAAAPNERTFMDGDMALLDMGAEYHFYGSDITCSFPVNGKFTAD 303

Query: 316 QRLIYNAVLGAHNSVIAAMKPGVSWINMHKLAEKVILEALKKGNIITGDIEDMMTQRLGA 375
           QRLIYNAVL AHNSVIAAM+PGVSW+NMHKLAEK+IL+ LKKGNIITGDI+DMM+QRLGA
Sbjct: 304 QRLIYNAVLDAHNSVIAAMRPGVSWLNMHKLAEKIILDGLKKGNIITGDIDDMMSQRLGA 363

Query: 376 VFMPHGLGHLLGIDTHDPGGYPKGLERPKEPGLSSLRTARELKEDMVITVEPGCYFISAL 435
           VFMPHGLGHLLGIDTHDPGGYPKGLERPKEPGL SLRTARELKEDMVITVEPGCYFISAL
Sbjct: 364 VFMPHGLGHLLGIDTHDPGGYPKGLERPKEPGLKSLRTARELKEDMVITVEPGCYFISAL 423

Query: 436 LIPAMESSETSKFFNPDAIRRFMVFGGVRIESDVYVTSIGCKNMTNVPRETWEIEAVMAG 495
           LIPAME+S+TSKFFNPD IRRFMVFGGVRIESDV+VTS GCKNMTNVPRETWEIEAVMAG
Sbjct: 424 LIPAMENSDTSKFFNPDTIRRFMVFGGVRIESDVHVTSTGCKNMTNVPRETWEIEAVMAG 483

Query: 496 AKWPLEK 503
            KWP EK
Sbjct: 484 GKWPPEK 490

BLAST of Spo00502.1 vs. NCBI nr
Match: gi|817499302|gb|AKF43198.1| (metallopeptidase M24 family protein [Hypseocharis bilobata])

HSP 1 Score: 822.8 bits (2124), Expect = 6.700e-235
Identity = 395/505 (78.22%), Postives = 438/505 (86.73%), Query Frame = 1

		  

Query: 1   MAEAKEAIPMATAEVP-RSSLTPLEVPMQLHEINRQKLVKYLQQHLTDSGRPIQGIVLLQ 60
           M EA+EA  M +A  P RSSLTP EVP +LH  NR+KL+K ++QHL+DS RP+ G VLLQ
Sbjct: 1   MGEAREATAMVSASSPSRSSLTPPEVPFELHVGNREKLLKSIRQHLSDSSRPLHGFVLLQ 60

Query: 61  GGEEKNRYCTDHTELFRQESYFAYLFGVREPSFYGAVDIATGKSILFAPHLPAEYAVWMG 120
           GGEEK RYCTDH ELFRQESYFAYLFGVREP FYGA+DIATG SILFAP LPA+YAVW+G
Sbjct: 61  GGEEKTRYCTDHIELFRQESYFAYLFGVREPGFYGAIDIATGNSILFAPRLPADYAVWLG 120

Query: 121 QIKPLSHYTERYMVNMVCYVDEMEEVLLNLYKGEDKPLLFLLHGLNTDSGNFSKPAEFKG 180
           +IK LS++ ERYMV++V Y DE+ +VL N Y G  KPLLFLLHGLNTDS NFSKPA+FKG
Sbjct: 121 EIKSLSYFKERYMVSLVYYTDEIAQVLCNQYSGSGKPLLFLLHGLNTDSDNFSKPADFKG 180

Query: 181 IENFKTDLSTLHPILTECRVIKSKLELDVIQYATDISSEAHVEVMRNTKVGMKEYQLESM 240
           +E F+TDL+ LHPILTECRVIKS LEL +IQ+A DISSEAHVEVMR T+  MKEYQLESM
Sbjct: 181 MEKFETDLTALHPILTECRVIKSDLELALIQFANDISSEAHVEVMRKTRANMKEYQLESM 240

Query: 241 FLHHTYMYGGCRHCSYTCICATGDNSSVLHYGHAAAPNERTFMDGNMALLDMGAEYHFYG 300
           FLHHTYMYGGCRHCSYTCICATG N +VLHYGHAAAPNERT  DG+MALLDMGAEYHFYG
Sbjct: 241 FLHHTYMYGGCRHCSYTCICATGANGAVLHYGHAAAPNERTLEDGDMALLDMGAEYHFYG 300

Query: 301 SDITCSFPVNGKFTVDQRLIYNAVLGAHNSVIAAMKPGVSWINMHKLAEKVILEALKKGN 360
           SDITCSFPVNGKFT DQ LIYNAVL AHN+VI++MKPGVSWI+MHKLAEK IL +LKKG 
Sbjct: 301 SDITCSFPVNGKFTSDQTLIYNAVLEAHNAVISSMKPGVSWIDMHKLAEKTILGSLKKGC 360

Query: 361 IITGDIEDMMTQRLGAVFMPHGLGHLLGIDTHDPGGYPKGLERPKEPGLSSLRTARELKE 420
           II GD++DMM +RLGA+FMPHGLGH LGIDTHDPGGY KG+ERPKEPGL SLRT+RELKE
Sbjct: 361 IIVGDVDDMMAERLGAIFMPHGLGHFLGIDTHDPGGYSKGMERPKEPGLRSLRTSRELKE 420

Query: 421 DMVITVEPGCYFISALLIPAMESSETSKFFNPDAIRRFMVFGGVRIESDVYVTSIGCKNM 480
            MVITVEPGCYFI ALLIPAMESS TSKFFN + I RF  FGGVRIESDV+VT+ GCKNM
Sbjct: 421 GMVITVEPGCYFIDALLIPAMESSNTSKFFNRETIGRFKNFGGVRIESDVHVTANGCKNM 480

Query: 481 TNVPRETWEIEAVMAGAKWPLEKES 505
           T  PRETWEIEAVMAGA WPL+K S
Sbjct: 481 TKCPRETWEIEAVMAGAPWPLDKTS 505

BLAST of Spo00502.1 vs. UniProtKB/TrEMBL
Match: A0A0K9RSJ3_SPIOL (Uncharacterized protein OS=Spinacia oleracea GN=SOVF_033800 PE=3 SV=1)

HSP 1 Score: 1013.8 bits (2620), Expect = 1.400e-292
Identity = 492/493 (99.80%), Postives = 493/493 (100.00%), Query Frame = 1

		  

Query: 10  MATAEVPRSSLTPLEVPMQLHEINRQKLVKYLQQHLTDSGRPIQGIVLLQGGEEKNRYCT 69
           MATAEVPRSSLTPLEVPMQLHEINRQKLVKYLQQHLTDSGRPIQGIVLLQGGEEKNRYCT
Sbjct: 1   MATAEVPRSSLTPLEVPMQLHEINRQKLVKYLQQHLTDSGRPIQGIVLLQGGEEKNRYCT 60

Query: 70  DHTELFRQESYFAYLFGVREPSFYGAVDIATGKSILFAPHLPAEYAVWMGQIKPLSHYTE 129
           DHTELFRQESYFAYLFGVREPSFYGAVDIATGKSILFAPHLPAEYAVWMGQIKPLSHYTE
Sbjct: 61  DHTELFRQESYFAYLFGVREPSFYGAVDIATGKSILFAPHLPAEYAVWMGQIKPLSHYTE 120

Query: 130 RYMVNMVCYVDEMEEVLLNLYKGEDKPLLFLLHGLNTDSGNFSKPAEFKGIENFKTDLST 189
           RYMVNMVCYVDEMEEVLLNLYKGEDKPLLFLLHGLNTDSGNFSKPAEFKGIENFKTDLST
Sbjct: 121 RYMVNMVCYVDEMEEVLLNLYKGEDKPLLFLLHGLNTDSGNFSKPAEFKGIENFKTDLST 180

Query: 190 LHPILTECRVIKSKLELDVIQYATDISSEAHVEVMRNTKVGMKEYQLESMFLHHTYMYGG 249
           LHPILTECRVIKSKLELDVIQYATDISSEAHVEVMRNTKVGMKEYQLESMFLHHTYMYGG
Sbjct: 181 LHPILTECRVIKSKLELDVIQYATDISSEAHVEVMRNTKVGMKEYQLESMFLHHTYMYGG 240

Query: 250 CRHCSYTCICATGDNSSVLHYGHAAAPNERTFMDGNMALLDMGAEYHFYGSDITCSFPVN 309
           CRHCSYTCICATGDNSSVLHYGHAAAPNERTFMDG+MALLDMGAEYHFYGSDITCSFPVN
Sbjct: 241 CRHCSYTCICATGDNSSVLHYGHAAAPNERTFMDGDMALLDMGAEYHFYGSDITCSFPVN 300

Query: 310 GKFTVDQRLIYNAVLGAHNSVIAAMKPGVSWINMHKLAEKVILEALKKGNIITGDIEDMM 369
           GKFTVDQRLIYNAVLGAHNSVIAAMKPGVSWINMHKLAEKVILEALKKGNIITGDIEDMM
Sbjct: 301 GKFTVDQRLIYNAVLGAHNSVIAAMKPGVSWINMHKLAEKVILEALKKGNIITGDIEDMM 360

Query: 370 TQRLGAVFMPHGLGHLLGIDTHDPGGYPKGLERPKEPGLSSLRTARELKEDMVITVEPGC 429
           TQRLGAVFMPHGLGHLLGIDTHDPGGYPKGLERPKEPGLSSLRTARELKEDMVITVEPGC
Sbjct: 361 TQRLGAVFMPHGLGHLLGIDTHDPGGYPKGLERPKEPGLSSLRTARELKEDMVITVEPGC 420

Query: 430 YFISALLIPAMESSETSKFFNPDAIRRFMVFGGVRIESDVYVTSIGCKNMTNVPRETWEI 489
           YFISALLIPAMESSETSKFFNPDAIRRFMVFGGVRIESDVYVTSIGCKNMTNVPRETWEI
Sbjct: 421 YFISALLIPAMESSETSKFFNPDAIRRFMVFGGVRIESDVYVTSIGCKNMTNVPRETWEI 480

Query: 490 EAVMAGAKWPLEK 503
           EAVMAGAKWPLEK
Sbjct: 481 EAVMAGAKWPLEK 493

BLAST of Spo00502.1 vs. UniProtKB/TrEMBL
Match: A0A0K9RUB9_SPIOL (Uncharacterized protein OS=Spinacia oleracea GN=SOVF_033790 PE=4 SV=1)

HSP 1 Score: 999.6 bits (2583), Expect = 2.800e-288
Identity = 487/488 (99.80%), Postives = 488/488 (100.00%), Query Frame = 1

		  

Query: 522  MDSKPETGNAVDGGVGVLNHHIQSCSQLSGLSITSESSAPNFVYQRKRIQKNPNPVFPTL 581
            MDSKPETGNAVDGGVGVLNHHIQSCSQLSGLSITSESSAPNFVYQRKRIQKNPNPVFPTL
Sbjct: 1    MDSKPETGNAVDGGVGVLNHHIQSCSQLSGLSITSESSAPNFVYQRKRIQKNPNPVFPTL 60

Query: 582  SPDNIAYVYERRKHQKSAPTNFTLMVSENSSPIKRGLNEREGTRVSSIEQPLTTSEAIDG 641
            SPDNIAYVYERRKHQKSAPTNFTLMVSENSSPIKRGLNEREGTRVSSIEQPLTTSEAIDG
Sbjct: 61   SPDNIAYVYERRKHQKSAPTNFTLMVSENSSPIKRGLNEREGTRVSSIEQPLTTSEAIDG 120

Query: 642  YVTEDRGFDAKEKRNKGFELYSIDDSCSSSLLNVDSGSSPLKSKVDETDECSSSGALVVD 701
            YVTEDRGFDAKEKRNKGF+LYSIDDSCSSSLLNVDSGSSPLKSKVDETDECSSSGALVVD
Sbjct: 121  YVTEDRGFDAKEKRNKGFDLYSIDDSCSSSLLNVDSGSSPLKSKVDETDECSSSGALVVD 180

Query: 702  VMGNDKSLSVKGLCVSILRSHGLLSNYESNRSGLSEEDAPSCSGSSCSRTCNVCGVVGTA 761
            VMGNDKSLSVKGLCVSILRSHGLLSNYESNRSGLSEEDAPSCSGSSCSRTCNVCGVVGTA
Sbjct: 181  VMGNDKSLSVKGLCVSILRSHGLLSNYESNRSGLSEEDAPSCSGSSCSRTCNVCGVVGTA 240

Query: 762  IDLLICDECEKAFHISCFNPCIKKIPDDDWYCQPCSKKKHNLLKERAIRRSSIIRTGKGS 821
            IDLLICDECEKAFHISCFNPCIKKIPDDDWYCQPCSKKKHNLLKERAIRRSSIIRTGKGS
Sbjct: 241  IDLLICDECEKAFHISCFNPCIKKIPDDDWYCQPCSKKKHNLLKERAIRRSSIIRTGKGS 300

Query: 822  VSSEGDFSPIALMLNDSQPYITGVRVGKGFQAFVPEWCGPVARRGTVFPEPMELDPLQPV 881
            VSSEGDFSPIALMLNDSQPYITGVRVGKGFQAFVPEWCGPVARRGTVFPEPMELDPLQPV
Sbjct: 301  VSSEGDFSPIALMLNDSQPYITGVRVGKGFQAFVPEWCGPVARRGTVFPEPMELDPLQPV 360

Query: 882  HLLGPTSGKHRKAFIGNWLQCRDVIVGMGDSIDGTICGKWRRAPLFEIQTDKWDCFRSVL 941
            HLLGPTSGKHRKAFIGNWLQCRDVIVGMGDSIDGTICGKWRRAPLFEIQTDKWDCFRSVL
Sbjct: 361  HLLGPTSGKHRKAFIGNWLQCRDVIVGMGDSIDGTICGKWRRAPLFEIQTDKWDCFRSVL 420

Query: 942  WDPSHADCSVPQELETEEVLKQLKYIEMLRPRLAAKRRKFDVCTANGSEAVTEDVSKCTG 1001
            WDPSHADCSVPQELETEEVLKQLKYIEMLRPRLAAKRRKFDVCTANGSEAVTEDVSKCTG
Sbjct: 421  WDPSHADCSVPQELETEEVLKQLKYIEMLRPRLAAKRRKFDVCTANGSEAVTEDVSKCTG 480

Query: 1002 SVDAERNL 1010
            SVDAERNL
Sbjct: 481  SVDAERNL 488

BLAST of Spo00502.1 vs. UniProtKB/TrEMBL
Match: A0A0J8B4E6_BETVU (Uncharacterized protein OS=Beta vulgaris subsp. vulgaris GN=BVRB_002720 PE=3 SV=1)

HSP 1 Score: 909.8 bits (2350), Expect = 2.900e-261
Identity = 433/487 (88.91%), Postives = 461/487 (94.66%), Query Frame = 1

		  

Query: 16  PRSSLTPLEVPMQLHEINRQKLVKYLQQHLTDSGRPIQGIVLLQGGEEKNRYCTDHTELF 75
           P SSL+P EVPMQLH INR+KLVK LQQHL DSGRPIQG+VLLQGGEEK RYCTDH+ELF
Sbjct: 4   PCSSLSPPEVPMQLHVINREKLVKSLQQHLADSGRPIQGLVLLQGGEEKTRYCTDHSELF 63

Query: 76  RQESYFAYLFGVREPSFYGAVDIATGKSILFAPHLPAEYAVWMGQIKPLSHYTERYMVNM 135
           RQESYFAYLFGVREP FYGA+D+ATGKSILFAP LP EYAVWMG IKP S++ E YMVN+
Sbjct: 64  RQESYFAYLFGVREPGFYGAIDVATGKSILFAPRLPKEYAVWMGDIKPQSYFMECYMVNL 123

Query: 136 VCYVDEMEEVLLNLYKGEDKPLLFLLHGLNTDSGNFSKPAEFKGIENFKTDLSTLHPILT 195
           V YVDEMEEVL++ Y GEDK +L+LLHGLNTDSGNFSKPAEFKGIENF+TDLSTLHPILT
Sbjct: 124 VSYVDEMEEVLVSQYDGEDKCVLYLLHGLNTDSGNFSKPAEFKGIENFETDLSTLHPILT 183

Query: 196 ECRVIKSKLELDVIQYATDISSEAHVEVMRNTKVGMKEYQLESMFLHHTYMYGGCRHCSY 255
           ECRV+KSKLELDVIQYA DISSEAHVEVMRNTKVGMKEYQ+ES+FLHHTYMYGGCRHCSY
Sbjct: 184 ECRVLKSKLELDVIQYANDISSEAHVEVMRNTKVGMKEYQMESLFLHHTYMYGGCRHCSY 243

Query: 256 TCICATGDNSSVLHYGHAAAPNERTFMDGNMALLDMGAEYHFYGSDITCSFPVNGKFTVD 315
           TCICATG+NSSVLHYGHAAAPNERTFMDG+MALLDMGAEYHFYGSDITCSFPVNGKFT D
Sbjct: 244 TCICATGENSSVLHYGHAAAPNERTFMDGDMALLDMGAEYHFYGSDITCSFPVNGKFTAD 303

Query: 316 QRLIYNAVLGAHNSVIAAMKPGVSWINMHKLAEKVILEALKKGNIITGDIEDMMTQRLGA 375
           QRLIYNAVL AHNSVIAAM+PGVSW+NMHKLAEK+IL+ LKKGNIITGDI+DMM+QRLGA
Sbjct: 304 QRLIYNAVLDAHNSVIAAMRPGVSWLNMHKLAEKIILDGLKKGNIITGDIDDMMSQRLGA 363

Query: 376 VFMPHGLGHLLGIDTHDPGGYPKGLERPKEPGLSSLRTARELKEDMVITVEPGCYFISAL 435
           VFMPHGLGHLLGIDTHDPGGYPKGLERPKEPGL SLRTARELKEDMVITVEPGCYFISAL
Sbjct: 364 VFMPHGLGHLLGIDTHDPGGYPKGLERPKEPGLKSLRTARELKEDMVITVEPGCYFISAL 423

Query: 436 LIPAMESSETSKFFNPDAIRRFMVFGGVRIESDVYVTSIGCKNMTNVPRETWEIEAVMAG 495
           LIPAME+S+TSKFFNPD IRRFMVFGGVRIESDV+VTS GCKNMTNVPRETWEIEAVMAG
Sbjct: 424 LIPAMENSDTSKFFNPDTIRRFMVFGGVRIESDVHVTSTGCKNMTNVPRETWEIEAVMAG 483

Query: 496 AKWPLEK 503
            KWP EK
Sbjct: 484 GKWPPEK 490

BLAST of Spo00502.1 vs. UniProtKB/TrEMBL
Match: A0A0G2T3M3_9ROSI (Metallopeptidase M24 family protein OS=Hypseocharis bilobata GN=m24 PE=2 SV=1)

HSP 1 Score: 822.8 bits (2124), Expect = 4.600e-235
Identity = 395/505 (78.22%), Postives = 438/505 (86.73%), Query Frame = 1

		  

Query: 1   MAEAKEAIPMATAEVP-RSSLTPLEVPMQLHEINRQKLVKYLQQHLTDSGRPIQGIVLLQ 60
           M EA+EA  M +A  P RSSLTP EVP +LH  NR+KL+K ++QHL+DS RP+ G VLLQ
Sbjct: 1   MGEAREATAMVSASSPSRSSLTPPEVPFELHVGNREKLLKSIRQHLSDSSRPLHGFVLLQ 60

Query: 61  GGEEKNRYCTDHTELFRQESYFAYLFGVREPSFYGAVDIATGKSILFAPHLPAEYAVWMG 120
           GGEEK RYCTDH ELFRQESYFAYLFGVREP FYGA+DIATG SILFAP LPA+YAVW+G
Sbjct: 61  GGEEKTRYCTDHIELFRQESYFAYLFGVREPGFYGAIDIATGNSILFAPRLPADYAVWLG 120

Query: 121 QIKPLSHYTERYMVNMVCYVDEMEEVLLNLYKGEDKPLLFLLHGLNTDSGNFSKPAEFKG 180
           +IK LS++ ERYMV++V Y DE+ +VL N Y G  KPLLFLLHGLNTDS NFSKPA+FKG
Sbjct: 121 EIKSLSYFKERYMVSLVYYTDEIAQVLCNQYSGSGKPLLFLLHGLNTDSDNFSKPADFKG 180

Query: 181 IENFKTDLSTLHPILTECRVIKSKLELDVIQYATDISSEAHVEVMRNTKVGMKEYQLESM 240
           +E F+TDL+ LHPILTECRVIKS LEL +IQ+A DISSEAHVEVMR T+  MKEYQLESM
Sbjct: 181 MEKFETDLTALHPILTECRVIKSDLELALIQFANDISSEAHVEVMRKTRANMKEYQLESM 240

Query: 241 FLHHTYMYGGCRHCSYTCICATGDNSSVLHYGHAAAPNERTFMDGNMALLDMGAEYHFYG 300
           FLHHTYMYGGCRHCSYTCICATG N +VLHYGHAAAPNERT  DG+MALLDMGAEYHFYG
Sbjct: 241 FLHHTYMYGGCRHCSYTCICATGANGAVLHYGHAAAPNERTLEDGDMALLDMGAEYHFYG 300

Query: 301 SDITCSFPVNGKFTVDQRLIYNAVLGAHNSVIAAMKPGVSWINMHKLAEKVILEALKKGN 360
           SDITCSFPVNGKFT DQ LIYNAVL AHN+VI++MKPGVSWI+MHKLAEK IL +LKKG 
Sbjct: 301 SDITCSFPVNGKFTSDQTLIYNAVLEAHNAVISSMKPGVSWIDMHKLAEKTILGSLKKGC 360

Query: 361 IITGDIEDMMTQRLGAVFMPHGLGHLLGIDTHDPGGYPKGLERPKEPGLSSLRTARELKE 420
           II GD++DMM +RLGA+FMPHGLGH LGIDTHDPGGY KG+ERPKEPGL SLRT+RELKE
Sbjct: 361 IIVGDVDDMMAERLGAIFMPHGLGHFLGIDTHDPGGYSKGMERPKEPGLRSLRTSRELKE 420

Query: 421 DMVITVEPGCYFISALLIPAMESSETSKFFNPDAIRRFMVFGGVRIESDVYVTSIGCKNM 480
            MVITVEPGCYFI ALLIPAMESS TSKFFN + I RF  FGGVRIESDV+VT+ GCKNM
Sbjct: 421 GMVITVEPGCYFIDALLIPAMESSNTSKFFNRETIGRFKNFGGVRIESDVHVTANGCKNM 480

Query: 481 TNVPRETWEIEAVMAGAKWPLEKES 505
           T  PRETWEIEAVMAGA WPL+K S
Sbjct: 481 TKCPRETWEIEAVMAGAPWPLDKTS 505

BLAST of Spo00502.1 vs. UniProtKB/TrEMBL
Match: F6HGR9_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_11s0016g02540 PE=3 SV=1)

HSP 1 Score: 820.5 bits (2118), Expect = 2.300e-234
Identity = 393/487 (80.70%), Postives = 432/487 (88.71%), Query Frame = 1

		  

Query: 18  SSLTPLEVPMQLHEINRQKLVKYLQQHLTDSGRPIQGIVLLQGGEEKNRYCTDHTELFRQ 77
           SSLTP EVPM+LH INR KLVK L QHLT+S  P+ G VLLQGGEE+ R+ TDH ELFRQ
Sbjct: 4   SSLTPPEVPMELHAINRGKLVKSLLQHLTESTHPLHGFVLLQGGEEQTRHDTDHAELFRQ 63

Query: 78  ESYFAYLFGVREPSFYGAVDIATGKSILFAPHLPAEYAVWMGQIKPLSHYTERYMVNMVC 137
           ESYFAYLFGVREP FYGA+DIATGKSILFAP LPAEYAVW+G+IKPLS++ ERYMV+ VC
Sbjct: 64  ESYFAYLFGVREPGFYGAIDIATGKSILFAPRLPAEYAVWLGEIKPLSYFKERYMVSKVC 123

Query: 138 YVDEMEEVLLNLYKGEDKPLLFLLHGLNTDSGNFSKPAEFKGIENFKTDLSTLHPILTEC 197
           Y DE+  VL + YK + KPLLFLLHGLNTDS NFSKPAEF+GIE FKTDL+TLHPIL EC
Sbjct: 124 YTDEIAGVLHDEYKEQGKPLLFLLHGLNTDSNNFSKPAEFEGIEKFKTDLNTLHPILAEC 183

Query: 198 RVIKSKLELDVIQYATDISSEAHVEVMRNTKVGMKEYQLESMFLHHTYMYGGCRHCSYTC 257
           RV KS LEL +IQYA DISSEAHVEVMR T VGMKEYQLESMFLHHTYMYGGCRHCSYTC
Sbjct: 184 RVFKSDLELALIQYANDISSEAHVEVMRKTTVGMKEYQLESMFLHHTYMYGGCRHCSYTC 243

Query: 258 ICATGDNSSVLHYGHAAAPNERTFMDGNMALLDMGAEYHFYGSDITCSFPVNGKFTVDQR 317
           ICATG NS+VLHYGHAAAPN+RTF DG+MALLDMGAEYHFYGSDITCSFPVNGKFT DQR
Sbjct: 244 ICATGGNSAVLHYGHAAAPNDRTFEDGDMALLDMGAEYHFYGSDITCSFPVNGKFTSDQR 303

Query: 318 LIYNAVLGAHNSVIAAMKPGVSWINMHKLAEKVILEALKKGNIITGDIEDMMTQRLGAVF 377
           LIYNAVL AHN+VI+AMKPGV+WI+MHKLAEK+IL++LKKG I+ GD++DMM +RLGAVF
Sbjct: 304 LIYNAVLQAHNTVISAMKPGVNWIDMHKLAEKIILDSLKKGCIVVGDVDDMMVKRLGAVF 363

Query: 378 MPHGLGHLLGIDTHDPGGYPKGLERPKEPGLSSLRTARELKEDMVITVEPGCYFISALLI 437
           MPHGLGH LGIDTHD GGY +GLERPKEPGL SLRT R+L+E MVITVEPGCYFI ALL 
Sbjct: 364 MPHGLGHFLGIDTHDTGGYLEGLERPKEPGLKSLRTVRDLQEGMVITVEPGCYFIDALLA 423

Query: 438 PAMESSETSKFFNPDAIRRFMVFGGVRIESDVYVTSIGCKNMTNVPRETWEIEAVMAGAK 497
           PAME+SETSKFFN + I RF  FGGVRIESDV+VTS GCKNMTNVPRETWEIEAVMAG+ 
Sbjct: 424 PAMENSETSKFFNHEIIGRFKSFGGVRIESDVHVTSNGCKNMTNVPRETWEIEAVMAGSP 483

Query: 498 WPLEKES 505
           WPL+K S
Sbjct: 484 WPLDKSS 490

BLAST of Spo00502.1 vs. ExPASy Swiss-Prot
Match: PEPD_HUMAN (Xaa-Pro dipeptidase OS=Homo sapiens GN=PEPD PE=1 SV=3)

HSP 1 Score: 520.4 bits (1339), Expect = 4.400e-146
Identity = 257/474 (54.22%), Postives = 330/474 (69.62%), Query Frame = 1

		  

Query: 23  LEVPMQLHEINRQKLVKYLQQH-LTDSGRPIQGIVLLQGGEEKNRYCTDHTELFRQESYF 82
           L+VP+ L  +NRQ+L + L+++    +G     IV+LQGGEE  RYCTD   LFRQES+F
Sbjct: 16  LKVPLALFALNRQRLCERLRKNPAVQAG----SIVVLQGGEETQRYCTDTGVLFRQESFF 75

Query: 83  AYLFGVREPSFYGAVDIATGKSILFAPHLPAEYAVWMGQIKPLSHYTERYMVNMVCYVDE 142
            + FGV EP  YG +D+ TGKS LF P LPA +A WMG+I    H+ E+Y V+ V YVDE
Sbjct: 76  HWAFGVTEPGCYGVIDVDTGKSTLFVPRLPASHATWMGKIHSKEHFKEKYAVDDVQYVDE 135

Query: 143 MEEVLLNLYKGEDKPLLFLLHGLNTDSGNFSKPAEFKGIENFKTDLSTLHPILTECRVIK 202
           +  VL +    +   +L  L G+NTDSG+  + A F GI  F+ + + LHP + ECRV K
Sbjct: 136 IASVLTS----QKPSVLLTLRGVNTDSGSVCREASFDGISKFEVNNTILHPEIVECRVFK 195

Query: 203 SKLELDVIQYATDISSEAHVEVMRNTKVGMKEYQLESMFLHHTYMYGGCRHCSYTCICAT 262
           + +EL+V++Y   ISSEAH EVM+  KVGMKEY+LES+F H+ Y  GG RH SYTCIC +
Sbjct: 196 TDMELEVLRYTNKISSEAHREVMKAVKVGMKEYELESLFEHYCYSRGGMRHSSYTCICGS 255

Query: 263 GDNSSVLHYGHAAAPNERTFMDGNMALLDMGAEYHFYGSDITCSFPVNGKFTVDQRLIYN 322
           G+NS+VLHYGHA APN+RT  +G+M L DMG EY+ + SDITCSFP NGKFT DQ+ +Y 
Sbjct: 256 GENSAVLHYGHAGAPNDRTIQNGDMCLFDMGGEYYCFASDITCSFPANGKFTADQKAVYE 315

Query: 323 AVLGAHNSVIAAMKPGVSWINMHKLAEKVILEALKKGNIITGDIEDMMTQRLGAVFMPHG 382
           AVL +  +V+ AMKPGV W +MH+LA+++ LE L    I++G ++ M+   LGAVFMPHG
Sbjct: 316 AVLRSSRAVMGAMKPGVWWPDMHRLADRIHLEELAHMGILSGSVDAMVQAHLGAVFMPHG 375

Query: 383 LGHLLGIDTHDPGGYPKGLERPKEPGLSSLRTARELKEDMVITVEPGCYFISALLIPAME 442
           LGH LGID HD GGYP+G+ER  EPGL SLRTAR L+  MV+TVEPG YFI  LL  A+ 
Sbjct: 376 LGHFLGIDVHDVGGYPEGVERIDEPGLRSLRTARHLQPGMVLTVEPGIYFIDHLLDEALA 435

Query: 443 SSETSKFFNPDAIRRFMVFGGVRIESDVYVTSIGCKNMTNVPRETWEIEAVMAG 496
               + F N + ++RF  FGGVRIE DV VT  G + +T VPR   EIEA MAG
Sbjct: 436 DPARASFLNREVLQRFRGFGGVRIEEDVVVTDSGIELLTCVPRTVEEIEACMAG 481

BLAST of Spo00502.1 vs. ExPASy Swiss-Prot
Match: PEPD_PONAB (Xaa-Pro dipeptidase OS=Pongo abelii GN=PEPD PE=2 SV=1)

HSP 1 Score: 514.6 bits (1324), Expect = 2.400e-144
Identity = 255/474 (53.80%), Postives = 329/474 (69.41%), Query Frame = 1

		  

Query: 23  LEVPMQLHEINRQKLVKYLQQH-LTDSGRPIQGIVLLQGGEEKNRYCTDHTELFRQESYF 82
           L+VP+ L  +NRQ+L + L+++    +G     IV+LQGGEE  RYCTD   LFRQES+F
Sbjct: 16  LKVPVALFALNRQRLCERLRKNPAVQAG----SIVVLQGGEETLRYCTDTEVLFRQESFF 75

Query: 83  AYLFGVREPSFYGAVDIATGKSILFAPHLPAEYAVWMGQIKPLSHYTERYMVNMVCYVDE 142
            + FGV EP  YG +D+ TGKS LF P LPA YA WMG+I    H+ E+Y ++ V Y DE
Sbjct: 76  HWAFGVTEPGCYGVIDVDTGKSTLFVPRLPASYATWMGKIHSKEHFKEKYAMDDVQYTDE 135

Query: 143 MEEVLLNLYKGEDKPLLFLLHGLNTDSGNFSKPAEFKGIENFKTDLSTLHPILTECRVIK 202
           ++ VL +    +   +L  L G+NTDSG+  + A F GI  F+ + + LHP + ECRV K
Sbjct: 136 IDSVLTS----QKPSVLLTLRGVNTDSGSVCREASFDGISKFEVNNTILHPEIVECRVFK 195

Query: 203 SKLELDVIQYATDISSEAHVEVMRNTKVGMKEYQLESMFLHHTYMYGGCRHCSYTCICAT 262
           + +EL+V++Y   ISSEAH EVM+  KVGMKEY+LES+F H+ Y  GG RH SYTCIC +
Sbjct: 196 TDMELEVLRYTNKISSEAHREVMKAVKVGMKEYELESLFEHYCYSRGGMRHSSYTCICGS 255

Query: 263 GDNSSVLHYGHAAAPNERTFMDGNMALLDMGAEYHFYGSDITCSFPVNGKFTVDQRLIYN 322
           G+NS+VLHYGHA APN+RT  +G+M L DMG EY+ + SDITCSFP NGKFT DQ+ +Y 
Sbjct: 256 GENSAVLHYGHAGAPNDRTIQNGDMCLFDMGGEYYCFASDITCSFPANGKFTADQKAVYE 315

Query: 323 AVLGAHNSVIAAMKPGVSWINMHKLAEKVILEALKKGNIITGDIEDMMTQRLGAVFMPHG 382
           AVL +  +V+ AMKPGV W +M +LA+++ LE L    I++G ++ M+   LGAV MPHG
Sbjct: 316 AVLRSSRAVMGAMKPGVWWPDMRRLADRIHLEELAHTGILSGSVDAMVQAHLGAVSMPHG 375

Query: 383 LGHLLGIDTHDPGGYPKGLERPKEPGLSSLRTARELKEDMVITVEPGCYFISALLIPAME 442
           LGH LGID HD GGYP+G+ER  EPGL SLRTAR L+  MV+TVEPG YFI  LL  A+ 
Sbjct: 376 LGHFLGIDVHDVGGYPEGVERIDEPGLRSLRTARHLQPGMVLTVEPGIYFIDHLLDEALA 435

Query: 443 SSETSKFFNPDAIRRFMVFGGVRIESDVYVTSIGCKNMTNVPRETWEIEAVMAG 496
               + FFN + ++RF  FGGVRIE DV VT  G + +T VPR   EIEA MAG
Sbjct: 436 DPAHACFFNREVLQRFRGFGGVRIEEDVVVTDSGMELLTCVPRTVEEIEACMAG 481

BLAST of Spo00502.1 vs. ExPASy Swiss-Prot
Match: PEPD_RAT (Xaa-Pro dipeptidase OS=Rattus norvegicus GN=Pepd PE=2 SV=1)

HSP 1 Score: 513.1 bits (1320), Expect = 7.100e-144
Identity = 257/489 (52.56%), Postives = 336/489 (68.71%), Query Frame = 1

		  

Query: 10  MATAEVPRSSL--TPLEVPMQLHEINRQKLVKYLQQH-LTDSGRPIQGIVLLQGGEEKNR 69
           MA+   P  SL    L+VP+ L  +NRQ+L + L+++    +G      V+LQGGEE  R
Sbjct: 1   MASTVRPSFSLGNETLKVPLALFALNRQRLCERLRKNGAVQAG----SAVVLQGGEEMQR 60

Query: 70  YCTDHTELFRQESYFAYLFGVREPSFYGAVDIATGKSILFAPHLPAEYAVWMGQIKPLSH 129
           YCTD + +FRQES+F + FGV E   YG +D+ TGKS LF P LPA YA WMG+I    H
Sbjct: 61  YCTDTSIIFRQESFFHWAFGVIESGCYGVIDVDTGKSTLFVPRLPASYATWMGKIHSKEH 120

Query: 130 YTERYMVNMVCYVDEMEEVLLNLYKGEDKPLLFLLHGLNTDSGNFSKPAEFKGIENFKTD 189
           + E+Y V+ V Y DE+  VL +     +  +L  L G+NTDSGN  + A F+GI  F  +
Sbjct: 121 FKEKYAVDDVQYADEIASVLTS----RNPSVLLTLRGVNTDSGNVCREASFEGISKFTVN 180

Query: 190 LSTLHPILTECRVIKSKLELDVIQYATDISSEAHVEVMRNTKVGMKEYQLESMFLHHTYM 249
            + LHP + ECRV K+ +EL+V++Y   ISSEAH EVM+  KVGMKEY++ES+F H+ Y 
Sbjct: 181 NTILHPEIVECRVFKTDMELEVLRYTNRISSEAHREVMKAVKVGMKEYEMESLFQHYCYS 240

Query: 250 YGGCRHCSYTCICATGDNSSVLHYGHAAAPNERTFMDGNMALLDMGAEYHFYGSDITCSF 309
            GG RH SYTCIC +G+N++VLHYGHA APN+RT  DG++ L DMG EY+ + SDITCSF
Sbjct: 241 KGGMRHTSYTCICCSGENAAVLHYGHAGAPNDRTIKDGDICLFDMGGEYYCFASDITCSF 300

Query: 310 PVNGKFTVDQRLIYNAVLGAHNSVIAAMKPGVSWINMHKLAEKVILEALKKGNIITGDIE 369
           P NGKFT DQ+ IY AVL +  +V++ MKPGV W +MH+LA+++ LE L +  +++G ++
Sbjct: 301 PANGKFTDDQKAIYEAVLRSCRTVMSTMKPGVWWPDMHRLADRIHLEELTRIGLLSGSVD 360

Query: 370 DMMTQRLGAVFMPHGLGHLLGIDTHDPGGYPKGLERPKEPGLSSLRTARELKEDMVITVE 429
            M+   LGAVFMPHGLGH LG+D HD GGYP+G+ER  EPGL SLRTAR L+  MV+TVE
Sbjct: 361 AMLQVHLGAVFMPHGLGHFLGLDVHDVGGYPEGVERIDEPGLRSLRTARHLEPGMVLTVE 420

Query: 430 PGCYFISALLIPAMESSETSKFFNPDAIRRFMVFGGVRIESDVYVTSIGCKNMTNVPRET 489
           PG YFI  LL  A+     + FFN + ++RF  FGGVRIE DV VT  G + +T VPR  
Sbjct: 421 PGIYFIDHLLDQALADPAQACFFNQEVLQRFRNFGGVRIEEDVVVTDSGMELLTCVPRTV 480

Query: 490 WEIEAVMAG 496
            EIEA MAG
Sbjct: 481 EEIEACMAG 481

BLAST of Spo00502.1 vs. ExPASy Swiss-Prot
Match: PEPD_MOUSE (Xaa-Pro dipeptidase OS=Mus musculus GN=Pepd PE=1 SV=3)

HSP 1 Score: 507.3 bits (1305), Expect = 3.900e-142
Identity = 253/488 (51.84%), Postives = 333/488 (68.24%), Query Frame = 1

		  

Query: 10  MATAEVPRSSL--TPLEVPMQLHEINRQKLVKYLQQHLTDSGRPIQGIVLLQGGEEKNRY 69
           MA+   P  SL    L+VP+ L  +NRQ+L + L+++           V+LQGGEE  RY
Sbjct: 1   MASTVRPSFSLGNETLKVPLALFALNRQRLCERLRKN---GAVQAASAVVLQGGEEMQRY 60

Query: 70  CTDHTELFRQESYFAYLFGVREPSFYGAVDIATGKSILFAPHLPAEYAVWMGQIKPLSHY 129
           CTD + +FRQES+F + FGV E   YG +D+ TGKS LF P LP  YA WMG+I    ++
Sbjct: 61  CTDTSIIFRQESFFHWAFGVVESGCYGVIDVDTGKSTLFVPRLPDSYATWMGKIHSKEYF 120

Query: 130 TERYMVNMVCYVDEMEEVLLNLYKGEDKPLLFLLHGLNTDSGNFSKPAEFKGIENFKTDL 189
            E+Y V+ V Y DE+  VL +     +  +L  L G+NTDSG+  + A F+GI  F  + 
Sbjct: 121 KEKYAVDDVQYTDEIASVLTS----RNPSVLLTLRGVNTDSGSVCREASFEGISKFNVNN 180

Query: 190 STLHPILTECRVIKSKLELDVIQYATDISSEAHVEVMRNTKVGMKEYQLESMFLHHTYMY 249
           + LHP + ECRV K+ +EL+V++Y   ISSEAH EVM+  KVGMKEY++ES+F H+ Y  
Sbjct: 181 TILHPEIVECRVFKTDMELEVLRYTNRISSEAHREVMKAVKVGMKEYEMESLFQHYCYSR 240

Query: 250 GGCRHCSYTCICATGDNSSVLHYGHAAAPNERTFMDGNMALLDMGAEYHFYGSDITCSFP 309
           GG RH SYTCIC +G+N++VLHYGHA APN+RT  DG++ L DMG EY+ + SDITCSFP
Sbjct: 241 GGMRHTSYTCICCSGENAAVLHYGHAGAPNDRTIKDGDICLFDMGGEYYCFASDITCSFP 300

Query: 310 VNGKFTVDQRLIYNAVLGAHNSVIAAMKPGVSWINMHKLAEKVILEALKKGNIITGDIED 369
            NGKFT DQ+ IY AVL +  +V++ MKPGV W +MH+LA+++ LE L +  +++G ++ 
Sbjct: 301 ANGKFTEDQKAIYEAVLRSCRTVMSTMKPGVWWPDMHRLADRIHLEELARIGLLSGSVDA 360

Query: 370 MMTQRLGAVFMPHGLGHLLGIDTHDPGGYPKGLERPKEPGLSSLRTARELKEDMVITVEP 429
           M+   LGAVFMPHGLGH LG+D HD GGYP+G+ER  EPGL SLRTAR L+  MV+TVEP
Sbjct: 361 MLQVHLGAVFMPHGLGHFLGLDVHDVGGYPEGVERIDEPGLRSLRTARHLEPGMVLTVEP 420

Query: 430 GCYFISALLIPAMESSETSKFFNPDAIRRFMVFGGVRIESDVYVTSIGCKNMTNVPRETW 489
           G YFI  LL  A+     + FFN + ++RF  FGGVRIE DV VT  G + +T VPR   
Sbjct: 421 GIYFIDHLLDQALADPAQACFFNQEVLQRFRNFGGVRIEEDVVVTDSGMELLTCVPRTVE 480

Query: 490 EIEAVMAG 496
           EIEA MAG
Sbjct: 481 EIEACMAG 481

BLAST of Spo00502.1 vs. ExPASy Swiss-Prot
Match: PEPD_DICDI (Xaa-Pro dipeptidase OS=Dictyostelium discoideum GN=pepd PE=1 SV=1)

HSP 1 Score: 456.1 bits (1172), Expect = 1.000e-126
Identity = 232/474 (48.95%), Postives = 305/474 (64.35%), Query Frame = 1

		  

Query: 23  LEVPMQLHEINRQKLVKYLQQHLTDSGRPIQGIVLLQGGEEKNRYCTDHTELFRQESYFA 82
           L+VP+ LH+ NRQ+LV  +     D  +     +LL+ G+   +Y TDH  LF+QE YF 
Sbjct: 35  LKVPLVLHKENRQRLVSQILSKHKDQVKE-NSFILLESGKSTMQYDTDHEPLFKQERYFF 94

Query: 83  YLFGVREPSFYGAVDI-ATGKSILFAPHLPAEYAVWMGQIKPLSHYTERYMVNMVCYVDE 142
           + FG   P  +G V +     SIL  P LPAEYA WMG+I+   +Y   ++V+ V YVDE
Sbjct: 95  WTFGSDIPDCFGIVGLDEQATSILCIPKLPAEYATWMGEIRSKEYYKSIFLVDQVLYVDE 154

Query: 143 MEEVLLNLYKGEDKPLLFLLHGLNTDSGNFSKPAEFKGI-ENFKTDLSTLHPILTECRVI 202
           M + L    K ++   ++ + G NTDSG+     ++ G+ E F  + + L P + ECRVI
Sbjct: 155 MMDYL----KSKNASTIYTILGTNTDSGSTFVEPQYPGLRETFNVNNTLLFPEIAECRVI 214

Query: 203 KSKLELDVIQYATDISSEAHVEVMRNTKVGMKEYQLESMFLHHTYMYGGCRHCSYTCICA 262
           KS  E++VI+Y  D S  AH  VMR  KVG+KEYQ ES FLHH Y   GCR+  YTCICA
Sbjct: 215 KSPKEVEVIRYCVDASVSAHKHVMRKVKVGLKEYQCESEFLHHVYNEWGCRNVGYTCICA 274

Query: 263 TGDNSSVLHYGHAAAPNERTFMDGNMALLDMGAEYHFYGSDITCSFPVNGKFTVDQRLIY 322
              NS+VLHYGHA  PN  T  +    L DMGAEYH Y +DITCSFP  GKF+ +QR++Y
Sbjct: 275 ANKNSAVLHYGHAGEPNSATISENGFCLFDMGAEYHSYTADITCSFPATGKFSPEQRVVY 334

Query: 323 NAVLGAHNSVIAAMKPGVSWINMHKLAEKVILEALKKGNIITGDIEDMMTQRLGAVFMPH 382
            AVL A  +V+ AM+PGVSW++MHKLAE+ IL AL K  I+ GD++D++  ++G+VF PH
Sbjct: 335 QAVLDASVAVMEAMRPGVSWVDMHKLAERCILAALLKAGILVGDLQDLIANKIGSVFFPH 394

Query: 383 GLGHLLGIDTHDPGGYPKGLERPKEPGLSSLRTARELKEDMVITVEPGCYFISALLIPAM 442
           GLGH LG+DTHD GGY        +P + SLRT R LK  MVIT EPGCYFI+ LL  A+
Sbjct: 395 GLGHFLGLDTHDVGGYLGDC----QPKVHSLRTTRTLKAGMVITSEPGCYFINHLLTQAL 454

Query: 443 ESSETSKFFNPDAIRRFMVFGGVRIESDVYVTSIGCKNMT-NVPRETWEIEAVM 494
            + ET+KFFN   + ++   GGVRIE D+ VT  GC N++ N+PR   EIEA M
Sbjct: 455 SNPETAKFFNLTELDKYRNIGGVRIEDDILVTETGCDNLSKNLPRTIDEIEAFM 499

BLAST of Spo00502.1 vs. TAIR (Arabidopsis)
Match: AT4G29490.1 (Metallopeptidase M24 family protein)

HSP 1 Score: 791.2 bits (2042), Expect = 7.600e-229
Identity = 369/485 (76.08%), Postives = 423/485 (87.22%), Query Frame = 1

		  

Query: 18  SSLTPLEVPMQLHEINRQKLVKYLQQHLTDSGRPIQGIVLLQGGEEKNRYCTDHTELFRQ 77
           SSL+P  +PM+LH  NR+KL++ +++ L+ S R + G VLLQGGEEKNRYCTDHTELFRQ
Sbjct: 2   SSLSPPPIPMELHAGNRKKLLESIRRQLSSSNRSLDGFVLLQGGEEKNRYCTDHTELFRQ 61

Query: 78  ESYFAYLFGVREPSFYGAVDIATGKSILFAPHLPAEYAVWMGQIKPLSHYTERYMVNMVC 137
           ESYFAYLFGVREP FYGA+DI +GKSILF P LP +YAVW+G+IKPLSH+ E YMV+MV 
Sbjct: 62  ESYFAYLFGVREPDFYGAIDIGSGKSILFIPRLPDDYAVWLGEIKPLSHFKETYMVDMVF 121

Query: 138 YVDEMEEVLLNLYKGEDKPLLFLLHGLNTDSGNFSKPAEFKGIENFKTDLSTLHPILTEC 197
           YVDE+ +V    +KG  KPLL+LLHGLNTDS NFSKPA F+GI+ F+TDL+TLHPIL EC
Sbjct: 122 YVDEIIQVFNEQFKGSGKPLLYLLHGLNTDSSNFSKPASFEGIDKFETDLTTLHPILAEC 181

Query: 198 RVIKSKLELDVIQYATDISSEAHVEVMRNTKVGMKEYQLESMFLHHTYMYGGCRHCSYTC 257
           RVIKS LEL +IQ+A DISSEAH+EVMR    GMKEYQ+ESMFLHH+YMYGGCRHCSYTC
Sbjct: 182 RVIKSSLELQLIQFANDISSEAHIEVMRKVTPGMKEYQMESMFLHHSYMYGGCRHCSYTC 241

Query: 258 ICATGDNSSVLHYGHAAAPNERTFMDGNMALLDMGAEYHFYGSDITCSFPVNGKFTVDQR 317
           ICATGDNS+VLHYGHAAAPN+RTF DG++ALLDMGAEYHFYGSDITCSFPVNGKFT DQ 
Sbjct: 242 ICATGDNSAVLHYGHAAAPNDRTFEDGDLALLDMGAEYHFYGSDITCSFPVNGKFTSDQS 301

Query: 318 LIYNAVLGAHNSVIAAMKPGVSWINMHKLAEKVILEALKKGNIITGDIEDMMTQRLGAVF 377
           LIYNAVL AHNSVI+AMKPGV+W++MHKLAEK+ILE+LKKG+I+TGD++DMM QRLGAVF
Sbjct: 302 LIYNAVLDAHNSVISAMKPGVNWVDMHKLAEKIILESLKKGSILTGDVDDMMVQRLGAVF 361

Query: 378 MPHGLGHLLGIDTHDPGGYPKGLERPKEPGLSSLRTARELKEDMVITVEPGCYFISALLI 437
           MPHGLGH +GIDTHD GGYPKG+ERPK+PGL SLRTAR+L E MVITVEPGCYFI ALL 
Sbjct: 362 MPHGLGHFMGIDTHDTGGYPKGVERPKKPGLKSLRTARDLLEGMVITVEPGCYFIKALLF 421

Query: 438 PAMESSETSKFFNPDAIRRFMVFGGVRIESDVYVTSIGCKNMTNVPRETWEIEAVMAGAK 497
           PAM ++ TSKFFN + I RF  FGGVRIESD+ VT+ GCKNMTNVPRETWEIEAVMAG  
Sbjct: 422 PAMANATTSKFFNRETIERFRNFGGVRIESDLVVTANGCKNMTNVPRETWEIEAVMAGGP 481

Query: 498 WPLEK 503
           WP  K
Sbjct: 482 WPPTK 486

BLAST of Spo00502.1 vs. TAIR (Arabidopsis)
Match: AT2G19260.1 (RING/FYVE/PHD zinc finger superfamily protein)

HSP 1 Score: 198.7 bits (504), Expect = 1.700e-50
Identity = 115/291 (39.52%), Postives = 152/291 (52.23%), Query Frame = 1

		  

Query: 692 CSSSGALVVDVMGNDKSLSVKGLC-VSILRSHGLLSNYESNRSGLSEEDAPSCSGSSCSR 751
           CSS G        ND   S+K    V+   S     +  S+ SG+SE D      SS  R
Sbjct: 360 CSSDGT-------NDSCSSLKSSSEVNSTSSKSREDDCYSSDSGVSETDTDG--SSSPFR 419

Query: 752 TCNVCGVVGTAIDLLICDECEKAFHISCFNPCIKKIPD-DDWYCQPCSKKKHNLLKERAI 811
            C  C   GT   +LICDECE+A+H  C    +K + + D+W C  C K + +  K    
Sbjct: 420 QCKHCDKPGTVEKMLICDECEEAYHTRCCGVQMKDVAEIDEWLCPSCLKNQSSKTKT--- 479

Query: 812 RRSSIIRTGKGSVSSEGDFSPIALMLNDSQPYITGVRVGKGFQAFVPEWCGPVARRGTVF 871
                    KG +S E  +           P++ G+R+GK FQA VP+W GP     +  
Sbjct: 480 ---------KGRISHERKWRVTV-------PFVIGIRIGKMFQADVPDWSGPTMSDTSFV 539

Query: 872 PEPMELDPLQPVHLLGPTSGKHRKAFIGNWLQCRDVIVGMGDSIDGTICGKWRRAPLFEI 931
            EP+E+   + +H L       ++    NWLQCR+      +  +G ICGKWRRAP  E+
Sbjct: 540 GEPLEIGQSEYMHDLKKAKNSKKQCSAVNWLQCRE------EDTNGVICGKWRRAPRSEV 599

Query: 932 QTDKWDCFRSVLWDPSHADCSVPQELETEEVLKQLKYIEMLRPRLAAKRRK 981
           QT  W+CF    WDPS ADC+VPQELET E+LKQLKYI+MLRPR  AK+RK
Sbjct: 600 QTKDWECFCCFSWDPSRADCAVPQELETSEILKQLKYIKMLRPRSDAKKRK 616

BLAST of Spo00502.1 vs. TAIR (Arabidopsis)
Match: AT5G24330.1 (ARABIDOPSIS TRITHORAX-RELATED PROTEIN 6)

HSP 1 Score: 53.9 bits (128), Expect = 6.600e-7
Identity = 24/72 (33.33%), Postives = 35/72 (48.61%), Query Frame = 1

		  

Query: 744 SGSSCSRTCNVCGVVGTAIDLLICDECEKAFHISCFNPCIKKIPDDDWYCQPCSKKKHNL 803
           S S     C  C        LL+CD+C+K FH+ C  P +  +P   W+C  CS  KH +
Sbjct: 27  SDSDWDTVCEECSSGKQPAKLLLCDKCDKGFHLFCLRPILVSVPKGSWFCPSCS--KHQI 86

Query: 804 LKERAIRRSSII 816
            K   + ++ II
Sbjct: 87  PKSFPLIQTKII 96

BLAST of Spo00502.1 vs. TAIR (Arabidopsis)
Match: AT4G14700.1 (origin recognition complex 1)

HSP 1 Score: 52.4 bits (124), Expect = 1.900e-6
Identity = 19/49 (38.78%), Postives = 29/49 (59.18%), Query Frame = 1

		  

Query: 752 CNVCGVVGTAIDLLICDECEKAFHISCFNPCIKKIPDDDWYCQPCSKKK 801
           C +C    T   ++ CD+C   FH++C  P +K++P+ DW CQ C  KK
Sbjct: 166 CQICFKSHTNTIMIECDDCLGGFHLNCLKPPLKEVPEGDWICQFCEVKK 214

BLAST of Spo00502.1 vs. TAIR (Arabidopsis)
Match: AT5G44800.1 (chromatin remodeling 4)

HSP 1 Score: 52.4 bits (124), Expect = 1.900e-6
Identity = 21/52 (40.38%), Postives = 29/52 (55.77%), Query Frame = 1

		  

Query: 752 CNVCGVVGTAIDLLICDECEKAFHISCFNPCIKKIPDDDWYCQPCSKKKHNL 804
           C +C + G   DLL CD C + +H +C NP +K+IP+  W C  CS     L
Sbjct: 78  CVICDLGG---DLLCCDSCPRTYHTACLNPPLKRIPNGKWICPKCSPNSEAL 126

The following BLAST results are available for this feature:
BLAST of Spo00502.1 vs. NCBI nr
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. NCBI nr)
Total hits: 5
Match NameE-valueIdentityDescription
gi|902231892|gb|KNA22433.1|2.0e-29299.8hypothetical protein SOVF_0338... [more]
gi|902231891|gb|KNA22432.1|4.0e-28899.8hypothetical protein SOVF_0337... [more]
gi|731373074|ref|XP_010666538.1|2.0e-26387.8PREDICTED: xaa-Pro dipeptidase... [more]
gi|870842688|gb|KMS96044.1|4.1e-26188.9hypothetical protein BVRB_0027... [more]
gi|817499302|gb|AKF43198.1|6.7e-23578.2metallopeptidase M24 family pr... [more]
back to top
BLAST of Spo00502.1 vs. UniProtKB/TrEMBL
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. UniprotKB/TrEMBL)
Total hits: 5
Match NameE-valueIdentityDescription
A0A0K9RSJ3_SPIOL1.4e-29299.8Uncharacterized protein OS=Spi... [more]
A0A0K9RUB9_SPIOL2.8e-28899.8Uncharacterized protein OS=Spi... [more]
A0A0J8B4E6_BETVU2.9e-26188.9Uncharacterized protein OS=Bet... [more]
A0A0G2T3M3_9ROSI4.6e-23578.2Metallopeptidase M24 family pr... [more]
F6HGR9_VITVI2.3e-23480.7Putative uncharacterized prote... [more]
back to top
BLAST of Spo00502.1 vs. ExPASy Swiss-Prot
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. ExPASy SwissProt)
Total hits: 5
Match NameE-valueIdentityDescription
PEPD_HUMAN4.4e-14654.2Xaa-Pro dipeptidase OS=Homo sa... [more]
PEPD_PONAB2.4e-14453.8Xaa-Pro dipeptidase OS=Pongo a... [more]
PEPD_RAT7.1e-14452.5Xaa-Pro dipeptidase OS=Rattus ... [more]
PEPD_MOUSE3.9e-14251.8Xaa-Pro dipeptidase OS=Mus mus... [more]
PEPD_DICDI1.0e-12648.9Xaa-Pro dipeptidase OS=Dictyos... [more]
back to top
BLAST of Spo00502.1 vs. TAIR (Arabidopsis)
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. TAIR)
Total hits: 5
Match NameE-valueIdentityDescription
AT4G29490.17.6e-22976.0Metallopeptidase M24 family pr... [more]
AT2G19260.11.7e-5039.5RING/FYVE/PHD zinc finger supe... [more]
AT5G24330.16.6e-733.3ARABIDOPSIS TRITHORAX-RELATED ... [more]
AT4G14700.11.9e-638.7origin recognition complex 1[more]
AT5G44800.11.9e-640.3chromatin remodeling 4[more]
back to top
InterPro
Analysis Name: InterPro Annotations of S. oleracea
Date Performed: 2018-06-29
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000994Peptidase M24, structural domainGENE3D3.90.230.10coord: 206..493
score: 3.6
IPR000994Peptidase M24, structural domainPFAMPF00557Peptidase_M24coord: 209..472
score: 4.6
IPR000994Peptidase M24, structural domainunknownSSF55920Creatinase/aminopeptidasecoord: 202..494
score: 2.75
IPR001965Zinc finger, PHD-typeSMARTSM00249PHD_3coord: 751..797
score: 2.
IPR007865Aminopeptidase P, N-terminalPFAMPF05195AMP_Ncoord: 31..155
score: 1.1
IPR007865Aminopeptidase P, N-terminalSMARTSM01011AMP_N_2coord: 25..169
score: 1.6
IPR011011Zinc finger, FYVE/PHD-typeunknownSSF57903FYVE/PHD zinc fingercoord: 749..802
score: 1.64
IPR011124Zinc finger, CW-typePROFILEPS51050ZF_CWcoord: 893..957
score: 11
IPR013083Zinc finger, RING/FYVE/PHD-typeGENE3D3.30.40.10coord: 750..802
score: 3.2
IPR019786Zinc finger, PHD-type, conserved sitePROSITEPS01359ZF_PHD_1coord: 752..796
scor
IPR019787Zinc finger, PHD-fingerPFAMPF00628PHDcoord: 752..798
score: 1.
IPR019787Zinc finger, PHD-fingerPROFILEPS50016ZF_PHD_2coord: 749..799
score:
NoneNo IPR availablePANTHERPTHR10804PROTEASE FAMILY M24 METHIONYL AMINOPEPTIDASE, AMINOPEPTIDASE Pcoord: 23..515
score: 7.4E
NoneNo IPR availablePANTHERPTHR10804:SF100XAA-PRO DIPEPTIDASEcoord: 23..515
score: 7.4E

GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009987 cellular process
biological_process GO:0006508 proteolysis
molecular_function GO:0004177 aminopeptidase activity
molecular_function GO:0030145 manganese ion binding
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0016805 dipeptidase activity