Spo14122.1 (mRNA)

Overview
NameSpo14122.1
TypemRNA
OrganismSpinacia oleracea (Spinach)
DescriptionApocytochrome f (Precursor)
LocationSuper_scaffold_93 : 298263 .. 337087 (-)
Sequence length4238
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
CCCCTCGTCCCCCTTCGAACGAATTAGGGTTTCTCTTTCTCTCTCTCTCCTCTTTCTCTCTCTTAAATTCTTTATATTTTTCCTCTTTTTTTCTTTTCTCGCCGAAGATTCCATCACATATCATCATCATCAACAACATCAAAAAGAAATTAGAAATCAAAATATTTAGAAAAACCCTAATAAATCTTGACGGACTAAATCTACAGAATTTGATCCATCTGAAATCGTTGACGCTTTTGATCATTGTTCTTCTCACAGATCAGATCTGCTTAAAAATTTGGCCTGATTATTCACGCTTACAAACCCTAATTTTCATTTTTAAATTTTGGGATAATTTTCCCTTTTTTAGTCGGATTTCTGGATTTCTTTGCTCGAATTTGATAGATCCGTGTGTTTACATGAAATTTGAATCTCGACGAAGAAGACGACGAAGAAGAATTGAAAATCAGAAACCCTAACAGTCCAGATCGAACCCAGAAACTTGAAGTCAGAACAATTTCCCCCTTAATTCAATTTTTTATTATTTTTCATTATTAATTTTCGGATTTCCGGCTAATTTTATTTTCATAAATTCACAGAACAGTAACAAAAATCAAGAACAAAAAATGATGAAAACGGTAGTTTATGAGGGTGATAATCTACTGGGTGAAGTAGAGATATATTTTGAGAACAACGCCAACAAGATTGAAATGATGAAGAAGGGGATGTTGATGAGGATAAGTCATTATTCAGAAGCAAGTGAGAGATGTCCACCTCTTGCTGTTCTTCATACTATTACTAAATCAACTGGTGGTGTTTCCTTCAAAATGATGGAGAAGTCGCTCTACTTTCAACAACACAATGATTCCCAGATTTTTGCTTTGCATTCTTCCTGTCTCAGAGGCAACAAGGTAAAGCCCCTTAATCCAACAACAAGAACTTCAAGAACAACAAGAATAACTTAACAGAACGATTACGATTGTAATACGAGTAAAACCCGAACACCACCCAAGCACGAAAGGTGTAATTCAGAACTTGAAAGGCATAATCGAACAGGAATGAACTGTGGTGATTTTATTTGTTAGTTCTGGATTAGTTGTCCAATTTGATATAATTGAGCTTAATTGATTATGACAGATATTATGGGTTAAAATTAGAATGTGGGTTAAATTGGTTGCCATTCAATGTGAACAAATGTAATAATTTGGATTACGGTGGATATTTTGGTTTGACCCAGGATCAATACATTTGGACAAGCGTTAGAATATTGGTTCATAATTTTTTTCCCAGAAAATAAATCCAAGATTGAAACTTTGAACTGGTTTGGAGTTTGGTTTAGTATTCGTGTTGACCGACAACTGTTGTTTAGTCATGGAGGAATTGAAGGGTAGTCAATGTATATGGGAAAATTTATATGCAAATAAAAAACCTAATTTGAGCTCAGTTCTTTTGGGATCATGGTTTAAAATTCTTGGTTTAGTGAACTGGACTGGAGTTATGTAGACATCCATGGTTTACATGTGTATTTTGATTACGTTTAGTATTAAATACTTGTCAAATTTTGCAGACGGCTGTGGTGTCCCTGGGTGAGCAGGAGATTCATCTGGTGGCAATGCGTTCAAGGAGAATGGATGGTGTAACAACCCCTTGCTTTTGGGGTTTCATTGTCATGCCAGGGTTATATGAATCTTGTCTTGGCATGTTAAATCTTAGATGTCTTGGTATTGTGTTTGATCTTGATGAGACGCTGATTGTTGCAAACACACTGCGATCTTTCGAGGATAGAATTGAGGCCTTGCAAAGAAAAATGACTGTAGAAGCTGACCCGCAACGTATGGCGGGTATGATGGCAGAAGTGAAACGATACCAGGAAGATAAGGCTATACTGAAGCAATATGCTGAAACTGACCAGGTGGTGGATAATGGGAAAGTCCATAAAATTCAAGCTGAAGTTATTCCAGCTCTATCTGACAACCACCAAACAGTTGTTCGACCGCTTATTCGGTTACAGGATAAAAATATTGTCCTTACTCGAATTAATCCTCAGGTATGTTGCCTGTTTTATCGTGTAGTTATGTAGAAATGGATGAACTATTGCTTGAAATTTTTTGCCCTTCTAATCATTTGTCATATCATCACTGTGTAAACAAGTATTGTCAATTAAATGTGCCTGCTTCACGAGTAAGTGTATTGACATTGACAATGGAACATGTGGTGGTATTTTTGTCTCAATGAGTTAAAATTGGTTTGCAAAAAATTAAATTTATACTCTGTACAAAGTTGGTTTTGTTTCGGCAAATAATTAACTGTTGTATATGTCTAGATACGCGATACAAGTGTTCTTGTAAGATTAAGACCTGCATGGGAAGATCTACGCAGCTATCTAACTGCCAGAGGCCGTAAACGCTTTGAGGTTTATGTTTGTACAATGGCTGAAAGAGATTACGCTTTAGAAATGTGGAGGCTTCTTGATCCTGACTCAAATTTGATTGGTGGGAGGGAACTTTTGGATCGTATTGTGTGTGTCAAATCTGGTAAGAGTCAAAAGAAAACGCTCTGTGTGTAAATGGCTCTAATAATTTGTAATTGTTGCTCCTCCGCTTTTGCTAAGAAGAGGTTGTACAGTTATTTTTCTTTTTCAACTGCAGTGTTCATTTTCTTCATTTCATGTGTTTGATAGGATCAAGGAAGTCGTTGTTTAATGTTTTCCAAGGTGGGATTTGTCACCCCAAAATGGCTTTAGTAATTGATGATCGTCTAAAGGTGTGGGATGAGAAAGATCAGCCACGGGTGCATGTTGTGCCTGCATTTGCTCCTTATTATGCTCCCCAAGCCGAGGTATGTATACCAGAGTTGTCATTGAGCCACTTCTAGCTAACAAACCTTTTTTGCTCATTTTTTCTGTTTGTGCCTCCTTATTTCAGGCAAATAATGCCATCCCAGTTCTCTGCGTGGCTAGGAATGTAGCTTGCAATGTCCGAGGTGGTTTTTTCAAGTAAGTTATTTCCATTTGTACAATATTAGAAATCACTGTAGCAAGGTTATGTTCTTGTGAGGATCTTACATAATTTTGATATTAATCAACCTTGTTACTCTATCAAGTAAAAGTAAATTCTACCTCAAATGGTCTTTAAAATTTTGGCCATTCCCTGGACTAGCTTAAGTAGGTATGTTATATAATATTTTCTAGCTTTTGCTGACATGTGGGTTTTTTCACACCAGAGAATTTGATGAGGGTCTCTTGCAACGAATGTCTGACGTTTATTTTGAAGATGATCCCAAAGATTTTCCTTCCCCCCCTGACGTGAGCAATTACTTGGTATCAGAGGTATTGGGGCTTCCTTTTTCATTAATTTTAATTCATCATGTCTTCTGGGAAGATGTCTGCAATTTTTATTGTTTATTTGATGCTTTGCTTCGTACCCGGGAAACAACCATATCTGTCATTTGTCAGTTTTTTTTCTGCCTGGCAAGTTGTTAACATCAGAACCTGCAGGATGACGGTTCCGGTTCCAATGCAAACAAAGAACCGATTTGTTTTGACGGGATGGCGGATGCCGAGGTTGAAAGAAGGCTTAAGGTAAATTTCATCCCTCTTTGGTTGACAACTATGGGAATTTGACAGCTTCCCTGTCTCCTTTTCCTCCCCCTTTCCGGGTTCGGTTAGGAAAAATTCAGGAATTCGTTCATCATAATTGTGAGTTTCTATGTCACTTTCTTATTGTGAGGAAAATAATTTTGGCTTTCCTCTTTCTCCAGGAAGCAGTTTTATCCTCTTCTTCAGCCCCTTCTCTTCCGTGTGTAACATCTTTGGCGACTGTGAATCTTGATCATAGGCTGGCATCTTCTCTCCCGTTCTCTGTTGCTGCTTCTTCCATGACAATTCCACAACCTGCACCTCAAGCATCAATTGCACCTTTCCATGCTAACCTATTTTCACAAGCAGGTCCTTTAGCGAGAACATTGGCTAGTATTGGTCCCAAGGACCTTGGCCTGCACAGTTCCCCTGCTCGAGAAGAAGGTGAAGTACCTGAATCTGAGTTAGATCCTGATACAAGGAGACGGCTTCTTATATTGCAGCATGGCCAAGATATGAGAGAAGGCTTACCAAATGAGCCTCCGTTCCCGGGAAGACCTCCAGTTCAAGCTCCTGTTGCAGGTCCTGGTTCTGGTCCTGTTCCAGTCCCTGGTCCAGTGCCTGTTGCAGGTCCTGGCTCAGCTTCAATCTCAGTTCCGGGTCCTGGTCCTGTTCCTATGTCTGGTTCTGTTCCAGCTCCTGCTCCTGTTCCTGTTCCTGTTCCACGGGTACAATCACGCGGGAGTTGGTTTCAAGTCGAGGATCACATGAACCCAAGTCCTCTGGGCCGATCAGCCACTAAAGAATTTCCTATGTCTCCTGATGCTGTACATGTTGAGAAGCAGCGGCCACCTCCCCCTTTTCCTCGAAAAGTGGAGAATCCAGTTTGGTCTGATCGAAGTTTCCCTGAAAAACAAAGACTGCCGAGGGAGGTAACTTTCGTTTTGATTCATACCTCTGGCTTACTACTTAGTAACCTAAAATGTTGGTGTTTATTTGGTATATGCTGAGTTTTCTTCATCTGCAGGCTTCTCGCAGAGATGAGAGATTGAGGTCAAACTATTCAGTGCCTAGTCATCAATCATTTCGAGGTAATCATCGACTTGAAGTCACAGATCTTGCATTTTTGTGTGGTGTAATTGCATTGAGATTTGTAACTGGGGTGCTTGCTGTCGATGACTTTGTAAATATTTTGAGCAGTAATATGGATGCCTTTTCCTTTTGTTATCTGATCCTACCTGGTGTGGGATCAAATATGAATTATCTGTTCCCAAATTTGCATGTATTCCGTGGAATTCTATTTGCAGGTAACATAATGACCCAACCTTCTTTTCGAAGCACTGCCTTTGTGTGTTTTGGATGGCTGTTTTAAATTTGAATAGGACTTAGTTGTTGTTGAACCTTTTACGTTTAAAGTTTCTGATAGGACTTGATGACCTTTGACTATTCGTTAACTTCTTTCTTGTTCTTCCTTAAGCTCTTCCTTTGGAAGGGGTCTGATCCATCTTTTTTCTCCCAACAGCGTTTTTTTAAAAGGCAGATCAAATGTGATTGCTAATTTGGCTATAAGTAGGTCATAGTTTACCATCCGATATAATATGTTGGTTGTCATAGGTGCTGGTATAATTGGTGCGCAATTGCCGAGATCGTCTTGTTTTCGCCACTTTGCAAGGATTTCTGTTCACTATTTGGAGCCATCTCTTAAATGAACTGAATATTTGTATATACCAGTCAAATCTTACATCTAACGAATAATTTGGGTTCATTTCATAACCTTCGCTTTTAATTAATGCATTTTTGACTTTCTTTGTTGTGTCGTGTTTTATTTCTAGGTGGCAGCACAAAAAAGGTAAAATTGATGACTCGTCAGTCGTCATTGATGGAGTTCACTCCAGATATATGCTCTCCTGGGGTAGATGCCCAAGTCCGAGATCATATTTTGTGCCATAGGCCATTTTGTAACTCTTGTGCTTCCTCTAAGCATAAGTGTCATAACTATAGGCATTAGATGTTTTGAGTAGGATCTGAAGTAGGTTTTTTTTGTTGGTGTTTCTTTGTTTTTTAAACAGGGACCTTGTTTATAATACTCCTGAAATAATAAAATATATGAGAGAATGTTAACAGTTTAAAACAGCGTTTTGCTTACTTACAGAATCCTCTAATTTGACTTGAAAAGCATGACCTGTCCATTTTTTGCATTAAAGGGGACAGCGGTCAGTGGTCATATTTTTATCATAGGTGATTTTGTCACATGCTTGCCTCCAAATTCTACTCTTTGGCTAGGTAGTGTCCATATAGTTAAAATAACTTGCGTGATGTTTATCATTTCATTTTGGATCAAACATACGAGTTCCTAGCAGTAGAGTACCTTGTATCCCACTTAATTGCGTAATGCATACCTACATACAGCCAACTGTAGTTTTCACTTAATTCAACTGTAAACTTTTGCAACAGTTGTATGTCCATGTGTTAGGGTATGTATGAATCTATTATTTACTTGGTTGGGTCATTAACGAGGAGACTGCCTATCATGGTATTCCTGACGATGTGTGTTGCATTTTCTGTTAGCCCGGTATCTATCTAAAAGGTCATGATTTTCATTGAGGGCTAAAGCTATTAACATGAAGAAAATTAACTCCTTAATGTTAGTCTATGACTTTGGCATGTGACGAAATCACTTGCGACTGTACTCTTGACAACAGAGCAGAAACTGTTGTTTAGTAGTGTGATGTAACCCATGCCAGGAAATTGTTGCCTGTTTCTAGGAATTTGCTATTAGGTGTTGGTCTCAGAGAGCATGACCATCATTTTACCTGGATAACTTATCCACCCTTCAAAAATGGTGGTTGGTCATGTTGATGAAAATGTTCAACCAAATGGCTTGGCATACCTATTTTTAGGGGGGAATAAATACTTACGATCTCTTCACGAGACAAAAGGGGAAAAGGTCTGTCTGATTTATTCATTTATTTTATTTTTACTTGTCAAGCTCCTTTGTGCTGCCTTCTTATAATTAAGCATGTGTTTTGAGTCTTTTGACTGGACAGCGGATAATATCCTGATACAGGGACTTGTTTGAACTTTGGGTCATAAATCATAATGTCATATAGGCTGGTTCAGGATTAGCTGCTTATAGGGCAAGAATGTTGAACCAGCAATTAGATATTACCGACCCAAGTTTGGTAATGATTTAAATCAAGTTCCTCTCTGCTTGCGCATATGAACTTTGTGTTCTATGCTATGTGAACTATTTGGATTCTTGATCCTGTTGTACTATTTTTGTATGGGTGCAAGCTGCATGTTTTCTAGCGATCTATGATTAGTTCATACTTGAAATAAGTTGGGCACCTTTGTTTTTGCCAATACTAAGTTTGATGACACACAATCATGTTCTTGTTTTGTATCTTTGCCGAGTCCGGATAATGCAGCTTTTCTTTTGCTTCAACTTGAGGACAAAGCTTGGGTTAAATATTGGAGAATGATAATAATAGCTCCATATGTGCTGCTTACTGTAGATGCTTATGTTTCGTTGTTCGTACTCTTGTGCCTTTTAACAAATACTCTTGTTCGTACTCTTGTTACTCTTGTATGAGAGGCGTATTATGACGGGTATTTTGTTTATAATCTTACTAGTGTATGTTGTATTTAACACCAGCTACTCTTGTATAATCCCCATCCCCTTCTCCTAGTGGCAGCTTGGGAAAGTTCTTGAATGTAATGAACTTTTGGTATAAATTTGTATATAATGAACCTTTTCAATATAGTTTCTAAATTCTTCGTTCTCTGTGGCCTATAGATTTTGTGTTATGTTGTTGAGGACTATTCCTCTTTGATTCATTGTCATTGCTGTTCTCTTTTGAAGGCGATGAAATTTCTTTGAGCCGATCAGTCTCAAGCAACAAGGGTTTCGAAGTTGAACCTGAAAAAGGCAGTTCATTGTCGGAGAATCCTTCAGTTGCTTTACATGACATTGCAATGAGGTGTGGAGCAAAGGTAAACACATAGGTTTTACTGGAAATTGTCATATCTATTTCACATGTTTCTACCTCAGTGACTTATTGCATCTATTTTGTAGGTTGAGTTTAAGCTAGGGTTGGTTGCTACCTCAGAGTTGAAGTTCTTTACGGAGGTTGGTCTACCTTAATATGCTGTATTTGATCATTTTGTTCTTTTTCTATTCCACCGGTTTTCCCTCCTAAGTCCTAACTAGTTTAAAAATTTGCATCATTCTGATGCTCCATTGTAAGCACGTGATAAAATAGCTTGGCTGCTATGATGTCTCCGCTATTATCCATACTAAAGTATTATGTTCAATCTATGATTCATTGTCTGGGCTGGTTAACCTGTGCTAGGTGCTGGTTCAGTAGGGTCCCTCTTCTTAACTATTAGTTGATTTGTTTTTAAAATCTGGTGTTTTTCTTTTATTGAACTGATTGTCTGTTCATTTTTTTCCGATGTATCTTGCTTCAACTTCTCTACTATATGAATTTGTAGGCTTATTTTGTTGGAGAGAAAATTGGTGAAGGAACTGGTACAACCAGAAGGGAAGCCCAGTATCGTGCTGCAGAGGCTGCTTTGATGAATCTGGCTGGTAAAATTTCCTGAAAAAAGTTCCCATCTGTTTTGTAGTATGAGTTATTTTCCTCTATGACTCTTCAGTGACTCCTGAGAATTCTGATGCTTTACATCTTTAACAAGCATAGTCTTATTGTTCCATAGCTTTAAATACAATATTTCGGTATGGAGTTGCTGTCCTGAGCATATTCTTATAGTTTCATAGCTTTCAATACAATATTTCCGGTATGGAGTTGCTGCCCTGAAACACTACTTCCTGTATTTCCAATATGGAGATGAATAGATGATGTCCCGTTCCTACGAAATAAACAGTATAAGGTCGTACATGGGACTTTCAAATATGTTTCTTTATCTAATGTCAAAGCCCATCTACCCGTCGCGCAATCAAGGGAACTCCAGCTAATTTATTAAAAATCTTCTTAAACTCCATTTTGAAAACTAATTACACTTCCACCCCTATCAGCGTAGTGTATAGCCTCATAGTTTTTTCACTTCTGTGAGGTGGTTTTAAGTTTTGGTGTTATTTTTCCTTCTTGCTTTGATTAACCATGTGTTGCAGGCTCATCCTTCCTGATTTCACATTGAATGATGTCCATTAGATCGTTATGGTTCCTTTGGATAACTAGTGGTTTTTTGACCAATATTGCACTTTCTTGTGATTACAAATTGTTCTCTGTGGTTCTCGTTTTTGTGTTCTTTTGTATTGTGTACCGTTGAAATGAAACAAATGTTTACATATCATAATCTCTCTCTATATAGTCCTGTTGATGGTTTTGGTCCCTTATGGATTTCATAACATTTTAACATTGGGGAAATAATTATATACTTCGTATCATAATCTAATTGAAACTTAAGAATGAGATTTTTGTAAATGTATTTAACAATTCGTACTATGTGTATTTAACAATTCGTACAATTGCATAGTACAGAGGAAGCCTGCCTCAATTTTAACTTCCGATCGTGGAATCAAGTGACTGATTTCACAACAGAGGATCCTGTACCCTTTACCTTCTGATCCTTCTTACCCTGTTGTGGAATTGAGTGAAGGGGTGGATTTTAAACCAAAGAATAGTAAATCTGTTTGCTTAACTGAACACGGAAAATGGAAAATTTCCCCTGTAAATACATGTAACAGTTTTGTTGTTGCATTTATGGTTTTCTGTGCCATGCGTTTTAATGATCATTCAATATGCTTCTGGCCTGCCGTCTGCTAATTTTGAGAATTCGATGTGATTGTTTTTTATTTTTCTTTTTGTATGATTCGTTGGTGACTGACATTTGAGGCTTGGTAATTGCAGATAGATATTTGACCCATATAAAGTCCGATGCTAGCACTCCACAAAGTGATACAAGTAGGGGTCCGAGTCCAAAGGACATGGGATTTGCAAGTGATGCAAATTCTCAAGGGGATTGCACTTCAAGAAAGGAAGAGACAACAACACCTTCATCGGAGCTTACCAGGCTGGATGATTCTATTCTAGAGGGCTCTAAGGACTCCATGGGCTCTGTTTCCGTTCTTAAAGAATTGGTAGGTGATGCTCATTTGTCTATTCCCATAACTTGTTTTTACTTTAAAAAAAATCTATTTTAATGGTTGAGTAATTGGTTGTGTCATTTCCAGTGCATGATAGAGGGCCTTGGTGTCGAATTTAAAGGTCAGTCTCCGACTTCAACTAATCCAGTCCACGGAGATGAAATACACGCAGAGGTACTGAATCTAGCATAGTGTAGGTTCCTTGTGATTCTGTTTTGTTTTACTTCAACCGTCCAACAATAATAGTCTCTTTAACTATTCTGATTGTGGAGTAGCATAAATGTTTTCACTTTTCCTTCGTTTTTTAGTTCTACGATATTGACTTTTTCTTCACTATTCACATTACCCAATTTGAGCTTTTTTTTTTACTAATATATAAAATCAAATGTTATCGTGTAAAATGTTGGATTTATCTCAATGTATATGTCAAAGTATCAAAATTTATAATTATTTACTAATAGATAATTGAAGATATTATTGGACAAAAAATGGCGTGTTGGCAAACGTAAAAAGTGAATTTGGTAGAACAAAAAGAAATGAGGAAGTAATTGTTTAATTTTTACCTTGCATAGAGTGGGTAAAGGCCAAAGGGGGATTGTGTGAATAGTCCAATAACTGAAGGGTCTTTTGTGAAATTATGACGTATTCCTAATCATGGGCGGTTTGGTAAACTGATACTAGTTTCTGTGCCTAATTAATGGTCATGTCTCTGTGGTGTAATTGTAATGTAGGTAGAAATAAATGGACAAGTTCTTGGCAAGGGCACAGGATTGACATGGGATGAGGCAAAGATGCAGGTATGAGCTTTTTCTGTTAAGTCCTTTGCAATCCTCCATCTCTCTAGAAGTCGAGAGACGAATATTTAGCACGTCCACACAAAGAGAGATAGAAAAAAATCGGGGGGAAGGGGGTGATTGTATGGAAGATTGTGTGGTAATTTTTTCTGCCTTTTGTCAGGCTGCTGAGCTTGCTCTTGCAAGTCTTAAATCCATGCTGGGTCAAATTACTAAGCGTCCAAGCTCTCCGCGGTTAGTACCTTCAAATGTTTCTATTAACTAGTGTAGCTCTCCGCGGTTAATATCTTCACATGATCCAGGTCTATATGTAGAATAGTTATGGAGGGATTGTTGGAGGATCGAGAAGCAGGGTAGTAAAACTTGAGTTCTTGATGTATGTGTTTTTTTTCCTCGTGATACTCTATGTATAGCGTAGCATTGAGAGAGGGAGTATTTTTATCTGCTTAAATTAAATTAGGAGCAAATGCAAAATCTTGTACCTAGCCAGCTGTGAGTCGCGCTGACTTAGGCTTGAGCGCAAATATTGAATGAGTTGCTATCAGAGTTTTCCACTGTTTGATCATGTGAATTGCAGGTTGTTGCAAGGGATGGCCAGTAAACGCCTTAAACCAGAATATGCTCGGGTTTTAGAGCATATGCCGTCTTCAAGATATCCAAGGAATGCATCACCTGTGCCTTGAAACCAAGAACATAGAATTGCACTCTTTTTTGATTATATGGTGTGGCCGATGTACAGCAGTTGCCACATCCGGCATCACCAACCAGGCCATGGTTTATTGCCATGAAAGCCCAATCACCCAACCGGCCTTTCCTAGGATACACAATTATCAAGACAAGCTCGAGTTGTTCCAATTGAGTTTGATGTCGAATGTATGCTTAGAATAAAAAACAGGCCATGTTTTATCCAGACGTTCAGACGTTATGTAATGGTCTTATGCTGATCCAATGAATAATCATACGGAGATGGTCTTTAAAGGTGCTGGTGAATCTCAAGGGCGTTGGTGTACTTTGACTTGTACAATGCCTGATGTACAGTTGAGACTCATTGCTGTAGAAGATTGATTTAACAAGCATCTCTCTTCTCTTGGCAATTTTGTTTGTGGTCGCTCAAAGCTTGCAGGATGGGAAGATCATTTTTTAGATACATCTGACAACGGCAACCTTGGCCTGGCTTAATTTTGTTAGAAGCCGCCGGTAACATTGACAATTTTCAATATCATGTACCATAATATGCGTTGTATGATATGATATAGCTAGAATTATAGCTAGTATTTGTTGCACAATACCAGGTTTCCCAAATGAAGCCAAAGAATACTAGTCATTGTTTTTGCCTGATGTGCAAATTTGAGTTGAATATATAAAGAATTTTTTCTCCGGCAATTTCCTGCTTCGTTGCTGGATGTTTTTTTTCTCACATTCACATCTATGATGTACAATTGTGTATAATTGTTGGCTATAAGATATACCGTGTAGATTGTAATAGGCGTAATGCCATAACTTAACTCTTTTGCTGTAATACCCTTCGCACCGAAATAAATAAAAAATATGTAAGGCATATACTGAAGGATATAAATCGAGTTCCTTTTAAAAAAAAATACGGATTTTTCATGAAATACTCTCGAGTTTTAGCATAATTCACCAAACGCCCATCAAGTTTCATAAATACATAAAATACCCCTCCTAAATGACTTAATACCCCAAATATCCCTCGATGACGTCATCCACCCTTAATCAACCCAATTAACATCTAATTACCTACAAAATTTTATTTGTGTGACATAAGGAGATGCAAAGTGCGAATTCATCCAAAAAAATGGAGATGAAAACCTAAATAATGCAAAGAAATCAATAATGGAAAAACATGGGTTCATTTTGAAGAAAGATAGTGGATCTGGAGGAGGAGGATGATTGAGTTTAGAGTTGTTGTTGGATGTCGGTTGATGAGATTTGACGCATCATTTGCTTTGAAGAATTGGAATTTGCATAGGAAAAAGATGGTGGAGGGAAGGAGAAAGATCGGGCATGGCGGCGGGCAGAGCGGCGATGGCCCAATGGAGCTTGAAGAAGGAGTTCTGGTTGGGGGTGATAATGGACAGAGTTTTTTTGGTGGTCGGATTATTTTTTTGGGTAAGCATGGAGGAGTGAGAAATCTTGAGCTTGGCTTTAATGGTGGTGGAGGTTGCTAGTGTTTGAAGGGAAGTGTTCATGGAGGAAATGTCAAGGAAGAATGAGGAGGGAGATGGGAAAGGGAAAACAAAATTAAAAAAGATAAAAAAATGGGTTAAAGGGTGGGTTTTAAAGATGGGTTAAAGGATTAATTAGATGTTAATTGGGTTGATTAAGGGTTGATGACTTCATCGAGGGATATTTAGGGTATAGAGCCATTTATGAGGGGTATTTTATGTATTTCTGAAACTTTATGGGCGTTTGGTGAATTGTGCTAAAACTTGGGGGTATTTCATGAAAAACCCAAAAAAAATAAGCAAATATTTTCTTATGGACCAGATTTACAACAATTTTTTGTATTTTTTTGGTGATCGAGAGGGATATATTATCTAAAATGAGAAACAGACTACCAGCTAACATAGCTCAATTGGTAAAGAGAATGGACTACAGAGGCTAAGTTTGAAAATGTATGTCGCGTTCAAGGTTTAAATCTTGTAAAAAATGTAACCTTATGTGCACTGACCAAAGGAGACACATGCTAGTTGCTAGTTGATAATTTACTACAACGGATATTTCATGAAATGCCCCCGAGTTTTGCTGTAATTCACCAAACGCCCATCACGTTTCAACAATTCATAAAATACCCATCACAATCCGTACAAATGCCCACCATACCCTTACTACTAACGTGTCGTTAGCCCGCCGTTAGCTAAGTTTCTAATTCACCAAATACCCCTATTACTAAATTTAATGCACCAAATGCCCCTAATCTCAAACTTATTTCACCAAATGCCCTAACATTAAATTTAGCTAGTTTTGGTGTTCCAACGGCTAGTTTTTGTATATGAAAAGTAGTCGTTGGTTCTGGTAGGCTATAAAAACCCCCTTGTGAATTGGCAATCAGAACGTATATGGGTAGAATTTATAACGGGATCTCGAAAAATAAGTAATTTCTGCTGGGCCTTTATACTTTTTTTAGGTTCATCAGGATTTTTATTGGTTGGAAATTCCAGCTATCTTGGTAAGAATTTTATATCTTTATTTCCCCCCCAGCAAATTCTTTTTTTTCCACAAGGGCTGGTGATGTCTTTCTATGGGATTGCAGGCCTCTTTATTAGCGCCTACTTGTGGTGTGCAATTTCATGGAATGTAGGTAGTGGTTATGATCGATTCGATAGAAAAGAAGGAGCAGTCTATATTTTTCGTTGGGGATTCCCTGGAAAAAATCGTCGCATCTTCTTCGATATCTTATAAAAGATATTCAGTCTATTAGAATAGAACTTAAAGAGGGTATTTATACTCGTCGTGTCCTTTATCTGGAAATCAGAGGTCAAGGAGCCATTCCTTTGACTCGTACTGATGAGAATTTGACTCCACGAGAAATGGAACAAAAAGCGGCTGAATTGGCCTATTTCTTGCGTGTACCAATTAAAGTATTTTGAAAAATGGGCTGAAATTATGAATGCTTTCTTAACTTGGCGTAAGATAAAAAATCTAGAATATTTTTTTCTACGGCATACCTTAATCTCATCAGAACGCCCACTCGGGTCAAAACAGACGTATACAGAAGAACCAAAAGGGAAAACATTTTATGCCTACAAATAATAGGTCTTTAAATGGAAATCCATTCAATCCCGTAACAAAAAAATAGTATTTTTTTTGTCCAGTAAGTATTTGTAATTGATATATAAAATACAAATAAATAATGTAAATTTTTTATGTTATCTCTTTTTCAGTAATACTTTCTTTTGTTCGCTTTAATGATCAAATAATTGGATTTATTATAATTTATAACGATCATTATCCAAACTCTTTTGTGTTTTGCCACCCATTTTTATCATCACATTCAGGCCTTCAATTCTTTTTTTGTGCTATGATTAACTTGTGCAATTTTTTAAAATATTTTCTATTTTGGTAGAAAGTGAGTTATTTCATCTTGAAATCGAAATAATATCACTTAATTGAAAAATGAAGTGGTTCTGCCGCGTATTCATTAACAATTTATAGATGAAAAAAAATTGAAAAAAAGAAAGTATTTATTCCTTTTCTATATCTTATATCTATAGTATTTTTACCCTGGTGGATCTATCTATCATTTCAAAAAAGTCTGGAATCTTGGGTTACTACTTGGTGGAATACTAAGCAATCTGAAACTTTTTTGAATGATATTCAAGAAAAAAAGCTTCTCGAAAAATTCATCGAATTAGAGGAGCTCCGCTTGTTGGACGAAATGATAAAGGAATATCCAGAAACACAGTTACAAAAACTCGGTATAGGAATCCACAATGAAATGATTCAATTGATCAAGATGCACAATGAGGATTGTATCCATATGATTTTGCACTTCTCGACAAATTTAATCTGTTTCCTTATTCTAGGCGGTTATTCCATTCTGGGAAATAAAGAACTTATTCTTCTTAATTCTTGGGTTCAGGAATTCCTATATAACTTAAGCGACACAATAAAAGCTTTTTCTATTCTTTTAGTAACCGATTTATGTATCGGATTTCATTCACCCCAGGGTTGGGAACTACTAATTGAATCTATCTACAAAGATTTTGGATTTGCCGATAACGATCAAATTATATCAAGTCTTGTTTCCACTTTTCCAGTCATTCTAGATACAATTTTGAAATATTGGATATTCCGTTCTTTAAATCGTGTATCCCCATCGCTTGTAGTGATTTATCATTCAATGAATGACTGAAAAGAAGAAAAAGGGTATACGGATATAAATCCAATTCAAATTTCTAATTCGAATGTTTGTTGCTTTGTACATAAAAATAATCAAAGCATTACAAATTGCACCCCCTCTTTACTATTTCTACTCGTCTCAGGCGGGGAGTTCCTCCGATATTCCAGTAATATTTTTTTATTAAATTTAGTTTTCAGTTAAAGTAAATAGCAGAATCGTGGATAGGGAACTTTACTAGCAACCTACCCAATTTATTGTATAAATTTTCGGAATCAATGGTTGGACTATGCAAACTATAAATACCTTTTCTTGGATAAAAGAATAGATTACTCGATCCATTTCCATATCACTTATATTATATATAATAACTCGGTCATCCATTGCGAATGCCTATCCCATTTTCGCACAACAAGGTTATGAAAATCCACGAGAGGCGACGGGACGTATTGTATGTGCCAATTTCCATTTAGCTAATAAGCCTGTGGATATTGAGGTTCCACAAGCGGTCCTTCCAGATACTGTATTTGAAGCAGTTGTTCGAATTCCTTATGATATGCAATTAAAACAAGTTTTAGCTAACGGTAAAAAGGGTGGCTTGAATGTGGGGGATGTTCTTATTTTACCTGAGGGATTTGAATTAGCCCCACCCGATCGTATTTCTCCAGAAATGAAGGAAAAGATAGGGAATCTTTCTTTTCAGAGCTATCGCCCCAATAAACAAAATATTCTTGTGATAGGTCCTGTTCCTTAGGATGTTTGAACTATGTGAAGATATTCGACCCATATGAATAAATCTAAGTAAAGGATGTTTGAACTATGTATAAGCATTTACTAAGTTAAACATCAAAGAGTCTAAATGAGATTCTTAACCTATATTATATGTCAAAGAATTTAGCTGGAATTAGTATCTACTGAAACTAGATGAGCTAAAGTTACATGAATAGAATTAAATTGGGAATTATTCTGCAAAAGAATTTATCATGTATGATATAATATGAGGATCGCCAAAAACGTATCGTATGGCTTTAGGCATGACGAACATATACCAGTCTCTATTGATCTAAGTGAAGATCAACTAGATTGAGATCAAGAAAAACTTATGGTACTTGAAAAGGTACATAGGAATAGTTCTTGATTCAAGGAAATAAAGATATGCTAAATATTGATGCTACACGCATAAACACTGGCAAAGGATCAAGCAAGATTCCCTTAGAATTAACCATTGGCAAGGACGAGCTATAGAGCATCGTGTTTTGAAAATGGCAACATGGGTTGGAGACCATGAGTTGTTGCGTGGGAAATTAAAATATTAATTTCTATGTTCCAAAGATACAGCTGGAAAGTCTTCCACATATCTGTGAACTGCTTGGATAAGTAAGTCAAAACAAAGCATCACTAGCAACCTAAACAGTTGAAGTAAAAGTACTTATTGCCTAAGAAGCAATAAAACAGAGTTGTTTATATTAATGAGTTCTTCATTGAACTTGGGAAGATCACATGTCTGTTGACTTAATGGTTCTTCATTGCAAAATGCGTAGAACCACTAATGTAGCAAGAAAGACTAGATCACAAAATAAACATACTCAAAAGATCTTATCATCATATCTCGAAGAACATTCTATGAAAAGGATATTAAGATTGACAAAGCATAATAACTAAACCTATGCAACAAGTGAGAAGCAACACTCACGTTGTAGCACTGGAAATCAAGCATAGCTTTGAATTCCATGAACTGTTTTAAAGATGGGTTTGAGGCCCATGGTTGTAAAACATTGGGGTTGAACATTTATCGTATATGAAATGTATTTTCATATTCCATTTAATCTTGGTTTAGTATTAAATGATGAGTCCCTTCAATTTGACTAAGAAATGTCTATCAAGTGAACTTGAATGTCAAAAGTTGAAAAGGTCCCTGGTCGGAGTTTTCTATAAAATTGGACGCATAGAAAACGTTAGACGACTAGAATGCAAGATGACTAGTTGTTCTGTTTCTTGAACTATGTGGACATGGCAATGTCATAATCATTTGCATAGATACTTACTTTGGAAAGACTAGTATCGGATAGACCTATGAAACTTTACTGTAAGAGATGAAAATCTGCCATAAGTAAATTTTATTAAAATTATTAGACACTAAATCCTCAATACCTGAGTGATTTGAGATTACTTGTTTGAGAACTGGTTACTTTGACGTTGACCAACCGTCGCACCGTAAAAGGAGGCTATAAAGGAAACGCTCAGGTAATCACCTATCAAACGAAGTCTAATCTCAAGATCGCAAGATTGGGATTGTCCTCCCATAAATTGGGATGAGATGCTAAAAAGTTGTACAAGGCCACTCGGAGAGCTAGAAACTGTAAAATGCATGGCCGTGCTCAGATGAATCATAGGCTATGATTATCTGTTTATTTGATCAGTTGAACTCTGAAACTGAGAAACACCTCTGGACGTAATAAGGATGACAACTCTTACCTTATGTTCAAGAGCAAGCATCGAGCGACAAAGGAATTAGGAAATGCACACTTGTCCCTAAGGACAAGTGGGAGACTGAAGGAAATAATGCCCTTGGTCCAAGTATGCATTTAATGTTAAGTCTAATAAATGCAGTTCAGTAATAATTAACAAGTTAATAATTCAGTGAGATCAAGTGAGCTGAATGCCTAGCTAGAGGCCGCTTCAGTTCAAGTGGAATTATTGATATTAATCCACATCTTACTCTTGATTGAACCGGTAGGGTCAAACAAATAGTACGTAAACGGATCAAGTATTTAATGGCATTAAATACTCCATCTATGGATATTCGGAATCGACGAATCTTGGTTTCAGTGGGAGCTGAGATCGTCATAAGCAAGAAATGAATACTCCGGAAACGATGATATTGCCGGAAACTGAAATATGGGTCGTGTCGGAAATATAAATATTATCCAAGTCGTAGATGTTGCCGGAAACGGAAACATGGTACGTATCGGAAAATATTATCGGAAATGGAAATATTGCCGGAATCGGAAATATTGCCGGAAACGGAAATATTGTCAGAATCGGAAATATTATCGAAATCGAAAAATAATTCCGGAAACGGAAATATTAAATATTTGTTCGAAACGGAAATTAATTCCGGAATCGGAAATATTAAATATTGTTCGTATCGGAAATGAATTCCGGAATCGAGAATTTAATCGGAAAGGTATCGTACGAATTAGCATCGGACGAGGCCTGCCAGACGAAGGCCCAGCACGAAGCCGGGCCATCGCCCAGCAAGCCAACACGCAACAAACCACACGCCAAGCTCGACCAGGCCCAGCGCAAGGCCAGGCCCAGCCAAGCCTTGGCGCGCGCGGATCATGGGCTGCGGGCAAAGGGGCTGTGCGCCGTGCGTGGGCCGCGAGGCTTACGCATGTGCGTGCGGCTCGTGCGTGCATGAGTGTTTGTGAATCCTAAAACTATCGGGATTCTACATATGATTAAATCCTAATTCTAAAAGATAAAATTAATTGTTTTAGAGTTCTACTAGGATTCTAAGTTAATTAATTCGTATCCTAGTAGGATTATAATTCCTTTCCATAAACTCTAAAATAAGGGCCTAGGGTCACATATTTATCGAGACAATTGAAGTATTCAAAGGTAAGATTTTCAAGAAAAATCAGTCACTCTCTTGCCCCATAATAGCCGAAATTCATACTACCTTAAGGGCGATTCTAGTTGGTCAAGCTTAAGGCGGATCCGGACGTGCTGTGGACTATCTACAGAGGGACGACACTTGGAGTCCAAAAGACTTGTTCTTGTTTGGTTCGAGCGCAGCTAGGGAGGGCACGCTACAAAGTGTATGCATCTGAATTATGCTAAATGATTATGTGTTAATAATATGTTTCCTGGCTTTATGGTTTTTCCGCATGATTTATGTTTATTCATATGTATCATAACCTAACAGAATTGAACTATGTCTCACAATTGAATTGTAATAATTTAAGAATCATTATTCTTCTGAATGAATATTGGTGCTATTGGGAATCCAAGTAAGAGATGGCAAAGATTGAGAGGTTTTTCGACTAGAAAAAGTGAGTTCATTGAGAATCTTGAGGTCGAACTATCGTGAAATATGTGAAAGAGTCAAAAGGTTAAGGGAGTGTTAATTAAGAGTTAATCATAAGTGAATACGTTAATGATCAGGTAAAAGTCCTTGCATAATGGAAGTAGTCCTACTTGAGTTTCTAAGGAGTTGAGGTCTTAGTATTAAACTATATAAGAGATCCTAAAGTAAGATTTAGTGGAACCAAAGACACCACTTTTAATTCTAAATTCTTGTTTTCAATTAAGGTTTATGTTTTCTTTCAATTGGGTCATACTTGTTTTAATTTTCTTAAAAGACGCTTGTTTTCTTAATCAAGTGTTAATAATAAAAGAATTCTTTTAATGTAGTAAATGAGGATCTCTAAGCTATCTTAAGTATAACAAAAGCGGAGTTTTTAATGATTCTTGAATGAGTCGTTTCATGGTTTTATAATCTTTCCTAACAAGCCATATTTATTTTAGAGTTTCCTAAAATTCACATGTTTTTCCTAAAACTATGAATTATGGAACTTAAACTAGCTTTGAACAATATTAAAGTGTTTATATAATGAACTTAAAAGTTGGAAAGGTTATAATGAAAGATATTTTCAGTGTTATTTTGATTTCTTTAAATGACAATAGTTTGCAAATAAAGAAATCATTAAAAGTTTTAAAAAAGAGTTTATTATAAAGAATACAAATATACTTTAAAATAAATAATCAAGTTTTCATATTTTTATAACCTAATAGTTTGAAATCTATCGAGAGGACATTTCAATGAATAATTAAGATTATGTTTGAGTAATTATATTTTATGTAAATGTTTATAAAATACATTTCAGGCACACTAAGCTATTAAATAATGAATTCGGATTTGAGGAAAGTATGGCCTTTTGATAACCTTGGGATCTTAAATGAGTATCACTCACTAACAGTCAATTACTAATCCATGGTCTAAGAGTATAACCAAGGATAAGGAGTGATTCTAAGTTAAAAGGTAATAATTGAGAAGAAAGTCCAATTTTAAATAGATTGTTACGGTTTGACTAAGTGTAAGAGAAAGTGTCAGCAAGTCTAGAGTTTGAGATTGGAATCGGGTTAAGAAGTCATATAAGATAGAAGCTGAGTTGAATTTAGTTGAGGAAGAATTAATGCATGAATGAGGTAATGCATATTTTGAGAACGAAATTGCATAAATTGGTAATAAAATGCTTAAGTAGGGGAAAAGTAGCCGAGTATGTGTGGCGTAGTGCTCACATGCATCGGTGATGAAAAACATTGGAAGCTAGTGCTACATTAGTAGACTCGTGGTAAATGGGCTTGGCCCATATTTGGAAGCGAGTGCTATATCAATAGACTCGTGGTAAATGGGTTTGACCCATATTTGGAAGCGAGTATTACTCGTGGTAAGCGGGCTTGACCCGTAGTTAGGGACATGTCCTAGTTAAGTCTTGCAAGCGTATGTTTATCAGGAGGGTGATGACCCCCACCTCATAGAGACAGCTGGTCGTGCTGACCCCTCTCTATTCGCGTTCACTTTCCCCAGATTCAAAATATTAAGTTGAAATTGAATTGTGTTTGCTAGTATTTTTTGAAGCAAAGCTGCATTACTTGGCAATGAGTTTCGAATTGAATTGTGAATTGTGTATGAATAAGCACTATGAAGTTCAGTTTCTAGAATAATGGATTCTGCCTTAAGTTATGAATGTCTTATGAATAAGTAATATGAATATCGCCTTGAGGAATAATTTATGGCTTGAAAAGAGTTTATAGTGGAGAGATTCTTGGGCATGATATTGGAATTATATAAAAATTTTATGACATGGTTATGGTTTGACATGATTAAGAGTATGAAGAAATCTAGGTAAATCTTAGGATTTGAGTCTAATTATATTGATGTGATTGTGAAATTAGAAACTTTGTGGTAAGTAAGGTCTGAAGTCATTTAAAGAAATACTAGTGTATCATTTTCTAAATCGTTGTTTGAATAATAAGGATTTCGCGAAATAGATTTCAAAGAAGTTATAACGTTTTATCAAGTTATGTTTTCCAATTGAGTTTGATAAGCGAAGTTGTTTTTTAAGACAATACCTACAAACCAAACAAAAATTATTTTGAGGCACCTATCAAATGCATTTTAGGTTGGATGTGTGTACTCAGATTTCCGCTGATTTTGTTCAATGAACCTGTCCTGGTGGGGGCTGAGTTAAAATATGTTTTGCAGGCAGCGAGAATCGAGTCTCGGTTCAGTTTGAAGGTGCGTAAAGTCCTCCAGAGGCCTTATATGTTTCAAGGCGGGCATATGGATTCTTCAAGTGTATTTAAGTCCTTCCTAATTTATTAATGGGTTGTATTATTTATTTCTAGAGTATTTTTTTTGGTTATTATTTATGGATTATATATCCTTGTTGAGTTTGGTGTCACTCTAATGACGTTTTGGGAAATGTTATATGCTAGAATGATGTAATAGCAGGTGGAAGAAAATCTAACGTAATTTCCTAAACAGTATTTTAGTTGTCCATATTTAGGGGAAGTGCTGCCGAATTTTTTTAATAACTAATTAAGTTTAAGGGTCTTGATTATTGTCTTTGAGATAGGCTAAACCCGTGATTAAATCTTTGAAGTTGTTTGAAGAGTTAAAGTTTATTTAGTAGGGTTTAAACTTGTAATTTCGTTGAGTTTGTTTAAATTAAGACTAAGTGTGTTTAGAAAGCAGCTCGTTAAGGGCGGTTTCAGGTGGTATCAGAGCGGAGGATGATTTTGGGCTTGCTATGATCCACATATCACGACATATGAATTGGGAATTGAATATTGGGTTGATTTAGAGGCAAGGAATGTATAAATTTCCTTTAGAATGATTTGCTTGGAGGTTGTATCAAATTGTAGGATTAAGAAGTTAAAAAGAAAGATGTGGAGTTTCCAAAGACTTGTGTGCAATTTGAGTTATAAGGCCAAGATGGCCTTATGCAATTATATATCTAAGTTACGGAGATAGGAATGAGTTTTAATGAATGAGAGAAGGATTGTAAACTATTTAAATAAAGGAAGTCGGTTAGGAATATGAGTAGTGGATCTAGTAAGGATTGATTGAGTTAATATTGAGAAGTATTGTGTATTGTTGTTTTTATCTTTAAGCTGATTTTGAAAGGAATTGAGTTCGATGAAGTGGTTTGAGAATATTATAATTATGATGTTCAAAGTTTTCTTGGTTAATTAAAAGTTAAATTGTCTTGTTGGAAAGTCAAATAAGGTGATTGATTTTCATCTTTGGTGTATTAGGTATAAGGTATTGAGTTTTATATTTATATGATGTATTTGGGATGTAAAGGAGTTGAGATTTGAGCAAATGAAATGGAGGAATATGAATGGATAATTCTTATTGCTTGGTTGAATTATAACAGTCATACATCACGGGTTCAGTCTAAATGGTTGCGATTTCAATTGGATTAAATGCTTTTATTTTGAATATTGGAGCAAAAAGACAAACTATGCTAGTTCTTAAGATTTTTAAAGATTGATCTTAGTTAATTTGATTTCGAAAGACCTCCCTAGTTGTTATTAAGGACCTGACATTATTTAATTATGTTAGTTTATTAGATAAAGGCATAAGGATTCCTAGTTAATGTGGTTATTAGAAATTAAGCATGGAATTGGATGTTAGGAATGATGATTTTGGGCTCGTGAAATGTCAATTTGGAATGAATGGCTTGGAAATTAGTTGTAGAGGTTAGAACGTAATTTAAAGGATTAATTATTGCATGGTAGGGTTATCAAAGAAGTAAATTTAAAGTGAGTCTAACTAGAGTTACGCATTTGAGATCTGTGAGCGAGGATTTTAGATTCCTTCATTAATTCCTTCTTGTTTATTCAAATTATTGGAATTAGAGTTGAGAATAGGTGGTACAAGCTGAAGAATTAAGGTCTTTTTGATGTAAGGTTGACTAGTGGATGGAGAATTAAAGGTGAGTTGTACATCTTTATGCTGTTCATAGTAGTATGCACCTGTCGTAATTTTAAATGGTCTTAAAAATGCTATTTTCAAACTTCGAGGATGCGACTTGTTTTTAAGTATTGATAGAAATATTTTATGTATGTAATATTTTATAAAGTAAAATTTTGTTTGAAATCCTTCTTCAATGCCTTAAGAATTGTATATATGTTGTTGAACTTCTACCCTAGAGAGAAAAATCATTTTCTTCATACTCTTGCTAATTGCATCGTGGTTGAAGATGTCAGTGTATTGTTTCATTGTTCATCCATTCAGCTCCCTCCTATTAAATAGGGTTAAGCTTTTCCACCTCTCATCTCACATATCCTTCCACTGGTTCAATAACTATAGCATAACTCAAATTTGTCCAGAGCAATTGAGAATGTATTTCCAAACCTTGGTCACAATTTCTCCTATAAGAATATCTCAAAGGTTTAAACATCTTGACTAGATTACTTGAAATTAAGTAGGTTGGATTACTTTCTACTGTGTGATTCTTGAATGTATTTCCAAATTTATGTTTATCCAAATAAATCCTATGTTTTGAAGTTTCAAGGACGAAACCTATTTTAAGGGTGGTATATTGTAACAAGCCATATTTTAAGCATCTCTTAATCTCGTGCTAATTATGTTTATCATCTTAAACCGCTCTTAAAAATCGCTTTTTGAAACTACTTTAGGGTTAAGGTTTAGGAATTATATCATTTTGACGAGTGAATTTAAGGTTTTATTATTATCGCTTAAGTGATCAAACCTTAAAGAAATTTTTTGGAATTTGATTGGTTTAATGATTTGAGATTTTGTTATTCCAAAGTAAATTATTTGAAATTTAGTATTAATTGTGAAATTAATCCTAAATAAATTATTTTCGTTTAAAATTATTTGTGTTTAATTATCATTCCTAAATAAATGAATCATAAACTAGTTTGTAAGAATTTGATTTAAGAATAGGGTTATTTATATTTAAGCTTATTTACTCCTAGCTTAAAGTTTTAATTTTATTTATAAATATGATTATGGTTGATAAATTTCAACGTTTGCATCTCCTCATTTGCAACGTGTCAGCTCAAATGGAAGCCAACAACCCATTTACTTGCCAAGTGTCCCAAAAAGTTACCTTATTGTAAATCTACCTAGAAGCTACTACCTTCTTTTGCCTGATTAACAATACTCAGTAAATTAAAAAAAACCCAAGCTCTCACTTCTCTCTCTCTCCTCATTCCCGATCTCCCTTTTCACCTCCCCTCTCCTCGTTCCTTTGTCCCTGTGCCAGCACCTCCTCCGTCGTCTTCGTTCCGTTGCCAGTTCCTCTCCTCTATTTTGCCGGTGCATTTGGGGATTAGGTTCCCTCGCCGGCGGCTTGTGCTTTCCCTTTCGCATTGCTCCGAGTCGTGTCGCCATTGTCGCCGCCGCTAGTGGTGTCCACCAATCCACCATCACTTATTCCTCTTCACCAATTGGTAATCAATTTATCTCCCTCCCTTGATTTACTATTTTAAATTTCAATCTAAAAAAACATAAAACCCTAACTTTCTCTTCTTTGGGTTTTTTTTCTCTCCTCTTGTTCGGTGCACCACTTCCTCCGTCGGAGCCGTGATTTGGTGGTGGTGATGCCCACCCTGTACGTGGTGGTTGTTAGTTGTCTGATGTTGTTGCTCTTCTTCCTCTCTCCACTTGCTCTTCCCTGCCAGGCTCGCCCACCACTGCGGCGGTGCGGCGTAGCTGCTGTTGCTGCTGCGTCTTGGTATTATTAGTGTCCACCTTGTTGTTGTTGATGTGGAAGCTCATGTTGCTAATTCTGTCTTCTATGTTCCTTAAGGGTATAACTCAATCCAAACTTTATTCTATCTACTTTTCAATTTATCCATTTCCATAAAGTTATGAGAATTTAATTTGTTATTGATTATTGGTTTAATTTGATTTTCTAAATATCGAATTAAACGGGTTTTGATAATGTAATTGAGTAATTCAGATTTTTGTTTCGAGAAAGTTTTGAATAGCTATATAATTAAGATTTCCAATTTGATAATTTAAACGACTAATATTCATCAAGTATACAAATTTAAATTGTTGAACTATATGTTTTCAATCCCGGATTATAATTAGCCATCCAATACAGCTTTCTTAAAATAAAATATGTGCTATGTTAGTTCTAAAATCTAAATTAAGTTTTTCGAAAATCAATTAAGATTTCAAAAAGAATTCATTAAGTTCTAGATATATTAGAGTGAGTAGTTTATTAATTTAAGTCTTTCTTGTTTAGGTGAATTCATGAAGAATGGTTTAAGAAGATTCAAGTCATAAAGTTAAGCGAAGAATGATGATCGAATATGTAGCTTTGGGATTTATGAATGCTTATTGAAGGTACACATCGTCCAACCATCTTCAATGTTGAAATCATATTTTAAGTTTAAGTATTATGTGTATATATGTATATGTATAGATTCTGAATTGTGCTTTGCAAATTTAATTGTGCTTTGGAAATTGAATCGTGTTTTGCAAATTGAATTGTGCTTTACAAATTTAAGTATGTTTCATGAATTGAACTATGTCTCGCAATTGAATTGTAATAATTTAAGAATCTTTATTCTTCTGAATGAATATTGGTGCTATTGGGAATCCAAGTAAGAGATGGCAAAGGTTGAGAGGTTTTTCGACTAGAAAAATTGAGTTCATTGAGTATCTTGAGGTCGAACTTGGAAATCTTGAAATATGTGAAAGAGTCAAAAGTTTAAGGGGGTGTTAATTAAGAGTTAATCATAAGTGAATACGTTAATGATCAGGTAAAATTCCTTGCATAATGGAAGTAGTCCTACTTGAGTTTCTAAGGAGTTGAGGTCTTAGTATTAAACTATATAAGAGATCCTAAAGTAAGATTTAGTGGAACCAAAGACACCACTTTTAATTCTAAATTCTTGTTTTCAATTAAGGTTTATGTTTTCTTTCAATTGGGTCATACTTGTTTTAATTTTCTTAAAAGAGGCTTGTTTTCTTAATCAAGTGTTAATAATAAAAGAATTCTTTTAATGTAGTAAATGAGGATCTCTAAGCTATCTTAAGTATAACAAAAGCGGAGTTTTTAATGATTCTTGAATGGTCGTTTCATGGTTTTATAATCTTTCCTAACAAGCCATATTCATTTTAGAGTTTCCTAAAATTCACATGTTTTTCCTAAAACTATGAATTATGGAACTTAAACTAGCTTTGAACAATATTAAAGTGTTTATATAATGAACTTAAAAGTTGGAAAGGTTATAATGAAAGATATTTTCAGTGTTATTTTGATTTCTTTAAATGACAATAGTTTGCAAATAAAGAAATCATTAAAAGTTTTAAAAAAGAGTTTATTATAAAGAATACAAATATACTTTAAAATAAATAATCAAGTTTTCATATTTTTATAACCTAATAGTTTGAAATCTATCGAGAGGACATTTCAATGAATATTTATGTTTGAGTAATTATGTTTTATGTAAATGTTTATAAAATACATTTCAGGCACACTAAGCTATTAAATAATGAATTCGGATTTGAGGAAAGTATGCCCTATTGATAACCTTGGGATCTTAAATGAGTATCACTCACTAACAGTCAATTACTAATCAATGATCTAAGAGTATAACCAAGGATAAGGAGTGATTCTAAGTTAAAAGGTAATAATTGAGAAGAAAGTCCAATTTTAAATAGATTGTTACGGTTTGACTAAGTGTAAGAGAAAGTGTCAGCAAGTCTAGAGTTTGAGATTGGAATCGGGTTAAGAAGTCATATAAGATAGAAGCTGAGTTGAATTTAGTTGAGGAAGAATTAATGCATGAATGAGGGAATGCATATTTTGAGAACGAAATTGCATAAATTGGTAATAAAATGCTTAAGTAGAGGAAAGTAGCCGAGTATGTGTGGCGTAGTGCTCACATTCATCGGTGATGAAAAACATTGGAAGCTAGTGCTACATTAGTAGACTCGTGGTAAATGGGCTTGGCCCATATTTGGAAGCGAGTGCTATATCAATAGACTCGTGGTAAATGGGCTTGACCCATATTTGGAAGCGAGTATTACTCGTGGTAAGCGGGCTTGACCCGTAGTTAGGGACATGTCCTAGTTAAGTCTTGCAAGCGTATGTTTATCAGGAGGGTGATGACCCCCACCGCATAGAGACAACTGGTCGTGCTGACCCCTCTCTATTTGTGTTCACTTTCCCCAGATTCAAAATATTAAGTTGAAATTGAATTGTGTTTGCTAGTATTTTTTGAAGCAAAGCTGCATTACTTGGCAATGAGTTTCGAATTGAATTGTGAATTGTGTATGAATAAGCACTATGAAGTTCAGTTTCTAGAATAATGGATTCTGCCTTAAGTTATGAATGTCTTATGAATAAGTAATATGAATATCGCCTTGAGGAATAATTTATGGCTTGAAAAGAGTTTATAGTGGAGAGATTCTTGGGCATGATATTGGAATTATATAAAAATTTTATGACATGGTTATNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTAATTTAAAGGATTAATTATTGCATGGTAGGGTTATCAAAGAAGTAAATTTAAAGTGAGTCTAACTAGAGTTACGCATTTGAGATCTGTGAGCGAGGATTTTAGATTCCTTCATTAATTCCTTCTTGTTTATTCGAATTATTGGAATTAGAGTTGAGAATAGGTGGTACAAGCTGAAGAATTAAGGTCTTTTTGATGTAAGGTTGACTAGTGGATGGAGAATTAAAGGTGAGTTGTACATCTTTATGCTGTTCATAGTAGTATGCACCTGTCGTAATTTTAAATGGTCTTAAAAATGCTATTTTCAAACTTCGAGGATGCGACTTGTTTTTAAGTATTGATAGAAATATTTTATGTATGTAATATTTTATAAAGTAAAATTTTGTTTGAAATCCTTCTTCAATGCCTTAAGAATTGTATATATGTTGTTGAACTTCTACCCTAGAGAGAAAAATCATTTTCTTCATACTCTTGCTAATTGCATCGTGGTTGAAGATGTCAGTGTATTGTTTCATTGTTCATCCATTCAGCTCCCTCCTATTAAATAGGGTTAAGCTTTTCCACCTCTCATCTCACATATCCTTCCACTGGTTCAATAACTATAGCATAACTCAAATTTGTCCAGAGCAATTGAGAATGTATTTCCAAACCTTGGTCACAATTTCTCCTATAAGAATATCTCAAAGGTTTAAACATCTTGACTAGATTACTTGAAATTAAGTAGGTTGGATTACTTTCTACTGTGTGATTCTTGAATGTATTTCCAAATTTATGTTTATCCAAATAAATCCTATGTTTTGAAGTTTCAAGGACGAAACCTATTTTAAGGGTGGTATATTGTAACAAGCCATATTTTAAGCATCTCTTAATCTCGTGCTAATTATGTTTATCATCTTAAACCGCTCTTAAAATTCGCTTTTGAAACTACTTTAGGGTTAAGGTTTAGGAATTATATCATTTTGACGAGTGAATTTAAGGTTTTATTATTATCGCTTAAGTGATCAAACCTTAAAGAAATTTTTTGGAATTTGATTGGTTTAATGATTTGAGATTTTGTTATTCCAAAGTAAATTATTTGAAATTTAGTATTAATTGTGAAATTAATCCTAAATAAAATATTCTCGTTTAAAATGATTTGTGTTTAATTATCATTCCTAAATAAATGAATCATAAACTAGTTTGTAAGAATTTGATTTAAGAATAGGGTTATTTATATTTAAGCTTATTTACTCCTAGCTTAAAGTTTTAATTTTATTTATAAATATGATTATGGTTGATAAATTTCAACGTTTGCATCTCCTCATTTGCAACGTGTCAGCTCAAATGGAAGCCAACAACCCATTTACTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGGTATCAGAGCGGAGGATGATTTTGGGCTTGCTATGATCCTCATATCACGACATATGAATTGGGAATTGAATATTGGGTTGATTTAGAGGCAAGGAATGTATAAATTTCCCTTAGAATGATTTGCTTGGAGGTTGTATCAAATTGTAGGATTAAGAAGTTAAAAAGAAAGATGTCGAGTTTCCAAAGACTTGTGTGCAATTTGAGTTATAAGGCCAAGATGGCCTTATGCAATTATATATCTAAGTTACGGAGATAGGAATGAGTTTTAATGAATGAGAGCAGGATTGTAAACTATTTAAATAAAGGAAGTCGGTTAGGAATATGAGTAGTGGATCTAGTAAGGATTGATTGAGTTAATATTGAGAAGTATTGTGTATTGTTGTTTTTATCTTTAAGCTGATTTTGAAAGGAATTGAGTTCGATGAAGTGGTTTGAGAATATTATAATTATGATGTTCAAAGTTTTCTTGGTTAATTAAAAGTTAACTTGTCTTGTTGGAAAGTCAAATAAGGTGATTGATTTTCATCTTTGGTGTATTAGGTACGTATAAGGTATTGTGTTTTATATTTATATGATGTATTTGGGATGTAAAGGAGTTGAGATTTGAGCAAATGAAATGGAGGAATATGAATGGATACTTCTTATTGCTTGGTTGAATTATAACAGTCATACATCACGGGTTCAGTCTAAATGGTTGCGATTTCAATTGGATTAAATGCTTTTATTTTGAATATTGGAGCAAAAAGACAAACTATGCTAGTTCTTAAGATTTTTAAAGATTGATCTTAGTTAATTTGATTTCGAAAGACCTCCCTAGTTGTTATTAAGGACCTGACATTATTTAATTATGTTAGTTTATTAGATAAAGGCATAAGGATTCCTAGTTAATGTGGTTATTAGAAATTAAGCATGGAATTGGATGTTAGGAATGATGATTTTGGGCTGGTGAAATGTCAATTTGGAATGAATCGCATGGAAATTAGTTGTAGAGGTTAGAACGTAATTTAAAGGATTAATTATTGCATGGTAGGGTTATCAAAGAAGTAAATTTAAAGTGAGTCTAACTAGAGTTACGCATTTGAAATCTGTGAGCGAGGATTTTAGATTCCTTTATTAATTCCTTCTTGTTTATTCGAATTATTGGAATTAGAGTTGAGAATAGGTGGTAAAAGCTTAAGAATTAAGGTCTTTTTGATGTAAGGTTGACTGGTGGATGGAGAATTAAAGGTGAGTTGTACATCTTTATGCTGTTCATAGTAGTATGCACCTGTCGTAATTTTAAATGGTCTTAAAAATGCTATTTTCAAACTTCGAGGATGCGACTTGTTTTTAAGTATTGATAGAAATATTTTATGTATGTAATATTTTATAAAGTAAAATTTTGTTTGAAATCCTTCTTCAATGCCTTAAGAATTGTATATATGTTGTTGAACTTCTACCCTAGAGAGAAAAACCATTTCCTTCATACTCTTGCTAATTGCATCGTGCTTAAAGATGTCAGTGTATTATTTTATTGTTAATCCATTCAGTTCCCTCCTATTAAATAGGGTTAAGCTTTTCCACCTCTCATCTCACATATCCTTCCACTGGTTCAATAACTATAGCATAGCTCGAATTTGTCCAGAGCAATTGAGAATGTATTTCCAAACCTTGGTCACAAATTCTCCTATCAGAATATCTCAAAGGTTTAAACATCTTGACTAGATTACTTGAAATTAAGTAGGTTGGAGTACTTTCTATTGTGTGATTCTTGAATGTATTTCCAAATTTATTGTAACACCCTAATAATTCCTTGCTTTTATAAAACCATTTTCCAACTTAAAATAAAGGAATTACTAAAGCATTACCGCCACCGTGATAACGGTTAAGGCTATTACCAGAATTATGCAGCGGAATTAAATGTCAACTAACTTTTAAAAACATAATTAATGAAATATTGAGGCCTCCTACAATTTGGAACCATAATGGCCCAAAACCCAAAGTTTAAATAGTTCAAACATTTCAAACATAATTAAAGTAAAGTTAAATAGTTCAAAATACGAAAACATGCAACTCTCTCGATCATCCCAAGCCACATGATTCCGATCTACCAACCTGCTATTTTATTCTACTCCCCATCAATGCAAGTGCAAATGATAGATCATCATAGGGTCATTAAGGCAAAGGCCATGACCAAAACACACAAAGCACGTAGTCAGTAAAAGCTGAGTACATACAAGCATAGAGTGAATGAAACTAAAATATGCATCACTCACTAAGTACTAGTCAACATGCAAAAGCCATATAATAATAAACAATCAATCATGAATACAAGACTCAACTCTTGACTCACAATTTAATTAGAATAAGCTTCGAACGGGCCAAAAGAATAATCCACAAAGGGAAAAGGTGATGGGAGCCAACCATACACCAAATATATAATAATAAAATCGGGCCATACCGACGGAATTTCTGCGCATAATATAAATGAAACAACGTCTTGTGTCATTAGTAGGAGTACGAGCTTTCCGGCAAGACTCCCCCCATTGTTCATACTCAAGGTATATACGTTCCAAGATTTTTGAAGCTTGTTCTGGTTGCACTTTACGTTTATTAATATTTTATTAAAGGTGAACTCAAGACTCGACAATGTACACACATACAATTAATAAACATCAACCAAAACAATCTTATTTTAATGCTTTAAGACTTATGATCAATCAAACAAGACACTTGTGATATTACTACACTAGAGGAAAAAACCTCAAGTGCGGGCTCATTTTAAGGGTTTTAGTGCGGGCTTTTACATGTGCAGCACATGGCCTGCCCGCACTTGATGTTTCTCAAGTGCTGCCAAAATATTGGCAGCACTTGATCGATGTTTTAAGTGCTGCCAATTTGGAAAATGAGCCCGCACTTGACATATTTTGTGCTGCCAATTTTAAAATATAAGCCCGCACTTGATATATTTGGTGCTGCCAATTTTGAACATGAGCCCGCACTTAACATATTTGGTGCTGCCAATATTGAAAATGAGCCCGCACTTGACATATTTGGTGCTGCCAAATTTAAAAATGAGCCCGCACTTGACATATAAATGGGCAGCACAAGCATAAAAATGAGCCGCATTTGGCCTATAAATGAGCTGCACTTCATAGATTTGTTGTATAACCGATTTTCCTGCTTCATATTACAATTGGACCACGCTACTGATTAACCTCCATACGTCATATAAAAAGTATCCAATTATATAGATAATTAAATTAAATAGTATATTATTGTAAATATAAAAGTCGATTACATTACAATCATATGAAAACCAGGCTAGCATCCGTAATTTAAGATTCAATACAAGAAAATATCAAAGACAATCACTGCTAATGAAGTTTTCCAACATTTCTCGAACTTCATCTATCTCCTCAAAGGTGAAAGGTCACTCCCTCGGATTATTGAATACCTTAAAAAAATCGACCATCAAAATATATATATAAACTTTAGTTATATTATAAATATTGAATCGTACAATAAATTAAGACTTAATTTGTTCATAACAAATAAATAATGAACCGTATAACAATAAAATTGGTTAATTTCAGTCCCAACTTTCATCTAATGAACTTAAATAAATATTTTAAAGTCCTTCCATGTTTAATAAACTTAGTTTTATGTTTCAAATACTCATTCTCACCCATTTTGGTGAAGTTATGCACATTTAAACCTCGTTAGTTCATTATCGGTCACATTTTGACTAAAATGGCTAATTTCGGTCCCAACTTTCAACTAATGAACTTAAATGAGTATTTTAAAGTCTTTCCATGTTTAATAAACTTAGTTTTATGTTCTAAAGACTCATTCTCACTATTTTGGTGAAGTTATGCACATTTAAGCTATGTTAGTTCATTATCGGTCACGTTTTGACTAAAATGGCTAATTTCGGTCCCAACTTTCAACTAATGAACTTAAATGAGTATTTTAAAGTCTTTCCATGTTTAATACACTTAGTTTTATTTTCTAAAGAGTCATTCTCACTATTTTGGTGAAGTTATGCACATTTAAGCTACTTTAGTTCATTATCGGTCACGTTTTGACTAAAATGGTTAATTTTGGTCCCAACTTTCAACTAATGAACTTAAATAAGTGTTTTAAAGCCTTTCCATGTTCAGCACACTTAGTTTTATATTTCAAAGACTATCACCCATTTTGGTCAAATTATGCACATATAAGGATTGTTTGATCATTATAGGTCACATTTCGACAAAAATGGCTTATTTCGCTCCCAACTTTCAACTAATGAACCTAAATAAATATTTTAAACCCCTACATGTTTATTAGACTTAGTTTAATGTTTCTAAGACTCGTTCTCACTTACTTTTAGTCAAGTTATATACAGTTATAGTATAATGTGTACCTTTTCGAGATCTTGTGACTATATCATACATGAGCTTCATCACGTAGTAGCCGCACTCCGTATTGCCCGTTTGTTGTGCACACTTGGTATCACAAATAAAATGCATGCAAGACAAATGTAAGTTTTTGAGCTAATAACATCTAATGAAAATATATAACAATTGAAATCATTTACCTTCACTGATTTCAAACTTTCTGTCTCGTTTTAGTAATCCAACCACTCACTAACTTAGCCTTGAAATCCCTGAATCTTATGCCCACAACTTTTAGAAATATTTTTTCTTGACTGGATCACTTTCAACATTGAAAAGATTCTTCACAAAACAAGAAAACCAGTAGTCAGTCTATTTTATTAACCAAAATTCAAATGCAAAGCAAATTGGAGCATTTAGATTATGAAAAACGAGAAAGATATTTAATAGAGTGAGCAACAATATTGGGCTTGAACAACATATAGAAAGTAGCAGCAGCAGCATATCATAAGTAGGAAGGAGATACTCGCATCTTCCTGGTGACATTTGCAGTAGGTTGAACTTGACCATTAATGTATATACGTAATGTATGATAATAAACCTTAACATACCAAGGGACAACTTGAAAAACATTGATACGCAAGTTGCACTTTTCCCCATAAGCACCAGTACCCACCAAACTCTGATGATGATTAATAGATTTCAGTGAGATAACTATAACACCCCTTTCATTACCACTTCCCATCAAGAATCTATTAGCAAGTAATGGTGCTTGAGAACAAGACCAAACTACAGGAAGTTTCCAAGTTTGTACTAAATCAAATGGTTTAGCATCAACATATTTCTCGATAACAAACACATACAGAATAGATGACTCCACATCATTTAAGTTATCAACTTCTCTAATCATTTTCTCAGGGGAAGGTGACAATTCAAACGCTGGATTAGTGATGGAAATATCAGAAGTTGCAAAATCAGGGAGAAATCCTTTCTTTTTCATTTCAGAGACTATGCCTCTGTCAAGTTGA

mRNA sequence

CCCCTCGTCCCCCTTCGAACGAATTAGGGTTTCTCTTTCTCTCTCTCTCCTCTTTCTCTCTCTTAAATTCTTTATATTTTTCCTCTTTTTTTCTTTTCTCGCCGAAGATTCCATCACATATCATCATCATCAACAACATCAAAAAGAAATTAGAAATCAAAATATTTAGAAAAACCCTAATAAATCTTGACGGACTAAATCTACAGAATTTGATCCATCTGAAATCGTTGACGCTTTTGATCATTGTTCTTCTCACAGATCAGATCTGCTTAAAAATTTGGCCTGATTATTCACGCTTACAAACCCTAATTTTCATTTTTAAATTTTGGGATAATTTTCCCTTTTTTAGTCGGATTTCTGGATTTCTTTGCTCGAATTTGATAGATCCGTGTGTTTACATGAAATTTGAATCTCGACGAAGAAGACGACGAAGAAGAATTGAAAATCAGAAACCCTAACAGTCCAGATCGAACCCAGAAACTTGAAGTCAGAACAATTTCCCCCTTAATTCAATTTTTTATTATTTTTCATTATTAATTTTCGGATTTCCGGCTAATTTTATTTTCATAAATTCACAGAACAGTAACAAAAATCAAGAACAAAAAATGATGAAAACGGTAGTTTATGAGGGTGATAATCTACTGGGTGAAGTAGAGATATATTTTGAGAACAACGCCAACAAGATTGAAATGATGAAGAAGGGGATGTTGATGAGGATAAGTCATTATTCAGAAGCAAGTGAGAGATGTCCACCTCTTGCTGTTCTTCATACTATTACTAAATCAACTGGTGGTGTTTCCTTCAAAATGATGGAGAAGTCGCTCTACTTTCAACAACACAATGATTCCCAGATTTTTGCTTTGCATTCTTCCTGTCTCAGAGGCAACAAGACGGCTGTGGTGTCCCTGGGTGAGCAGGAGATTCATCTGGTGGCAATGCGTTCAAGGAGAATGGATGGTGTAACAACCCCTTGCTTTTGGGGTTTCATTGTCATGCCAGGGTTATATGAATCTTGTCTTGGCATGTTAAATCTTAGATGTCTTGGTATTGTGTTTGATCTTGATGAGACGCTGATTGTTGCAAACACACTGCGATCTTTCGAGGATAGAATTGAGGCCTTGCAAAGAAAAATGACTGTAGAAGCTGACCCGCAACGTATGGCGGGTATGATGGCAGAAGTGAAACGATACCAGGAAGATAAGGCTATACTGAAGCAATATGCTGAAACTGACCAGGTGGTGGATAATGGGAAAGTCCATAAAATTCAAGCTGAAGTTATTCCAGCTCTATCTGACAACCACCAAACAGTTGTTCGACCGCTTATTCGGTTACAGGATAAAAATATTGTCCTTACTCGAATTAATCCTCAGATACGCGATACAAGTGTTCTTGTAAGATTAAGACCTGCATGGGAAGATCTACGCAGCTATCTAACTGCCAGAGGCCGTAAACGCTTTGAGGTTTATGTTTGTACAATGGCTGAAAGAGATTACGCTTTAGAAATGTGGAGGCTTCTTGATCCTGACTCAAATTTGATTGGTGGGAGGGAACTTTTGGATCGTATTGTGTGTGTCAAATCTGGATCAAGGAAGTCGTTGTTTAATGTTTTCCAAGGTGGGATTTGTCACCCCAAAATGGCTTTAGTAATTGATGATCGTCTAAAGGTGTGGGATGAGAAAGATCAGCCACGGGTGCATGTTGTGCCTGCATTTGCTCCTTATTATGCTCCCCAAGCCGAGGCAAATAATGCCATCCCAGTTCTCTGCGTGGCTAGGAATGTAGCTTGCAATGTCCGAGGTGGTTTTTTCAAAGAATTTGATGAGGGTCTCTTGCAACGAATGTCTGACGTTTATTTTGAAGATGATCCCAAAGATTTTCCTTCCCCCCCTGACGTGAGCAATTACTTGGTATCAGAGGAAGCAGTTTTATCCTCTTCTTCAGCCCCTTCTCTTCCGTGTGTAACATCTTTGGCGACTGTGAATCTTGATCATAGGCTGGCATCTTCTCTCCCGTTCTCTGTTGCTGCTTCTTCCATGACAATTCCACAACCTGCACCTCAAGCATCAATTGCACCTTTCCATGCTAACCTATTTTCACAAGCAGGTCCTTTAGCGAGAACATTGGCTAGTATTGGTCCCAAGGACCTTGGCCTGCACAGTTCCCCTGCTCGAGAAGAAGGTGAAGTACCTGAATCTGAGTTAGATCCTGATACAAGGAGACGGCTTCTTATATTGCAGCATGGCCAAGATATGAGAGAAGGCTTACCAAATGAGCCTCCGTTCCCGGGAAGACCTCCAGTTCAAGCTCCTGTTGCAGGTCCTGGTTCTGGTCCTGTTCCAGTCCCTGGTCCAGTGCCTGTTGCAGGTCCTGGCTCAGCTTCAATCTCAGTTCCGGGTCCTGGTCCTGTTCCTATGTCTGGTTCTGTTCCAGCTCCTGCTCCTGTTCCTGTTCCTGTTCCACGGGTACAATCACGCGGGAGTTGGTTTCAAGTCGAGGATCACATGAACCCAAGTCCTCTGGGCCGATCAGCCACTAAAGAATTTCCTATGTCTCCTGATGCTGTACATGTTGAGAAGCAGCGGCCACCTCCCCCTTTTCCTCGAAAAGTGGAGAATCCAGTTTGGTCTGATCGAAGTTTCCCTGAAAAACAAAGACTGCCGAGGGAGGCTTCTCGCAGAGATGAGAGATTGAGGTCAAACTATTCAGTGCCTAGTCATCAATCATTTCGAGGCGATGAAATTTCTTTGAGCCGATCAGTCTCAAGCAACAAGGGTTTCGAAGTTGAACCTGAAAAAGGCAGTTCATTGTCGGAGAATCCTTCAGTTGCTTTACATGACATTGCAATGAGGTGTGGAGCAAAGGTTGAGTTTAAGCTAGGGTTGGTTGCTACCTCAGAGTTGAAGTTCTTTACGGAGGCTTATTTTGTTGGAGAGAAAATTGGTGAAGGAACTGGTACAACCAGAAGGGAAGCCCAGTATCGTGCTGCAGAGGCTGCTTTGATGAATCTGGCTGATAGATATTTGACCCATATAAAGTCCGATGCTAGCACTCCACAAAGTGATACAAGTAGGGGTCCGAGTCCAAAGGACATGGGATTTGCAAGTGATGCAAATTCTCAAGGGGATTGCACTTCAAGAAAGGAAGAGACAACAACACCTTCATCGGAGCTTACCAGGCTGGATGATTCTATTCTAGAGGGCTCTAAGGACTCCATGGGCTCTGTTTCCGTTCTTAAAGAATTGTGCATGATAGAGGGCCTTGGTGTCGAATTTAAAGGTCAGTCTCCGACTTCAACTAATCCAGTCCACGGAGATGAAATACACGCAGAGGTAGAAATAAATGGACAAGTTCTTGGCAAGGGCACAGGATTGACATGGGATGAGGCAAAGATGCAGGCTGCTGAGCTTGCTCTTGCAAGTCTTAAATCCATGCTGGGTCAAATTACTAAGCGTCCAAGCTCTCCGCGGTTGTTGCAAGGGATGGCCAGTAAACGCCTTAAACCAGAATATGCTCGGTCTATTAGAATAGAACTTAAAGAGGGTATTTATACTCGTCGTGTCCTTTATCTGGAAATCAGAGGTCAAGGAGCCATTCCTTTGACTCGTACTGATGAGAATTTGACTCCACGAGAAATGGAACAAAAAGCGGCTGAATTGGCCTATTTCTTGCGTGTACCAATTAAAATTACTCGATCCATTTCCATATCACTTATATTATATATAATAACTCGGTCATCCATTGCGAATGCCTATCCCATTTTCGCACAACAAGGTTATGAAAATCCACGAGAGGCGACGGGACGTATTGTATGTGCCAATTTCCATTTAGCTAATAAGCCTGTGGATATTGAGGTTCCACAAGCGGTCCTTCCAGATACTGTATTTGAAGCAGTTGTTCGAATTCCTTATGATATGCAATTAAAACAAGTTTTAGCTAACGGTAAAAAGGGTGGCTTGAATGTGGGGGATGTTCTTATTTTACCTGAGGGATTTGAATTAGCCCCACCCGATCGTTCCCTCGCCGGCGGCTTGTGCTTTCCCTTTCGCATTGCTCCGAGTCGTGTCGCCATTGTCGCCGCCGCTAGTGGTGTCCACCAATCCACCATCACTTATTCCTCTTCACCAATTGGGGAAGGTGACAATTCAAACGCTGGATTAGTGATGGAAATATCAGAAGTTGCAAAATCAGGGAGAAATCCTTTCTTTTTCATTTCAGAGACTATGCCTCTGTCAAGTTGA

Coding sequence (CDS)

ATGATGAAAACGGTAGTTTATGAGGGTGATAATCTACTGGGTGAAGTAGAGATATATTTTGAGAACAACGCCAACAAGATTGAAATGATGAAGAAGGGGATGTTGATGAGGATAAGTCATTATTCAGAAGCAAGTGAGAGATGTCCACCTCTTGCTGTTCTTCATACTATTACTAAATCAACTGGTGGTGTTTCCTTCAAAATGATGGAGAAGTCGCTCTACTTTCAACAACACAATGATTCCCAGATTTTTGCTTTGCATTCTTCCTGTCTCAGAGGCAACAAGACGGCTGTGGTGTCCCTGGGTGAGCAGGAGATTCATCTGGTGGCAATGCGTTCAAGGAGAATGGATGGTGTAACAACCCCTTGCTTTTGGGGTTTCATTGTCATGCCAGGGTTATATGAATCTTGTCTTGGCATGTTAAATCTTAGATGTCTTGGTATTGTGTTTGATCTTGATGAGACGCTGATTGTTGCAAACACACTGCGATCTTTCGAGGATAGAATTGAGGCCTTGCAAAGAAAAATGACTGTAGAAGCTGACCCGCAACGTATGGCGGGTATGATGGCAGAAGTGAAACGATACCAGGAAGATAAGGCTATACTGAAGCAATATGCTGAAACTGACCAGGTGGTGGATAATGGGAAAGTCCATAAAATTCAAGCTGAAGTTATTCCAGCTCTATCTGACAACCACCAAACAGTTGTTCGACCGCTTATTCGGTTACAGGATAAAAATATTGTCCTTACTCGAATTAATCCTCAGATACGCGATACAAGTGTTCTTGTAAGATTAAGACCTGCATGGGAAGATCTACGCAGCTATCTAACTGCCAGAGGCCGTAAACGCTTTGAGGTTTATGTTTGTACAATGGCTGAAAGAGATTACGCTTTAGAAATGTGGAGGCTTCTTGATCCTGACTCAAATTTGATTGGTGGGAGGGAACTTTTGGATCGTATTGTGTGTGTCAAATCTGGATCAAGGAAGTCGTTGTTTAATGTTTTCCAAGGTGGGATTTGTCACCCCAAAATGGCTTTAGTAATTGATGATCGTCTAAAGGTGTGGGATGAGAAAGATCAGCCACGGGTGCATGTTGTGCCTGCATTTGCTCCTTATTATGCTCCCCAAGCCGAGGCAAATAATGCCATCCCAGTTCTCTGCGTGGCTAGGAATGTAGCTTGCAATGTCCGAGGTGGTTTTTTCAAAGAATTTGATGAGGGTCTCTTGCAACGAATGTCTGACGTTTATTTTGAAGATGATCCCAAAGATTTTCCTTCCCCCCCTGACGTGAGCAATTACTTGGTATCAGAGGAAGCAGTTTTATCCTCTTCTTCAGCCCCTTCTCTTCCGTGTGTAACATCTTTGGCGACTGTGAATCTTGATCATAGGCTGGCATCTTCTCTCCCGTTCTCTGTTGCTGCTTCTTCCATGACAATTCCACAACCTGCACCTCAAGCATCAATTGCACCTTTCCATGCTAACCTATTTTCACAAGCAGGTCCTTTAGCGAGAACATTGGCTAGTATTGGTCCCAAGGACCTTGGCCTGCACAGTTCCCCTGCTCGAGAAGAAGGTGAAGTACCTGAATCTGAGTTAGATCCTGATACAAGGAGACGGCTTCTTATATTGCAGCATGGCCAAGATATGAGAGAAGGCTTACCAAATGAGCCTCCGTTCCCGGGAAGACCTCCAGTTCAAGCTCCTGTTGCAGGTCCTGGTTCTGGTCCTGTTCCAGTCCCTGGTCCAGTGCCTGTTGCAGGTCCTGGCTCAGCTTCAATCTCAGTTCCGGGTCCTGGTCCTGTTCCTATGTCTGGTTCTGTTCCAGCTCCTGCTCCTGTTCCTGTTCCTGTTCCACGGGTACAATCACGCGGGAGTTGGTTTCAAGTCGAGGATCACATGAACCCAAGTCCTCTGGGCCGATCAGCCACTAAAGAATTTCCTATGTCTCCTGATGCTGTACATGTTGAGAAGCAGCGGCCACCTCCCCCTTTTCCTCGAAAAGTGGAGAATCCAGTTTGGTCTGATCGAAGTTTCCCTGAAAAACAAAGACTGCCGAGGGAGGCTTCTCGCAGAGATGAGAGATTGAGGTCAAACTATTCAGTGCCTAGTCATCAATCATTTCGAGGCGATGAAATTTCTTTGAGCCGATCAGTCTCAAGCAACAAGGGTTTCGAAGTTGAACCTGAAAAAGGCAGTTCATTGTCGGAGAATCCTTCAGTTGCTTTACATGACATTGCAATGAGGTGTGGAGCAAAGGTTGAGTTTAAGCTAGGGTTGGTTGCTACCTCAGAGTTGAAGTTCTTTACGGAGGCTTATTTTGTTGGAGAGAAAATTGGTGAAGGAACTGGTACAACCAGAAGGGAAGCCCAGTATCGTGCTGCAGAGGCTGCTTTGATGAATCTGGCTGATAGATATTTGACCCATATAAAGTCCGATGCTAGCACTCCACAAAGTGATACAAGTAGGGGTCCGAGTCCAAAGGACATGGGATTTGCAAGTGATGCAAATTCTCAAGGGGATTGCACTTCAAGAAAGGAAGAGACAACAACACCTTCATCGGAGCTTACCAGGCTGGATGATTCTATTCTAGAGGGCTCTAAGGACTCCATGGGCTCTGTTTCCGTTCTTAAAGAATTGTGCATGATAGAGGGCCTTGGTGTCGAATTTAAAGGTCAGTCTCCGACTTCAACTAATCCAGTCCACGGAGATGAAATACACGCAGAGGTAGAAATAAATGGACAAGTTCTTGGCAAGGGCACAGGATTGACATGGGATGAGGCAAAGATGCAGGCTGCTGAGCTTGCTCTTGCAAGTCTTAAATCCATGCTGGGTCAAATTACTAAGCGTCCAAGCTCTCCGCGGTTGTTGCAAGGGATGGCCAGTAAACGCCTTAAACCAGAATATGCTCGGTCTATTAGAATAGAACTTAAAGAGGGTATTTATACTCGTCGTGTCCTTTATCTGGAAATCAGAGGTCAAGGAGCCATTCCTTTGACTCGTACTGATGAGAATTTGACTCCACGAGAAATGGAACAAAAAGCGGCTGAATTGGCCTATTTCTTGCGTGTACCAATTAAAATTACTCGATCCATTTCCATATCACTTATATTATATATAATAACTCGGTCATCCATTGCGAATGCCTATCCCATTTTCGCACAACAAGGTTATGAAAATCCACGAGAGGCGACGGGACGTATTGTATGTGCCAATTTCCATTTAGCTAATAAGCCTGTGGATATTGAGGTTCCACAAGCGGTCCTTCCAGATACTGTATTTGAAGCAGTTGTTCGAATTCCTTATGATATGCAATTAAAACAAGTTTTAGCTAACGGTAAAAAGGGTGGCTTGAATGTGGGGGATGTTCTTATTTTACCTGAGGGATTTGAATTAGCCCCACCCGATCGTTCCCTCGCCGGCGGCTTGTGCTTTCCCTTTCGCATTGCTCCGAGTCGTGTCGCCATTGTCGCCGCCGCTAGTGGTGTCCACCAATCCACCATCACTTATTCCTCTTCACCAATTGGGGAAGGTGACAATTCAAACGCTGGATTAGTGATGGAAATATCAGAAGTTGCAAAATCAGGGAGAAATCCTTTCTTTTTCATTTCAGAGACTATGCCTCTGTCAAGTTGA

Protein sequence

MMKTVVYEGDNLLGEVEIYFENNANKIEMMKKGMLMRISHYSEASERCPPLAVLHTITKSTGGVSFKMMEKSLYFQQHNDSQIFALHSSCLRGNKTAVVSLGEQEIHLVAMRSRRMDGVTTPCFWGFIVMPGLYESCLGMLNLRCLGIVFDLDETLIVANTLRSFEDRIEALQRKMTVEADPQRMAGMMAEVKRYQEDKAILKQYAETDQVVDNGKVHKIQAEVIPALSDNHQTVVRPLIRLQDKNIVLTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDSNLIGGRELLDRIVCVKSGSRKSLFNVFQGGICHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNAIPVLCVARNVACNVRGGFFKEFDEGLLQRMSDVYFEDDPKDFPSPPDVSNYLVSEEAVLSSSSAPSLPCVTSLATVNLDHRLASSLPFSVAASSMTIPQPAPQASIAPFHANLFSQAGPLARTLASIGPKDLGLHSSPAREEGEVPESELDPDTRRRLLILQHGQDMREGLPNEPPFPGRPPVQAPVAGPGSGPVPVPGPVPVAGPGSASISVPGPGPVPMSGSVPAPAPVPVPVPRVQSRGSWFQVEDHMNPSPLGRSATKEFPMSPDAVHVEKQRPPPPFPRKVENPVWSDRSFPEKQRLPREASRRDERLRSNYSVPSHQSFRGDEISLSRSVSSNKGFEVEPEKGSSLSENPSVALHDIAMRCGAKVEFKLGLVATSELKFFTEAYFVGEKIGEGTGTTRREAQYRAAEAALMNLADRYLTHIKSDASTPQSDTSRGPSPKDMGFASDANSQGDCTSRKEETTTPSSELTRLDDSILEGSKDSMGSVSVLKELCMIEGLGVEFKGQSPTSTNPVHGDEIHAEVEINGQVLGKGTGLTWDEAKMQAAELALASLKSMLGQITKRPSSPRLLQGMASKRLKPEYARSIRIELKEGIYTRRVLYLEIRGQGAIPLTRTDENLTPREMEQKAAELAYFLRVPIKITRSISISLILYIITRSSIANAYPIFAQQGYENPREATGRIVCANFHLANKPVDIEVPQAVLPDTVFEAVVRIPYDMQLKQVLANGKKGGLNVGDVLILPEGFELAPPDRSLAGGLCFPFRIAPSRVAIVAAASGVHQSTITYSSSPIGEGDNSNAGLVMEISEVAKSGRNPFFFISETMPLSS
Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Spo14122Spo14122gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Spo14122.1Spo14122.1-proteinpolypeptide


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Spo14122.1.utr5p.1Spo14122.1.utr5p.1five_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Spo14122.1.CDS.20Spo14122.1.CDS.20CDS
Spo14122.1.CDS.19Spo14122.1.CDS.19CDS
Spo14122.1.CDS.18Spo14122.1.CDS.18CDS
Spo14122.1.CDS.17Spo14122.1.CDS.17CDS
Spo14122.1.CDS.16Spo14122.1.CDS.16CDS
Spo14122.1.CDS.15Spo14122.1.CDS.15CDS
Spo14122.1.CDS.14Spo14122.1.CDS.14CDS
Spo14122.1.CDS.13Spo14122.1.CDS.13CDS
Spo14122.1.CDS.12Spo14122.1.CDS.12CDS
Spo14122.1.CDS.11Spo14122.1.CDS.11CDS
Spo14122.1.CDS.10Spo14122.1.CDS.10CDS
Spo14122.1.CDS.9Spo14122.1.CDS.9CDS
Spo14122.1.CDS.8Spo14122.1.CDS.8CDS
Spo14122.1.CDS.7Spo14122.1.CDS.7CDS
Spo14122.1.CDS.6Spo14122.1.CDS.6CDS
Spo14122.1.CDS.5Spo14122.1.CDS.5CDS
Spo14122.1.CDS.4Spo14122.1.CDS.4CDS
Spo14122.1.CDS.3Spo14122.1.CDS.3CDS
Spo14122.1.CDS.2Spo14122.1.CDS.2CDS
Spo14122.1.CDS.1Spo14122.1.CDS.1CDS


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
Spo14122.1.exon.20Spo14122.1.exon.20exon
Spo14122.1.exon.19Spo14122.1.exon.19exon
Spo14122.1.exon.18Spo14122.1.exon.18exon
Spo14122.1.exon.17Spo14122.1.exon.17exon
Spo14122.1.exon.16Spo14122.1.exon.16exon
Spo14122.1.exon.15Spo14122.1.exon.15exon
Spo14122.1.exon.14Spo14122.1.exon.14exon
Spo14122.1.exon.13Spo14122.1.exon.13exon
Spo14122.1.exon.12Spo14122.1.exon.12exon
Spo14122.1.exon.11Spo14122.1.exon.11exon
Spo14122.1.exon.10Spo14122.1.exon.10exon
Spo14122.1.exon.9Spo14122.1.exon.9exon
Spo14122.1.exon.8Spo14122.1.exon.8exon
Spo14122.1.exon.7Spo14122.1.exon.7exon
Spo14122.1.exon.6Spo14122.1.exon.6exon
Spo14122.1.exon.5Spo14122.1.exon.5exon
Spo14122.1.exon.4Spo14122.1.exon.4exon
Spo14122.1.exon.3Spo14122.1.exon.3exon
Spo14122.1.exon.2Spo14122.1.exon.2exon
Spo14122.1.exon.1Spo14122.1.exon.1exon


Homology
BLAST of Spo14122.1 vs. NCBI nr
Match: gi|902176661|gb|KNA08810.1| (hypothetical protein SOVF_159350 isoform A [Spinacia oleracea])

HSP 1 Score: 1875.1 bits (4856), Expect = 0.000e+0
Identity = 969/1000 (96.90%), Postives = 970/1000 (97.00%), Query Frame = 1

		  

Query: 1    MMKTVVYEGDNLLGEVEIYFENNANKIEMMKKGMLMRISHYSEASERCPPLAVLHTITKS 60
            MMKTVVYEGDNLLGEVEIYFENNANKIEMMKKGMLMRISHYSEASERCPPLAVLHTITKS
Sbjct: 1    MMKTVVYEGDNLLGEVEIYFENNANKIEMMKKGMLMRISHYSEASERCPPLAVLHTITKS 60

Query: 61   TGGVSFKMMEKSLYFQQHNDSQIFALHSSCLRGNKTAVVSLGEQEIHLVAMRSRRMDGVT 120
            TGGVSFKMMEKSLYFQQHNDSQIFALHSSCLRGNKTAVVSLGEQEIHLVAMRSRRMDGVT
Sbjct: 61   TGGVSFKMMEKSLYFQQHNDSQIFALHSSCLRGNKTAVVSLGEQEIHLVAMRSRRMDGVT 120

Query: 121  TPCFWGFIVMPGLYESCLGMLNLRCLGIVFDLDETLIVANTLRSFEDRIEALQRKMTVEA 180
            TPCFWGFIVMPGLYESCLGMLNLRCLGIVFDLDETLIVANTLRSFEDRIEALQRKMTVEA
Sbjct: 121  TPCFWGFIVMPGLYESCLGMLNLRCLGIVFDLDETLIVANTLRSFEDRIEALQRKMTVEA 180

Query: 181  DPQRMAGMMAEVKRYQEDKAILKQYAETDQVVDNGKVHKIQAEVIPALSDNHQTVVRPLI 240
            DPQRMAGMMAEVKRYQEDKAILKQYAETDQVVDNGKVHKIQAEVIPALSDNHQTVVRPLI
Sbjct: 181  DPQRMAGMMAEVKRYQEDKAILKQYAETDQVVDNGKVHKIQAEVIPALSDNHQTVVRPLI 240

Query: 241  RLQDKNIVLTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEM 300
            RLQDKNIVLTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEM
Sbjct: 241  RLQDKNIVLTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEM 300

Query: 301  WRLLDPDSNLIGGRELLDRIVCVKSGSRKSLFNVFQGGICHPKMALVIDDRLKVWDEKDQ 360
            WRLLDPDSNLIGGRELLDRIVCVKSGSRKSLFNVFQGGICHPKMALVIDDRLKVWDEKDQ
Sbjct: 301  WRLLDPDSNLIGGRELLDRIVCVKSGSRKSLFNVFQGGICHPKMALVIDDRLKVWDEKDQ 360

Query: 361  PRVHVVPAFAPYYAPQAEANNAIPVLCVARNVACNVRGGFFKEFDEGLLQRMSDVYFEDD 420
            PRVHVVPAFAPYYAPQAEANNAIPVLCVARNVACNVRGGFFKEFDEGLLQRMSDVYFEDD
Sbjct: 361  PRVHVVPAFAPYYAPQAEANNAIPVLCVARNVACNVRGGFFKEFDEGLLQRMSDVYFEDD 420

Query: 421  PKDFPSPPDVSNYLVSE----------------------------EAVLSSSSAPSLPCV 480
            PKDFPSPPDVSNYLVSE                            EAVLSSSSAPSLPCV
Sbjct: 421  PKDFPSPPDVSNYLVSEDDGSGSNANKEPICFDGMADAEVERRLKEAVLSSSSAPSLPCV 480

Query: 481  TSLATVNLDHRLASSLPFSVAASSMTIPQPAPQASIAPFHANLFSQAGPLARTLASIGPK 540
            TSLATVNLDHRLASSLPFSVAASSMTIPQPAPQASIAPFHANLFSQAGPLARTLASIGPK
Sbjct: 481  TSLATVNLDHRLASSLPFSVAASSMTIPQPAPQASIAPFHANLFSQAGPLARTLASIGPK 540

Query: 541  DLGLHSSPAREEGEVPESELDPDTRRRLLILQHGQDMREGLPNEPPFPGRPPVQAPVAGP 600
            DLGLHSSPAREEGEVPESELDPDTRRRLLILQHGQDMREGLPNEPPFPGRPPVQAPVAGP
Sbjct: 541  DLGLHSSPAREEGEVPESELDPDTRRRLLILQHGQDMREGLPNEPPFPGRPPVQAPVAGP 600

Query: 601  GS--GPVPVPGPVPVAGPGSASISVPGPGPVPMSGSVPAPAPVPVPVPRVQSRGSWFQVE 660
            GS  GPVPVPGPVPVAGPGSASISVPGPGPVPMSGSVPAPAPVPVPVPRVQSRGSWFQVE
Sbjct: 601  GSGPGPVPVPGPVPVAGPGSASISVPGPGPVPMSGSVPAPAPVPVPVPRVQSRGSWFQVE 660

Query: 661  DHMNPSPLGRSATKEFPMSPDAVHVEKQRPPPPFPRKVENPVWSDRSFPEKQRLPREASR 720
            DHMNPSPLGRSATKEFPMSPDAVHVEKQRPPPPFPRKVENPVWSDRSFPEKQRLPREASR
Sbjct: 661  DHMNPSPLGRSATKEFPMSPDAVHVEKQRPPPPFPRKVENPVWSDRSFPEKQRLPREASR 720

Query: 721  RDERLRSNYSVPSHQSFRGDEISLSRSVSSNKGFEVEPEKGSSLSENPSVALHDIAMRCG 780
            RDERLRSNYSVPSHQSFRGDEISLSRSVSSNKGFEVEPEKGSSLSENPSVALHDIAMRCG
Sbjct: 721  RDERLRSNYSVPSHQSFRGDEISLSRSVSSNKGFEVEPEKGSSLSENPSVALHDIAMRCG 780

Query: 781  AKVEFKLGLVATSELKFFTEAYFVGEKIGEGTGTTRREAQYRAAEAALMNLADRYLTHIK 840
            AKVEFKLGLVATSELKFFTEAYFVGEKIGEGTGTTRREAQYRAAEAALMNLADRYLTHIK
Sbjct: 781  AKVEFKLGLVATSELKFFTEAYFVGEKIGEGTGTTRREAQYRAAEAALMNLADRYLTHIK 840

Query: 841  SDASTPQSDTSRGPSPKDMGFASDANSQGDCTSRKEETTTPSSELTRLDDSILEGSKDSM 900
            SDASTPQSDTSRGPSPKDMGFASDANSQGDCTS+KEETTTPSSELTRLDDSILEGSKDSM
Sbjct: 841  SDASTPQSDTSRGPSPKDMGFASDANSQGDCTSKKEETTTPSSELTRLDDSILEGSKDSM 900

Query: 901  GSVSVLKELCMIEGLGVEFKGQSPTSTNPVHGDEIHAEVEINGQVLGKGTGLTWDEAKMQ 960
            GSVSVLKELCMIEGLGVEFKGQSPTSTNPVHGDEIHAEVEINGQVLGKGTGLTWDEAKMQ
Sbjct: 901  GSVSVLKELCMIEGLGVEFKGQSPTSTNPVHGDEIHAEVEINGQVLGKGTGLTWDEAKMQ 960

Query: 961  AAELALASLKSMLGQITKRPSSPRLLQGMASKRLKPEYAR 971
            AAELALASLKSMLGQITKRPSSPRLLQGMASKRLKPEYAR
Sbjct: 961  AAELALASLKSMLGQITKRPSSPRLLQGMASKRLKPEYAR 1000

BLAST of Spo14122.1 vs. NCBI nr
Match: gi|902176662|gb|KNA08811.1| (hypothetical protein SOVF_159350 isoform B [Spinacia oleracea])

HSP 1 Score: 1812.7 bits (4694), Expect = 0.000e+0
Identity = 953/1043 (91.37%), Postives = 957/1043 (91.75%), Query Frame = 1

		  

Query: 1    MMKTVVYEGDNLLGEVEIYFENNANKIEMMKKGMLMRISHYSEASERCPPLAVLHTITKS 60
            MMKTVVYEGDNLLGEVEIYFENNANKIEMMKKGMLMRISHYSEASERCPPLAVLHTITKS
Sbjct: 1    MMKTVVYEGDNLLGEVEIYFENNANKIEMMKKGMLMRISHYSEASERCPPLAVLHTITKS 60

Query: 61   TGGVSFKMMEKSLYFQQHNDSQIFALHSSCLRGNKTAVVSLGEQEIHLVAMRSRRMDGVT 120
            TGGVSFKMMEKSLYFQQHNDSQIFALHSSCLRGNKTAVVSLGEQEIHLVAMRSRRMDGVT
Sbjct: 61   TGGVSFKMMEKSLYFQQHNDSQIFALHSSCLRGNKTAVVSLGEQEIHLVAMRSRRMDGVT 120

Query: 121  TPCFWGFIVMPGLYESCLGMLNLRCLGIVFDLDETLIVANTLRSFEDRIEALQRKMTVEA 180
            TPCFWGFIVMPGLYESCLGMLNLRCLGIVFDLDETLIVANTLRSFEDRIEALQRKMTVEA
Sbjct: 121  TPCFWGFIVMPGLYESCLGMLNLRCLGIVFDLDETLIVANTLRSFEDRIEALQRKMTVEA 180

Query: 181  DPQRMAGMMAEVKRYQEDKAILKQYAETDQVVDNGKVHKIQAEVIPALSDNHQTVVRPLI 240
            DPQRMAGMMAEVKRYQEDKAILKQYAETDQVVDNGKVHKIQAEVIPALSDNHQTVVRPLI
Sbjct: 181  DPQRMAGMMAEVKRYQEDKAILKQYAETDQVVDNGKVHKIQAEVIPALSDNHQTVVRPLI 240

Query: 241  RLQDKNIVLTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEM 300
            RLQDKNIVLTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEM
Sbjct: 241  RLQDKNIVLTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEM 300

Query: 301  WRLLDPDSNLIGGRELLDRIVCVKSGSRKSLFNVFQGGICHPKMALVIDDRLKVWDEKDQ 360
            WRLLDPDSNLIGGRELLDRIVCVKSGSRKSLFNVFQGGICHPKMALVIDDRLKVWDEKDQ
Sbjct: 301  WRLLDPDSNLIGGRELLDRIVCVKSGSRKSLFNVFQGGICHPKMALVIDDRLKVWDEKDQ 360

Query: 361  PRVHVVPAFAPYYAPQAEANNAIPVLCVARNVACNVRGGFFKEFDEGLLQRMSDVYFEDD 420
            PRVHVVPAFAPYYAPQAEANNAIPVLCVARNVACNVRGGFFKEFDEGLLQRMSDVYFEDD
Sbjct: 361  PRVHVVPAFAPYYAPQAEANNAIPVLCVARNVACNVRGGFFKEFDEGLLQRMSDVYFEDD 420

Query: 421  PKDFPSP----------------------------PDVSNYLVSEEAVLSSSSAPSLPCV 480
            PKDFPSP                             D       +EAVLSSSSAPSLPCV
Sbjct: 421  PKDFPSPPDVSNYLVSEDDGSGSNANKEPICFDGMADAEVERRLKEAVLSSSSAPSLPCV 480

Query: 481  TSLATVNLDHRLASSLPFSVAASSMTIPQPAPQASIAPFHANLFSQAGPLARTLASIGPK 540
            TSLATVNLDHRLASSLPFSVAASSMTIPQPAPQASIAPFHANLFSQAGPLARTLASIGPK
Sbjct: 481  TSLATVNLDHRLASSLPFSVAASSMTIPQPAPQASIAPFHANLFSQAGPLARTLASIGPK 540

Query: 541  DLGLHSSPAREEGEVPESELDPDTRRRLLILQHGQDMREGLPNEPPFPGRPPVQAPVAGP 600
            DLGLHSSPAREEGEVPESELDPDTRRRLLILQHGQDMREGLPNEPPFPGRPPVQAPVAGP
Sbjct: 541  DLGLHSSPAREEGEVPESELDPDTRRRLLILQHGQDMREGLPNEPPFPGRPPVQAPVAGP 600

Query: 601  GSGP--VPVPGPVPVAGPGSASISVPGPGPVPMSGSVPAPAPVPVPVPRVQSRGSWFQVE 660
            GSGP  VPVPGPVPVAGPGSASISVPGPGPVPMSGSVPAPAPVPVPVPRVQSRGSWFQVE
Sbjct: 601  GSGPGPVPVPGPVPVAGPGSASISVPGPGPVPMSGSVPAPAPVPVPVPRVQSRGSWFQVE 660

Query: 661  DHMNPSPLGRSATKEFPMSPDAVHVEKQRPPPPFPRKVENPVWSDRSFPEKQRLPREASR 720
            DHMNPSPLGRSATKEFPMSPDAVHVEKQRPPPPFPRKVENPVWSDRSFPEKQRLPREASR
Sbjct: 661  DHMNPSPLGRSATKEFPMSPDAVHVEKQRPPPPFPRKVENPVWSDRSFPEKQRLPREASR 720

Query: 721  RDERLRSNYS-------------------------------------------VPSHQSF 780
            RDERLRSNYS                                           +  H+ F
Sbjct: 721  RDERLRSNYSVPSHQSFRGGSTKKVKLMTRQSSLMEFTPDICSPGVDAQVRDHILCHRPF 780

Query: 781  RGDEISLSRSVSSNKGFEVEPEKGSSLSENPSVALHDIAMRCGAKVEFKLGLVATSELKF 840
              DEISLSRSVSSNKGFEVEPEKGSSLSENPSVALHDIAMRCGAKVEFKLGLVATSELKF
Sbjct: 781  C-DEISLSRSVSSNKGFEVEPEKGSSLSENPSVALHDIAMRCGAKVEFKLGLVATSELKF 840

Query: 841  FTEAYFVGEKIGEGTGTTRREAQYRAAEAALMNLADRYLTHIKSDASTPQSDTSRGPSPK 900
            FTEAYFVGEKIGEGTGTTRREAQYRAAEAALMNLADRYLTHIKSDASTPQSDTSRGPSPK
Sbjct: 841  FTEAYFVGEKIGEGTGTTRREAQYRAAEAALMNLADRYLTHIKSDASTPQSDTSRGPSPK 900

Query: 901  DMGFASDANSQGDCTSRKEETTTPSSELTRLDDSILEGSKDSMGSVSVLKELCMIEGLGV 960
            DMGFASDANSQGDCTS+KEETTTPSSELTRLDDSILEGSKDSMGSVSVLKELCMIEGLGV
Sbjct: 901  DMGFASDANSQGDCTSKKEETTTPSSELTRLDDSILEGSKDSMGSVSVLKELCMIEGLGV 960

Query: 961  EFKGQSPTSTNPVHGDEIHAEVEINGQVLGKGTGLTWDEAKMQAAELALASLKSMLGQIT 971
            EFKGQSPTSTNPVHGDEIHAEVEINGQVLGKGTGLTWDEAKMQAAELALASLKSMLGQIT
Sbjct: 961  EFKGQSPTSTNPVHGDEIHAEVEINGQVLGKGTGLTWDEAKMQAAELALASLKSMLGQIT 1020

BLAST of Spo14122.1 vs. NCBI nr
Match: gi|731350518|ref|XP_010686545.1| (PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1 [Beta vulgaris subsp. vulgaris])

HSP 1 Score: 1512.3 bits (3914), Expect = 0.000e+0
Identity = 800/1016 (78.74%), Postives = 867/1016 (85.33%), Query Frame = 1

		  

Query: 1    MMKTVVYEGDNLLGEVEIYFENNANK--IEMMKKGMLMRISHYSEASERCPPLAVLHTIT 60
            M+K+VVYEG+NLLGEVEIYF+NN N   +E+MK    MRISHYSE SERCPPLAVLHTIT
Sbjct: 1    MIKSVVYEGENLLGEVEIYFQNNNNNKNLELMKG---MRISHYSEMSERCPPLAVLHTIT 60

Query: 61   KSTGGVSFKMMEKS------------LYFQQHN-DSQIFALHSSCLRGNKTAVVSLGEQE 120
            KS+GG+ FKMME S             YFQQ   +SQ+ A+HS+C+R NKTAVV LGEQE
Sbjct: 61   KSSGGICFKMMESSSHTSSNNNNNNKFYFQQQQQESQLLAMHSNCIRDNKTAVVPLGEQE 120

Query: 121  IHLVAMRSRRMDGVTTPCFWGFIVMPGLYESCLGMLNLRCLGIVFDLDETLIVANTLRSF 180
            IHLVA+RSRRM GVT PCFWGF V PGLYESCLG+LNLRCLGIVFDLDETLIVANTLRSF
Sbjct: 121  IHLVALRSRRMAGVT-PCFWGFSVAPGLYESCLGLLNLRCLGIVFDLDETLIVANTLRSF 180

Query: 181  EDRIEALQRKMTVEADPQRMAGMMAEVKRYQEDKAILKQYAETDQVVDNGKVHKIQAEVI 240
            EDRIEALQRK++VEADPQR+AGM+AEVKRYQEDK+ILKQYAETDQVVDNGKVHKIQAEVI
Sbjct: 181  EDRIEALQRKISVEADPQRIAGMVAEVKRYQEDKSILKQYAETDQVVDNGKVHKIQAEVI 240

Query: 241  PALSDNHQTVVRPLIRLQDKNIVLTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFE 300
            PALSDNHQTV+RPLIRLQDKNIVLTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFE
Sbjct: 241  PALSDNHQTVIRPLIRLQDKNIVLTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFE 300

Query: 301  VYVCTMAERDYALEMWRLLDPDSNLIGGRELLDRIVCVKSGSRKSLFNVFQGGICHPKMA 360
            VYVCTMAERDYALEMWRLLDPDSNLI  RELLDRIVCVKSGS+KSLFNVF GGICHPKMA
Sbjct: 301  VYVCTMAERDYALEMWRLLDPDSNLICARELLDRIVCVKSGSKKSLFNVFHGGICHPKMA 360

Query: 361  LVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNAIPVLCVARNVACNVRGGFFKEFD 420
            LVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNAIPVLCVARNVACNVRGGFFKEFD
Sbjct: 361  LVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNAIPVLCVARNVACNVRGGFFKEFD 420

Query: 421  EGLLQRMSDVYFEDDPKDFPSPPDVSNYLV----------------------------SE 480
            EGLLQR+S+V FEDDP+D PSPPDVSNYLV                             +
Sbjct: 421  EGLLQRVSEVSFEDDPRDIPSPPDVSNYLVSEDDGSGSNGIKESMTFDGMADAEVERRLK 480

Query: 481  EAVLSSSSAPSLPCVTSLATVNLDHRLASSLPFSVAASSMTIPQPAPQASIAPFHANLFS 540
            EAVLSSSSA  LP   + ATVN DHRLASSLPF+VA S++ IPQPAPQA+I P+H NLFS
Sbjct: 481  EAVLSSSSASPLPSANTPATVNFDHRLASSLPFAVATSALAIPQPAPQATITPYHNNLFS 540

Query: 541  QAGPLARTLASIGPKDLGLHSSPAREEGEVPESELDPDTRRRLLILQHGQDMREGLPNEP 600
            QAGPLAR L +IGP+D+GLH+SPAREEGEVPESELDPDTRRRLLILQHGQDMREG PNEP
Sbjct: 541  QAGPLARPLGNIGPQDIGLHNSPAREEGEVPESELDPDTRRRLLILQHGQDMREGPPNEP 600

Query: 601  PFPGRPPVQAPVAGPGSGPVPVPGPVPVAGPGSASISVPGPGPVPMSGSVPAPAPVPVPV 660
            PFP R PVQAPV GP   PV VPGPVPV GPG   +SVP PGP+P    VP    VP P 
Sbjct: 601  PFPARTPVQAPVTGPV--PVSVPGPVPVPGPGP--VSVPVPGPIPSPVPVPVSGTVPGPG 660

Query: 661  PRVQSRGSWFQVEDHMNPSPLGRSATKEFPMSPDAVHVEKQRPPPP-FPRKVENPVWSDR 720
            PRVQSRGSWF VEDH++  PL R A KEFP++PDA  VEKQRPPPP FPRKVE+  WSDR
Sbjct: 661  PRVQSRGSWFPVEDHISQGPLSRVAAKEFPVAPDASPVEKQRPPPPSFPRKVESLGWSDR 720

Query: 721  SFPEKQRLPREASRRDERLRSNYSVPSHQSFRGDEISLSRSVSSNKGFEVEPEKGSSLSE 780
            ++ EKQRLPREA RRD+RLRSNYS+PSHQSFRGDEISLSRS SSNK FEVEPE+GSS +E
Sbjct: 721  NYAEKQRLPREALRRDDRLRSNYSLPSHQSFRGDEISLSRSASSNKDFEVEPERGSSFAE 780

Query: 781  NPSVALHDIAMRCGAKVEFKLGLVATSELKFFTEAYFVGEKIGEGTGTTRREAQYRAAEA 840
            +PS+ALHDIAM+CG KVEFK GLVAT ELKF  EAYF G+KIGEGTGTTRREAQ+RAAEA
Sbjct: 781  SPSIALHDIAMKCGTKVEFKTGLVATPELKFLLEAYFAGDKIGEGTGTTRREAQHRAAEA 840

Query: 841  ALMNLADRYLTHIKSDASTPQSDTSRGPSPKDMGFASDANSQGDCTSRKEETTTPSSELT 900
            ALMNLAD+YLTHIKSD+STPQSDTSRG SP D GF SDANS GD  SRKE+   PSSE+T
Sbjct: 841  ALMNLADKYLTHIKSDSSTPQSDTSRGHSPIDTGFVSDANSHGDGISRKED-IIPSSEMT 900

Query: 901  RLDDSILEGSKDSMGSVSVLKELCMIEGLGVEFKGQSPTSTNPVHGDEIHAEVEINGQVL 960
             LDDS ++GSK+SMGSVSVLKELC+ EGLGV+FKGQSPTSTN V  DEIHAEVEINGQVL
Sbjct: 901  GLDDSNVDGSKNSMGSVSVLKELCLREGLGVDFKGQSPTSTNSVDRDEIHAEVEINGQVL 960

Query: 961  GKGTGLTWDEAKMQAAELALASLKSMLGQITKRPSSPRLLQGMASKRLKPEYARSI 973
            GKGTGLTWDEAKMQAAE+AL SL SM+GQ  KRPSSPRLLQGM +KRLKPEY R +
Sbjct: 961  GKGTGLTWDEAKMQAAEMALTSLNSMIGQFNKRPSSPRLLQGMPNKRLKPEYPRVV 1007

BLAST of Spo14122.1 vs. NCBI nr
Match: gi|590624710|ref|XP_007025680.1| (C-terminal domain phosphatase-like 1 isoform 1 [Theobroma cacao])

HSP 1 Score: 1018.8 bits (2633), Expect = 7.600e-294
Identity = 589/1023 (57.58%), Postives = 708/1023 (69.21%), Query Frame = 1

		  

Query: 1   MMKTVVYEGDNLLGEVEIY------------FENNANKIEMMKKGML-MRISHYSEASER 60
           M K+VVY G+ +LGEVEIY             E +  KI +M++ M  +RI + ++ SER
Sbjct: 4   MYKSVVYRGEEVLGEVEIYPQQQLQQQQQLREEEDERKIMVMEEEMKEIRIEYLTQGSER 63

Query: 61  CPPLAVLHTITKSTGGVSFKM--MEKSLYFQQHNDSQIFALHSSCLRGNKTAVVSLGEQE 120
           CPPLAVLHTIT S  G+ FKM   + + Y    +   +  LHS C+R NKTAV+ +G+ E
Sbjct: 64  CPPLAVLHTITSS--GICFKMESSKDNNYSSSQDSPPLHLLHSECIRDNKTAVMPMGDCE 123

Query: 121 IHLVAMRSRRMDGVTTPCFWGFIVMPGLYESCLGMLNLRCLGIVFDLDETLIVANTLRSF 180
           +HLVAM SR  D    PCFWGF V  GLY+SCL MLNLRCLGIVFDLDETLIVANT+RSF
Sbjct: 124 LHLVAMYSRNSD---RPCFWGFNVSRGLYDSCLLMLNLRCLGIVFDLDETLIVANTMRSF 183

Query: 181 EDRIEALQRKMTVEADPQRMAGMMAEVKRYQEDKAILKQYAETDQVVDNGKVHKIQAEVI 240
           EDRIEALQRKMT E DPQR+AGM+AE+KRYQ+DKAILKQYAE DQVV+NGKV KIQ+EV+
Sbjct: 184 EDRIEALQRKMTTEVDPQRVAGMVAEMKRYQDDKAILKQYAENDQVVENGKVIKIQSEVV 243

Query: 241 PALSDNHQTVVRPLIRLQDKNIVLTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFE 300
           PALSDNHQ ++RPLIRLQ+KNI+LTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFE
Sbjct: 244 PALSDNHQPIIRPLIRLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFE 303

Query: 301 VYVCTMAERDYALEMWRLLDPDSNLIGGRELLDRIVCVKSGSRKSLFNVFQGGICHPKMA 360
           VYVCTMAERDYALEMWRLLDP+SNLI  +ELLDRIVCVKSGSRKSLFNVFQ GICHPKMA
Sbjct: 304 VYVCTMAERDYALEMWRLLDPESNLINSKELLDRIVCVKSGSRKSLFNVFQDGICHPKMA 363

Query: 361 LVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNAIPVLCVARNVACNVRGGFFKEFD 420
           LVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANN IPVLCVARNVACNVRGGFF+EFD
Sbjct: 364 LVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFFREFD 423

Query: 421 EGLLQRMSDVYFEDDPKDFPSPPDVSNYLVSEE--AVLSSSSAP---------------- 480
           EGLLQR+ ++ +EDD KD PSPPDV NYLVSE+  + L+ +  P                
Sbjct: 424 EGLLQRIPEISYEDDIKDIPSPPDVGNYLVSEDDTSALNGNKDPLLFDGMADAEVERRLK 483

Query: 481 ---SLPCVTSLATVNLDHRLASSLPFSVAASSMTIPQPAPQASIAPFHANLFSQAGPLAR 540
              S     S A +NLD RL  SL +++ +SS +IP  A Q SI  F    F  A P+ +
Sbjct: 484 EAISATSTVSSAAINLDPRLTPSLQYTMPSSSSSIPPSASQPSIVSFSNMQFPLAAPVVK 543

Query: 541 TLASIGPKDLGLHSSPAREEGEVPESELDPDTRRRLLILQHGQDMREGLPNEPPFPG-RP 600
            +A +   +  L SSPAREEGEVPESELDPDTRRRLLILQHGQD R+  P EP FP  RP
Sbjct: 544 PVAPVAVPEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTPPEPAFPPVRP 603

Query: 601 PVQAPVAGPGSGPVPVPGPVPVAGPGSASISVPGPGPVPMSGSVPAPAPVPVPVPRVQSR 660
            +Q          V VP      G    S         P   +  AP   P+   R+   
Sbjct: 604 TMQ----------VSVP-----RGQSRGSWFAAEEEMSPRQLNRAAPKEFPLDSERM--- 663

Query: 661 GSWFQVEDHMNPSPLGRSATKEFPMSPDAVHVEKQRPPPPFPRKVENPVWSDRSFPEKQR 720
                +E H +P                           PF  KVE+ + SDR   E QR
Sbjct: 664 ----HIEKHRHP---------------------------PFFPKVESSIPSDRLLRENQR 723

Query: 721 LPREASRRDERLRSNYSVPSHQSFRGDEISLSRSVSSNKGFEVEPEKGSSLSENPSVALH 780
           L +EA  RD+RL  N++  S+ SF G+E+ LS+S SS++  + E  +  +  E  +  L 
Sbjct: 724 LSKEALHRDDRLGLNHTPSSYHSFSGEEMPLSQSSSSHRDLDFESGRTVTSGETSAGVLQ 783

Query: 781 DIAMRCGAKVEFKLGLVATSELKFFTEAYFVGEKIGEGTGTTRREAQYRAAEAALMNLAD 840
           DIAM+CGAKVEF+  LVA+ +L+F  EA+F GEK+GEG G TRREAQ +AAE ++ NLA+
Sbjct: 784 DIAMKCGAKVEFRPALVASLDLQFSIEAWFAGEKVGEGVGRTRREAQRQAAEESIKNLAN 843

Query: 841 RYLTHIKSDASTPQSDTSRGPSPKDMGFASDANSQGDCTSRKEETT--TPSSELTRLDDS 900
            YL+ IK D+ + + D SR  +  D GF S+ NS G+    KEE+   + +SE +RL D 
Sbjct: 844 TYLSRIKPDSGSAEGDLSRLHNINDNGFPSNVNSFGNQLLAKEESLSFSTASEQSRLADP 903

Query: 901 ILEGSKDSMGSVSVLKELCMIEGLGVEFKGQSPTSTNPVHGDEIHAEVEINGQVLGKGTG 960
            LEGSK SMGSV+ LKELCM+EGLGV F+ Q P+S+N +  DE++A+VEI+GQVLGKGTG
Sbjct: 904 RLEGSKKSMGSVTALKELCMMEGLGVVFQPQPPSSSNALQKDEVYAQVEIDGQVLGKGTG 963

Query: 961 LTWDEAKMQAAELALASLKSMLGQIT-KRPSSPRLLQGMASKRLKPEYARSIRIELKEGI 984
           LTW+EAKMQAAE AL SL+SMLGQ + KR  SPR LQGM +KRLKPE+ R ++     G 
Sbjct: 964 LTWEEAKMQAAEKALGSLRSMLGQYSQKRQGSPRSLQGMQNKRLKPEFPRVLQRMPSSGR 972

BLAST of Spo14122.1 vs. NCBI nr
Match: gi|731439813|ref|XP_002267987.3| (PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1 [Vitis vinifera])

HSP 1 Score: 1010.4 bits (2611), Expect = 2.700e-291
Identity = 588/1003 (58.62%), Postives = 694/1003 (69.19%), Query Frame = 1

		  

Query: 1   MMKTVVYEGDNLLGEVEIYFENNANKIEMMKKGMLMRISHYSEASERCPPLAVLHTITKS 60
           M K++VYEGD+++GEVEIY +N    +E+MK+   +RISHYS+ SERCPPLAVLHTIT  
Sbjct: 1   MYKSIVYEGDDVVGEVEIYPQNQG--LELMKE---IRISHYSQPSERCPPLAVLHTITSC 60

Query: 61  TGGVSFKMMEKSLYFQQHNDSQIFALHSSCLRGNKTAVVSLGEQEIHLVAMRSRRMDGVT 120
             GV FKM        Q  D+ ++ LHS+C+R NKTAV+SLGE+E+HLVAM S++ DG  
Sbjct: 61  --GVCFKMESSKA---QSQDTPLYLLHSTCIRENKTAVMSLGEEELHLVAMYSKKKDG-Q 120

Query: 121 TPCFWGFIVMPGLYESCLGMLNLRCLGIVFDLDETLIVANTLRSFEDRIEALQRKMTVEA 180
            PCFWGF V  GLY SCL MLNLRCLGIVFDLDETLIVANT+RSFEDRI+ALQRK+  E 
Sbjct: 121 YPCFWGFNVALGLYSSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEV 180

Query: 181 DPQRMAGMMAEVKRYQEDKAILKQYAETDQVVDNGKVHKIQAEVIPALSDNHQTVVRPLI 240
           DPQR++GM AEV+RYQ+D+ ILKQYAE DQVV+NGK+ K Q E++PALSDNHQ +VRPLI
Sbjct: 181 DPQRISGMAAEVRRYQDDRNILKQYAENDQVVENGKLFKTQPEIVPALSDNHQPIVRPLI 240

Query: 241 RLQDKNIVLTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEM 300
           RLQ+KNI+LTRINP IRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEM
Sbjct: 241 RLQEKNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEM 300

Query: 301 WRLLDPDSNLIGGRELLDRIVCVKSGSRKSLFNVFQGGICHPKMALVIDDRLKVWDEKDQ 360
           WRLLDP+SNLI  +ELLDRIVCVKSGSRKSLFNVFQ GICHPKMALVIDDRLKVWDEKDQ
Sbjct: 301 WRLLDPESNLINSKELLDRIVCVKSGSRKSLFNVFQDGICHPKMALVIDDRLKVWDEKDQ 360

Query: 361 PRVHVVPAFAPYYAPQAEANNAIPVLCVARNVACNVRGGFFKEFDEGLLQRMSDVYFEDD 420
           PRVHVVPAFAPYYAPQAEANNAI VLCVARNVACNVRGGFFKEFDEGLLQR+ ++ +EDD
Sbjct: 361 PRVHVVPAFAPYYAPQAEANNAISVLCVARNVACNVRGGFFKEFDEGLLQRIPEISYEDD 420

Query: 421 PKDFPSPPDVSNYLVSEEAVLSSSSAPSLPCVTSLATV-----------------NLDHR 480
            KD  S PDVSNYLVSE+    S+     PC   +A V                 +LD R
Sbjct: 421 IKDIRSAPDVSNYLVSEDDASVSNGNRDQPCFDGMADVEVERKLKDAISAPSTVTSLDPR 480

Query: 481 LASSLPFSVAASSMTIPQPAPQASIAPFHANLFSQAGPLARTLASIGPKDLGLHSSPARE 540
           L+  L F+VAASS   PQPA Q SI PF    F Q+  L + LA     +  + SSPARE
Sbjct: 481 LSPPLQFAVAASSGLAPQPAAQGSIMPFSNKQFPQSASLIKPLA----PEPTMQSSPARE 540

Query: 541 EGEVPESELDPDTRRRLLILQHGQDMREGLPNEPPFPGRPPVQAPVAGPGSGPVPVPGPV 600
           EGEVPESELDPDTRRRLLILQHGQD RE   ++PPFP RPP+Q          V VP   
Sbjct: 541 EGEVPESELDPDTRRRLLILQHGQDTREHASSDPPFPVRPPIQ----------VSVP--- 600

Query: 601 PVAGPGSASISVPGPGPVPMSGSVPAPAPVPVPVPRVQSRGSWFQVEDHMNPSPLGRSAT 660
            V   GS   +     P  ++ +VP   P+               +E H    P      
Sbjct: 601 RVQSRGSWFPADEEMSPRQLNRAVPKEFPLD---------SDTMHIEKHRPHHPSFFHKV 660

Query: 661 KEFPMSPDAVHVEKQRPPPPFPRKVENPVWSDRSFPEKQRLPREASRRDERLRSNYSVPS 720
           +    S   +H              EN           QRL +E   RD+RLR N+S+P 
Sbjct: 661 ESSASSDRILH--------------EN-----------QRLSKEVLHRDDRLRLNHSLPG 720

Query: 721 HQSFRGDEISLSRSVSSNKGFEVEPEKGSSLSENPSVALHDIAMRCGAKVEFKLGLVATS 780
           + SF G+E+ L RS SSN+  + E  +G+  +E P+V L +IAM+CG K+EF+  LVA +
Sbjct: 721 YHSFSGEEVPLGRS-SSNRDLDFESGRGAPYAETPAVGLQEIAMKCGTKLEFRPSLVAAT 780

Query: 781 ELKFFTEAYFVGEKIGEGTGTTRREAQYRAAEAALMNLADRYLTHIKSDASTPQSDTSRG 840
           EL+F  E +F GEKIGEGTG TRREAQ +AAEA+LM L+ RYL            D +R 
Sbjct: 781 ELQFSIEVWFAGEKIGEGTGKTRREAQCQAAEASLMYLSYRYL----------HGDVNRF 840

Query: 841 PSPKDMGFASDANSQGDCTSRKEETT--TPSSELTRLDDSILEGSKDSMGSVSVLKELCM 900
           P+  D  F SD NS G  +  KE +   + +SE +RL D  LE SK SMGS+S LKELCM
Sbjct: 841 PNASDNNFMSDTNSFGYQSFPKEGSMSFSTASESSRLLDPRLESSKKSMGSISALKELCM 900

Query: 901 IEGLGVEFKGQSPTSTNPVHGDEIHAEVEINGQVLGKGTGLTWDEAKMQAAELALASLKS 960
           +EGLGVEF  Q P S+N    +EI A+VEI+GQVLGKGTG TWD+AKMQAAE AL SLKS
Sbjct: 901 MEGLGVEFLSQPPLSSNSTQKEEICAQVEIDGQVLGKGTGSTWDDAKMQAAEKALGSLKS 929

Query: 961 MLGQIT-KRPSSPRLLQGMASKRLKPEYARSIRIELKEGIYTR 984
           MLGQ + KR  SPR LQGM  KRLK E+ R ++     G Y++
Sbjct: 961 MLGQFSQKRQGSPRSLQGM-GKRLKSEFTRGLQRTPSSGRYSK 929

BLAST of Spo14122.1 vs. UniProtKB/TrEMBL
Match: A0A0K9QNK5_SPIOL (Uncharacterized protein OS=Spinacia oleracea GN=SOVF_159350 PE=4 SV=1)

HSP 1 Score: 1875.1 bits (4856), Expect = 0.000e+0
Identity = 969/1000 (96.90%), Postives = 970/1000 (97.00%), Query Frame = 1

		  

Query: 1    MMKTVVYEGDNLLGEVEIYFENNANKIEMMKKGMLMRISHYSEASERCPPLAVLHTITKS 60
            MMKTVVYEGDNLLGEVEIYFENNANKIEMMKKGMLMRISHYSEASERCPPLAVLHTITKS
Sbjct: 1    MMKTVVYEGDNLLGEVEIYFENNANKIEMMKKGMLMRISHYSEASERCPPLAVLHTITKS 60

Query: 61   TGGVSFKMMEKSLYFQQHNDSQIFALHSSCLRGNKTAVVSLGEQEIHLVAMRSRRMDGVT 120
            TGGVSFKMMEKSLYFQQHNDSQIFALHSSCLRGNKTAVVSLGEQEIHLVAMRSRRMDGVT
Sbjct: 61   TGGVSFKMMEKSLYFQQHNDSQIFALHSSCLRGNKTAVVSLGEQEIHLVAMRSRRMDGVT 120

Query: 121  TPCFWGFIVMPGLYESCLGMLNLRCLGIVFDLDETLIVANTLRSFEDRIEALQRKMTVEA 180
            TPCFWGFIVMPGLYESCLGMLNLRCLGIVFDLDETLIVANTLRSFEDRIEALQRKMTVEA
Sbjct: 121  TPCFWGFIVMPGLYESCLGMLNLRCLGIVFDLDETLIVANTLRSFEDRIEALQRKMTVEA 180

Query: 181  DPQRMAGMMAEVKRYQEDKAILKQYAETDQVVDNGKVHKIQAEVIPALSDNHQTVVRPLI 240
            DPQRMAGMMAEVKRYQEDKAILKQYAETDQVVDNGKVHKIQAEVIPALSDNHQTVVRPLI
Sbjct: 181  DPQRMAGMMAEVKRYQEDKAILKQYAETDQVVDNGKVHKIQAEVIPALSDNHQTVVRPLI 240

Query: 241  RLQDKNIVLTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEM 300
            RLQDKNIVLTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEM
Sbjct: 241  RLQDKNIVLTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEM 300

Query: 301  WRLLDPDSNLIGGRELLDRIVCVKSGSRKSLFNVFQGGICHPKMALVIDDRLKVWDEKDQ 360
            WRLLDPDSNLIGGRELLDRIVCVKSGSRKSLFNVFQGGICHPKMALVIDDRLKVWDEKDQ
Sbjct: 301  WRLLDPDSNLIGGRELLDRIVCVKSGSRKSLFNVFQGGICHPKMALVIDDRLKVWDEKDQ 360

Query: 361  PRVHVVPAFAPYYAPQAEANNAIPVLCVARNVACNVRGGFFKEFDEGLLQRMSDVYFEDD 420
            PRVHVVPAFAPYYAPQAEANNAIPVLCVARNVACNVRGGFFKEFDEGLLQRMSDVYFEDD
Sbjct: 361  PRVHVVPAFAPYYAPQAEANNAIPVLCVARNVACNVRGGFFKEFDEGLLQRMSDVYFEDD 420

Query: 421  PKDFPSPPDVSNYLVSE----------------------------EAVLSSSSAPSLPCV 480
            PKDFPSPPDVSNYLVSE                            EAVLSSSSAPSLPCV
Sbjct: 421  PKDFPSPPDVSNYLVSEDDGSGSNANKEPICFDGMADAEVERRLKEAVLSSSSAPSLPCV 480

Query: 481  TSLATVNLDHRLASSLPFSVAASSMTIPQPAPQASIAPFHANLFSQAGPLARTLASIGPK 540
            TSLATVNLDHRLASSLPFSVAASSMTIPQPAPQASIAPFHANLFSQAGPLARTLASIGPK
Sbjct: 481  TSLATVNLDHRLASSLPFSVAASSMTIPQPAPQASIAPFHANLFSQAGPLARTLASIGPK 540

Query: 541  DLGLHSSPAREEGEVPESELDPDTRRRLLILQHGQDMREGLPNEPPFPGRPPVQAPVAGP 600
            DLGLHSSPAREEGEVPESELDPDTRRRLLILQHGQDMREGLPNEPPFPGRPPVQAPVAGP
Sbjct: 541  DLGLHSSPAREEGEVPESELDPDTRRRLLILQHGQDMREGLPNEPPFPGRPPVQAPVAGP 600

Query: 601  GS--GPVPVPGPVPVAGPGSASISVPGPGPVPMSGSVPAPAPVPVPVPRVQSRGSWFQVE 660
            GS  GPVPVPGPVPVAGPGSASISVPGPGPVPMSGSVPAPAPVPVPVPRVQSRGSWFQVE
Sbjct: 601  GSGPGPVPVPGPVPVAGPGSASISVPGPGPVPMSGSVPAPAPVPVPVPRVQSRGSWFQVE 660

Query: 661  DHMNPSPLGRSATKEFPMSPDAVHVEKQRPPPPFPRKVENPVWSDRSFPEKQRLPREASR 720
            DHMNPSPLGRSATKEFPMSPDAVHVEKQRPPPPFPRKVENPVWSDRSFPEKQRLPREASR
Sbjct: 661  DHMNPSPLGRSATKEFPMSPDAVHVEKQRPPPPFPRKVENPVWSDRSFPEKQRLPREASR 720

Query: 721  RDERLRSNYSVPSHQSFRGDEISLSRSVSSNKGFEVEPEKGSSLSENPSVALHDIAMRCG 780
            RDERLRSNYSVPSHQSFRGDEISLSRSVSSNKGFEVEPEKGSSLSENPSVALHDIAMRCG
Sbjct: 721  RDERLRSNYSVPSHQSFRGDEISLSRSVSSNKGFEVEPEKGSSLSENPSVALHDIAMRCG 780

Query: 781  AKVEFKLGLVATSELKFFTEAYFVGEKIGEGTGTTRREAQYRAAEAALMNLADRYLTHIK 840
            AKVEFKLGLVATSELKFFTEAYFVGEKIGEGTGTTRREAQYRAAEAALMNLADRYLTHIK
Sbjct: 781  AKVEFKLGLVATSELKFFTEAYFVGEKIGEGTGTTRREAQYRAAEAALMNLADRYLTHIK 840

Query: 841  SDASTPQSDTSRGPSPKDMGFASDANSQGDCTSRKEETTTPSSELTRLDDSILEGSKDSM 900
            SDASTPQSDTSRGPSPKDMGFASDANSQGDCTS+KEETTTPSSELTRLDDSILEGSKDSM
Sbjct: 841  SDASTPQSDTSRGPSPKDMGFASDANSQGDCTSKKEETTTPSSELTRLDDSILEGSKDSM 900

Query: 901  GSVSVLKELCMIEGLGVEFKGQSPTSTNPVHGDEIHAEVEINGQVLGKGTGLTWDEAKMQ 960
            GSVSVLKELCMIEGLGVEFKGQSPTSTNPVHGDEIHAEVEINGQVLGKGTGLTWDEAKMQ
Sbjct: 901  GSVSVLKELCMIEGLGVEFKGQSPTSTNPVHGDEIHAEVEINGQVLGKGTGLTWDEAKMQ 960

Query: 961  AAELALASLKSMLGQITKRPSSPRLLQGMASKRLKPEYAR 971
            AAELALASLKSMLGQITKRPSSPRLLQGMASKRLKPEYAR
Sbjct: 961  AAELALASLKSMLGQITKRPSSPRLLQGMASKRLKPEYAR 1000

BLAST of Spo14122.1 vs. UniProtKB/TrEMBL
Match: A0A0K9QQJ6_SPIOL (Uncharacterized protein OS=Spinacia oleracea GN=SOVF_159350 PE=4 SV=1)

HSP 1 Score: 1812.7 bits (4694), Expect = 0.000e+0
Identity = 953/1043 (91.37%), Postives = 957/1043 (91.75%), Query Frame = 1

		  

Query: 1    MMKTVVYEGDNLLGEVEIYFENNANKIEMMKKGMLMRISHYSEASERCPPLAVLHTITKS 60
            MMKTVVYEGDNLLGEVEIYFENNANKIEMMKKGMLMRISHYSEASERCPPLAVLHTITKS
Sbjct: 1    MMKTVVYEGDNLLGEVEIYFENNANKIEMMKKGMLMRISHYSEASERCPPLAVLHTITKS 60

Query: 61   TGGVSFKMMEKSLYFQQHNDSQIFALHSSCLRGNKTAVVSLGEQEIHLVAMRSRRMDGVT 120
            TGGVSFKMMEKSLYFQQHNDSQIFALHSSCLRGNKTAVVSLGEQEIHLVAMRSRRMDGVT
Sbjct: 61   TGGVSFKMMEKSLYFQQHNDSQIFALHSSCLRGNKTAVVSLGEQEIHLVAMRSRRMDGVT 120

Query: 121  TPCFWGFIVMPGLYESCLGMLNLRCLGIVFDLDETLIVANTLRSFEDRIEALQRKMTVEA 180
            TPCFWGFIVMPGLYESCLGMLNLRCLGIVFDLDETLIVANTLRSFEDRIEALQRKMTVEA
Sbjct: 121  TPCFWGFIVMPGLYESCLGMLNLRCLGIVFDLDETLIVANTLRSFEDRIEALQRKMTVEA 180

Query: 181  DPQRMAGMMAEVKRYQEDKAILKQYAETDQVVDNGKVHKIQAEVIPALSDNHQTVVRPLI 240
            DPQRMAGMMAEVKRYQEDKAILKQYAETDQVVDNGKVHKIQAEVIPALSDNHQTVVRPLI
Sbjct: 181  DPQRMAGMMAEVKRYQEDKAILKQYAETDQVVDNGKVHKIQAEVIPALSDNHQTVVRPLI 240

Query: 241  RLQDKNIVLTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEM 300
            RLQDKNIVLTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEM
Sbjct: 241  RLQDKNIVLTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEM 300

Query: 301  WRLLDPDSNLIGGRELLDRIVCVKSGSRKSLFNVFQGGICHPKMALVIDDRLKVWDEKDQ 360
            WRLLDPDSNLIGGRELLDRIVCVKSGSRKSLFNVFQGGICHPKMALVIDDRLKVWDEKDQ
Sbjct: 301  WRLLDPDSNLIGGRELLDRIVCVKSGSRKSLFNVFQGGICHPKMALVIDDRLKVWDEKDQ 360

Query: 361  PRVHVVPAFAPYYAPQAEANNAIPVLCVARNVACNVRGGFFKEFDEGLLQRMSDVYFEDD 420
            PRVHVVPAFAPYYAPQAEANNAIPVLCVARNVACNVRGGFFKEFDEGLLQRMSDVYFEDD
Sbjct: 361  PRVHVVPAFAPYYAPQAEANNAIPVLCVARNVACNVRGGFFKEFDEGLLQRMSDVYFEDD 420

Query: 421  PKDFPSP----------------------------PDVSNYLVSEEAVLSSSSAPSLPCV 480
            PKDFPSP                             D       +EAVLSSSSAPSLPCV
Sbjct: 421  PKDFPSPPDVSNYLVSEDDGSGSNANKEPICFDGMADAEVERRLKEAVLSSSSAPSLPCV 480

Query: 481  TSLATVNLDHRLASSLPFSVAASSMTIPQPAPQASIAPFHANLFSQAGPLARTLASIGPK 540
            TSLATVNLDHRLASSLPFSVAASSMTIPQPAPQASIAPFHANLFSQAGPLARTLASIGPK
Sbjct: 481  TSLATVNLDHRLASSLPFSVAASSMTIPQPAPQASIAPFHANLFSQAGPLARTLASIGPK 540

Query: 541  DLGLHSSPAREEGEVPESELDPDTRRRLLILQHGQDMREGLPNEPPFPGRPPVQAPVAGP 600
            DLGLHSSPAREEGEVPESELDPDTRRRLLILQHGQDMREGLPNEPPFPGRPPVQAPVAGP
Sbjct: 541  DLGLHSSPAREEGEVPESELDPDTRRRLLILQHGQDMREGLPNEPPFPGRPPVQAPVAGP 600

Query: 601  GSGP--VPVPGPVPVAGPGSASISVPGPGPVPMSGSVPAPAPVPVPVPRVQSRGSWFQVE 660
            GSGP  VPVPGPVPVAGPGSASISVPGPGPVPMSGSVPAPAPVPVPVPRVQSRGSWFQVE
Sbjct: 601  GSGPGPVPVPGPVPVAGPGSASISVPGPGPVPMSGSVPAPAPVPVPVPRVQSRGSWFQVE 660

Query: 661  DHMNPSPLGRSATKEFPMSPDAVHVEKQRPPPPFPRKVENPVWSDRSFPEKQRLPREASR 720
            DHMNPSPLGRSATKEFPMSPDAVHVEKQRPPPPFPRKVENPVWSDRSFPEKQRLPREASR
Sbjct: 661  DHMNPSPLGRSATKEFPMSPDAVHVEKQRPPPPFPRKVENPVWSDRSFPEKQRLPREASR 720

Query: 721  RDERLRSNYS-------------------------------------------VPSHQSF 780
            RDERLRSNYS                                           +  H+ F
Sbjct: 721  RDERLRSNYSVPSHQSFRGGSTKKVKLMTRQSSLMEFTPDICSPGVDAQVRDHILCHRPF 780

Query: 781  RGDEISLSRSVSSNKGFEVEPEKGSSLSENPSVALHDIAMRCGAKVEFKLGLVATSELKF 840
              DEISLSRSVSSNKGFEVEPEKGSSLSENPSVALHDIAMRCGAKVEFKLGLVATSELKF
Sbjct: 781  C-DEISLSRSVSSNKGFEVEPEKGSSLSENPSVALHDIAMRCGAKVEFKLGLVATSELKF 840

Query: 841  FTEAYFVGEKIGEGTGTTRREAQYRAAEAALMNLADRYLTHIKSDASTPQSDTSRGPSPK 900
            FTEAYFVGEKIGEGTGTTRREAQYRAAEAALMNLADRYLTHIKSDASTPQSDTSRGPSPK
Sbjct: 841  FTEAYFVGEKIGEGTGTTRREAQYRAAEAALMNLADRYLTHIKSDASTPQSDTSRGPSPK 900

Query: 901  DMGFASDANSQGDCTSRKEETTTPSSELTRLDDSILEGSKDSMGSVSVLKELCMIEGLGV 960
            DMGFASDANSQGDCTS+KEETTTPSSELTRLDDSILEGSKDSMGSVSVLKELCMIEGLGV
Sbjct: 901  DMGFASDANSQGDCTSKKEETTTPSSELTRLDDSILEGSKDSMGSVSVLKELCMIEGLGV 960

Query: 961  EFKGQSPTSTNPVHGDEIHAEVEINGQVLGKGTGLTWDEAKMQAAELALASLKSMLGQIT 971
            EFKGQSPTSTNPVHGDEIHAEVEINGQVLGKGTGLTWDEAKMQAAELALASLKSMLGQIT
Sbjct: 961  EFKGQSPTSTNPVHGDEIHAEVEINGQVLGKGTGLTWDEAKMQAAELALASLKSMLGQIT 1020

BLAST of Spo14122.1 vs. UniProtKB/TrEMBL
Match: A0A0J8BRZ7_BETVU (Uncharacterized protein OS=Beta vulgaris subsp. vulgaris GN=BVRB_8g183610 PE=4 SV=1)

HSP 1 Score: 1512.3 bits (3914), Expect = 0.000e+0
Identity = 800/1016 (78.74%), Postives = 867/1016 (85.33%), Query Frame = 1

		  

Query: 1    MMKTVVYEGDNLLGEVEIYFENNANK--IEMMKKGMLMRISHYSEASERCPPLAVLHTIT 60
            M+K+VVYEG+NLLGEVEIYF+NN N   +E+MK    MRISHYSE SERCPPLAVLHTIT
Sbjct: 1    MIKSVVYEGENLLGEVEIYFQNNNNNKNLELMKG---MRISHYSEMSERCPPLAVLHTIT 60

Query: 61   KSTGGVSFKMMEKS------------LYFQQHN-DSQIFALHSSCLRGNKTAVVSLGEQE 120
            KS+GG+ FKMME S             YFQQ   +SQ+ A+HS+C+R NKTAVV LGEQE
Sbjct: 61   KSSGGICFKMMESSSHTSSNNNNNNKFYFQQQQQESQLLAMHSNCIRDNKTAVVPLGEQE 120

Query: 121  IHLVAMRSRRMDGVTTPCFWGFIVMPGLYESCLGMLNLRCLGIVFDLDETLIVANTLRSF 180
            IHLVA+RSRRM GVT PCFWGF V PGLYESCLG+LNLRCLGIVFDLDETLIVANTLRSF
Sbjct: 121  IHLVALRSRRMAGVT-PCFWGFSVAPGLYESCLGLLNLRCLGIVFDLDETLIVANTLRSF 180

Query: 181  EDRIEALQRKMTVEADPQRMAGMMAEVKRYQEDKAILKQYAETDQVVDNGKVHKIQAEVI 240
            EDRIEALQRK++VEADPQR+AGM+AEVKRYQEDK+ILKQYAETDQVVDNGKVHKIQAEVI
Sbjct: 181  EDRIEALQRKISVEADPQRIAGMVAEVKRYQEDKSILKQYAETDQVVDNGKVHKIQAEVI 240

Query: 241  PALSDNHQTVVRPLIRLQDKNIVLTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFE 300
            PALSDNHQTV+RPLIRLQDKNIVLTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFE
Sbjct: 241  PALSDNHQTVIRPLIRLQDKNIVLTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFE 300

Query: 301  VYVCTMAERDYALEMWRLLDPDSNLIGGRELLDRIVCVKSGSRKSLFNVFQGGICHPKMA 360
            VYVCTMAERDYALEMWRLLDPDSNLI  RELLDRIVCVKSGS+KSLFNVF GGICHPKMA
Sbjct: 301  VYVCTMAERDYALEMWRLLDPDSNLICARELLDRIVCVKSGSKKSLFNVFHGGICHPKMA 360

Query: 361  LVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNAIPVLCVARNVACNVRGGFFKEFD 420
            LVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNAIPVLCVARNVACNVRGGFFKEFD
Sbjct: 361  LVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNAIPVLCVARNVACNVRGGFFKEFD 420

Query: 421  EGLLQRMSDVYFEDDPKDFPSPPDVSNYLV----------------------------SE 480
            EGLLQR+S+V FEDDP+D PSPPDVSNYLV                             +
Sbjct: 421  EGLLQRVSEVSFEDDPRDIPSPPDVSNYLVSEDDGSGSNGIKESMTFDGMADAEVERRLK 480

Query: 481  EAVLSSSSAPSLPCVTSLATVNLDHRLASSLPFSVAASSMTIPQPAPQASIAPFHANLFS 540
            EAVLSSSSA  LP   + ATVN DHRLASSLPF+VA S++ IPQPAPQA+I P+H NLFS
Sbjct: 481  EAVLSSSSASPLPSANTPATVNFDHRLASSLPFAVATSALAIPQPAPQATITPYHNNLFS 540

Query: 541  QAGPLARTLASIGPKDLGLHSSPAREEGEVPESELDPDTRRRLLILQHGQDMREGLPNEP 600
            QAGPLAR L +IGP+D+GLH+SPAREEGEVPESELDPDTRRRLLILQHGQDMREG PNEP
Sbjct: 541  QAGPLARPLGNIGPQDIGLHNSPAREEGEVPESELDPDTRRRLLILQHGQDMREGPPNEP 600

Query: 601  PFPGRPPVQAPVAGPGSGPVPVPGPVPVAGPGSASISVPGPGPVPMSGSVPAPAPVPVPV 660
            PFP R PVQAPV GP   PV VPGPVPV GPG   +SVP PGP+P    VP    VP P 
Sbjct: 601  PFPARTPVQAPVTGPV--PVSVPGPVPVPGPGP--VSVPVPGPIPSPVPVPVSGTVPGPG 660

Query: 661  PRVQSRGSWFQVEDHMNPSPLGRSATKEFPMSPDAVHVEKQRPPPP-FPRKVENPVWSDR 720
            PRVQSRGSWF VEDH++  PL R A KEFP++PDA  VEKQRPPPP FPRKVE+  WSDR
Sbjct: 661  PRVQSRGSWFPVEDHISQGPLSRVAAKEFPVAPDASPVEKQRPPPPSFPRKVESLGWSDR 720

Query: 721  SFPEKQRLPREASRRDERLRSNYSVPSHQSFRGDEISLSRSVSSNKGFEVEPEKGSSLSE 780
            ++ EKQRLPREA RRD+RLRSNYS+PSHQSFRGDEISLSRS SSNK FEVEPE+GSS +E
Sbjct: 721  NYAEKQRLPREALRRDDRLRSNYSLPSHQSFRGDEISLSRSASSNKDFEVEPERGSSFAE 780

Query: 781  NPSVALHDIAMRCGAKVEFKLGLVATSELKFFTEAYFVGEKIGEGTGTTRREAQYRAAEA 840
            +PS+ALHDIAM+CG KVEFK GLVAT ELKF  EAYF G+KIGEGTGTTRREAQ+RAAEA
Sbjct: 781  SPSIALHDIAMKCGTKVEFKTGLVATPELKFLLEAYFAGDKIGEGTGTTRREAQHRAAEA 840

Query: 841  ALMNLADRYLTHIKSDASTPQSDTSRGPSPKDMGFASDANSQGDCTSRKEETTTPSSELT 900
            ALMNLAD+YLTHIKSD+STPQSDTSRG SP D GF SDANS GD  SRKE+   PSSE+T
Sbjct: 841  ALMNLADKYLTHIKSDSSTPQSDTSRGHSPIDTGFVSDANSHGDGISRKED-IIPSSEMT 900

Query: 901  RLDDSILEGSKDSMGSVSVLKELCMIEGLGVEFKGQSPTSTNPVHGDEIHAEVEINGQVL 960
             LDDS ++GSK+SMGSVSVLKELC+ EGLGV+FKGQSPTSTN V  DEIHAEVEINGQVL
Sbjct: 901  GLDDSNVDGSKNSMGSVSVLKELCLREGLGVDFKGQSPTSTNSVDRDEIHAEVEINGQVL 960

Query: 961  GKGTGLTWDEAKMQAAELALASLKSMLGQITKRPSSPRLLQGMASKRLKPEYARSI 973
            GKGTGLTWDEAKMQAAE+AL SL SM+GQ  KRPSSPRLLQGM +KRLKPEY R +
Sbjct: 961  GKGTGLTWDEAKMQAAEMALTSLNSMIGQFNKRPSSPRLLQGMPNKRLKPEYPRVV 1007

BLAST of Spo14122.1 vs. UniProtKB/TrEMBL
Match: A0A061GGL6_THECC (C-terminal domain phosphatase-like 1 isoform 1 OS=Theobroma cacao GN=TCM_029910 PE=4 SV=1)

HSP 1 Score: 1018.8 bits (2633), Expect = 5.300e-294
Identity = 589/1023 (57.58%), Postives = 708/1023 (69.21%), Query Frame = 1

		  

Query: 1   MMKTVVYEGDNLLGEVEIY------------FENNANKIEMMKKGML-MRISHYSEASER 60
           M K+VVY G+ +LGEVEIY             E +  KI +M++ M  +RI + ++ SER
Sbjct: 4   MYKSVVYRGEEVLGEVEIYPQQQLQQQQQLREEEDERKIMVMEEEMKEIRIEYLTQGSER 63

Query: 61  CPPLAVLHTITKSTGGVSFKM--MEKSLYFQQHNDSQIFALHSSCLRGNKTAVVSLGEQE 120
           CPPLAVLHTIT S  G+ FKM   + + Y    +   +  LHS C+R NKTAV+ +G+ E
Sbjct: 64  CPPLAVLHTITSS--GICFKMESSKDNNYSSSQDSPPLHLLHSECIRDNKTAVMPMGDCE 123

Query: 121 IHLVAMRSRRMDGVTTPCFWGFIVMPGLYESCLGMLNLRCLGIVFDLDETLIVANTLRSF 180
           +HLVAM SR  D    PCFWGF V  GLY+SCL MLNLRCLGIVFDLDETLIVANT+RSF
Sbjct: 124 LHLVAMYSRNSD---RPCFWGFNVSRGLYDSCLLMLNLRCLGIVFDLDETLIVANTMRSF 183

Query: 181 EDRIEALQRKMTVEADPQRMAGMMAEVKRYQEDKAILKQYAETDQVVDNGKVHKIQAEVI 240
           EDRIEALQRKMT E DPQR+AGM+AE+KRYQ+DKAILKQYAE DQVV+NGKV KIQ+EV+
Sbjct: 184 EDRIEALQRKMTTEVDPQRVAGMVAEMKRYQDDKAILKQYAENDQVVENGKVIKIQSEVV 243

Query: 241 PALSDNHQTVVRPLIRLQDKNIVLTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFE 300
           PALSDNHQ ++RPLIRLQ+KNI+LTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFE
Sbjct: 244 PALSDNHQPIIRPLIRLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFE 303

Query: 301 VYVCTMAERDYALEMWRLLDPDSNLIGGRELLDRIVCVKSGSRKSLFNVFQGGICHPKMA 360
           VYVCTMAERDYALEMWRLLDP+SNLI  +ELLDRIVCVKSGSRKSLFNVFQ GICHPKMA
Sbjct: 304 VYVCTMAERDYALEMWRLLDPESNLINSKELLDRIVCVKSGSRKSLFNVFQDGICHPKMA 363

Query: 361 LVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNAIPVLCVARNVACNVRGGFFKEFD 420
           LVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANN IPVLCVARNVACNVRGGFF+EFD
Sbjct: 364 LVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFFREFD 423

Query: 421 EGLLQRMSDVYFEDDPKDFPSPPDVSNYLVSEE--AVLSSSSAP---------------- 480
           EGLLQR+ ++ +EDD KD PSPPDV NYLVSE+  + L+ +  P                
Sbjct: 424 EGLLQRIPEISYEDDIKDIPSPPDVGNYLVSEDDTSALNGNKDPLLFDGMADAEVERRLK 483

Query: 481 ---SLPCVTSLATVNLDHRLASSLPFSVAASSMTIPQPAPQASIAPFHANLFSQAGPLAR 540
              S     S A +NLD RL  SL +++ +SS +IP  A Q SI  F    F  A P+ +
Sbjct: 484 EAISATSTVSSAAINLDPRLTPSLQYTMPSSSSSIPPSASQPSIVSFSNMQFPLAAPVVK 543

Query: 541 TLASIGPKDLGLHSSPAREEGEVPESELDPDTRRRLLILQHGQDMREGLPNEPPFPG-RP 600
            +A +   +  L SSPAREEGEVPESELDPDTRRRLLILQHGQD R+  P EP FP  RP
Sbjct: 544 PVAPVAVPEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTPPEPAFPPVRP 603

Query: 601 PVQAPVAGPGSGPVPVPGPVPVAGPGSASISVPGPGPVPMSGSVPAPAPVPVPVPRVQSR 660
            +Q          V VP      G    S         P   +  AP   P+   R+   
Sbjct: 604 TMQ----------VSVP-----RGQSRGSWFAAEEEMSPRQLNRAAPKEFPLDSERM--- 663

Query: 661 GSWFQVEDHMNPSPLGRSATKEFPMSPDAVHVEKQRPPPPFPRKVENPVWSDRSFPEKQR 720
                +E H +P                           PF  KVE+ + SDR   E QR
Sbjct: 664 ----HIEKHRHP---------------------------PFFPKVESSIPSDRLLRENQR 723

Query: 721 LPREASRRDERLRSNYSVPSHQSFRGDEISLSRSVSSNKGFEVEPEKGSSLSENPSVALH 780
           L +EA  RD+RL  N++  S+ SF G+E+ LS+S SS++  + E  +  +  E  +  L 
Sbjct: 724 LSKEALHRDDRLGLNHTPSSYHSFSGEEMPLSQSSSSHRDLDFESGRTVTSGETSAGVLQ 783

Query: 781 DIAMRCGAKVEFKLGLVATSELKFFTEAYFVGEKIGEGTGTTRREAQYRAAEAALMNLAD 840
           DIAM+CGAKVEF+  LVA+ +L+F  EA+F GEK+GEG G TRREAQ +AAE ++ NLA+
Sbjct: 784 DIAMKCGAKVEFRPALVASLDLQFSIEAWFAGEKVGEGVGRTRREAQRQAAEESIKNLAN 843

Query: 841 RYLTHIKSDASTPQSDTSRGPSPKDMGFASDANSQGDCTSRKEETT--TPSSELTRLDDS 900
            YL+ IK D+ + + D SR  +  D GF S+ NS G+    KEE+   + +SE +RL D 
Sbjct: 844 TYLSRIKPDSGSAEGDLSRLHNINDNGFPSNVNSFGNQLLAKEESLSFSTASEQSRLADP 903

Query: 901 ILEGSKDSMGSVSVLKELCMIEGLGVEFKGQSPTSTNPVHGDEIHAEVEINGQVLGKGTG 960
            LEGSK SMGSV+ LKELCM+EGLGV F+ Q P+S+N +  DE++A+VEI+GQVLGKGTG
Sbjct: 904 RLEGSKKSMGSVTALKELCMMEGLGVVFQPQPPSSSNALQKDEVYAQVEIDGQVLGKGTG 963

Query: 961 LTWDEAKMQAAELALASLKSMLGQIT-KRPSSPRLLQGMASKRLKPEYARSIRIELKEGI 984
           LTW+EAKMQAAE AL SL+SMLGQ + KR  SPR LQGM +KRLKPE+ R ++     G 
Sbjct: 964 LTWEEAKMQAAEKALGSLRSMLGQYSQKRQGSPRSLQGMQNKRLKPEFPRVLQRMPSSGR 972

BLAST of Spo14122.1 vs. UniProtKB/TrEMBL
Match: A0A067GKB1_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g002166mg PE=4 SV=1)

HSP 1 Score: 1002.7 bits (2591), Expect = 3.900e-289
Identity = 585/1010 (57.92%), Postives = 690/1010 (68.32%), Query Frame = 1

		  

Query: 1   MMKTVVYEGDNLLGEVEIYFENNANKIEMMKKGMLM----RISHYSEASERCPPLAVLHT 60
           M KTV Y G  +LGEVEIY +      E  +K   +    RIS++SEASERCPPLAVLHT
Sbjct: 1   MYKTVAYLGKEILGEVEIYPQQQGEGGEGEEKNKKVFDEIRISYFSEASERCPPLAVLHT 60

Query: 61  ITKSTGGVSFKMMEKSLYFQQHNDSQIFALHSSCLRGNKTAVVSLG-EQEIHLVAMRSRR 120
           IT S  G+ FKM  KS      ++ Q+  LHSSC+R NKTAV+ LG  +E+HLVAM SR 
Sbjct: 61  ITAS--GICFKMESKS-----SDNIQLHLLHSSCIRENKTAVMPLGLTEELHLVAMYSRN 120

Query: 121 MDGVTTPCFWGFIVMPGLYESCLGMLNLRCLGIVFDLDETLIVANTLRSFEDRIEALQRK 180
            +    PCFW F V  GLY SCL MLNLRCLGIVFDLDETLIVANT+RSFEDRIEAL RK
Sbjct: 121 NEK-QYPCFWAFSVGSGLYNSCLTMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRK 180

Query: 181 MTVEADPQRMAGMMAEVKRYQEDKAILKQYAETDQVVDNGKVHKIQAEVIPALSDNHQTV 240
           ++ E DPQR+AGM AEVKRYQ+DK ILKQYAE DQV +NGKV K+Q+EV+PALSD+HQ +
Sbjct: 181 ISTEVDPQRIAGMQAEVKRYQDDKNILKQYAENDQVNENGKVIKVQSEVVPALSDSHQAL 240

Query: 241 VRPLIRLQDKNIVLTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERD 300
           VRPLIRLQ+KNI+LTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERD
Sbjct: 241 VRPLIRLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERD 300

Query: 301 YALEMWRLLDPDSNLIGGRELLDRIVCVKSGSRKSLFNVFQGGICHPKMALVIDDRLKVW 360
           YALEMWRLLDP+SNLI  +ELLDRIVCVKSGSRKSLFNVFQ G CHPKMALVIDDRLKVW
Sbjct: 301 YALEMWRLLDPESNLINTKELLDRIVCVKSGSRKSLFNVFQDGTCHPKMALVIDDRLKVW 360

Query: 361 DEKDQPRVHVVPAFAPYYAPQAEANNAIPVLCVARNVACNVRGGFFKEFDEGLLQRMSDV 420
           D+KDQPRVHVVPAFAPYYAPQAEANNAIPVLCVARN+ACNVRGGFFKEFDEGLLQR+ ++
Sbjct: 361 DDKDQPRVHVVPAFAPYYAPQAEANNAIPVLCVARNIACNVRGGFFKEFDEGLLQRIPEI 420

Query: 421 YFEDDPKDFPSPPDVSNYLVSEEAVLSSSS---------------------APSLPCVTS 480
            +EDD KD PSPPDVSNYLVSE+   +++                      A +     S
Sbjct: 421 SYEDDVKDIPSPPDVSNYLVSEDDAATANGIKDPLSFDGMADAEVERRLKEAIAASATIS 480

Query: 481 LATVNLDHRLASSLPFSVAASSMTIPQPAPQASIAPFHANLFSQAGPLARTLASIGPKDL 540
            A  NLD RLA    +++ +SS T   P  QA++ P     F  A  L + L  +GP + 
Sbjct: 481 SAVANLDPRLAP-FQYTMPSSSSTTTLPTSQAAVMPLANMQFPPATSLVKPLGHVGPPEQ 540

Query: 541 GLHSSPAREEGEVPESELDPDTRRRLLILQHGQDMREGLPNEPPFPGRPPVQAPVAGPGS 600
            L SSPAREEGEVPESELDPDTRRRLLILQHG D RE  P+E PFP R  +Q        
Sbjct: 541 SLQSSPAREEGEVPESELDPDTRRRLLILQHGMDTRENAPSEAPFPARTQMQ-------- 600

Query: 601 GPVPVPGPVPVAGPGSASISVPGPGPVPMSGSVPAPAPVPVPVPRVQSRGSWFQVEDHMN 660
             V VP    V   GS         P  ++ +VP   P+              Q+E H  
Sbjct: 601 --VSVP---RVPSRGSWFPVEEEMSPRQLNRAVPKEFPL---------NSEAMQIEKHRP 660

Query: 661 PSPLGRSATKEFPMSPDAVHVEKQRPPPPFPRKVENPVWSDRSFPEKQRLPREASRRDER 720
           P                          P F  K+ENP  SDR   E QR+P+EA RRD+R
Sbjct: 661 PH-------------------------PSFFPKIENPSTSDRPH-ENQRMPKEALRRDDR 720

Query: 721 LRSNYSVPSHQSFRGDEISLSRSVSSNKGFEVEPEKGSSLSENPSVALHDIAMRCGAKVE 780
           LR N+++  +QSF G+EI LSRS SS++  + E  +  S +E PS  L DIAM+CG KVE
Sbjct: 721 LRLNHTLSDYQSFSGEEIPLSRSSSSSRDVDFESGRDVSSTETPSGVLQDIAMKCGTKVE 780

Query: 781 FKLGLVATSELKFFTEAYFVGEKIGEGTGTTRREAQYRAAEAALMNLADRYLTHIKSDAS 840
           F+  LVA++EL+F  EA+F GEKIGEG G TRREAQ +AAE ++ +LA+ Y+  +KSD+ 
Sbjct: 781 FRPALVASTELQFSIEAWFAGEKIGEGIGRTRREAQRQAAEGSIKHLANVYMLRVKSDSG 840

Query: 841 TPQSDTSRGPSPKDMGFASDANSQGDCTSRKEETTTPSSELTRLDDSILEGSKDSMGSVS 900
           +   D SR  +  +  F  + NS G     K+E+   SSE ++L D  LEGSK  MGSVS
Sbjct: 841 SGHGDGSRFSNANENCFMGEINSFGGQPLAKDESL--SSEPSKLVDPRLEGSKKLMGSVS 900

Query: 901 VLKELCMIEGLGVEFKGQSPTSTNPVHGDEIHAEVEINGQVLGKGTGLTWDEAKMQAAEL 960
            LKELCM EGLGV F+ Q P+S N V  DE++A+VEI+GQVLGKG G TWDEAKMQAAE 
Sbjct: 901 ALKELCMTEGLGVVFQQQPPSSANSVQKDEVYAQVEIDGQVLGKGIGSTWDEAKMQAAEK 951

Query: 961 ALASLKSMLGQI-TKRPSSPRLLQGMASKRLKPEYARSIRIELKEGIYTR 984
           AL SL+SM GQ   K   SPR LQGM +KRLKPE+ R ++     G Y +
Sbjct: 961 ALGSLRSMFGQFPQKHQGSPRSLQGMPNKRLKPEFPRVLQRMPPSGRYPK 951

BLAST of Spo14122.1 vs. ExPASy Swiss-Prot
Match: CPL1_ARATH (RNA polymerase II C-terminal domain phosphatase-like 1 OS=Arabidopsis thaliana GN=CPL1 PE=1 SV=1)

HSP 1 Score: 837.8 bits (2163), Expect = 1.500e-241
Identity = 498/1023 (48.68%), Postives = 648/1023 (63.34%), Query Frame = 1

		  

Query: 6   VYEGDNLLGEVEIYFENNANK----------------IEMMKKGMLMRISHYSEASERCP 65
           V+ GD  LGE+EIY     N+                +E+ K G+  RISH+S++ ERCP
Sbjct: 9   VFHGDGRLGELEIYPSRELNQQQDDVMKQRKKKQREVMELAKMGI--RISHFSQSGERCP 68

Query: 66  PLAVLHTITKSTGGVSFKMMEKSLYFQQHNDSQIFALHSSCLRGNKTAVVSLGEQEIHLV 125
           PLA+L TI  S+ G+ FK+       Q+     +   +SSCLR NKTAV+ LG +E+HLV
Sbjct: 69  PLAILTTI--SSCGLCFKLEASPSPAQE----SLSLFYSSCLRDNKTAVMLLGGEELHLV 128

Query: 126 AMRSRRMDGVTTPCFWGFIVMPGLYESCLGMLNLRCLGIVFDLDETLIVANTLRSFEDRI 185
           AM S  +     PCFW F V PG+Y+SCL MLNLRCLGIVFDLDETL+VANT+RSFED+I
Sbjct: 129 AMYSENIKN-DRPCFWAFSVAPGIYDSCLVMLNLRCLGIVFDLDETLVVANTMRSFEDKI 188

Query: 186 EALQRKMTVEADPQRMAGMMAEVKRYQEDKAILKQYAETDQVVDNGKVHKIQAEVIPALS 245
           +  QR++  E DPQR+A ++AE+KRYQ+DK +LKQY E+DQVV+NG+V K+Q+E++PALS
Sbjct: 189 DGFQRRINNEMDPQRLAVIVAEMKRYQDDKNLLKQYIESDQVVENGEVIKVQSEIVPALS 248

Query: 246 DNHQTVVRPLIRLQDKNIVLTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVC 305
           DNHQ +VRPLIRLQ+KNI+LTRINP IRDTSVLVR+RP+WE+LRSYLTA+GRKRFEVYVC
Sbjct: 249 DNHQPLVRPLIRLQEKNIILTRINPMIRDTSVLVRMRPSWEELRSYLTAKGRKRFEVYVC 308

Query: 306 TMAERDYALEMWRLLDPDSNLIGGRELLDRIVCVKSGSRKSLFNVFQGGICHPKMALVID 365
           TMAERDYALEMWRLLDP+ NLI   +LL RIVCVKSG +KSLFNVF  G CHPKMALVID
Sbjct: 309 TMAERDYALEMWRLLDPEGNLINTNDLLARIVCVKSGFKKSLFNVFLDGTCHPKMALVID 368

Query: 366 DRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNAIPVLCVARNVACNVRGGFFKEFDEGLL 425
           DRLKVWDEKDQPRVHVVPAFAPYY+PQAEA  A PVLCVARNVAC VRGGFF++FD+ LL
Sbjct: 369 DRLKVWDEKDQPRVHVVPAFAPYYSPQAEA-AATPVLCVARNVACGVRGGFFRDFDDSLL 428

Query: 426 QRMSDVYFEDDPKDFPSPPDVSNYLVSEEAVLSSSSAPSLPCVTSLATVNLDHRLASSLP 485
            R++++ +E+D +D PSPPDVS+YLVSE+     +          +A   ++ RL  ++ 
Sbjct: 429 PRIAEISYENDAEDIPSPPDVSHYLVSEDDTSGLNGNKDPLSFDGMADTEVERRLKEAIS 488

Query: 486 FSVA-----------ASSMTIPQPAPQASIAPFHANLFSQA-GPLARTLASIG------- 545
            S A           A+ +  P  +  +   P    +  QA  P A    SI        
Sbjct: 489 ASSAVLPAANIDPRIAAPVQFPMASASSVSVPVPVQVVQQAIQPSAMAFPSIPFQQPQQP 548

Query: 546 --------PKDLGLHSSPAREEGEVPESELDPDTRRRLLILQHGQDMREGLPNEPPFPGR 605
                   P +  L SSPAREEGEVPESELDPDTRRRLLILQHGQD R+  P+EP FP R
Sbjct: 549 TSIAKHLVPSEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDPAPSEPSFPQR 608

Query: 606 PPVQAPVAGPGS--GPVPVPGPVPVAGPGSASISVPGPGPVPMSGSVPAPAPVPVPVPRV 665
           PPVQAP +   S  G  PV   +                P  +  +V    P+       
Sbjct: 609 PPVQAPPSHVQSRNGWFPVEEEM---------------DPAQIRRAVSKEYPLD------ 668

Query: 666 QSRGSWFQVEDHMNPSPLGRSATKEFPMSPDAVHVEKQRPPPPFPRKVENPVWSDRSFPE 725
                   +E H    P   S       S   +H E +RP                    
Sbjct: 669 ---SEMIHMEKHRPRHPSFFSKIDNSTQSDRMLH-ENRRP-------------------- 728

Query: 726 KQRLPREASRRDERLRSNYSVPSHQSFRGDEISLSRSVSSNKGFEVEPEKGSSLSENPSV 785
               P+E+ RRDE+LRSN ++P    F G++ S ++S S N   +  PE+  S +E  + 
Sbjct: 729 ----PKESLRRDEQLRSNNNLPDSHPFYGEDASWNQSSSRNSDLDFLPERSVSATETSAD 788

Query: 786 ALHDIAMRCGAKVEFKLGLVATSELKFFTEAYFVGEKIGEGTGTTRREAQYRAAEAALMN 845
            LH IA++CGAKVE+K  LV++++L+F  EA+   +KIGEG G +RREA ++AAEA++ N
Sbjct: 789 VLHGIAIKCGAKVEYKPSLVSSTDLRFSVEAWLSNQKIGEGIGKSRREALHKAAEASIQN 848

Query: 846 LADRYLTHIKSDASTPQSDTSRGPSPKDMGFASDANSQGDCTSRKEETTTPSSELTRLDD 905
           LAD Y+   +++     S     P   +     +AN+  +    ++ET  P S  +R  D
Sbjct: 849 LADGYM---RANGDPGPSHRDATPFTNENISMGNANALNNQPFARDETALPVS--SRPTD 908

Query: 906 SILEGSKDSMGSVSVLKELCMIEGLGVEFKGQSPTSTNPVHGDEIHAEVEINGQVLGKGT 965
             LEGS    GS++ L+ELC  EGL + F+ Q    ++ VH DE+HA+VEI+G+V+G+G 
Sbjct: 909 PRLEGSMRHTGSITALRELCASEGLEMAFQSQRQLPSDMVHRDELHAQVEIDGRVVGEGV 967

Query: 966 GLTWDEAKMQAAELALASLKSMLGQ-ITKRPSSPRLLQGMASKRLKPEYARSIRIELKEG 983
           G TWDEA+MQAAE AL+S++SMLGQ + KR  SPR   GM++KRLKP++ RS++     G
Sbjct: 969 GSTWDEARMQAAERALSSVRSMLGQPLHKRQGSPRSFGGMSNKRLKPDFQRSLQRMPSSG 967

BLAST of Spo14122.1 vs. ExPASy Swiss-Prot
Match: CPL2_ARATH (RNA polymerase II C-terminal domain phosphatase-like 2 OS=Arabidopsis thaliana GN=CPL2 PE=1 SV=3)

HSP 1 Score: 500.0 bits (1286), Expect = 7.400e-140
Identity = 303/618 (49.03%), Postives = 387/618 (62.62%), Query Frame = 1

		  

Query: 3   KTVVYEGDNLLGEVEIYFENNANKIEMMKKGMLMRISHYSEASERCPPLAVLHTITKSTG 62
           K+VVY GD  LGE+++   +++++       +  RI H S A ERCPPLA+L TI     
Sbjct: 7   KSVVYHGDLRLGELDVNHVSSSHEFRFPNDEI--RIHHLSPAGERCPPLAILQTIA---- 66

Query: 63  GVSFKMMEKSLYFQQHNDSQIFALHSSCLRGNKTAVVSLGEQEIHLVAMRSRRMDGVTTP 122
             SF +  K          ++  LH+ C    KTAVV LG++EIHLVAM S+       P
Sbjct: 67  --SFAVRCKLESSAPVKSQELMHLHAVCFHELKTAVVMLGDEEIHLVAMPSKEKK---FP 126

Query: 123 CFWGFIVMPGLYESCLGMLNLRCLGIVFDLDETLIVANTLRSFEDRIEALQRKMTVEADP 182
           CFW F V  GLY+SCL MLN RCL IVFDLDETLIVANT++SFEDRIEAL+  ++ E DP
Sbjct: 127 CFWCFSVPSGLYDSCLRMLNTRCLSIVFDLDETLIVANTMKSFEDRIEALKSWISREMDP 186

Query: 183 QRMAGMMAEVKRYQEDKAILKQYAETDQVVDNGKVHKIQAEVIPALSDNHQTVVRPLIRL 242
            R+ GM AE+KRY +D+ +LKQY + D   DNG + K Q E +   SD  + V RP+IRL
Sbjct: 187 VRINGMSAELKRYMDDRMLLKQYIDNDYAFDNGVLLKAQPEEVRPTSDGQEKVCRPVIRL 246

Query: 243 QDKNIVLTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWR 302
            +KN VLTRI P+IRDTSVLV+LRPAWE+LRSYLTA+ RKRFEVYVCTMAERDYALEMWR
Sbjct: 247 PEKNTVLTRIKPEIRDTSVLVKLRPAWEELRSYLTAKTRKRFEVYVCTMAERDYALEMWR 306

Query: 303 LLDPDSNLIGGRELLDRIVCVKSGSRKSLFNVFQGGICHPKMALVIDDRLKVWDEKDQPR 362
           LLDP+++LI  +EL DRIVCVK  ++KSL +VF GGICHPKMA+VIDDR+KVW++KDQPR
Sbjct: 307 LLDPEAHLISLKELRDRIVCVKPDAKKSLLSVFNGGICHPKMAMVIDDRMKVWEDKDQPR 366

Query: 363 VHVVPAFAPYYAPQAEANNAIPVLCVARNVACNVRGGFFKEFDEGLLQRMSDVYFEDDPK 422
           VHVV A+ PYYAPQAE    +P LCVARNVACNVRG FFKEFDE L+  +S VY+EDD +
Sbjct: 367 VHVVSAYLPYYAPQAETALVVPHLCVARNVACNVRGYFFKEFDESLMSSISLVYYEDDVE 426

Query: 423 DFPSPPDVSNYLVSEEAVLSSSSAPSLPCVT-SLATVNLDHRLASSLPFSVAASSMTIP- 482
           + P  PDVSNY+V E+   +S+   + P +   +    ++ RL      + AA   T+P 
Sbjct: 427 NLPPSPDVSNYVVIEDPGFASNGNINAPPINEGMCGGEVERRLNQ----AAAADHSTLPA 486

Query: 483 ---------QPAPQASIAPFHANLFSQAGPLARTLASIGPKDLGLHSSPAREEGEVPESE 542
                     P PQ ++ P +A+        A  L S  P  LG   +P R+     +  
Sbjct: 487 TSNAEQKPETPKPQIAVIPNNAS----TATAAALLPSHKPSLLG---APRRDGFTFSDG- 546

Query: 543 LDPDTRRRLLILQHGQDMREGLPNEPPFPGRPPVQAPVAGPGS-GPVPVPGPVPVAGPGS 602
                  R L+++ G D+R    N+PP   + P+Q P +   S G   V      + PG 
Sbjct: 547 ------GRPLMMRPGVDIRNQNFNQPPILAKIPMQPPSSSMHSPGGWLVDDENRPSFPGR 595

Query: 603 ASISVPGPGPVPMSGSVP 609
            S   P   P    GS P
Sbjct: 607 PSGLYPSQFPHGTPGSAP 595

BLAST of Spo14122.1 vs. ExPASy Swiss-Prot
Match: CYF_SPIOL (Cytochrome f OS=Spinacia oleracea GN=petA PE=3 SV=3)

HSP 1 Score: 215.7 bits (548), Expect = 2.800e-54
Identity = 116/151 (76.82%), Postives = 122/151 (80.79%), Query Frame = 1

		  

Query: 1026 KITRSISISLILYIITRSSIANAYPIFAQQGYENPREATGRIVCANFHLANKPVDIEVPQ 1085
            +ITRSISISLILYIITRSSIANAYPIFAQQGYENPREATGRIVCAN HLANKPVDIEVPQ
Sbjct: 13   QITRSISISLILYIITRSSIANAYPIFAQQGYENPREATGRIVCANCHLANKPVDIEVPQ 72

Query: 1086 AVLPDTVFEAVVRIPYDMQLKQVLANGKKGGLNVGDVLILPEGFELAPPDR------SLA 1145
            AVLPDTVFEAVVRIPYDMQLKQVLANGKKGGLNVG VLILPEGFELAPPDR         
Sbjct: 73   AVLPDTVFEAVVRIPYDMQLKQVLANGKKGGLNVGAVLILPEGFELAPPDRISPEMKEKM 132

Query: 1146 GGLCF-PFRIAPSRVAIVAAASGVHQSTITY 1170
            G L F  +R     + ++    G   S IT+
Sbjct: 133  GNLSFQSYRPNKQNILVIGPVPGQKYSEITF 163

BLAST of Spo14122.1 vs. ExPASy Swiss-Prot
Match: CYF_CARPA (Cytochrome f OS=Carica papaya GN=petA PE=3 SV=1)

HSP 1 Score: 209.5 bits (532), Expect = 2.000e-52
Identity = 110/151 (72.85%), Postives = 122/151 (80.79%), Query Frame = 1

		  

Query: 1026 KITRSISISLILYIITRSSIANAYPIFAQQGYENPREATGRIVCANFHLANKPVDIEVPQ 1085
            +ITRSIS+SL++YIIT +SI+NAYPIFAQQGYENPREATGRIVCAN HLANKPVDIEVPQ
Sbjct: 13   EITRSISVSLMIYIITWASISNAYPIFAQQGYENPREATGRIVCANCHLANKPVDIEVPQ 72

Query: 1086 AVLPDTVFEAVVRIPYDMQLKQVLANGKKGGLNVGDVLILPEGFELAPPDR------SLA 1145
            AVLPDTVFEAVVRIPYDMQLKQVLANGKKGGLNVG VLILPEGFELAPPDR         
Sbjct: 73   AVLPDTVFEAVVRIPYDMQLKQVLANGKKGGLNVGAVLILPEGFELAPPDRISPEMKEKI 132

Query: 1146 GGLCFP-FRIAPSRVAIVAAASGVHQSTITY 1170
            G L F  +R    ++ ++    G   S IT+
Sbjct: 133  GNLSFQNYRPTQKKILVIGPVPGQKYSEITF 163

BLAST of Spo14122.1 vs. ExPASy Swiss-Prot
Match: CYF_IPOPU (Cytochrome f OS=Ipomoea purpurea GN=petA PE=3 SV=1)

HSP 1 Score: 208.8 bits (530), Expect = 3.400e-52
Identity = 111/151 (73.51%), Postives = 122/151 (80.79%), Query Frame = 1

		  

Query: 1026 KITRSISISLILYIITRSSIANAYPIFAQQGYENPREATGRIVCANFHLANKPVDIEVPQ 1085
            +ITRSIS+SL+LYIITR+SIA+AYPIFAQQG+ENPREATGRIVCAN HLANKPVDIEVPQ
Sbjct: 13   QITRSISVSLMLYIITRTSIASAYPIFAQQGFENPREATGRIVCANCHLANKPVDIEVPQ 72

Query: 1086 AVLPDTVFEAVVRIPYDMQLKQVLANGKKGGLNVGDVLILPEGFELAPPDR------SLA 1145
            AVLPDTVFEAVVRIPYDMQLKQVL+NGKKGGLNVG VLILPEGFELAPPDR         
Sbjct: 73   AVLPDTVFEAVVRIPYDMQLKQVLSNGKKGGLNVGAVLILPEGFELAPPDRLSTEMKEKI 132

Query: 1146 GGLCF-PFRIAPSRVAIVAAASGVHQSTITY 1170
            G L F  +R     + +V    G   S IT+
Sbjct: 133  GNLSFQSYRPNKKNILVVGPVPGKKYSEITF 163

BLAST of Spo14122.1 vs. TAIR (Arabidopsis)
Match: AT4G21670.1 (C-terminal domain phosphatase-like 1)

HSP 1 Score: 837.8 bits (2163), Expect = 8.500e-243
Identity = 498/1023 (48.68%), Postives = 648/1023 (63.34%), Query Frame = 1

		  

Query: 6   VYEGDNLLGEVEIYFENNANK----------------IEMMKKGMLMRISHYSEASERCP 65
           V+ GD  LGE+EIY     N+                +E+ K G+  RISH+S++ ERCP
Sbjct: 9   VFHGDGRLGELEIYPSRELNQQQDDVMKQRKKKQREVMELAKMGI--RISHFSQSGERCP 68

Query: 66  PLAVLHTITKSTGGVSFKMMEKSLYFQQHNDSQIFALHSSCLRGNKTAVVSLGEQEIHLV 125
           PLA+L TI  S+ G+ FK+       Q+     +   +SSCLR NKTAV+ LG +E+HLV
Sbjct: 69  PLAILTTI--SSCGLCFKLEASPSPAQE----SLSLFYSSCLRDNKTAVMLLGGEELHLV 128

Query: 126 AMRSRRMDGVTTPCFWGFIVMPGLYESCLGMLNLRCLGIVFDLDETLIVANTLRSFEDRI 185
           AM S  +     PCFW F V PG+Y+SCL MLNLRCLGIVFDLDETL+VANT+RSFED+I
Sbjct: 129 AMYSENIKN-DRPCFWAFSVAPGIYDSCLVMLNLRCLGIVFDLDETLVVANTMRSFEDKI 188

Query: 186 EALQRKMTVEADPQRMAGMMAEVKRYQEDKAILKQYAETDQVVDNGKVHKIQAEVIPALS 245
           +  QR++  E DPQR+A ++AE+KRYQ+DK +LKQY E+DQVV+NG+V K+Q+E++PALS
Sbjct: 189 DGFQRRINNEMDPQRLAVIVAEMKRYQDDKNLLKQYIESDQVVENGEVIKVQSEIVPALS 248

Query: 246 DNHQTVVRPLIRLQDKNIVLTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVC 305
           DNHQ +VRPLIRLQ+KNI+LTRINP IRDTSVLVR+RP+WE+LRSYLTA+GRKRFEVYVC
Sbjct: 249 DNHQPLVRPLIRLQEKNIILTRINPMIRDTSVLVRMRPSWEELRSYLTAKGRKRFEVYVC 308

Query: 306 TMAERDYALEMWRLLDPDSNLIGGRELLDRIVCVKSGSRKSLFNVFQGGICHPKMALVID 365
           TMAERDYALEMWRLLDP+ NLI   +LL RIVCVKSG +KSLFNVF  G CHPKMALVID
Sbjct: 309 TMAERDYALEMWRLLDPEGNLINTNDLLARIVCVKSGFKKSLFNVFLDGTCHPKMALVID 368

Query: 366 DRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNAIPVLCVARNVACNVRGGFFKEFDEGLL 425
           DRLKVWDEKDQPRVHVVPAFAPYY+PQAEA  A PVLCVARNVAC VRGGFF++FD+ LL
Sbjct: 369 DRLKVWDEKDQPRVHVVPAFAPYYSPQAEA-AATPVLCVARNVACGVRGGFFRDFDDSLL 428

Query: 426 QRMSDVYFEDDPKDFPSPPDVSNYLVSEEAVLSSSSAPSLPCVTSLATVNLDHRLASSLP 485
            R++++ +E+D +D PSPPDVS+YLVSE+     +          +A   ++ RL  ++ 
Sbjct: 429 PRIAEISYENDAEDIPSPPDVSHYLVSEDDTSGLNGNKDPLSFDGMADTEVERRLKEAIS 488

Query: 486 FSVA-----------ASSMTIPQPAPQASIAPFHANLFSQA-GPLARTLASIG------- 545
            S A           A+ +  P  +  +   P    +  QA  P A    SI        
Sbjct: 489 ASSAVLPAANIDPRIAAPVQFPMASASSVSVPVPVQVVQQAIQPSAMAFPSIPFQQPQQP 548

Query: 546 --------PKDLGLHSSPAREEGEVPESELDPDTRRRLLILQHGQDMREGLPNEPPFPGR 605
                   P +  L SSPAREEGEVPESELDPDTRRRLLILQHGQD R+  P+EP FP R
Sbjct: 549 TSIAKHLVPSEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDPAPSEPSFPQR 608

Query: 606 PPVQAPVAGPGS--GPVPVPGPVPVAGPGSASISVPGPGPVPMSGSVPAPAPVPVPVPRV 665
           PPVQAP +   S  G  PV   +                P  +  +V    P+       
Sbjct: 609 PPVQAPPSHVQSRNGWFPVEEEM---------------DPAQIRRAVSKEYPLD------ 668

Query: 666 QSRGSWFQVEDHMNPSPLGRSATKEFPMSPDAVHVEKQRPPPPFPRKVENPVWSDRSFPE 725
                   +E H    P   S       S   +H E +RP                    
Sbjct: 669 ---SEMIHMEKHRPRHPSFFSKIDNSTQSDRMLH-ENRRP-------------------- 728

Query: 726 KQRLPREASRRDERLRSNYSVPSHQSFRGDEISLSRSVSSNKGFEVEPEKGSSLSENPSV 785
               P+E+ RRDE+LRSN ++P    F G++ S ++S S N   +  PE+  S +E  + 
Sbjct: 729 ----PKESLRRDEQLRSNNNLPDSHPFYGEDASWNQSSSRNSDLDFLPERSVSATETSAD 788

Query: 786 ALHDIAMRCGAKVEFKLGLVATSELKFFTEAYFVGEKIGEGTGTTRREAQYRAAEAALMN 845
            LH IA++CGAKVE+K  LV++++L+F  EA+   +KIGEG G +RREA ++AAEA++ N
Sbjct: 789 VLHGIAIKCGAKVEYKPSLVSSTDLRFSVEAWLSNQKIGEGIGKSRREALHKAAEASIQN 848

Query: 846 LADRYLTHIKSDASTPQSDTSRGPSPKDMGFASDANSQGDCTSRKEETTTPSSELTRLDD 905
           LAD Y+   +++     S     P   +     +AN+  +    ++ET  P S  +R  D
Sbjct: 849 LADGYM---RANGDPGPSHRDATPFTNENISMGNANALNNQPFARDETALPVS--SRPTD 908

Query: 906 SILEGSKDSMGSVSVLKELCMIEGLGVEFKGQSPTSTNPVHGDEIHAEVEINGQVLGKGT 965
             LEGS    GS++ L+ELC  EGL + F+ Q    ++ VH DE+HA+VEI+G+V+G+G 
Sbjct: 909 PRLEGSMRHTGSITALRELCASEGLEMAFQSQRQLPSDMVHRDELHAQVEIDGRVVGEGV 967

Query: 966 GLTWDEAKMQAAELALASLKSMLGQ-ITKRPSSPRLLQGMASKRLKPEYARSIRIELKEG 983
           G TWDEA+MQAAE AL+S++SMLGQ + KR  SPR   GM++KRLKP++ RS++     G
Sbjct: 969 GSTWDEARMQAAERALSSVRSMLGQPLHKRQGSPRSFGGMSNKRLKPDFQRSLQRMPSSG 967

BLAST of Spo14122.1 vs. TAIR (Arabidopsis)
Match: AT5G01270.2 (carboxyl-terminal domain (ctd) phosphatase-like 2)

HSP 1 Score: 500.0 bits (1286), Expect = 4.200e-141
Identity = 303/618 (49.03%), Postives = 387/618 (62.62%), Query Frame = 1

		  

Query: 3   KTVVYEGDNLLGEVEIYFENNANKIEMMKKGMLMRISHYSEASERCPPLAVLHTITKSTG 62
           K+VVY GD  LGE+++   +++++       +  RI H S A ERCPPLA+L TI     
Sbjct: 7   KSVVYHGDLRLGELDVNHVSSSHEFRFPNDEI--RIHHLSPAGERCPPLAILQTIA---- 66

Query: 63  GVSFKMMEKSLYFQQHNDSQIFALHSSCLRGNKTAVVSLGEQEIHLVAMRSRRMDGVTTP 122
             SF +  K          ++  LH+ C    KTAVV LG++EIHLVAM S+       P
Sbjct: 67  --SFAVRCKLESSAPVKSQELMHLHAVCFHELKTAVVMLGDEEIHLVAMPSKEKK---FP 126

Query: 123 CFWGFIVMPGLYESCLGMLNLRCLGIVFDLDETLIVANTLRSFEDRIEALQRKMTVEADP 182
           CFW F V  GLY+SCL MLN RCL IVFDLDETLIVANT++SFEDRIEAL+  ++ E DP
Sbjct: 127 CFWCFSVPSGLYDSCLRMLNTRCLSIVFDLDETLIVANTMKSFEDRIEALKSWISREMDP 186

Query: 183 QRMAGMMAEVKRYQEDKAILKQYAETDQVVDNGKVHKIQAEVIPALSDNHQTVVRPLIRL 242
            R+ GM AE+KRY +D+ +LKQY + D   DNG + K Q E +   SD  + V RP+IRL
Sbjct: 187 VRINGMSAELKRYMDDRMLLKQYIDNDYAFDNGVLLKAQPEEVRPTSDGQEKVCRPVIRL 246

Query: 243 QDKNIVLTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWR 302
            +KN VLTRI P+IRDTSVLV+LRPAWE+LRSYLTA+ RKRFEVYVCTMAERDYALEMWR
Sbjct: 247 PEKNTVLTRIKPEIRDTSVLVKLRPAWEELRSYLTAKTRKRFEVYVCTMAERDYALEMWR 306

Query: 303 LLDPDSNLIGGRELLDRIVCVKSGSRKSLFNVFQGGICHPKMALVIDDRLKVWDEKDQPR 362
           LLDP+++LI  +EL DRIVCVK  ++KSL +VF GGICHPKMA+VIDDR+KVW++KDQPR
Sbjct: 307 LLDPEAHLISLKELRDRIVCVKPDAKKSLLSVFNGGICHPKMAMVIDDRMKVWEDKDQPR 366

Query: 363 VHVVPAFAPYYAPQAEANNAIPVLCVARNVACNVRGGFFKEFDEGLLQRMSDVYFEDDPK 422
           VHVV A+ PYYAPQAE    +P LCVARNVACNVRG FFKEFDE L+  +S VY+EDD +
Sbjct: 367 VHVVSAYLPYYAPQAETALVVPHLCVARNVACNVRGYFFKEFDESLMSSISLVYYEDDVE 426

Query: 423 DFPSPPDVSNYLVSEEAVLSSSSAPSLPCVT-SLATVNLDHRLASSLPFSVAASSMTIP- 482
           + P  PDVSNY+V E+   +S+   + P +   +    ++ RL      + AA   T+P 
Sbjct: 427 NLPPSPDVSNYVVIEDPGFASNGNINAPPINEGMCGGEVERRLNQ----AAAADHSTLPA 486

Query: 483 ---------QPAPQASIAPFHANLFSQAGPLARTLASIGPKDLGLHSSPAREEGEVPESE 542
                     P PQ ++ P +A+        A  L S  P  LG   +P R+     +  
Sbjct: 487 TSNAEQKPETPKPQIAVIPNNAS----TATAAALLPSHKPSLLG---APRRDGFTFSDG- 546

Query: 543 LDPDTRRRLLILQHGQDMREGLPNEPPFPGRPPVQAPVAGPGS-GPVPVPGPVPVAGPGS 602
                  R L+++ G D+R    N+PP   + P+Q P +   S G   V      + PG 
Sbjct: 547 ------GRPLMMRPGVDIRNQNFNQPPILAKIPMQPPSSSMHSPGGWLVDDENRPSFPGR 595

Query: 603 ASISVPGPGPVPMSGSVP 609
            S   P   P    GS P
Sbjct: 607 PSGLYPSQFPHGTPGSAP 595

BLAST of Spo14122.1 vs. TAIR (Arabidopsis)
Match: ATCG00540.1 (photosynthetic electron transfer A)

HSP 1 Score: 199.9 bits (507), Expect = 8.900e-51
Identity = 106/151 (70.20%), Postives = 118/151 (78.15%), Query Frame = 1

		  

Query: 1026 KITRSISISLILYIITRSSIANAYPIFAQQGYENPREATGRIVCANFHLANKPVDIEVPQ 1085
            +ITRSIS+SLI+YIIT +SI++AYPIFAQQ YENPREATGRIVCAN HLANKPVDIEVPQ
Sbjct: 13   EITRSISVSLIIYIITWASISSAYPIFAQQNYENPREATGRIVCANCHLANKPVDIEVPQ 72

Query: 1086 AVLPDTVFEAVVRIPYDMQLKQVLANGKKGGLNVGDVLILPEGFELAPPDR------SLA 1145
             VLPDTVFEAVV+IPYDMQLKQVLANGKKG LNVG VLILPEGFELAPPDR         
Sbjct: 73   TVLPDTVFEAVVKIPYDMQLKQVLANGKKGALNVGAVLILPEGFELAPPDRISPEMKEKI 132

Query: 1146 GGLCFP-FRIAPSRVAIVAAASGVHQSTITY 1170
            G L F  +R     + ++    G   S IT+
Sbjct: 133  GNLSFQNYRPNKKNILVIGPVPGQKYSEITF 163

BLAST of Spo14122.1 vs. TAIR (Arabidopsis)
Match: ATCG00520.1 (unfolded protein binding)

HSP 1 Score: 94.7 bits (234), Expect = 4.000e-19
Identity = 46/58 (79.31%), Postives = 53/58 (91.38%), Query Frame = 1

		  

Query: 970  RSIRIELKEGIYTRRVLYLEIRGQGAIPLTRTDENLTPREMEQKAAELAYFLRVPIKI 1028
            +SIRIE+KEG+  RRVLY+EIRGQGAIPL RTDEN T RE+EQKAAELAYFLRVPI++
Sbjct: 126  QSIRIEVKEGVSARRVLYMEIRGQGAIPLIRTDENFTTREIEQKAAELAYFLRVPIEV 183

The following BLAST results are available for this feature:
BLAST of Spo14122.1 vs. NCBI nr
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. NCBI nr)
Total hits: 5
Match NameE-valueIdentityDescription
gi|902176661|gb|KNA08810.1|0.0e+096.9hypothetical protein SOVF_1593... [more]
gi|902176662|gb|KNA08811.1|0.0e+091.3hypothetical protein SOVF_1593... [more]
gi|731350518|ref|XP_010686545.1|0.0e+078.7PREDICTED: RNA polymerase II C... [more]
gi|590624710|ref|XP_007025680.1|7.6e-29457.5C-terminal domain phosphatase-... [more]
gi|731439813|ref|XP_002267987.3|2.7e-29158.6PREDICTED: RNA polymerase II C... [more]
back to top
BLAST of Spo14122.1 vs. UniProtKB/TrEMBL
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. UniprotKB/TrEMBL)
Total hits: 5
Match NameE-valueIdentityDescription
A0A0K9QNK5_SPIOL0.0e+096.9Uncharacterized protein OS=Spi... [more]
A0A0K9QQJ6_SPIOL0.0e+091.3Uncharacterized protein OS=Spi... [more]
A0A0J8BRZ7_BETVU0.0e+078.7Uncharacterized protein OS=Bet... [more]
A0A061GGL6_THECC5.3e-29457.5C-terminal domain phosphatase-... [more]
A0A067GKB1_CITSI3.9e-28957.9Uncharacterized protein OS=Cit... [more]
back to top
BLAST of Spo14122.1 vs. ExPASy Swiss-Prot
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. ExPASy SwissProt)
Total hits: 5
Match NameE-valueIdentityDescription
CPL1_ARATH1.5e-24148.6RNA polymerase II C-terminal d... [more]
CPL2_ARATH7.4e-14049.0RNA polymerase II C-terminal d... [more]
CYF_SPIOL2.8e-5476.8Cytochrome f OS=Spinacia olera... [more]
CYF_CARPA2.0e-5272.8Cytochrome f OS=Carica papaya ... [more]
CYF_IPOPU3.4e-5273.5Cytochrome f OS=Ipomoea purpur... [more]
back to top
BLAST of Spo14122.1 vs. TAIR (Arabidopsis)
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. TAIR)
Total hits: 4
Match NameE-valueIdentityDescription
AT4G21670.18.5e-24348.6C-terminal domain phosphatase-... [more]
AT5G01270.24.2e-14149.0carboxyl-terminal domain (ctd)... [more]
ATCG00540.18.9e-5170.2photosynthetic electron transf... [more]
ATCG00520.14.0e-1979.3unfolded protein binding[more]
back to top
InterPro
Analysis Name: InterPro Annotations of S. oleracea
Date Performed: 2018-06-29
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002325Cytochrome fPRINTSPR00610CYTOCHROMEFcoord: 1098..1117
score: 1.6E-51coord: 1044..1055
score: 1.6E-51coord: 1077..1097
score: 1.6E-51coord: 1118..1138
score: 1.6E-51coord: 1057..1076
score: 1.6
IPR002325Cytochrome fPROFILEPS51010CYTFcoord: 1048..1136
score: 42
IPR003359Photosystem I Ycf4, assemblyPRODOMPD003698coord: 970..1024
score: 5.0
IPR003359Photosystem I Ycf4, assemblyPFAMPF02392Ycf4coord: 967..1025
score: 5.5
IPR004274FCP1 homology domainPFAMPF03031NIFcoord: 251..366
score: 9.
IPR004274FCP1 homology domainSMARTSM00577forpap2coord: 200..376
score: 1.4
IPR004274FCP1 homology domainPROFILEPS50969FCP1coord: 141..389
score: 1
IPR014720Double-stranded RNA-binding domainGENE3D3.30.160.20coord: 873..940
score: 2.4E-8coord: 736..803
score: 2.
IPR014720Double-stranded RNA-binding domainPFAMPF00035dsrmcoord: 873..939
score: 2.
IPR014720Double-stranded RNA-binding domainSMARTSM00358DRBM_3coord: 738..802
score: 6.9E-5coord: 872..940
score: 2.
IPR014720Double-stranded RNA-binding domainPROFILEPS50137DS_RBDcoord: 737..803
score: 12.038coord: 871..941
score: 12
IPR023214HAD-like domainGENE3D3.40.50.1000coord: 255..365
score: 1.4E-11coord: 146..174
score: 1.4
IPR023214HAD-like domainunknownSSF56784HAD-likecoord: 259..393
score: 4.33E-20coord: 135..179
score: 4.33
IPR024094Cytochrome f large domainGENE3D2.60.40.830coord: 1049..1137
score: 1.8
IPR024094Cytochrome f large domainPFAMPF16639Apocytochr_F_Ncoord: 1049..1136
score: 4.8
IPR024094Cytochrome f large domainunknownSSF49441Cytochrome f, large domaincoord: 1049..1161
score: 1.22
NoneNo IPR availablePANTHERPTHR23081RNA POLYMERASE II CTD PHOSPHATASEcoord: 1..438
score: 0.0coord: 634..1013
score: 0.0coord: 498..539
score:
NoneNo IPR availablePANTHERPTHR23081:SF7RNA POLYMERASE II C-TERMINAL DOMAIN PHOSPHATASE-LIKE 1coord: 634..1013
score: 0.0coord: 498..539
score: 0.0coord: 1..438
score:
NoneNo IPR availableunknownSSF54768dsRNA-binding domain-likecoord: 870..946
score: 1.24E-14coord: 737..801
score: 1.8

GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
biological_process GO:0015979 photosynthesis
biological_process GO:0006950 response to stress
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0031361 integral component of thylakoid membrane
cellular_component GO:0009522 photosystem I
cellular_component GO:0009579 thylakoid
molecular_function GO:0009055 electron transfer activity
molecular_function GO:0020037 heme binding
molecular_function GO:0005506 iron ion binding
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0016791 phosphatase activity
molecular_function GO:0005515 protein binding