Spo27314.1 (mRNA)

Overview
NameSpo27314.1
TypemRNA
OrganismSpinacia oleracea (Spinach)
Descriptionpre-mRNA-processing protein 40A
LocationSuper_scaffold_133 : 316349 .. 345706 (+)
Sequence length3932
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAGAAGGTGGATTCTTTTGGGTGGTGGAAATGATGTACGGGGTAAGAAGTTTGGAGAAGGTTACGGAGCAATTGATGAGCAGTTGGGCGGTGGAAATGGTATACAGAGTAATTGAAGGATTTTTCTTTATTTTGTGATTTGTTTTTTAATTGTTTTTTATTTTTATTTTGTGGGTGTATGGGTTGTACACTTAACACTTATACTCCTATATCTTCATTTTATAATTTTTGTGATTTGTATATGGGTTGTACACTTGTACTCCTGTACACTTCTTTATTTTATGATTTATTTTGTAAGAAATATGAGTTCAATTTTGTAATGTTTTGTTTTTTGGGTGTTTTGCCATGAATTAGTAATGAGTTTAATTACTTATGTTCGTCTAAAGCTATACAAAGTACATTTTTTTATAAGGATCATTATACATTTGTAATTTAAAACAACTCACTACTATTTACATTATAATTAAAAAAAATATTGTAAAAGAAAATAATGTTAACAATGTCCATAAGTGTCATTTTATATAAAAAAAAATAGTGAAAAACTTTTAAAAAACCTCTAAGTGTTACCAAACACACTTATATAAACAGCTAGTCAAATCCGCAGTCAAATCTGCTAAAAAAAAAAAAAAGTAATCAGCTAACTGCTATTGTGAACCGTAAACCGCTAAACGCTAACCCCATATTGAGCCATTATAATTGCAGTCATTATAATTTATCGCATATCTTAAGTTCCCAAAGTTCTCTTTCCGAGCTCCAGTTAATTTCGCCGCCGTCCACAAATCGCAGCCGACTACGAATACAGAGTTTGAGCCTTTGAGGTATTTTGTCTCTTCAGTTCCTCTTTTATTTATTGGGTGAATAATCAAACCGAGGCAAAATAACCACCGAAGTACACCCAGATCCGCAGCATACCGAGGCGACTAAGCGAGGTCGCGAGGGAGAGGAACCGAGTCACCGAGGAAGAGGAGGAGCCGTCGAGCCGGCGAAAACCGAGGAAGAGGAGGAGCCGTCGTCCTTCGAGCCGAGCACCGTGTTCATCATTTCCAAGCACCGAGGACTCGAGGAGACGAGGAGGAGGAGCCGTCCCCGTCGAGCCGAAAGTCCGAAACCCGATTCAGCAGACAGCAAGTGCAACGTGAGGTATAATTAGAACATTTACTCATTGTTAACCCTTAGATGGAAGATTAATTTGTTAATGTTCAATCTCTGGTTTTTGTTGGGTACAATTAGTTAGTTTTAGTCCCGGATAGTTTTGACACCCAGATGTTAACTGTTAAGTATGATTTATTTTAATTGATAATCTTTTCTCTTGTTAAAACAGCTTCTGCCCCCTAAATAGCTTCTATGGGCACTGCCCAAAACTATGGCCCTCCTATGTCCACACAGGTAGATTGTTGGTGTTTTTCGAGCTTTATTATTATGGTGAAAATTTGCTGAATTTTGCTCATGTATTATTGCTTTTGCAGTATCGTCCAGCAGTCCCAGGCCAGCAAGGGCAGCCATATTTACCTGGGTCTGCACAACAGTTTCTAGCACCAGGACAGAACATTCCTTCTGGTCATAATCAACCTATGCAGTTCTCCCAACCCATGCAATTCTCTCAACCCATGCAGCAGTTACCTCCTAGACCTGGAATGCCTGGTGTTCCTATGTCGTCACAAGGTATGGCTATGCCATATGGTCAGCCAAACAGGCCTATGACATTGGGTGCACAGCAGAATCAGCATTCTGCACCGCCTTTTGGCAATCACCCATCTGGTGTAGGTGGCATGGGAATTCCTTTTTCTTCGTCATATACTGTAAGATATTTTTCATAAACAGTCTTGAGTTATGCTATAGATTTGTCATATAAGTAGTTTACTGGATAAGTTTGTTGGTGAATTTAATTGATCGCCTTTTCACAGTTTCAACCAGTATCCCATTTGTCAACACCTGCAGCATCAGTTGGCGGTCAACCATGGTTATCATCTGGAAGTCAAAGTGCTGCGCCTTTTGTACCAGTTCCACAAACAGCAGACCAGTCTTCAGGTTCCGCTGCCCCCGTTCCTGTAAGTTCTCTTGCTTGACTCTCAGTCAGTTTTTTATACAATGATATATATAGATAAGCCTCACATCTGCTTTCTTACTGAAGCTTGTAAAAAAATTAAAATGTGTTAGAAACTAAAGGAGCTAAGTAAGAGGGGAGAAAACTTTCTTTCATTGAAGAATTTGAAGTCTAAAGTTCAGAATGAGGCTTTTTATGATGGGGTAAAGTAGAGGCCCAAAATATGGGAGGGGAGGCTTTGATGGAATGATGGTGTGTGCATTAAAGGAATCCGATACCCCAGTTTAATTGTTAATGTGTTAATGGTGTGTAAGGATAATTGAAGAGATTATGGGAGAGTAAGAACAAGTCTTAAGAACCTGAAGAGTACTTTATAGAACTTATTGCCCTATAGAGCTTACTAAGTGGCTGTTACATCCGAGGGTTTTAAGAAGGGGAAAGATAACATTTTGGGGTAATTTTTTGGCTCCCATCCCCTTTGGGATTTTTTGGGTAATTTTTTGGCTCCCATTTCTTAATATCTTTTACTCCCTCCGTTCTTGAATATTAGTCCTTTTTTGTGTGAGTTGTTTTTGTTTGTTAGTCCCTTTTTGGACATTTCTTATTTGTCCAACATTTATCCCAATTATACCCTTATAAATATCCAAATGACCAACTTGCTTACTCCCCAGTAAACCTTTATCCCCATAAACCCCTTAATTATTTCTCTCTCATGGTTACTCTATTGGGAGTTTGGGATTCATGTGACAATTGTACTTATGTTGGACTTTCTTAGATTCCAAAAGGTACTAATGTTCAAGAATGGAGGGAGTAAGAAAAAATTGCCAACAACTACCTTTTATGCATGTGGAATTAATAAAACACCTTTTAATGACCTTTTTTGGGGTATACAAGGACTACAACGTCCTAATACAAACTTTTTGTTTAGGTAACACACATTTCACTTGATTCCAACCAAGATCCAAGATCCAGAGTCCAGATAGGTGGATCCCAATCTTTTTTCATTTACCTCCTTTTTCCCCCTCCTCTTGTTCCCTTCCATTTCACGATAAACCAGAGTATATTATCCCATTCACTCCCCCCAAAACTAGCACACCCATATTTTCTCACGTTCGAACTATTTGCTATACGAATCTGAAGTTGAAGTCTTCAACAACTAAGTAATCATGCTAAAGTCGTTAATCACGCTAAGCTGCTTAGAAATTCACCCAACAACATGAAATTACGTTTTAGGGATTATGAGTTGCTCTTGGCATGAATTTGGGGCTTTTCACATAGGTTATTAAAATCAGAAATCGAGTATAATTAAGGGTTCCATTGGTGTAAAAAAGGCGTTGGTGCTTACTCAATTGTGGATTTAAATTCGTAGAATGAAAATTTCATGCTCCAAAAGTCATCTCTCTGTTTTGGCATCTTCCATGACGTTGGCATTCCAAAACACACTTTTCATCCGAAAAAAATCTCCCATCGATGCGTTATTTTGACAAATTTTTGTCATCAATAACTGATAGATAGTGTGAGAACGAGATAATGTTTGGGTGAAAGAGAGGGGGGGGNGGGGGGGGGGGGGAGAAAGTGAGAGGGAGAGAGAGAGACTTGCTGAGAACCAAAGAGGGGAAGGAGAAAAGGATGAACTTCATAAATTTTGGTCGGTGGTGTATTGGTCATTTTTAGGTAATTGTAAAATGGAGGGGTAATGAGAGATAGTTGAATAACAATGTATCAAATTGGGTAATTCACTCATATTGGCTGTGGTAGTTCATTTCATTAACAGGTGGTGTTACCTAAAAAAAAAGTTGTTATTAGGTGGTTGTAGACAAACTAATACTTTTAAAGGTGGTTTTGTTCGATTTCAGAAACATAAGGTTAATTTGTCACAGTGATACCCGGAACGTGGTACGTTGCCTCGAACCCCGATTCTGATTCGTTCATTCCGGGTTAACCATGGTACGCGACCTGGATAGCCGAAAGAGATGGGTGGCATTCTTAAATGAATTAAATTCCAAAAGAATCTTCTCTTGAGTTTCAACATCCGGAACTAACCTTCTCCTTAGCCCTATCCATGGCTTCATAAATATATCCCATAGGAGGCTTTTTCTCTCCATCAACAATTTTAAGCACCTTAACTAGTGCACCGGCCAACTTAAGAGCATATAAGACATTTCTTCAAAAAGTGTCTTGCAACAAATATGTTTCTATTTTCTTTTCTCCCACATCCCTTGACCACTTAGAGGTACTCCACTCTTGAGAAGTAACTATCTTCCTCAAGCTACTCTTTTGCTTATGAAATTGAGCAAGAGTGATGAAAGATGTAGCAAATCTTGTAACCGCGGCCTATGCAAATTCCTTTGATTTGTGAATTTCCTCATCAAGTTCACAAGTGATATATGGTTATATATATATAACCATTCATAAAGTTGCATTTGTTCATAGAATTCTTGACCTTGGGAATTTTCCCAATATCTTCCAACATCAAGTCCAAACAATGTGCGATACAAGGAGTCCAATAAAGATGTCTTTTAGCCTCCAACAATCTTCCTACAAATTTATAACAAAAGAGAAGGCATTTAGTTAATTAGTTTTAAATAAAAGGAAACGAAATGTAACATATAAAGTAAGAAGAAGATCTATATATCTTACCTGCCTTAACATAATTGGATGCGTTATCCGTCACCACTTGGATTATATTTTGTTTCCAACTTCCTCTACCATGTCATCAATCATATGAAACAACAAATTCACAACTTTCACAACATCGGAAACATCCATGAATTTTATGAAGAAATAGCCTTTTGGAGAGTTGACAAGAAAGTTCACTATATCTTTTGAGGCAATCGAATCACGCCATCAATCGGACATAGTTGAACAACCTTTTGTTGCCCACTCTTTCTGGTGCTCTTCCTTTAGTTTATCAATCTCCTCTACTTCACTCTTGAGGAGAGGAACTCTTAGATCATGCATGCTTGGTGGCTTGAATCCCATGCCATATTGTGCAACGACATCAAGCATGCCTTTGAAACTATTATAAGTAGCCGCATGGAAGGCAATCCCCGCTTCCAAAAACCACTTTACAACCGCTTTACAAGTCTTGTCTCTCAATTGCTTATCGCAAGTACCAAAGATACCTTTTCTATCATTTCTACCTATCAAGACATTTTCGGAAGAATCCATAGGATCCTTAACTTTAGGGTTTACGAAGTGGTTGGCTACTTTTTGAGAGTTCATGATTTCACAATCTTGTTATTCCTTATCATCGTCGTGATCAAATTGGTTAGCCATTGACATCATTTGACTCACAATTTTAGCTTGTGCTTTCTTGATCATAAAAGATTTCTCTTCCTCCCTCACATGCCCGGGGAAAGTAGATTTAGTCACATTTCTAAAACCTCCTACCAAATGTTGCTTCACTCGTAGAGCCCCTCCATTTGTTACTTTACCACAAAAGTCAAAATGCCAATTGGTCTTGTTAGTAGGTTCCGGTTTACCATAATTTCTTGGCAGATCCATGGTGTCTAAGGAAGGTGTGCTTGAACCCCCCTCGTTGGTAGCCATCCTAATATTAAAAAAAGAAGTTAGAAACCAATACAAGAAAGCTGAGCAAAAAAAAAAAGCTACTGTTGAGTGTTGCCTACCTAGGTTGCACGGAAACGGGTACGGGGACGGATACGGGGACAGAAAACGGCAAAATTAAAATTGCAAAAAACGGGTACGGCATGGATTCGGCAAAAAAAAAAAAAAAACTAATTTAAACATCCGCATATAAATAGTTCAAAGTTAAAAATACCAAAAAAATAACTAAATTTAATATCTAAATAACCTTTATGGGACTTTAAATTTTAGAATCTTTAAAAATTAAAACCATAAAACAAAATAAATAACCTCTCAATAAATAAATGTTTATATCACGTGTTCCTTACTTGTATAAAAAATTAAATGAATAAAGAAAAATAAATAAAGAAAAAAATACTCTTTTTGATAAGACAAAACAGCTGTATAAAAAATTAAATGAATAAAGAAAAATAAATAAAGAAAAAAATACTCTTTTTGATAAGACAAAACAGCTGGCAAAGGCTTGGTCAATGTGACAAAACAATAAAGTACTAACAGGCGTAATGAGTAACAGCTGTACATCCTCATCCTCATAATATTAAATAAAGAATCTAACTGCCTTGCTGTACAGCTGTACGCTTGTAAACACTTAACAGAAAACTAAAATAATGGAGTACTTCCGTAATAAATTAATAGAAGGACAAGGCTCGTCTTCTTCATCCTTTTTCCCCATCGCCATTATTAACAAAAAAAAACAGTCGTCTTCTTCTTCCTTCGCCATCTTTTTCAATGGACACTGTATCATTGGGCGAGACGACCGGTGACCGGCGGGCCTCTCAGATTTCATGAAACCTATAGAGTACGTGTCTTCCTCCTTAGAAAATGTACCCACCCCGTCCCCGACTTCACCTTAGCCGTCCCCAAACATAACTCGCGTACCCGGCCCTTTGGATTCGTTTTGGGTACGAATCCCAGGCGAATCCGTCCCGTACCCGTACCGTACCCGTACCCGGTACGGGAAACGGCGCCTGGGAGGCGTACCCGTGCTTCATAGTTGCCTACTAATTAGTTTGAAAAAGAAGCAGAGAGAAAAAGTATGAAAAAATAATTAAAAACAGAATATTACAAAATCAGAATATAAAAAATAGTTAAAAACAAAATTAAAAAAAACAGAATAAAAAAAGAGCCTAGAAAAAAGCAGAGAAATAAAATAACAGAGAGACAAGAAGAAAGAAAAATAAAAAAAAGAGAGAAGAAGACTGAAAAAAATCGAAAAAACAAAGAGAGAAGATGAATGAAAAAGAAAAAAATTCAAAAGAAGATTTACCGTAACCTAAAGATGCGGAATTTTCCGGCGAGTCCTGGGCTTGATGGCTGGAGAAATCCGGCAACAACAAGGTGGTTGGACAACCTCCAAAACCGCAGCTAGGGTTTCTCCAAAACCCCTAAATTCGTAGGTAAGGTTTTCTCCTAGTTACTCAGACACGAGTGTCGGTGTCGTGTCCGACAAGGTGTCGGAGTGTACTCCGATCCAAATATTTGCAGACACTCCAACACGGTCAAAGGAGTGTCTTGACTTGTTTTTTAGACACGGCTCCCTTACAAAGTGTCGGAAAAAAATTTGACACTTTTAATAAAAAAACAAAAATAGTTGTTAAATATTAATAAAAGAAAAAATTAAAATTACCAAACAAGGTAAAAGTTGTTTAAAGGGGTTAAAGATACAGAGAATTGAAGTGTAAACGTGGGTTTGAATTGGAAAGTACTCCGTAATAAATAAAGAATTGAAAAGTAATAAATAAATATAAAAGAGACTGACTTTTTTTTGGTAAAGATAACAGTTGGCACGTGGGTCTGTAGTCTGTTCCCACTTCCCACCTTTTTTTTCGTTTTTAATTAATTAATTTTATTTTATGTGAGTGGTGCCCTAGTAGGCCTGGGCTATTGAAAAAAAAAGGTTAAAAGAAAAAGAAAAAGGTGGAAAGCCCACCCACTCACCCTCACCAACGACGCAAAACAGAGAAACGGGAAGGAAACGCAGAGCTGGATAGCCATGCATGGCTGATTCTCAAAGTCAATTAGTGAAAACAAAAATTGAAGAGTGCAATAAGGGCGTTTTTTCTTCAACAACAAAAGCCCCAATATAGTTGTTCTTTTCTCGCTTTCCCCGACCTTCAATTGCTAAATCTAGGGTTTCGAATATTTACCCTTTTCTTTTCTCCTCTCGATTCCGACACTACATTTCAGACTCTTTTCCGACACCGGTGTCGACACCTCCCCGGACACGTGTCGAATGTCGTGTCGCGGGTCGGGTCGTGTCGACACCGACACCCACCGTCGGAAGAAGTGTCGGAGTAACATAGGTTTTCTCTCTCCTAAACTTAATATTCTGCCCATTAAACAAAACAAAACTCACTAAAAATTGGTCACGTGAGTATTACGTGTTAATTTATTTAATTCAAGCTTGCTTCTGCGCCCATAACACAAGAAACGTGCGCTTCAACAAAAGCCCGCTTCCACATAAAGCGCACAAAGCACGCGCACCTTGTACCTCGCTACGCTATTCGTACCTGGCGCGTTCTCAATTGCGCCCCCAGTCGCCCTAGGCACAATTTTTAAAACTAAGCTCACCGGTCACCAGTAAACGCTACCTTGATGAACTCCAATTATCAACAAAACACCACCATTCTTGCAACCGTAGGAGCAAAGTTTTTTCAAGAAAAAGAGGTTTAGTCCTTTTTTTTTTTTTCTGGATTACTACGGTTGCCAATGAGATGTAGGATTTGATTCCATTCCGTCTGGTGATTTTTCGAGGTTGTTGGGGATACAATTGTATGAACGGGATTGAATTAGAAGTGTTCAATCTAATTTGTAATAACAAACATTAACAATTACGAAGATCGATTAAGCAAGCAAACCAAGCATGAAAATGGGGACAAGGAAATTTACCGTGGGAAACCCGAGCACTAGGAGAAAAACCCACCAACCCACTAGAAGTTATGATGAAGATTTACAAAGAGATTATCAAGCTAAGCAAGCTTCACCTCACAACTCACCCTCACTTAAACCTTAAGTGTATGAGCTTTCAATGAGGAGTTTCTCTCTTATAAATTAACAACCCTAACTCTAAGACAAGCATATTATGAGTATATATAGTGCTAGAGTATACAAGACAATCCAAGGGTTCTAAACAAGCCATATCCCTTGATCAAAACATAAAGAGAGGAAGCTAGCAAGAGCCGTCACGAAAACCAGCCCGCTGGCCACTCGCTTGTTCACAAAAGGAAGAAGAAATTTGCAACTGCAATGCCCGCTGGAAACCAAGCGGCCCCGCTGGATTTTCCTACGTAACCTTTCTTCCACCTTCCAAGCCTCTCCTAATGCCTCCTTAGTTTATTGCCCATCAAACCCATCTCCACACTACATCAAACCCCAACAAGTGTCACCTCCATCTCTGGCTTCATATTTTAATAATAGATCTCAACCATCCATCTTGTACATCTTGCTGGTTATGTGTCCTTGTATACCACAGTTTCTGCACATCAATGCCACCTGATCCGTTTTTTCTTTCTCTATCATGTATCTGTGTTCATGTGTTTGCGACTTTATTTTGAGGACTTGAGATGGTGAGGTAAGAGAATGTTTGAGAAAAACCTTGAAGGTAGACATCTTTTTGATTGTTGGCAATAGAAAATGAGACATTGTTTTCGCATTCGTGGACAGTTCTTATGATCTTGCTTTTATTGGATATTTTTTCGTCGTGGAGTCTAGTAGTGTAGTAGAACTGTGGTTGTGATCAACCTTTTTTTTACTACTCTTGATTTCAGGCTGTTAATCCCCCCGATACCAGCCAACAATCCTCATCTGATTGGCAAGAGCACAACACTCCAGATGGAAGAAGGTTCTGATTCGTTTCTCCTTAGCATTATTATTAATTTTTTCTGTCATTATTGGTGTACCTATGTCGATAACTGAATACATAGTCTACATTCATCATTTTGCTAGTCTATTACTTGTAGTTTTGTCTTTTCATCTATGGCCGGGGTCTGAAAAATAAAGATTAAATTTTAACTTATACATAGTGTACTGTTTGGCTCCCTATTTAACGATCAACACTTTGTTCATTTCTTTGTTATTCTATCTTGTACGGCGCAGATATTATTATAACAAGAAAACTAAGCAATCTAGCTGGGAGAAGCCTGTTGAGTTAATGACACCAACAGAGGTGAGTTATTTTCATCTTAGCATGCTTCTTTTTTAATTGTCTAATCTTTAGTTAAACGTTGATTTTTTTTCTTTTTTGTTTGGGTATTATTTGCTTGGTATGTTCCGGTTTATGGAATTATGGTTGTAGACTTCTATAACTTGAATAGAAGGAATACTTAATTGGACAAGATTGTTTTATTCCAATTCTCCTTTTGTTTGCATGTTCCCACTGGTTTGTAGATGTCATGGCAAGTCAGGAGACTACTCTATATGCTCACCTTGAAGGCGGATTCACTTTTGCAAGGCAGGCAATGATTCGCTATATACATTATTTTCTTAGATAGGAATATCTTCTCTTTTATGCTTATTTTTCATGTTGAATGGTTTCACATATGTTCTTGTTTTAAGATTGCTTCTCCTTCTTTCCGCAATTGACCTTCCCATTAAAAAATACTTCCTCTGCTCTTATACATGACACAATTATTTAGCCACGTTTTCCAATGCACGATTTCAAACATTAATATCTTCCAAAAATTATAAAAAATTGATATTTAAAAAATATTTATTGAGACGAATCTAACAAGACCCCACACGACTATGTTATTTCTTATGTATAAATCACAATAGGAAGTCAAATGAACTTTTGTGAATAGTGTCCAAAACCGAAGCTTATGGTGTAGGTACGTAGGTAAGCTCTAAAAAAGAGTCCCTGTCAACAACTTTCTTTCAAATACATTCATTTGTGAGAACCCAATTGTGTCATGTAAATAAGAATGGAGGAAGTAATTTATAACTTCGACCAACTGCTGAGCATCCTGTAAATTAATGGAATTAATATTTTTGGGTATTGTTAAAGTCTTAGGCCATTCACGTTTTTCTTTAATGCTCCCTTTCAAAATTACCACAATTTTCTTGTGAAGGCTACACATGTAAGTTGAAATGTCATCAATTACATGTCTGCCATAAAAGGGAACTAGTTTAGCCAAACTTTGCTAAGTGATTGCACACATTCGGCTGAAAAGGTGTTCCAGAATATTTTCTTGAATCTAATTAGAGCTGCTATGTCAAAATTTTTGGCGCAAAGAGCCTACTAAACTGAAAAGTTTCATAAACTCTTTGACCCCCCAAATTGAAGTATCCTACTTCCATACAATTCACGGTATTCAACAAATAATAAATATCTTAGTGTCGTAGCTGTCTTAAAGCACATGTTAAAATAGGTGGAGCCAAAGGGAAAAACTGAAAAATATCGCGGGCACATGCAAAATAATGATGTGGATAAAATAAACAGGCTGGAATCATTTAGTAAGTTGGGGAAAAATAAGTGGAAGTCCCAACCGACTTATGTCAGCCTCACCTGTGGTAATCTGATCAGTGATGTTCTTCCCACTTTATTTTTCTTTCCTTCTGGAATCGTGAACTTTAACTTTATTTGGCTGGAACCTAGAAGTCACTAGTCATCTGCAGCAGCGTGAGATTGACCAATATGGACATGAGCTGTCCCATGATTATCCCATAATAAAAGCAGTTGTTGATGATATCTTCTTCTGAGGTCCTTTTTCTCCCGTTGGTACTCTGGTAGAATGGAGTAGGGATATAGTGGTTGACTACAAAATGTTTTTTAATTGTGGATTGTTTGTTCATTAGCATGGGTTCACGGCCAAATGTGTTTCTAGTCTACGTGGAAAGAGTGGTTTTGGATTAAGGATTTGTGTCACACTGGCACCCTCTTTTTCTTTTTTTCAATCTCAATTCAAATGTGTTTCTAGTCTACGAGGGAAAGAGTGGTTTTGGATTAAGGATTTGTGTCACACTGTCACCCTCCTTTTCTTTTTTTTTCAACACCTAGAATATTTTGCACTGTAGTTTATGACTGGTTCAAAGAACTTTGAGATTCAGGCGACTCTTAAAATAAAAATCAAAGCCAAGTCGCGTAGTTAGAAAATTCAAAAATCATGCTACAAGAATTCAAAAACCTTTTATGCTATTGATTTGATTCAAACGGGGTGGGGGACTGCAACGGGCTATGCCACCCAAAACAAAAGTGGGAAGAACTGTTCAATGCTGTTACTTGGTTGGGACTTTGTTTGTGGAACTAATAAGTAATAAACGCAAACACTTAAAAATCTGTTTGACATGCCGATTCACATCTCTTTCAACCAACACAATACTCAATGTTTGGAGCATAAGCTATTTGTTTAATAAGGTTGCTGTAATAAAGATGAGTATGCCCATGAGTTGGCTATACCCAGAATTAGGAATAAAATTAGACGAGGCCACTTTATGATATAGTGGTAGCTTTGGAACAATTAGGGGATTCTGTTTGGAACAATTAGGCGATTCAGTTTGGAACAATTAGCTTGCGCGGTCCTAAGATATTGGAATCGATTTGCAAAACTAACTGAGTAAATAGGTTGATAATTAGACTAGGAGTAAATGATGTATCAAATGATAAGTAAATATGTTATGGTTGCTAATAATACCTTGCAAATGAAAGAGCCCGATATTTCAATTGTTACGTAAATATATAATCCACGCATAAAAAGAAGCATAAATTATTGAAAAGAACGACTATCATGGAAGTCTCGTAAGGAAAGGAATAGAAAATAAGTGAGAAAACGATCGCATAAACTTCTCCTTTCCAATGACAATAGCTTCCTGTCCCTTATTCGCGTCATCATCATCATCACCAGAGAGGTCCACAAAAGTAGGCGAAGCTGTTTTTACTCCCTTAGCACCAATATGCATACTATTCCGAGTTCCCTTAGGTGGAGTATTCTTCTTCCCAGATTGCTTGCTCTTCTCGTTCTAACAAGAACTCGAATTAGGCATTTTTCTATTGTGGGATTGGCTGAGGATAATTGTTAAAATTCTGTAGGGTGATCGAATTTGGAGAGGAACTGAAGATTGGATTAGGGAAGTATCCATAGGAATTGGTGAGGAAAACTTTTGTTTGCGTTTGAGTTAGGTGAATAGTTGATGTTTGTGTTTCTTGATATTAGAGAATGATGCTAAACATGCTTGTATAGTGATGCTTAATGTTACAAAATTGTAAAAAATAATTATGGGTGATGTTACAAAGGTATCAACATGAGTTAATTTAATGAGTGATGTTACAACCACGATGACCATGAATAGTGTTAACAGAAACTTTAATGTAGAGGGTCATAACTTTCAAATTAGTATCGAACCAATATGGATGATAGATAGGGCCAAGTATAATAAAAGTAAATGAGTTGTAAAATTTACCAAAAGGGGAAGTGATGCAATTTTTTTGAAACACCGAAGTAGCAAGTGTTTGAATTTTAGTGAAATGGAGGAAGTAAACAAATTGGGTGGTTATTATGCAAAATTCCAAAACAATATGAATTAGACATGCGATTTGGGGGATGATATTGAGTAGTTCAGATTTCCGAACACAAGGATGATAATCTTTTTTATTGGGTGTTGATATAATAATTTTTTTATTACTCGGTATTATTTTGTTATTCATTCTGAAACGTATAACAATTTGAACATGCATTTTTATTGTTTTAGCCTTTTAGGTGACTATAGACTAGAGTCCCACTCTTACGAATCATTATCGTGAGGTTTTTCTTTTGATATGATACTGGAATGTTGTCTACTTATTGTGTAAAACCAGAAAACAGGAGTTGTTTCAGTTAGTTGTCTCTGGTTAAGCTATTCTCCGTAACTGGTAGTGCATTGGCATATCTAAGATGGATTCTAAGATGAGCTTTGCAATGTGTTTGCAGCTTCATTGTTATTCAGATTTGTGTTTGAGGTTGATGTTTGTGTAAAGCTATATATCTTCGGAGTTAGTTCCTAGTAGAGTTAGTGCCTAACATATAAAAAGACATTGTGTAACTTAATCAAAACATGACCTATTAGTTTGTTGGTTTGTTTGTGCTCTTCGTCATGCTTTTAGCTGTTAGTTCGACATCAATGATTGGTATTGTCTCAGATGTTCTAGTTCCAAGACTTCCAACTGACATCTCATTTTTGTTATTGGTGAAATGACTGAGAATGTATATAAAAATGTTTCTGATAATTGTGCCGTGATTCATGTTGTAACAAGAATATGCCTTGTCTAAACCGAGGAAATCAGGTATCTTGCTTCTTTTTCTGTGTGCTAAAACATCTTATGAGGCTGTTAATAGCTGTCTTGGGTTACGTTAGTTGACACAAAGATTGTTTACGTGTCTGTATGACCGTTTGTTCTCTTAAACAGTCGATGATTTGGAGCTTGGCATGCTCTTTATGATTTTTGTATGTGCACGTGTTCACTCAATATCTATTCTTACCCATTATGGTGCTGCAACAAAGATCAGTGATCTTTATAGACACAGAACCACCACATCACGAACTGAATGCGGATACTCTTGTTCGTTCATTTATAAGAAGCAACAAGTTCTTTTCGCATGCTTTATGTAATAATTGATTGTACAATGCTATTGTCAATTTTTTTGGCTTTTGGTAGTCGTGTCTCTTTCTCTGTGAGTTTGTGAGGGCTCCCGCTTGTCTGTATGTTTACATTTTTTTCCTTTTGAGTATCGTAAGGGATTCTGATGTTCAATAAAATTATTTGAATCCTTCATTAAAGGAAAATAACTTAATGTTATTTTGAATCTAATATCTTTTACCTGTCACAGAGAGCTGATGCGTCGACTGTATGGAAGGAGTTCACTACTCCAGAAGGGAAGAAGTATGTTCAATGTTCCATTCTATTATTCATGAGTGTTTCATGGGTTTGATTAACCTGTTGGTTCATTGTAGATTCGTTGCTCTGATGATCATGCTTTGTCTGTGCTGCTTACTAGGTATTACTTTAACAAGGTTACAAAGGAATCAAAATGGACCATACCTGAAGATTTGAAGGTATATTAAAGTGTTTGGAGAATAAACTAGGACTTGCTGCCCATGGGCTTGTAACACTAATTAGATGCTGGACTTTGCAGTTGGCTCGTGAACAAGCAGCAAAAGCAGTTAGTCAGGGAGCACAGTTAGATGCAGGAATGAAATCTCATCCTACAACAACTGGAGGTTTTACCTCTGAAGTAGCACCATCAACTAATCCAGTTTCCGGCAGCTCCAATGGGTCATCATCTGTTTCTGGTGCAAGTTCAATCCCGTTAAGTTTTCCGTCAGGTGTTAACCCAGCACCTGTAGTTAATGATGCGTCATCAGAACTCCTTATGGAGCACTCAGCTGCTCCGACCAGCATGGCAGTGACAAATGCTCTTGCAATGACTCCTTTGTCTGCTTCTATTTCTGGAGATGATGCTCTTCCTGCTGCATTAAATGCCTCCTCCATTACAGTGTATACATTTCTACTCCTGATCTATCTTTAATCTTGCTTTTCTTGGTCACATATTTGCCTGTTATATCTACTAACGGGGTTAGATTTAAACCTCACTTTTTTTGGGTGGTAGTACCAACGAAGAAGTAAATGTATACTGTTGCAGTATCTAGGAAGACTAGCTGATAATGGTCTGCTATTTTTTAGCGTTCTTACAAGGACTAGAGTGCTGTTGAACTCTTTATACAAAAGAAATGCTAAATAATCAACCTTCCTTTGCTAGTAAGTCTACTAGTTTTATCACCCTGGATCCATGCTTGGTCATCATCTGGCCCAATCTACTTTCATAATATTTGCATTCTGCTGAATGGGGGCCAAATCATTTAACAAGATAGTGTTAGCCTTCCAACTTGTCTATCTGGGGGTATGAGTCTTGTCAATCCTTGGTAACTTTGATCTGGAAAGTTAACAATGCTACAAAGCTCATTTCTTTATGTTCGAGAGGACATACATTCACTGAGGAGTGAGGACCCAATTCTAAGCGCTGTTTCAAATAGACATGCAGTAGAGTTGGTCCTAAGACCTTGATGCCAACCTTTATCCTTTTTTCTGGAGAAGAGGGGACATAGAGCTTGCTTAAAATTTAAAATCCAGACAAATACCCAACTGGCCAACTATTCCTCATTGTTGATAGAGGGGGGGGGNGGGGGGGGGTCAAAGACTCAAAGTTTTATTGTCAAACACGAAGAGGATGTACTAAGGTAAGTTGTTAAATGTTAGTGGATTGTTTTGAGGTTAGGTTGTGTTGTTTTGAAATCCCAAGTGGAAAAGGTAACAAAGATAGGAAGTTTAATAAGCGAAGGAGATTTAAGGGGATCAGGGGATTTCTTACGCCAATTCAAATTAAGTCATCATGAAGACTGAAAGCAGTGAGATTATGCTGTCCTGGTAGTGAGACACTGTTGTTGATAACAGCGTCAGGAGGGGATGGATACAGAAGAGGTGTTCCCCATGTTAATTTCAAAATATCTGTTACTTCTGAAAAGAATTCACTTTTCGTTTTTGGTTCATACACTTGGTCACTGAAGGACTAAACCCAGTCTATTCATCTTCTCAGAGGTCTGTAATGTCCTGCTAGTTCTAATTATATTTTTCTTGACAGGAATGCTTCAGACAAATTACCATCTCAAGAAATTTCAACTTCTACAGATGGAGGTTCTGCACATGATCTTGAGGTATGTGGGTAGGCATGAATATGTTACTCCCTCCGTCCCAGAATACTTGTTACAGTTTTTTTTTTTTTTTTTTTATATGTCCGGGAATACTTGTTACATTTTCCTTTTTTCATATGTCCTAGAATACTTGTTACCACTTCCTTTTTTCTAATATGTCCTAGAATACTTTTTAAACTTCCATGTTAGGAATGAACCCACAAATATTTTAGTATCTGTCTCTTGACACACTCACTTAATCGAAAAAAGAAAAAACCTACTAACTCATGTCACATCTATTTTTGACTAAAATAACAATTGATAACCAAGCAACCATGTATCATCAAAGACTATGTTCAAAGATGAGTGTAACAAGTATTTTGGAAGGGAGGGTGTATAACAGGTGATTAGACATATGACTTGCAAATATTGCAATCTGGATTCTGGTATACACCATACTGCAGTAAATTCTTTGTGCATTTGTGCTTATATCGACTCAAACTGAATTTTAGTTTGATGCCTTACTGAATTCTTGTGTACATATTAGGAATTACGTAAAGGGATTACTGCAGGGAAAGTTAGTTTGAGCGAGAAACCAACCAATGATGAACCTTTGGTTTTTGCCAACAAGCAGGTATGATACCTTCAGATGACTTAGTCTAGCATGTTCTGTTGCTTTTTCTTTGGCATTCATCTAGTAAGTTTTGAGTTTGTTTCTGCACGTCTGTATGATTATTTCTGCACTGACCCATTTTTTCTTTTCTTAGGAAGCAAAAGGCGCGTTTAAGTCACTTCTTGAGTCTGCAAATGTCCACTCTGACTGGAATTGGGATCAGGTATGGAGCTGATTTTTTTTGTGAATAAATGAAATAGCACAAAGTCTGTCCGGTTAGCCTGCCTATGGTTTAATGTTTTATTTAAATATATTACCAGGCTATGAGGGTGATAGTCAATGACAAGAGGTATGGTGCTCTGAAAACATTAGGGGAGCGGAAGCAAGCTTTTAACGAGGTAGTATTACATTGATGATGCTTTTCTGCAACCTGTATTGCTTTTCTTGGTTTGTTTGGATCTCTAGGTTATCTTTCTGTTTATTTTGTGTTCAGAAGACAATTTTTATTGTTATTAATATCAAGTTCGTTCACGACTTCCTTTTTTTTCACTAGTATTTAGGACAGAGGAGGAAACAGGAGGCTGAGGAGCGGCGCTTGAGGCAGAAGAAAGCAAAGGAAGAGTTTACAAAAATGTTGGAAGTATGCTGCTTTCCTTCGATGCTTTTTCTCCTAATTTTTTCCATTGCATTTTTTCCTTTTATCATTTTGTTTGCTATGTTGGGGAACGGGCTGTGAGTTTGATAATAACATAATTTTTTTATTTGCTTTAAAGGAGTCAGAGGTACTTGCATCATCGATGAAATGGAGGTATATGTTTTGTGAGTTTACCATTTACAACTTGCAAGCTTGATCTGCTTCCTCTTAATTGTCTCATATTGAAGCAGATGTTGTGATTCTTATTTAGATTCGTTCTGGTCTGTTTAATTGTTCTGATTAGAGTTTGATGTCATCAGTAAAGCTGTTACTATGTTTGAAGATGATGAGCGGTTCAAAGCTGTCGAAAAGCCGAAAGATCGGCAGGAGCTTTTCGATAATTACTTGGTGGAACTCCAAAAAAAGGCAAGTATTGGATTTAAATAGCATCTTGTGAAGGACAAAAGTTCCGAAATAATGTGACTAACAGTTACTATAACGAAAAATGCTAAATATGCTTCCCTTAAACACTTATGCTGTCATGTGGATGATGTGTCACTTTAGTAAACAAGATTTAATACCCTTTTTCAATATCAATCTAAAAATCATGCTGTAATTTTAAAAAAATAATTCTCTAAAACACAGTAGCAAAGTGGCTCTTTCAGTTACATGAAGGTTTCATTCTTTTTAACGAAGTCTTTCAAACCACATTTGGCACTTTCAATCATGATAATTTAGATGTTTGAGAAGGGTGCGTTAGTGAACCAACATTCTAGAAATCATGTCACTTTTTATTTTGGTTTATAGTCTCAGCTATTCGCCTCTTCTTTCAGTTTTCGTTTTGATTTCAGCTTAATCAGGTATACTTCTTGGCTGCTCATGTGTATGCTTCTTGTGCTGAGTTTATTCAGGAAAAGGAGAAGGCCGATGAAGAGTACCGACGGAATAGAGAAGATTACTGGACGTTTCTGGAATCTTGCGACTTTATTGAGGTGCATTATGATGCTTCTTAATTTGTTAGTTTCCTTTTACTAGGCATATCATAATGGTATATGTACTTATTTTTGATGTTTAGGCAAACAGCCTATGGAGAAAAGTTCAGGACCGCTTGGAGGATGACGAGAGATGCTCACGTCTTGAAAAGATTGACCGCTTAGAAATATTCCAGGTTTATAGCATATCTTATTGTCTGCTTATGGAGTTATATCTACTGGAACGTCTTGTTTAATAGTGGTGTTTAACAGGATTACATTCGTTTCTTGGAGAAGGGAGAAGAGGAGCAGAAGAAGCTTCAAAAAGTTTGCATACTACTATATTTTTCTCTCCCTTTACCGTTTATCATCCCCCACATTTTTTTGGAGGCCATTGACTATTTCCACAGATCTTACTTGGCTTGACACTAATACTCATGGTTATTAAAATCGCGATTCAAATCATAGAATCTTACGATTTTACGATTCAAATATGTCTTACGATTCGATTTACGATTCTACACGATTCTACACTTTTTTCTTAACTACCGAAAAATCTCGTAAATATTGCTTGTATAACTATTAAACAACTATATATTTCAATTATTAAATTTTTTCCTATAATATAAGCATTTTAAATGAAATTTGAGAATAAAATCTTGCTTATATTTGTTTGTAGAATCTTACGATTCTACGATTTTATTTTACGGTTCGATTCTAATAGTCCCTTCCGATTCAAAGTAGAATCACGATTTTGATAACCTTGCTAATACTATCAGTTTGAGATTTGAGTTATGCTTTTGTGATTAATTAATTACAAACTTATAGGAACAACTGAGGAGAGCAGAGCGAAAAAATCGTGATGAATTTCGCAAGGTGTTGGAGGCTGATGTTGCGGCTGGTGTTCTTACTGCAAAATCAATCTGGCGCGATTATTGTGCGAAGGTACCTAGTCTTTCCTTTTTATTTATAAATCAAGACGGAGGGCTGGAAATTCCCTACAGTATACATCGTTTTTGAAATTTGTTTTCTTCTTGAACTACAGGTCAAAGAATCTCCTGCTTATCTTGCTGTGGCACGTAACATATCAGGGTCTACTCCTAAAGATTTGTTTGAGGATGCGGTTGATGAATTGGAGAACCAGGTACATACCTAGCATCCACAGTTCCATATCTTATACCTATCCTCTGTTTGTCCACTTGTTTTCATATCTATTTCTACAAAATTATGCATTTTTTCTGTTAGACTTTTGAATTGGCCTACATCAGCCAATTTAATTATTTCTTACAAGGAATTTCTTTATTGAAACAACAAAGTAAGGCGTACAACAGTAAGACAAGGAGCCCTCAGGGAAAATTTGTATTCAGAAATCGACCTTGTTCCTTTGTGTCTTCCCAATGAAATGATTTTGGAAACCCCTGCTGCCTTAGCCCATAAAGATGTTATCAATGTTATTTATCCCAGAATCTATTATAGGAACCAGCAGTTCCCTTGAAAATATGGGCATTCTCCTTCAGCCACATAATCCATAGTATAGCCATCACAGTGTTTTGCCACGATAACTTTTCCTCCACCTTACCGAACCCCGCAAATCAATTTGCAACCAATCCTTTATATTTGAAGGTTGAACCTAGTTCTCTCTGAAAGTCTTCTTCACAGATACTTGTCTACTCACAGTGCGGAAAAGAGCAAGACTCACTTAACTCTGAAAACAAGCAGATTTGAGTGAGATAGCCATCGCCAGCCTTTGTTTTCGGAGCATATCATTCCCTATTAATCCTCTTCAAAGCTTTCCAGAACTCCACATGAATGCCCGCACCTTAAGAGGAACCATGGCTTTCTTGAATCTTGATAGCGTTTTCAAATGGGATTGGTGAATATTAAACCTGTGCAATGGGGCGGGGCTGTATTAAAGGCCCTGAACCATCCCTGAAACTAAGGGCCAGGCATGTCCCCTCACATTAACGGGTTTACGGGGTAGGTCTGCATTTTATGGGATTATAAGTTTATAACAAGGTAAGCCAGGCTTTGGGCACTGCACGGTCTGCATTATATGTTTTTATGGGAGACTTTTTTCTGTTTGACTTTGTGGGCTACTGTTCTAAAACCTCGTGCTATCTCTTTCTATATCAGCATTCAGTATCAGTCTTTCTCTAATCTCTACTCTACTGTTTTCGGTTTCATCAAATGTAAGTTCTTCATTAATTAAAATGCTATTCTTCGATTTGTTTCCCTACCCGTCAAACCTCCAAAACCTTACTTTCTTTCACCTGCCCTTCTCTTTCTGTCCTCTAATCTCTACTCAACCCACTCATTATTTTTAGCTCTCTTATCCTTTTTTTTATGGTTCAGCCTATCCTTGAAATTTGTGGAAGTTTTGAGGGGTGAAGAGAAAGAGAGAGTAGAGGAAGTGATGTGGTTATGTACTTGGAGCTGCTGAAGTATTAATTTTACTCCATGAGTTATGCCTCAATTCTTTTTGATGGGGGATGTTAGATAAAAACTGGTTTGATTGAGCAGGTGTGACCCGCCCTGCTATGTTATTTCATTATTCTTGACCCGTCCTGCCCCTTTCCTGTTTTACCCCATTGTGCAGGCCTAGGGATATTGTATAGGTTAACCGAAAAAGCTAATAAAGGATTTACAGGATATCTGAAGAATCTCCCACACATACACTGGAGTCCTTCACATCCATAAACTGAAATACTAAGTAGCCAAGTGGTCATGTAGAATCGGCCCCCTATAACTCCCGATCATGGGATAGTGTCCTGAAATGTAATTCCCACGCAGTACCCTATGAAAACCTGGAGCATCATCTAGAGAAGATAAACATAATTATCGCGGGTAACGGGCACAAAAACACGTCTCTTCTATCCAAATATCCTCGCAATAGTGAATCCTCTTTCCGTCCCATATTTTGTACCTATCAAAGGGTTGAAAGATGAACAGAAGTTTGAGATTAATTTTTATGGACAGTTATGAGTCAGCTGTCTGTATCCCGAGCATCCCACCCATTTCTTACGTAATCCATACTTGCTCTTGATGATGTCATACCAAAGAGACCAAGGCTCAGATGGAAACCTCCACCACCACTTACATCAGCCAATGAAGACACTTCATAATCTTCATGAACTTCCTGTTATATTTTTTTTAAAGAAAAATTTTGGTGTTTTGTTTTTTCCAGCTTACAATTTCTCTATGCTATAATATTAATGGTTTGCCATTATGTTCATTCTTGTTGGTATAATTAATATTATGTGTACTGTTTAATTGTTGTGAGATATGCTGTTGACTATTTGTTGTTCCTTCCAATGTTCTGTAAAGTAGGTCTTTAGAGGAAGAGCTAGTTTTCTGTCCTAATTAGTAATTACTATACAAATATCTGAAATGTCTGAGTTTTTTTTATCTTCTGACAGTATCATGAGGACAAAAGTAGAATAAAAGATGTGATGAAGCTGAGCAAGGTAAGATGAAATGTGTTATTTGCTTGCACTGTGTATTTTAATTGTCTAGAATTTGCTTTAAGCTCTGTCAATAGGTTTACATGAATGTATTTATTTATTGCAGATGACTATGACTTCTACATGGACTTTTGACGAATTTAAGGAATCTGTTTCCCCGGATCTTGGGTCTCCACCGATTTCTGATATTAATTTCAAGGTATTTATAGATAATTTAGAGTTGTAGAATAGTTAAATGGGCTGTTGAATATCTGAATTGCCTGTTAGGGTTATGTCTGCTGACTTTTGCAACTTTATTGTGTTTTCTCCCTGCTAAGGAGCTGTTGGGTTGAAGGGTAAGGAAAGTAATTAAGAAAGAGAAAGAAAAGGAAGTAAATCTACATCCTTTTAATCCAAATTTTGTTTGGTTTGAAAAAGAGAAGGGGTTTCACTTCCTGATAGTCTTTGGTTTACCAGATTGATTCCCAAAAAAAAAAAAAAAAATTGTCGTAAATGGTGGTAAGGAGTGGTCATAAAGCTCCTCAAATTTTAGAAGAATCTTTAAAATGAAAACAATACGGACTATTACTAATCTGATAATGATGGGATATTGTTTCTTTTTTCTTTCTTCTTTCTTCATATTACGGAGTAGTTTTCTTACCCGTAGTTTTTTTACTATCATTTTGAAAAAAATATAAAACCAAAACTATAACCTGATTTTAGTTTTTTAAAACTAAAAACTGAAAATAATGCCGAACGGCCCTTAGTTGGTTTCTTTTTACATTTGTTGCTGTTGTTATAATATGATTTCTATCTACTTGTTTGCAGCTGGTTTTTGAAGATCAGCTTGAAAGGATCAAAGAGAAGGAAGAGAAAGAAGCTAAAAAGCGCCAGCATCTTGTAGATGACTTTACTGATCTATTGCGGTCTCTTAAGGTACCTATGAATTCATATATCTTGTGACCGACTGTTTTATAAGACCTTTTATTTATTTCTGTGTTGTTTTGTAGGACATAACTGCATCTTCTACTTGGGAAGATTCTAGACAACTCTTCGAGGATAGTGAAGAGTACAGGTATACCTAATTTAATGATGCTGGTAGTTCCAGTTTTATATGAGGTTCTGAATGATATATTGAATCTGCTATCAAACCATATACTTTATCCAGGGAAATTGGGAATGAAACACTGGCAATGGAAACCTTTAAGGACTGTGTTGTTTACCTGCAAGAGAAGGCAAAAGAGAAGGAACGAAAGCGTGAGGATGAAAAGGTATAGTATCCACTTCATTGGGATTGCTGTCAGTGTTTTTGTTGATGCAGATCTGTGAATAAATATAATTGTTGGCGTAACCAAAATTTCATCATGTATAATTGTTGGTGAAACTGGAATTTCTTCCCATTGCACTACCTGCTATTAATTCAAGGATATAATTCATTTGCCTAGCCCAATCTCAGTGCATTGGAAAGATATGGTGTATTACTCAAACCTCATACAAATTGGGTTCCAAGTTTATAGGACATGCAAGGAAGTCTAACTCTCTCCTCAACCTGAGCTCTCTATCTCTCTTCTCTGAGTCTTATCACTTGCTTACTGCCAGTGTGATAGCACTGCAACTCTGCAGGTCAATGAAGACCAAAAACTTTACCTGTGATTAAAGTTTCCAGTTTGCATCTTCTTTATCACTCTGATGGGGGGCAAGGTTTTTGGGACGGGAAAGTTCCTTGTCTTTCTGCTGTTAGCGAAACTCAGAAATATTTGCAATGAACGGATTGATTTTTGTTTGAACTGGTCTTTTCTGCTGATCCAGAGTTTACTGTAACCAGCTTTAACTTGGTATTATAATGTTTATACCCGTGTAATTGATTCCTTCCTTACATGATCTTAGATACCTTAGCTGTTTGTTAATGTATATAGCCCTTATTTTTGGTTCTGAGCAATGAGTGACCATTTTATCACCTAGATAAGGGAGTATAATGAGACTTCTAATCTACATTGTAATACAACTCTGACAAGTTACCAGTATTGATTGGGTCTTTCAGGTTTCCTTGAAGTTGAATATAACAAGGTGCTTGTGTGTTATTATTTAGGATATTTTAGGAAAATCAAGTAATAGAAGCAACTTCTATATTAGTTAGTGATTCGAACAACATTTTGTTTTTTTTCAAGTGTGAAGGTCACAGTTTTTTAGTTAGGTTGATATCTTTACGTTGCAAACATCTTTGTACATATTATTTTTTGTTAACTCAATCCTTATATGACTTTTATGATCAATCCTTTGTTAAATGTGATGTTTAAATTGAATGTTCCCCAAGTATTTAGATGTGTGTTTGCAAACGGTATATTTGTCAAGAACACCTTATATACACTTCTATTAATATAACAACCTTTTTAAAAAGCTTTATGACTAACCACCTTTTTAAATTAATGAATTTAAGTTTTTGCAACCAGAGTCCGCCGATTTTTCATTTAAAGTGGTAGCAAAAACTTGTCTGGACTTAAAATCATCACAATTGGCTAGACTTAGTTTGGGTCTCAAAAACCCCTTCATGTGGCTCGAATCTATATTAGTTTTTTGGGACTTATTTAAGACTCAAATGCCTAATGGTGACACACTTTCATGAGGCTCGAAAAAACAGTTCTCAATTTTTATTGAAGCTCCAAAAATATTAATCAATTTTTTTGGAAAAAAAGGATTTTCCTTTCCGATTTTCTAGAATGTTATCAGCTAAAAAGGTTCTAAAATTTGCTATCTTTTCTCATTTTTGCTAATCTATCTAATGGATGGGTCTACTTTTGACAATTCTTTCTCTCCTCTTGATCCCCTTCTTAAGCCTCCCATGTAGATGCTCTTATAAGTTAGTTGTTTGAATATAAACAAAGTTTCACCATGCGATTTAAATGTGATACGTCGTACTTGATGAATTTCCATTAAGTAGTAGCAAACATTGAAGTTTCTTGATTTTTTAAGGTAGTTATTCAAAAAGTTAAATTTAGAAGGTAGTTATCTGAATAAAAGTGTATAGATAAGCTGATAGGTAATATTTGACAAATTTACCTTTGTGAAATGAAATTTATATTGTGTATACTATAGCTTTACAAACTCTTATTATTGATTTCTATTTCCGTTTCTCACTTCCACAGGGTTATAAGTAGGTTGTCCTAAGATTTTTCTGTGCCATGTGAAATAAGTAATGTGGGAGTATAGAAATTGGTGATATAGATTTTCTTGTGATTTCTTTCATGTCACCTTTATTATAACTTTACTTCCTTCAGATTCACTCTAAAGTCTATGTTCGCTCTCAATCATTCTCCCACAGAGTCTTTTCTCCGTGTGGTTGTCTTTCTTCCACCTGATTCCCTTTCCTGTTGGGTTTAATTGATGAAATCAGTGGGATTATAAAACATGTTTCCTGTCTCCTACAATGTCTGTTGGATGCCATTTTTATATTCTTTTTGCTATCTCTTTCTGAATCCCTTCTTTTTCTTCTCTTCTAGATTACCCTCATCCTCTTTTGTCTTTCAAGTAATTTTCAGCGCCTACTATGTTTGAACGTTTCTATGCCCACTGTATTGCTAATCTCTGTCGGTTTCCTTTTAGTACTTGATCTCTTGCAGGTGGAAGCGATAAGCCACTACTTCACTTCAGCTGCTATTTCCATAAATAGGGTAGTTGTCACTACTCTCTTTTGTCATTGTGCCCTTGGTCGCCAAACTCCCTTACCGGGTTTTCGATGTTGGAGAGGTCAGCCATGTATTGATATGGATGGTTGGGTTGCCAAAAACACGTCTTCTTAAATAGTGTAAAAGGGTGTTTGTCATTTGCCATTCTACCTGTAAGCCCTGAGTGGTTCGTGCATAGGGTGATGAGTTTAGGATTCGGATATTGAAGATGAACCGCTCAATTTAAAGAATTGGTTATTATACCTTTGTAGACTTCTTGTTAGGACAGTAAGGGGGCGTTTGGTTCAAAGGGGGTAAAGGGAATGGAATCAGGAAAGGGTTTGGAAGGGAAAAGAAAGGAAAACCCTCAAATGCGTTATGTTGTTTGTTTTGACACTATAAGGGAATCAATCCAATTCTGAGTTCTCAACCACCACCAACCATCCCTTTTCTTCAATACACAACAACCACCGCCAATCTACAACAACCACCACCCCCTCTACCTTCGGCTACTTTCCTCCCCTCTGACGCGGCCTCTTCGGTCTTTACCGCGGAACCGCTGCCTCTTCGGCCTTCACCGCTGCTCTCCTCGTGTACCCGCTCTCCTCGCGCCGCTGCACTCTTCGGCTCTCCTCTCCCCTCCGACATCGCTGCTCTCCTCGCGCCGCTGCACTCTTCGGCTCTCCTCGCGGCCCGGCTTCATCAAGATCTATTTATTTTATTTTTAGATCTAGTTATTTTATTGGATTTTCTTAATGATTGAGGTTATTTTTTGGGTAGTTATGTGATTTCAAGTGGTTAGAATTGGTTTTGGGGTGACTTGGATGGTTTCCGGCAAGTTTAGAGCTTTTTTTTGGGGGTGGTTGGGATGAATTTCGGGGGTAGGGTGGTTGTTTTCGTGGCGGGAAAGGTGGTTTTGGTGGTGGTTGGATGATTTCGGCGAGGAATAGGGTGGTTTCGGTGGTGGTTGGGTGATTCCGGTGGGAAGTAGGGTGGTTTCGGTGGTGGTTAGGTGATTCCGGTGATAAGTGGGGTGATTTCGGTGGTGGTTGGGTGGTTGAGTAGGTTTGTTGGGGGCGGCGTAGGTGGTTGGAGTGGTGGGTCTGATGGTGTTTTGGTGAAGATGTTGGTCAAAGATGGTCAAAGTGGTAGTTAAGTTGGTTGGGTGAAGAAGAGTTTGGGGGTTAGCATGATTCCTTGGGGTAGGGTGAAGAATCAGAGTAGAGGGAAATGGGGGAATCACAAGGGTAAAGGGAATGATTGAATTGGTATATCAAACAACAACAAAGGGAATCAAGGTAGTTGATTGCTTTTCCCTTTGTTGATACCCCCCAACCAAACGCCCCCTAATATAAGATGCTTACTGGTAATGTTAGAAATTTGGTTAACTGTAGTGTAGCAGTTTATTAAAGCTGCATAACAAAAATTGAACTGCGACTTGCGAGATTTGACTTTTTAAAAGTAGGTTAAAATAAATTATTGAAATCAATCTGATGCTTACTAAGGCTAGTTGCCTTGTCCTTTGGCTTGTTTCTATTGTACCTTGCATGTTTTCCCTTTGGGGCTGTCTGCCTAAAAGAGAGCTAGGAAAGGTACCCCTATCTTTTTTGTTAAAATAGAGAATATCCATAGATGTTTCAACTCTCTTTCTCTGTCCACAGGGACTCCGTTTCAATATCTTTCCCTATGCTGGAAGATGATCATAACCTCTCTGCACTTCCTTGGATGTTCATATATTATGCTATTTATGTTATATTCAGCAGATTGGCAATATCACATTAGGATGCTGATGAGAGAGTTTCCTCTCTTTTTACGAATATCTATCAGTACGGATGGTGACATGACCAGTGTTTCCTTTGCATTTCCCAATGCTAACCTACTAAAACCAATCTAGGTTCTGACAAGCAAAGCATGTTGTGATTGGCTTACTTGTCAAAACAAAGGGTAATAGGATTATCCGTGAAGGAAGGAGTATGTTAAGTGTTAGATGTTGCGAAATATTAATTACCGTCCACCTTTTAGTGCATGATTGATTATTGGGAGAGAAAATTGCCTATTTTGTTATTTAGAGCATTCACGTTTTGCACGTGAAAGTTTTCTATCTTTTGGTCTATGGCTGCAATGTTCAGGATTTAGAGTTTTGATTTGTGTGTGTGGTGGTCCGCACTAGTAATTAAGTGTATGTTTCTGCAGGTTCTTCTATGTATTGATTAGGAGTTTCGCTGTAGTTCATGTTTTGGATTGTTTCTCATGAGAGGAGTTTTATTCAGATGTACTACGTATTTATTACTTCATTTCATATTATGTGACTTTGTTTCAAAGATTAAGCTGACATAAGAACATGTTAGTCCTGCCTAGCCATATTTAATTAGTGTACAGGTTGTTTGAAGTTATAGATGAATGGTTTGAGAAAAGGGGATTAATGGATTAAGTGTAGAGATATTATGCAATCAATGGATTAACTTTTTTTTCATTCATTATTTTCTTTGATTAAACTGCAGTATCATGTCTTAACTTTTCCTGGTGTTAAATGGATATTCAGGTGAAAAAGGAGAAAGAAAGAGAGGAAAAGGAGAAGAGGAAGGAGAAGGATAGGAAGGAGAAAGATAAAGATCGTGACAGAGAGAAGCGGAAAGAACGCTCTAAGAAAGAGGAGTCAGATGATGAGGCTGTGGATGCAACTGATAGCCATAGCAACAAGGATGAAAATAGGAAGGAGAAGGAAAAAGACAGAGACAGGAAGCATAAAAAACGGCACCACGATAATACAATTGATGATGGAAGTTCTGACAGGGATGATGACAAGGAGAGACGTCACCGAAAGAGGCATCATGATACCACAGATGATGTCAGTTCCGACAAAGATGATAAAGAAGATCACAAGAAATCACGCCGACATGGTAGTGACCGGAAAAAATCTAGGAAGGTATGATAAGTAGTACCCAGTTTCTTGAGGACTTGAATATTGGAAAAAAGGAATTTTTAATGACACCACCATGACATTGATAACTGGATAATTTAAGCTAAACCTTAATTAAACAAAAATCTTTGAAAATTATTATTGATTAAATGTTTTTCTGTACTTTTACAGCATGAACACTCACCGGATTCAGATGGTGAAAGTAAACATAAAAGGCATAAGAGGGATCACCGTGATGGCTCCAGGAGAACTGGTGCTCATGATGATCTTGAAGACGGTGAGCTTGGTGAGGACGGGGAAATCTCGTAGTCTTCATGTAGTAGTCATTCTCTACTCTCAGCTCCTAGTGTTTTTTTATGAATAGAATTTAAATGTTATGCATGTGTTCTTCAACAAAGTTTTGATCTCTTCGCATGCCTGTCATATGATTACCAATTTAGATGGTTGTGTTAAAATAATTACATAGTTTGTTCTTGTAAAAAAAAACAATTTGATGGTTTTGTGTTCTTCTGTCTTCTGGGCGACAATGTTGTCTTTCGTGTTTATTTGGCTTTTGCTGTAGTTTTTGGATTATGTTTAAGAGAGTCCAGGAAAATAATAATAAATGAAAAGTAAACGGATAGCTACACTACCGTCATGGCGCCACTGTCAGTTGCCGAAAACTACTCCGTATTTGTCAATAATTTCAACTCCAACTTTTTATTTGGTTGATGGGTGTGTTGAAATTTTACTATGATAAAAGTGCCTGCTTGTATAATTAAGAAGTCACATTGGCTTTTAAGTTGGATGGCTATGTGATACGGTCC

mRNA sequence

ATGGAAGAAGGCAAAATAACCACCGAAGTACACCCAGATCCGCAGCATACCGAGGCGACTAAGCGAGGTCGCGAGGGAGAGGAACCGAGTCACCGAGGAAGAGGAGGAGCCGTCGAGCCGGCGAAAACCGAGGAAGAGGAGGAGCCGTCGTCCTTCGAGCCGAGCACCGTGTTCATCATTTCCAAGCACCGAGGACTCGAGGAGACGAGGAGGAGGAGCCGTCCCCGTCGAGCCGAAAGTCCGAAACCCGATTCAGCAGACAGCAAGTGCAACGTGAGCTTCTGCCCCCTAAATAGCTTCTATGGGCACTGCCCAAAACTATGGCCCTCCTATGTCCACACAGGTAGATTGTTGGTGTTTTTCGAGCTTTATTATTATGGTGAAAATTTGCTGAATTTTGCTCATGTATTATTGCTTTTGCAGTATCGTCCAGCAGTCCCAGGCCAGCAAGGGCAGCCATATTTACCTGGGTCTGCACAACAGTTTCTAGCACCAGGACAGAACATTCCTTCTGGTCATAATCAACCTATGCAGTTCTCCCAACCCATGCAATTCTCTCAACCCATGCAGCAGTTACCTCCTAGACCTGGAATGCCTGGTGTTCCTATGTCGTCACAAGGTATGGCTATGCCATATGGTCAGCCAAACAGGCCTATGACATTGGGTGCACAGCAGAATCAGCATTCTGCACCGCCTTTTGGCAATCACCCATCTGGTGTAGGTGGCATGGGAATTCCTTTTTCTTCGTCATATACTTTTCAACCAGTATCCCATTTGTCAACACCTGCAGCATCAGTTGGCGGTCAACCATGGTTATCATCTGGAAGTCAAAGTGCTGCGCCTTTTGTACCAGTTCCACAAACAGCAGACCAGTCTTCAGGTTCCGCTGCCCCCGTTCCTGCTGTTAATCCCCCCGATACCAGCCAACAATCCTCATCTGATTGGCAAGAGCACAACACTCCAGATGGAAGAAGATATTATTATAACAAGAAAACTAAGCAATCTAGCTGGGAGAAGCCTGTTGAGTTAATGACACCAACAGAGAGAGCTGATGCGTCGACTGTATGGAAGGAGTTCACTACTCCAGAAGGGAAGAAGTATTACTTTAACAAGGTTACAAAGGAATCAAAATGGACCATACCTGAAGATTTGAAGTTGGCTCGTGAACAAGCAGCAAAAGCAGTTAGTCAGGGAGCACAGTTAGATGCAGGAATGAAATCTCATCCTACAACAACTGGAGGTTTTACCTCTGAAGTAGCACCATCAACTAATCCAGTTTCCGGCAGCTCCAATGGGTCATCATCTGTTTCTGGTGCAAGTTCAATCCCGTTAAGTTTTCCGTCAGGTGTTAACCCAGCACCTGTAGTTAATGATGCGTCATCAGAACTCCTTATGGAGCACTCAGCTGCTCCGACCAGCATGGCAGTGACAAATGCTCTTGCAATGACTCCTTTGTCTGCTTCTATTTCTGGAGATGATGCTCTTCCTGCTGCATTAAATGCCTCCTCCATTACAGTGAATGCTTCAGACAAATTACCATCTCAAGAAATTTCAACTTCTACAGATGGAGGTTCTGCACATGATCTTGAGGAATTACGTAAAGGGATTACTGCAGGGAAAGTTAGTTTGAGCGAGAAACCAACCAATGATGAACCTTTGGTTTTTGCCAACAAGCAGGAAGCAAAAGGCGCGTTTAAGTCACTTCTTGAGTCTGCAAATGTCCACTCTGACTGGAATTGGGATCAGGCTATGAGGGTGATAGTCAATGACAAGAGGTATGGTGCTCTGAAAACATTAGGGGAGCGGAAGCAAGCTTTTAACGAGTATTTAGGACAGAGGAGGAAACAGGAGGCTGAGGAGCGGCGCTTGAGGCAGAAGAAAGCAAAGGAAGAGTTTACAAAAATGTTGGAAGAGTCAGAGGTACTTGCATCATCGATGAAATGGAGTAAAGCTGTTACTATGTTTGAAGATGATGAGCGGTTCAAAGCTGTCGAAAAGCCGAAAGATCGGCAGGAGCTTTTCGATAATTACTTGGTGGAACTCCAAAAAAAGGAAAAGGAGAAGGCCGATGAAGAGTACCGACGGAATAGAGAAGATTACTGGACGTTTCTGGAATCTTGCGACTTTATTGAGGCAAACAGCCTATGGAGAAAAGTTCAGGACCGCTTGGAGGATGACGAGAGATGCTCACGTCTTGAAAAGATTGACCGCTTAGAAATATTCCAGGATTACATTCGTTTCTTGGAGAAGGGAGAAGAGGAGCAGAAGAAGCTTCAAAAAGAACAACTGAGGAGAGCAGAGCGAAAAAATCGTGATGAATTTCGCAAGGTGTTGGAGGCTGATGTTGCGGCTGGTGTTCTTACTGCAAAATCAATCTGGCGCGATTATTGTGCGAAGGTCAAAGAATCTCCTGCTTATCTTGCTGTGGCACGTAACATATCAGGGTCTACTCCTAAAGATTTGTTTGAGGATGCGGTTGATGAATTGGAGAACCAGTATCATGAGGACAAAAGTAGAATAAAAGATGTGATGAAGCTGAGCAAGATGACTATGACTTCTACATGGACTTTTGACGAATTTAAGGAATCTGTTTCCCCGGATCTTGGGTCTCCACCGATTTCTGATATTAATTTCAAGCTGGTTTTTGAAGATCAGCTTGAAAGGATCAAAGAGAAGGAAGAGAAAGAAGCTAAAAAGCGCCAGCATCTTGTAGATGACTTTACTGATCTATTGCGGTCTCTTAAGGACATAACTGCATCTTCTACTTGGGAAGATTCTAGACAACTCTTCGAGGATAGTGAAGAGTACAGGGAAATTGGGAATGAAACACTGGCAATGGAAACCTTTAAGGACTGTGTTGTTTACCTGCAAGAGAAGGCAAAAGAGAAGGAACGAAAGCGTGAGGATGAAAAGGTGAAAAAGGAGAAAGAAAGAGAGGAAAAGGAGAAGAGGAAGGAGAAGGATAGGAAGGAGAAAGATAAAGATCGTGACAGAGAGAAGCGGAAAGAACGCTCTAAGAAAGAGGAGTCAGATGATGAGGCTGTGGATGCAACTGATAGCCATAGCAACAAGGATGAAAATAGGAAGGAGAAGGAAAAAGACAGAGACAGGAAGCATAAAAAACGGCACCACGATAATACAATTGATGATGGAAGTTCTGACAGGGATGATGACAAGGAGAGACGTCACCGAAAGAGGCATCATGATACCACAGATGATGTCAGTTCCGACAAAGATGATAAAGAAGATCACAAGAAATCACGCCGACATGGTAGTGACCGGAAAAAATCTAGGAAGCATGAACACTCACCGGATTCAGATGGTGAAAGTAAACATAAAAGGCATAAGAGGGATCACCGTGATGGCTCCAGGAGAACTGGTGCTCATGATGATCTTGAAGACGGTGAGCTTGGTGAGGACGGGGAAATCTCGTAGTCTTCATGTAGTAGTCATTCTCTACTCTCAGCTCCTAGTGTTTTTTTATGAATAGAATTTAAATGTTATGCATGTGTTCTTCAACAAAGTTTTGATCTCTTCGCATGCCTGTCATATGATTACCAATTTAGATGGTTGTGTTAAAATAATTACATAGTTTGTTCTTGTAAAAAAAAACAATTTGATGGTTTTGTGTTCTTCTGTCTTCTGGGCGACAATGTTGTCTTTCGTGTTTATTTGGCTTTTGCTGTAGTTTTTGGATTATGTTTAAGAGAGTCCAGGAAAATAATAATAAATGAAAAGTAAACGGATAGCTACACTACCGTCATGGCGCCACTGTCAGTTGCCGAAAACTACTCCGTATTTGTCAATAATTTCAACTCCAACTTTTTATTTGGTTGATGGGTGTGTTGAAATTTTACTATGATAAAAGTGCCTGCTTGTATAATTAAGAAGTCACATTGGCTTTTAAGTTGGATGGCTATGTGATACGGTCC

Coding sequence (CDS)

ATGGAAGAAGGCAAAATAACCACCGAAGTACACCCAGATCCGCAGCATACCGAGGCGACTAAGCGAGGTCGCGAGGGAGAGGAACCGAGTCACCGAGGAAGAGGAGGAGCCGTCGAGCCGGCGAAAACCGAGGAAGAGGAGGAGCCGTCGTCCTTCGAGCCGAGCACCGTGTTCATCATTTCCAAGCACCGAGGACTCGAGGAGACGAGGAGGAGGAGCCGTCCCCGTCGAGCCGAAAGTCCGAAACCCGATTCAGCAGACAGCAAGTGCAACGTGAGCTTCTGCCCCCTAAATAGCTTCTATGGGCACTGCCCAAAACTATGGCCCTCCTATGTCCACACAGGTAGATTGTTGGTGTTTTTCGAGCTTTATTATTATGGTGAAAATTTGCTGAATTTTGCTCATGTATTATTGCTTTTGCAGTATCGTCCAGCAGTCCCAGGCCAGCAAGGGCAGCCATATTTACCTGGGTCTGCACAACAGTTTCTAGCACCAGGACAGAACATTCCTTCTGGTCATAATCAACCTATGCAGTTCTCCCAACCCATGCAATTCTCTCAACCCATGCAGCAGTTACCTCCTAGACCTGGAATGCCTGGTGTTCCTATGTCGTCACAAGGTATGGCTATGCCATATGGTCAGCCAAACAGGCCTATGACATTGGGTGCACAGCAGAATCAGCATTCTGCACCGCCTTTTGGCAATCACCCATCTGGTGTAGGTGGCATGGGAATTCCTTTTTCTTCGTCATATACTTTTCAACCAGTATCCCATTTGTCAACACCTGCAGCATCAGTTGGCGGTCAACCATGGTTATCATCTGGAAGTCAAAGTGCTGCGCCTTTTGTACCAGTTCCACAAACAGCAGACCAGTCTTCAGGTTCCGCTGCCCCCGTTCCTGCTGTTAATCCCCCCGATACCAGCCAACAATCCTCATCTGATTGGCAAGAGCACAACACTCCAGATGGAAGAAGATATTATTATAACAAGAAAACTAAGCAATCTAGCTGGGAGAAGCCTGTTGAGTTAATGACACCAACAGAGAGAGCTGATGCGTCGACTGTATGGAAGGAGTTCACTACTCCAGAAGGGAAGAAGTATTACTTTAACAAGGTTACAAAGGAATCAAAATGGACCATACCTGAAGATTTGAAGTTGGCTCGTGAACAAGCAGCAAAAGCAGTTAGTCAGGGAGCACAGTTAGATGCAGGAATGAAATCTCATCCTACAACAACTGGAGGTTTTACCTCTGAAGTAGCACCATCAACTAATCCAGTTTCCGGCAGCTCCAATGGGTCATCATCTGTTTCTGGTGCAAGTTCAATCCCGTTAAGTTTTCCGTCAGGTGTTAACCCAGCACCTGTAGTTAATGATGCGTCATCAGAACTCCTTATGGAGCACTCAGCTGCTCCGACCAGCATGGCAGTGACAAATGCTCTTGCAATGACTCCTTTGTCTGCTTCTATTTCTGGAGATGATGCTCTTCCTGCTGCATTAAATGCCTCCTCCATTACAGTGAATGCTTCAGACAAATTACCATCTCAAGAAATTTCAACTTCTACAGATGGAGGTTCTGCACATGATCTTGAGGAATTACGTAAAGGGATTACTGCAGGGAAAGTTAGTTTGAGCGAGAAACCAACCAATGATGAACCTTTGGTTTTTGCCAACAAGCAGGAAGCAAAAGGCGCGTTTAAGTCACTTCTTGAGTCTGCAAATGTCCACTCTGACTGGAATTGGGATCAGGCTATGAGGGTGATAGTCAATGACAAGAGGTATGGTGCTCTGAAAACATTAGGGGAGCGGAAGCAAGCTTTTAACGAGTATTTAGGACAGAGGAGGAAACAGGAGGCTGAGGAGCGGCGCTTGAGGCAGAAGAAAGCAAAGGAAGAGTTTACAAAAATGTTGGAAGAGTCAGAGGTACTTGCATCATCGATGAAATGGAGTAAAGCTGTTACTATGTTTGAAGATGATGAGCGGTTCAAAGCTGTCGAAAAGCCGAAAGATCGGCAGGAGCTTTTCGATAATTACTTGGTGGAACTCCAAAAAAAGGAAAAGGAGAAGGCCGATGAAGAGTACCGACGGAATAGAGAAGATTACTGGACGTTTCTGGAATCTTGCGACTTTATTGAGGCAAACAGCCTATGGAGAAAAGTTCAGGACCGCTTGGAGGATGACGAGAGATGCTCACGTCTTGAAAAGATTGACCGCTTAGAAATATTCCAGGATTACATTCGTTTCTTGGAGAAGGGAGAAGAGGAGCAGAAGAAGCTTCAAAAAGAACAACTGAGGAGAGCAGAGCGAAAAAATCGTGATGAATTTCGCAAGGTGTTGGAGGCTGATGTTGCGGCTGGTGTTCTTACTGCAAAATCAATCTGGCGCGATTATTGTGCGAAGGTCAAAGAATCTCCTGCTTATCTTGCTGTGGCACGTAACATATCAGGGTCTACTCCTAAAGATTTGTTTGAGGATGCGGTTGATGAATTGGAGAACCAGTATCATGAGGACAAAAGTAGAATAAAAGATGTGATGAAGCTGAGCAAGATGACTATGACTTCTACATGGACTTTTGACGAATTTAAGGAATCTGTTTCCCCGGATCTTGGGTCTCCACCGATTTCTGATATTAATTTCAAGCTGGTTTTTGAAGATCAGCTTGAAAGGATCAAAGAGAAGGAAGAGAAAGAAGCTAAAAAGCGCCAGCATCTTGTAGATGACTTTACTGATCTATTGCGGTCTCTTAAGGACATAACTGCATCTTCTACTTGGGAAGATTCTAGACAACTCTTCGAGGATAGTGAAGAGTACAGGGAAATTGGGAATGAAACACTGGCAATGGAAACCTTTAAGGACTGTGTTGTTTACCTGCAAGAGAAGGCAAAAGAGAAGGAACGAAAGCGTGAGGATGAAAAGGTGAAAAAGGAGAAAGAAAGAGAGGAAAAGGAGAAGAGGAAGGAGAAGGATAGGAAGGAGAAAGATAAAGATCGTGACAGAGAGAAGCGGAAAGAACGCTCTAAGAAAGAGGAGTCAGATGATGAGGCTGTGGATGCAACTGATAGCCATAGCAACAAGGATGAAAATAGGAAGGAGAAGGAAAAAGACAGAGACAGGAAGCATAAAAAACGGCACCACGATAATACAATTGATGATGGAAGTTCTGACAGGGATGATGACAAGGAGAGACGTCACCGAAAGAGGCATCATGATACCACAGATGATGTCAGTTCCGACAAAGATGATAAAGAAGATCACAAGAAATCACGCCGACATGGTAGTGACCGGAAAAAATCTAGGAAGCATGAACACTCACCGGATTCAGATGGTGAAAGTAAACATAAAAGGCATAAGAGGGATCACCGTGATGGCTCCAGGAGAACTGGTGCTCATGATGATCTTGAAGACGGTGAGCTTGGTGAGGACGGGGAAATCTCGTAG

Protein sequence

MEEGKITTEVHPDPQHTEATKRGREGEEPSHRGRGGAVEPAKTEEEEEPSSFEPSTVFIISKHRGLEETRRRSRPRRAESPKPDSADSKCNVSFCPLNSFYGHCPKLWPSYVHTGRLLVFFELYYYGENLLNFAHVLLLLQYRPAVPGQQGQPYLPGSAQQFLAPGQNIPSGHNQPMQFSQPMQFSQPMQQLPPRPGMPGVPMSSQGMAMPYGQPNRPMTLGAQQNQHSAPPFGNHPSGVGGMGIPFSSSYTFQPVSHLSTPAASVGGQPWLSSGSQSAAPFVPVPQTADQSSGSAAPVPAVNPPDTSQQSSSDWQEHNTPDGRRYYYNKKTKQSSWEKPVELMTPTERADASTVWKEFTTPEGKKYYFNKVTKESKWTIPEDLKLAREQAAKAVSQGAQLDAGMKSHPTTTGGFTSEVAPSTNPVSGSSNGSSSVSGASSIPLSFPSGVNPAPVVNDASSELLMEHSAAPTSMAVTNALAMTPLSASISGDDALPAALNASSITVNASDKLPSQEISTSTDGGSAHDLEELRKGITAGKVSLSEKPTNDEPLVFANKQEAKGAFKSLLESANVHSDWNWDQAMRVIVNDKRYGALKTLGERKQAFNEYLGQRRKQEAEERRLRQKKAKEEFTKMLEESEVLASSMKWSKAVTMFEDDERFKAVEKPKDRQELFDNYLVELQKKEKEKADEEYRRNREDYWTFLESCDFIEANSLWRKVQDRLEDDERCSRLEKIDRLEIFQDYIRFLEKGEEEQKKLQKEQLRRAERKNRDEFRKVLEADVAAGVLTAKSIWRDYCAKVKESPAYLAVARNISGSTPKDLFEDAVDELENQYHEDKSRIKDVMKLSKMTMTSTWTFDEFKESVSPDLGSPPISDINFKLVFEDQLERIKEKEEKEAKKRQHLVDDFTDLLRSLKDITASSTWEDSRQLFEDSEEYREIGNETLAMETFKDCVVYLQEKAKEKERKREDEKVKKEKEREEKEKRKEKDRKEKDKDRDREKRKERSKKEESDDEAVDATDSHSNKDENRKEKEKDRDRKHKKRHHDNTIDDGSSDRDDDKERRHRKRHHDTTDDVSSDKDDKEDHKKSRRHGSDRKKSRKHEHSPDSDGESKHKRHKRDHRDGSRRTGAHDDLEDGELGEDGEIS
Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Spo27314Spo27314gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Spo27314.1Spo27314.1-proteinpolypeptide


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
Spo27314.1.exon.1Spo27314.1.exon.1exon
Spo27314.1.exon.2Spo27314.1.exon.2exon
Spo27314.1.exon.3Spo27314.1.exon.3exon
Spo27314.1.exon.4Spo27314.1.exon.4exon
Spo27314.1.exon.5Spo27314.1.exon.5exon
Spo27314.1.exon.6Spo27314.1.exon.6exon
Spo27314.1.exon.7Spo27314.1.exon.7exon
Spo27314.1.exon.8Spo27314.1.exon.8exon
Spo27314.1.exon.9Spo27314.1.exon.9exon
Spo27314.1.exon.10Spo27314.1.exon.10exon
Spo27314.1.exon.11Spo27314.1.exon.11exon
Spo27314.1.exon.12Spo27314.1.exon.12exon
Spo27314.1.exon.13Spo27314.1.exon.13exon
Spo27314.1.exon.14Spo27314.1.exon.14exon
Spo27314.1.exon.15Spo27314.1.exon.15exon
Spo27314.1.exon.16Spo27314.1.exon.16exon
Spo27314.1.exon.17Spo27314.1.exon.17exon
Spo27314.1.exon.18Spo27314.1.exon.18exon
Spo27314.1.exon.19Spo27314.1.exon.19exon
Spo27314.1.exon.20Spo27314.1.exon.20exon
Spo27314.1.exon.21Spo27314.1.exon.21exon
Spo27314.1.exon.22Spo27314.1.exon.22exon
Spo27314.1.exon.23Spo27314.1.exon.23exon
Spo27314.1.exon.24Spo27314.1.exon.24exon
Spo27314.1.exon.25Spo27314.1.exon.25exon
Spo27314.1.exon.26Spo27314.1.exon.26exon
Spo27314.1.exon.27Spo27314.1.exon.27exon
Spo27314.1.exon.28Spo27314.1.exon.28exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Spo27314.1.CDS.1Spo27314.1.CDS.1CDS
Spo27314.1.CDS.2Spo27314.1.CDS.2CDS
Spo27314.1.CDS.3Spo27314.1.CDS.3CDS
Spo27314.1.CDS.4Spo27314.1.CDS.4CDS
Spo27314.1.CDS.5Spo27314.1.CDS.5CDS
Spo27314.1.CDS.6Spo27314.1.CDS.6CDS
Spo27314.1.CDS.7Spo27314.1.CDS.7CDS
Spo27314.1.CDS.8Spo27314.1.CDS.8CDS
Spo27314.1.CDS.9Spo27314.1.CDS.9CDS
Spo27314.1.CDS.10Spo27314.1.CDS.10CDS
Spo27314.1.CDS.11Spo27314.1.CDS.11CDS
Spo27314.1.CDS.12Spo27314.1.CDS.12CDS
Spo27314.1.CDS.13Spo27314.1.CDS.13CDS
Spo27314.1.CDS.14Spo27314.1.CDS.14CDS
Spo27314.1.CDS.15Spo27314.1.CDS.15CDS
Spo27314.1.CDS.16Spo27314.1.CDS.16CDS
Spo27314.1.CDS.17Spo27314.1.CDS.17CDS
Spo27314.1.CDS.18Spo27314.1.CDS.18CDS
Spo27314.1.CDS.19Spo27314.1.CDS.19CDS
Spo27314.1.CDS.20Spo27314.1.CDS.20CDS
Spo27314.1.CDS.21Spo27314.1.CDS.21CDS
Spo27314.1.CDS.22Spo27314.1.CDS.22CDS
Spo27314.1.CDS.23Spo27314.1.CDS.23CDS
Spo27314.1.CDS.24Spo27314.1.CDS.24CDS
Spo27314.1.CDS.25Spo27314.1.CDS.25CDS
Spo27314.1.CDS.26Spo27314.1.CDS.26CDS
Spo27314.1.CDS.27Spo27314.1.CDS.27CDS
Spo27314.1.CDS.28Spo27314.1.CDS.28CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Spo27314.1.utr3p.1Spo27314.1.utr3p.1three_prime_UTR


Homology
BLAST of Spo27314.1 vs. NCBI nr
Match: gi|902205333|gb|KNA15150.1| (hypothetical protein SOVF_100850 [Spinacia oleracea])

HSP 1 Score: 1859.0 bits (4814), Expect = 0.000e+0
Identity = 1004/1004 (100.00%), Postives = 1004/1004 (100.00%), Query Frame = 1

		  

Query: 141  QYRPAVPGQQGQPYLPGSAQQFLAPGQNIPSGHNQPMQFSQPMQFSQPMQQLPPRPGMPG 200
            QYRPAVPGQQGQPYLPGSAQQFLAPGQNIPSGHNQPMQFSQPMQFSQPMQQLPPRPGMPG
Sbjct: 14   QYRPAVPGQQGQPYLPGSAQQFLAPGQNIPSGHNQPMQFSQPMQFSQPMQQLPPRPGMPG 73

Query: 201  VPMSSQGMAMPYGQPNRPMTLGAQQNQHSAPPFGNHPSGVGGMGIPFSSSYTFQPVSHLS 260
            VPMSSQGMAMPYGQPNRPMTLGAQQNQHSAPPFGNHPSGVGGMGIPFSSSYTFQPVSHLS
Sbjct: 74   VPMSSQGMAMPYGQPNRPMTLGAQQNQHSAPPFGNHPSGVGGMGIPFSSSYTFQPVSHLS 133

Query: 261  TPAASVGGQPWLSSGSQSAAPFVPVPQTADQSSGSAAPVPAVNPPDTSQQSSSDWQEHNT 320
            TPAASVGGQPWLSSGSQSAAPFVPVPQTADQSSGSAAPVPAVNPPDTSQQSSSDWQEHNT
Sbjct: 134  TPAASVGGQPWLSSGSQSAAPFVPVPQTADQSSGSAAPVPAVNPPDTSQQSSSDWQEHNT 193

Query: 321  PDGRRYYYNKKTKQSSWEKPVELMTPTERADASTVWKEFTTPEGKKYYFNKVTKESKWTI 380
            PDGRRYYYNKKTKQSSWEKPVELMTPTERADASTVWKEFTTPEGKKYYFNKVTKESKWTI
Sbjct: 194  PDGRRYYYNKKTKQSSWEKPVELMTPTERADASTVWKEFTTPEGKKYYFNKVTKESKWTI 253

Query: 381  PEDLKLAREQAAKAVSQGAQLDAGMKSHPTTTGGFTSEVAPSTNPVSGSSNGSSSVSGAS 440
            PEDLKLAREQAAKAVSQGAQLDAGMKSHPTTTGGFTSEVAPSTNPVSGSSNGSSSVSGAS
Sbjct: 254  PEDLKLAREQAAKAVSQGAQLDAGMKSHPTTTGGFTSEVAPSTNPVSGSSNGSSSVSGAS 313

Query: 441  SIPLSFPSGVNPAPVVNDASSELLMEHSAAPTSMAVTNALAMTPLSASISGDDALPAALN 500
            SIPLSFPSGVNPAPVVNDASSELLMEHSAAPTSMAVTNALAMTPLSASISGDDALPAALN
Sbjct: 314  SIPLSFPSGVNPAPVVNDASSELLMEHSAAPTSMAVTNALAMTPLSASISGDDALPAALN 373

Query: 501  ASSITVNASDKLPSQEISTSTDGGSAHDLEELRKGITAGKVSLSEKPTNDEPLVFANKQE 560
            ASSITVNASDKLPSQEISTSTDGGSAHDLEELRKGITAGKVSLSEKPTNDEPLVFANKQE
Sbjct: 374  ASSITVNASDKLPSQEISTSTDGGSAHDLEELRKGITAGKVSLSEKPTNDEPLVFANKQE 433

Query: 561  AKGAFKSLLESANVHSDWNWDQAMRVIVNDKRYGALKTLGERKQAFNEYLGQRRKQEAEE 620
            AKGAFKSLLESANVHSDWNWDQAMRVIVNDKRYGALKTLGERKQAFNEYLGQRRKQEAEE
Sbjct: 434  AKGAFKSLLESANVHSDWNWDQAMRVIVNDKRYGALKTLGERKQAFNEYLGQRRKQEAEE 493

Query: 621  RRLRQKKAKEEFTKMLEESEVLASSMKWSKAVTMFEDDERFKAVEKPKDRQELFDNYLVE 680
            RRLRQKKAKEEFTKMLEESEVLASSMKWSKAVTMFEDDERFKAVEKPKDRQELFDNYLVE
Sbjct: 494  RRLRQKKAKEEFTKMLEESEVLASSMKWSKAVTMFEDDERFKAVEKPKDRQELFDNYLVE 553

Query: 681  LQKKEKEKADEEYRRNREDYWTFLESCDFIEANSLWRKVQDRLEDDERCSRLEKIDRLEI 740
            LQKKEKEKADEEYRRNREDYWTFLESCDFIEANSLWRKVQDRLEDDERCSRLEKIDRLEI
Sbjct: 554  LQKKEKEKADEEYRRNREDYWTFLESCDFIEANSLWRKVQDRLEDDERCSRLEKIDRLEI 613

Query: 741  FQDYIRFLEKGEEEQKKLQKEQLRRAERKNRDEFRKVLEADVAAGVLTAKSIWRDYCAKV 800
            FQDYIRFLEKGEEEQKKLQKEQLRRAERKNRDEFRKVLEADVAAGVLTAKSIWRDYCAKV
Sbjct: 614  FQDYIRFLEKGEEEQKKLQKEQLRRAERKNRDEFRKVLEADVAAGVLTAKSIWRDYCAKV 673

Query: 801  KESPAYLAVARNISGSTPKDLFEDAVDELENQYHEDKSRIKDVMKLSKMTMTSTWTFDEF 860
            KESPAYLAVARNISGSTPKDLFEDAVDELENQYHEDKSRIKDVMKLSKMTMTSTWTFDEF
Sbjct: 674  KESPAYLAVARNISGSTPKDLFEDAVDELENQYHEDKSRIKDVMKLSKMTMTSTWTFDEF 733

Query: 861  KESVSPDLGSPPISDINFKLVFEDQLERIKEKEEKEAKKRQHLVDDFTDLLRSLKDITAS 920
            KESVSPDLGSPPISDINFKLVFEDQLERIKEKEEKEAKKRQHLVDDFTDLLRSLKDITAS
Sbjct: 734  KESVSPDLGSPPISDINFKLVFEDQLERIKEKEEKEAKKRQHLVDDFTDLLRSLKDITAS 793

Query: 921  STWEDSRQLFEDSEEYREIGNETLAMETFKDCVVYLQEKAKEKERKREDEKVKKEKEREE 980
            STWEDSRQLFEDSEEYREIGNETLAMETFKDCVVYLQEKAKEKERKREDEKVKKEKEREE
Sbjct: 794  STWEDSRQLFEDSEEYREIGNETLAMETFKDCVVYLQEKAKEKERKREDEKVKKEKEREE 853

Query: 981  KEKRKEKDRKEKDKDRDREKRKERSKKEESDDEAVDATDSHSNKDENRKEKEKDRDRKHK 1040
            KEKRKEKDRKEKDKDRDREKRKERSKKEESDDEAVDATDSHSNKDENRKEKEKDRDRKHK
Sbjct: 854  KEKRKEKDRKEKDKDRDREKRKERSKKEESDDEAVDATDSHSNKDENRKEKEKDRDRKHK 913

Query: 1041 KRHHDNTIDDGSSDRDDDKERRHRKRHHDTTDDVSSDKDDKEDHKKSRRHGSDRKKSRKH 1100
            KRHHDNTIDDGSSDRDDDKERRHRKRHHDTTDDVSSDKDDKEDHKKSRRHGSDRKKSRKH
Sbjct: 914  KRHHDNTIDDGSSDRDDDKERRHRKRHHDTTDDVSSDKDDKEDHKKSRRHGSDRKKSRKH 973

Query: 1101 EHSPDSDGESKHKRHKRDHRDGSRRTGAHDDLEDGELGEDGEIS 1145
            EHSPDSDGESKHKRHKRDHRDGSRRTGAHDDLEDGELGEDGEIS
Sbjct: 974  EHSPDSDGESKHKRHKRDHRDGSRRTGAHDDLEDGELGEDGEIS 1017

BLAST of Spo27314.1 vs. NCBI nr
Match: gi|731361861|ref|XP_010692586.1| (PREDICTED: pre-mRNA-processing protein 40A isoform X1 [Beta vulgaris subsp. vulgaris])

HSP 1 Score: 1458.0 bits (3773), Expect = 0.000e+0
Identity = 817/1021 (80.02%), Postives = 883/1021 (86.48%), Query Frame = 1

		  

Query: 132  NFAHVLLLLQYRPAVPGQQGQPYLPGSAQQFLA-----PGQNIPSGHNQPMQFSQPMQFS 191
            NF    +  Q+RPAVPGQQGQPYLPGS+ QF +     PGQN+PSG NQPM  +QPMQFS
Sbjct: 6    NFGPPPVPAQFRPAVPGQQGQPYLPGSSPQFPSAGQNMPGQNMPSGPNQPMHHNQPMQFS 65

Query: 192  QPMQQLPPRPGMPGVPMSSQGMAMPYGQPNRPMTLGAQQNQHSAPPFGNHPSGVGGMGIP 251
            QPMQQLPPRPGMPGVP SSQGM M YGQ NRPMTLGAQQNQHSAPPFGNH SGVGGMG+P
Sbjct: 66   QPMQQLPPRPGMPGVPFSSQGMGMSYGQQNRPMTLGAQQNQHSAPPFGNHTSGVGGMGVP 125

Query: 252  FSSSYTFQPVSHLSTPAASVGGQPWLSSGSQSAAPFVPVPQTADQSSGSAAPVPAVNPPD 311
            FSSSYTFQPVSHL TPAA V G PWLSSG QS+APFVPVP TA+QS  S+    AVN PD
Sbjct: 126  FSSSYTFQPVSHLPTPAAPVAGHPWLSSGGQSSAPFVPVPPTAEQSPVSSTIGLAVNLPD 185

Query: 312  TSQQSSSDWQEHNTPDGRRYYYNKKTKQSSWEKPVELMTPTERADASTVWKEFTTPEGKK 371
            TSQQSSSDWQEH TPDGRRYYYNKKTKQSSWEKP ELMTP ERADASTVWKEFTTPEGKK
Sbjct: 186  TSQQSSSDWQEHTTPDGRRYYYNKKTKQSSWEKPSELMTPLERADASTVWKEFTTPEGKK 245

Query: 372  YYFNKVTKESKWTIPEDLKLAREQAAKAVSQGAQLDAGMKSHPTTTGGFTSEVAPSTNPV 431
            YY+NK TKESKWTIPE+LKLARE+A KA SQGAQL+ G+ S     G  T E A +   V
Sbjct: 246  YYYNKATKESKWTIPEELKLARERAEKAASQGAQLETGVNSQSKPAGSLTPEEARTAAAV 305

Query: 432  SGSSNGSSSVSGASSIPLSFPSGVNPAPVVNDASSELLMEHSAAPTSMAVTNALAMT-PL 491
              SSN SSS +      LS P  VNP+P+VN AS+E+ + HSAAPTSMAVTN  A+T PL
Sbjct: 306  LASSNVSSSAT------LSVPPAVNPSPLVNAASAEIPVGHSAAPTSMAVTNTPAVTAPL 365

Query: 492  SASISGDDALPAALNASSITVNASDKLPSQEISTSTDGGSAHDLEELRKG-ITAGKVSLS 551
            SA  +GD  LPA+ NA  ITVNA+DKL SQE  T  DG    D EE  KG + A K  L 
Sbjct: 366  SAFTTGDPGLPASSNAPLITVNAADKLQSQETLTPMDGAPVQDHEEAHKGMLAAAKGDLI 425

Query: 552  EKPTNDEPLVFANKQEAKGAFKSLLESANVHSDWNWDQAMRVIVNDKRYGALKTLGERKQ 611
            EKP +DEPLVFA+KQEAK AFKSLLESANV SDW WDQAMRVIVNDKRYGALKTLGERKQ
Sbjct: 426  EKPADDEPLVFASKQEAKAAFKSLLESANVQSDWTWDQAMRVIVNDKRYGALKTLGERKQ 485

Query: 612  AFNEYLGQRRKQEAEERRLRQKKAKEEFTKMLEESEVLASSMKWSKAVTMFEDDERFKAV 671
            AFNEYLGQR+KQEAEERR+RQKKAKEEF KMLEESEVL SS+KWSKA+TMFEDDERFKAV
Sbjct: 486  AFNEYLGQRKKQEAEERRMRQKKAKEEFMKMLEESEVLTSSIKWSKAITMFEDDERFKAV 545

Query: 672  EKPKDRQELFDNYLVELQKKEKEKADEEYRRNREDYWTFLESCDFIEANSLWRKVQDRLE 731
            EKPKDRQELF+NYLVELQKKEKEKA+EEYRRNRE+YW FLESCDFIEANSLWRKVQDRLE
Sbjct: 546  EKPKDRQELFENYLVELQKKEKEKAEEEYRRNREEYWKFLESCDFIEANSLWRKVQDRLE 605

Query: 732  DDERCSRLEKIDRLEIFQDYIRFLEKGEEEQKKLQKEQLRRAERKNRDEFRKVLEADVAA 791
            DDERCSRLEKIDRLEIFQDYIR+LEK EEEQKKLQKEQLRR ERKNRDEFRK+LE D AA
Sbjct: 606  DDERCSRLEKIDRLEIFQDYIRYLEKEEEEQKKLQKEQLRRVERKNRDEFRKMLEVDTAA 665

Query: 792  GVLTAKSIWRDYCAKVKESPAYLAVARNISGSTPKDLFEDAVDELENQYHEDKSRIKDVM 851
            G LTAK+ WR+YC KVK+SPAYLAVARN SGSTPKDLFED VDELENQYH+DKSRIKD M
Sbjct: 666  GALTAKTSWREYCGKVKDSPAYLAVARNTSGSTPKDLFEDVVDELENQYHDDKSRIKDAM 725

Query: 852  KLSKMTMTSTWTFDEFKESVSPDLGSPPISDINFKLVFEDQLERIKEKEEKEAKKRQHLV 911
            KL K+ MTSTWTF+EFK S+S +LGSP I+DIN KLVFE+QLER+KEKEEKEAKKRQ L+
Sbjct: 726  KLCKIVMTSTWTFEEFKASISLELGSPLIADINLKLVFEEQLERVKEKEEKEAKKRQRLL 785

Query: 912  DDFTDLLRSLKDITASSTWEDSRQLFEDSEEYREIGNETLAMETFKDCVVYLQEKAKEKE 971
            DDFTDLLRSLK+IT SS W+DS+QLFE+SEEYREI NE LA E F+D VVYLQEKAKEKE
Sbjct: 786  DDFTDLLRSLKEITVSSLWDDSKQLFEESEEYREIDNEILAKEAFEDYVVYLQEKAKEKE 845

Query: 972  RKREDEKVKKEKEREEKEKRKEKDRKEKDKDRDREKRKERSKKEESDDEAVDATDSH-SN 1031
            RKRE+EK KKEKEREEKEKRKEKDRKEKDKDRDREKRKERSKKEE+DDEA+DATDSH S 
Sbjct: 846  RKREEEKAKKEKEREEKEKRKEKDRKEKDKDRDREKRKERSKKEETDDEAMDATDSHNSY 905

Query: 1032 KDENRKEKEKDRDRKHKKRHHDNTIDDGSSDRDDDKERRHRKRHHDTTDDVSSDKDDKED 1091
            KDE+RKEKEKDRDRKH+KRHHD T DD +SDRDDD+ERR RKRHHDTTDDVSS++DDKE+
Sbjct: 906  KDESRKEKEKDRDRKHRKRHHDTT-DDVTSDRDDDRERRQRKRHHDTTDDVSSERDDKEE 965

Query: 1092 HKKSRRHGSDRKKSRKHEHSPDSDGESKHKRHKRDHRDGSRRTGAHDDLEDGELGEDGEI 1145
            HKKSRRHGSDRKK RKHEHSPDSDGESKHKRHKRDHRDGSRR+GAHDDLEDGELGEDGEI
Sbjct: 966  HKKSRRHGSDRKKPRKHEHSPDSDGESKHKRHKRDHRDGSRRSGAHDDLEDGELGEDGEI 1019

BLAST of Spo27314.1 vs. NCBI nr
Match: gi|731361863|ref|XP_010692587.1| (PREDICTED: pre-mRNA-processing protein 40A isoform X2 [Beta vulgaris subsp. vulgaris])

HSP 1 Score: 1128.2 bits (2917), Expect = 0.000e+0
Identity = 641/800 (80.12%), Postives = 700/800 (87.50%), Query Frame = 1

		  

Query: 348  ERADASTVWKEFTTPEGKKYYFNKVTKESKWTIPEDLKLAREQAAKAVSQGAQLDAGMKS 407
            +RADASTVWKEFTTPEGKKYY+NK TKESKWTIPE+LKLARE+A KA SQGAQL+ G+ S
Sbjct: 21   QRADASTVWKEFTTPEGKKYYYNKATKESKWTIPEELKLARERAEKAASQGAQLETGVNS 80

Query: 408  HPTTTGGFTSEVAPSTNPVSGSSNGSSSVSGASSIPLSFPSGVNPAPVVNDASSELLMEH 467
                 G  T E A +   V  SSN SSS +      LS P  VNP+P+VN AS+E+ + H
Sbjct: 81   QSKPAGSLTPEEARTAAAVLASSNVSSSAT------LSVPPAVNPSPLVNAASAEIPVGH 140

Query: 468  SAAPTSMAVTNALAMT-PLSASISGDDALPAALNASSITVNASDKLPSQEISTSTDGGSA 527
            SAAPTSMAVTN  A+T PLSA  +GD  LPA+ NA  ITVNA+DKL SQE  T  DG   
Sbjct: 141  SAAPTSMAVTNTPAVTAPLSAFTTGDPGLPASSNAPLITVNAADKLQSQETLTPMDGAPV 200

Query: 528  HDLEELRKG-ITAGKVSLSEKPTNDEPLVFANKQEAKGAFKSLLESANVHSDWNWDQAMR 587
             D EE  KG + A K  L EKP +DEPLVFA+KQEAK AFKSLLESANV SDW WDQAMR
Sbjct: 201  QDHEEAHKGMLAAAKGDLIEKPADDEPLVFASKQEAKAAFKSLLESANVQSDWTWDQAMR 260

Query: 588  VIVNDKRYGALKTLGERKQAFNEYLGQRRKQEAEERRLRQKKAKEEFTKMLEESEVLASS 647
            VIVNDKRYGALKTLGERKQAFNEYLGQR+KQEAEERR+RQKKAKEEF KMLEESEVL SS
Sbjct: 261  VIVNDKRYGALKTLGERKQAFNEYLGQRKKQEAEERRMRQKKAKEEFMKMLEESEVLTSS 320

Query: 648  MKWSKAVTMFEDDERFKAVEKPKDRQELFDNYLVELQKKEKEKADEEYRRNREDYWTFLE 707
            +KWSKA+TMFEDDERFKAVEKPKDRQELF+NYLVELQKKEKEKA+EEYRRNRE+YW FLE
Sbjct: 321  IKWSKAITMFEDDERFKAVEKPKDRQELFENYLVELQKKEKEKAEEEYRRNREEYWKFLE 380

Query: 708  SCDFIEANSLWRKVQDRLEDDERCSRLEKIDRLEIFQDYIRFLEKGEEEQKKLQKEQLRR 767
            SCDFIEANSLWRKVQDRLEDDERCSRLEKIDRLEIFQDYIR+LEK EEEQKKLQKEQLRR
Sbjct: 381  SCDFIEANSLWRKVQDRLEDDERCSRLEKIDRLEIFQDYIRYLEKEEEEQKKLQKEQLRR 440

Query: 768  AERKNRDEFRKVLEADVAAGVLTAKSIWRDYCAKVKESPAYLAVARNISGSTPKDLFEDA 827
             ERKNRDEFRK+LE D AAG LTAK+ WR+YC KVK+SPAYLAVARN SGSTPKDLFED 
Sbjct: 441  VERKNRDEFRKMLEVDTAAGALTAKTSWREYCGKVKDSPAYLAVARNTSGSTPKDLFEDV 500

Query: 828  VDELENQYHEDKSRIKDVMKLSKMTMTSTWTFDEFKESVSPDLGSPPISDINFKLVFEDQ 887
            VDELENQYH+DKSRIKD MKL K+ MTSTWTF+EFK S+S +LGSP I+DIN KLVFE+Q
Sbjct: 501  VDELENQYHDDKSRIKDAMKLCKIVMTSTWTFEEFKASISLELGSPLIADINLKLVFEEQ 560

Query: 888  LERIKEKEEKEAKKRQHLVDDFTDLLRSLKDITASSTWEDSRQLFEDSEEYREIGNETLA 947
            LER+KEKEEKEAKKRQ L+DDFTDLLRSLK+IT SS W+DS+QLFE+SEEYREI NE LA
Sbjct: 561  LERVKEKEEKEAKKRQRLLDDFTDLLRSLKEITVSSLWDDSKQLFEESEEYREIDNEILA 620

Query: 948  METFKDCVVYLQEKAKEKERKREDEKVKKEKEREEKEKRKEKDRKEKDKDRDREKRKERS 1007
             E F+D VVYLQEKAKEKERKRE+EK KKEKEREEKEKRKEKDRKEKDKDRDREKRKERS
Sbjct: 621  KEAFEDYVVYLQEKAKEKERKREEEKAKKEKEREEKEKRKEKDRKEKDKDRDREKRKERS 680

Query: 1008 KKEESDDEAVDATDSHSN-KDENRKEKEKDRDRKHKKRHHDNTIDDGSSDRDDDKERRHR 1067
            KKEE+DDEA+DATDSH++ KDE+RKEKEKDRDRKH+KRHHD T DD +SDRDDD+ERR R
Sbjct: 681  KKEETDDEAMDATDSHNSYKDESRKEKEKDRDRKHRKRHHDTT-DDVTSDRDDDRERRQR 740

Query: 1068 KRHHDTTDDVSSDKDDKEDHKKSRRHGSDRKKSRKHEHSPDSDGESKHKRHKRDHRDGSR 1127
            KRHHDTTDDVSS++DDKE+HKKSRRHGSDRKK RKHEHSPDSDGESKHKRHKRDHRDGSR
Sbjct: 741  KRHHDTTDDVSSERDDKEEHKKSRRHGSDRKKPRKHEHSPDSDGESKHKRHKRDHRDGSR 800

Query: 1128 RTGAHDDLEDGELGEDGEIS 1145
            R+GAHDDLEDGELGEDGEIS
Sbjct: 801  RSGAHDDLEDGELGEDGEIS 813

BLAST of Spo27314.1 vs. NCBI nr
Match: gi|719971225|ref|XP_010273523.1| (PREDICTED: pre-mRNA-processing protein 40A [Nelumbo nucifera])

HSP 1 Score: 1087.4 bits (2811), Expect = 0.000e+0
Identity = 633/1035 (61.16%), Postives = 785/1035 (75.85%), Query Frame = 1

		  

Query: 140  LQYRPAVPGQQGQPYLPGSAQQFLAPGQNIP-SGHNQPMQFSQPMQFSQPMQQLPPRPGM 199
            +Q+RP VP QQ QP++P ++QQF    Q I  S    P    Q +QFSQP QQLPPRPG 
Sbjct: 32   MQFRPVVPMQQPQPFIPAASQQFRPVSQGISVSNVGTPPAQGQQLQFSQPTQQLPPRPGQ 91

Query: 200  PGV-PMSSQGMAMPYGQPNRPMTLGAQQNQHSAPPFGNHPSGVGGMGIPFSSSYTF---- 259
            PG  P SSQ + MPY QPNR +   + Q Q +A P     SG+GGMG+P SSSYTF    
Sbjct: 92   PGSGPPSSQAIPMPYPQPNRSILSVSSQPQQNAQPM----SGIGGMGMPLSSSYTFTISS 151

Query: 260  --------------QPVSHLSTPAASVGGQPWLSSGSQSAAPFVPVPQTADQSSGSAAPV 319
                          QP S +  P     GQPWLSSGSQS     PV Q   Q S + A V
Sbjct: 152  YGQPQSNINTSTQYQPASQIHAPVVPAAGQPWLSSGSQSVPLVTPVQQNIQQPSITTAQV 211

Query: 320  PAVNP-PDTSQQSSSDWQEHNTPDGRRYYYNKKTKQSSWEKPVELMTPTERADASTVWKE 379
            P     P  +QQSSSDWQEH + DGRRYYYNKKT+QSSWEKPVELMTP ERADASTVWKE
Sbjct: 212  PVTTAQPSPAQQSSSDWQEHTSADGRRYYYNKKTRQSSWEKPVELMTPIERADASTVWKE 271

Query: 380  FTTPEGKKYYFNKVTKESKWTIPEDLKLAREQAAKAVSQGAQLDAGMKSHPTTTGGFTSE 439
            FTTPEG+KYY+NK TK+SKWT+P++LKLAREQA KA SQG Q +  M S   +T   +S 
Sbjct: 272  FTTPEGRKYYYNKATKQSKWTMPDELKLAREQAEKAASQGIQSETTMASQTASTVTVSSV 331

Query: 440  VAPST--NPVSGSSNGSSSVSGASSIPLS-FPSGVNPAPVVNDASSELLMEHSAAPTSMA 499
              PS   NP+ G+++  +S+  +S +P++   + +NP PVV+  S  + +   A  TS  
Sbjct: 332  ETPSAAANPL-GATSAVTSIVASSPVPVTPVVAAINPLPVVSSGSQAVPVVPGAVTTSAV 391

Query: 500  VTN--ALAMTPLSASISGDDALPAAL-NASSITVNASDKLPSQEISTSTDGGSAHDLEEL 559
              N  A A+  +  S+ G   +P A  +A++   N+++ + +Q+++ S DG SA DLEE 
Sbjct: 392  GVNSPAAAIATVPVSVPGGAGVPVAFASANNTATNSAENVSTQDVAASVDGASAQDLEEA 451

Query: 560  RKGIT-AGKVSLS---EKPTNDEPLVFANKQEAKGAFKSLLESANVHSDWNWDQAMRVIV 619
            +KG+  AGK++++   EK  +DEPLV+ANK EAK AFK+LLESANV SDW W+QAMRVI+
Sbjct: 452  KKGMAVAGKINITPVEEKTIDDEPLVYANKLEAKNAFKALLESANVESDWTWEQAMRVII 511

Query: 620  NDKRYGALKTLGERKQAFNEYLGQRRKQEAEERRLRQKKAKEEFTKMLEESEVLASSMKW 679
            NDKRYGALKTLGERKQAFNEYLGQR+K EAEERR++QK+A+EEFTKMLEES+ L SS +W
Sbjct: 512  NDKRYGALKTLGERKQAFNEYLGQRKKLEAEERRMKQKRAREEFTKMLEESKELTSSTRW 571

Query: 680  SKAVTMFEDDERFKAVEKPKDRQELFDNYLVELQKKEKEKADEEYRRNREDYWTFLESCD 739
            SKA++MFEDDERFKAVE+P+DR++LF+NYLVELQKKE+ KA EE++RN  +Y  FLESCD
Sbjct: 572  SKAISMFEDDERFKAVERPRDREDLFENYLVELQKKERAKAQEEHKRNIMEYRQFLESCD 631

Query: 740  FIEANSLWRKVQDRLEDDERCSRLEKIDRLEIFQDYIRFLEKGEEEQKKLQKEQLRRAER 799
            FI+ANS WRKVQDRLEDDERCSRLEKIDRLEIFQ+YIR LEK EEEQ+K+QKEQLRRAER
Sbjct: 632  FIKANSQWRKVQDRLEDDERCSRLEKIDRLEIFQEYIRDLEKEEEEQRKIQKEQLRRAER 691

Query: 800  KNRDEFRKVLEADVAAGVLTAKSIWRDYCAKVKESPAYLAVARNISGSTPKDLFEDAVDE 859
            KNRDEFRK++E  V AG LTAK+ WRDYC KVK+ PAY+AV+ N SGSTPKDLFED  +E
Sbjct: 692  KNRDEFRKLMEEHVTAGTLTAKTHWRDYCMKVKDLPAYVAVSSNTSGSTPKDLFEDVSEE 751

Query: 860  LENQYHEDKSRIKDVMKLSKMTMTSTWTFDEFKESVSPDLGSPPISDINFKLVFEDQLER 919
            LE QYH+DK+RIKD +KL K++M STWT ++FK +++  +GSP ISDINFKLVF++ LER
Sbjct: 752  LEKQYHDDKTRIKDAIKLGKISMASTWTIEDFKAAIAEHVGSPSISDINFKLVFDELLER 811

Query: 920  IKEKEEKEAKKRQHLVDDFTDLLRSLKDITASSTWEDSRQLFEDSEEYREIGNETLAMET 979
            IKEKEEKEAKKRQ L DDF++LL S+K+ITASS WE+S+ LFEDS+EYR I +E    E 
Sbjct: 812  IKEKEEKEAKKRQRLADDFSELLHSIKEITASSKWEESKPLFEDSQEYRSISDENFRREI 871

Query: 980  FKDCVVYLQEKAKEKERKREDEKVKKEKEREEKEKRKEKDRKEKDKDRDREKRKERSKKE 1039
            F++ +++LQ KAKEKERKRE+EKVKKEKEREEKEKRKEK+RKEK+K+R+REK KERSKK+
Sbjct: 872  FEEYIIHLQGKAKEKERKREEEKVKKEKEREEKEKRKEKERKEKEKEREREKGKERSKKD 931

Query: 1040 ESDDEAVDATDSHSNKDENRKEKEKDRDRKHKKRHHDNTIDDGSSDRDDDKERRHRKRHH 1099
            ++++E +D TD H +K++ ++EK+KDRD+                DRD DK+R+HRKRHH
Sbjct: 932  DTENENIDVTDIHVSKEDKKREKDKDRDK----------------DRDKDKDRKHRKRHH 991

Query: 1100 DTTDDVSSDKDDKEDHKKSRRHGSDRKKSRKHEHSPDSDGESKHKRHKRDHRDGSRRTGA 1144
             T DDVSS+KD+KED KKSRRH SDRKKSRKH ++P+SD ES+HKRHKRDHRDGSRR GA
Sbjct: 992  STADDVSSEKDEKEDSKKSRRHNSDRKKSRKHAYTPESDSESRHKRHKRDHRDGSRRNGA 1045

BLAST of Spo27314.1 vs. NCBI nr
Match: gi|731428902|ref|XP_010664484.1| (PREDICTED: pre-mRNA-processing protein 40A [Vitis vinifera])

HSP 1 Score: 1068.5 bits (2762), Expect = 7.800e-309
Identity = 635/1037 (61.23%), Postives = 763/1037 (73.58%), Query Frame = 1

		  

Query: 138  LLLQYRPAVPGQQGQPYLPGSAQQFLAPGQNI-------PSGHNQPMQFSQPMQFSQPMQ 197
            L +Q+RPAVPGQQG P++P ++QQF   GQNI       PSG N      QP QFSQ MQ
Sbjct: 30   LSMQFRPAVPGQQGHPFIPAASQQFRPIGQNISSPNVGGPSGQN------QPPQFSQAMQ 89

Query: 198  QLPPRPGMPG-VPMSSQGMAMPYGQPNRPMTLGAQQNQHSAPPFGNHPSGVGGMGIPFSS 257
            QLPPRP  PG +  SSQ + MPY Q NRP+T  + Q   +APP  +H  G+ G G+PFSS
Sbjct: 90   QLPPRPNQPGPIAPSSQPIPMPYIQQNRPLTSSSPQPNQTAPPLNSHMPGLAGPGMPFSS 149

Query: 258  SYTFQPVSH---------------LSTPAASVGGQPWLSSGSQSAAPFVPVPQTADQSSG 317
            SYTF P S                +S   A VGGQPWLSSGSQS A   PV Q   Q S 
Sbjct: 150  SYTFAPASFGQPQSTINASAQFQPISQMHAPVGGQPWLSSGSQSGALVTPVHQAGQQPS- 209

Query: 318  SAAPVPAVNPPDTSQQSSSDWQEHNTPDGRRYYYNKKTKQSSWEKPVELMTPTERADAST 377
              A +PA N P+ + QSSSDWQEH + DGRRYYYNKKT+ SSWEKP+ELMTP ERADAST
Sbjct: 210  VTADIPAGNVPNPTHQSSSDWQEHTSADGRRYYYNKKTRLSSWEKPLELMTPIERADAST 269

Query: 378  VWKEFTTPEGKKYYFNKVTKESKWTIPEDLKLAREQAAKAVSQGAQLDAGMKSHPTTTGG 437
            VWKEFTTPEG+KYY+NKVTK+SKWTIPE+LKLAREQA K+VSQ  Q + G  S+      
Sbjct: 270  VWKEFTTPEGRKYYYNKVTKQSKWTIPEELKLAREQAEKSVSQETQSEMGTTSNEPAVVA 329

Query: 438  FTSEVAPSTNPVSGSSNGSSSVSGASSIPLSFP---SGVNPAPVVNDASSELLMEHSAAP 497
             +    PST  VS SS  SS++SG +S P+      + VNP PVV   +S + +  SA  
Sbjct: 330  VSLAETPSTASVSVSSTTSSTISGMTSSPVPVTPVVAVVNPPPVVVSGTSAIPIAQSAVT 389

Query: 498  TSMAVTNALAMTPLSASISGDDALPAALNASSITVNASDKLPSQEIST-STDGGSAHDLE 557
            TS         TPL A++SG   + AA     I  NA+     + +S  +T+G S  D+E
Sbjct: 390  TSAVGVQPSMGTPLPAAVSGSTGVAAAF----INPNATSMTSFENLSADATNGASMQDIE 449

Query: 558  ELRKGI-TAGKVS---LSEKPTNDEPLVFANKQEAKGAFKSLLESANVHSDWNWDQAMRV 617
            E +KG+  AGK++   L EK  +DEPLV++ K EAK AFK+LLESANV SDW WDQAM+ 
Sbjct: 450  EAKKGVAVAGKINVTPLEEKTLDDEPLVYSTKLEAKNAFKALLESANVESDWTWDQAMKA 509

Query: 618  IVNDKRYGALKTLGERKQAFNEYLGQRRKQEAEERRLRQKKAKEEFTKMLEESEVLASSM 677
            I+NDKRYGALKTLGERKQAFNEYLGQR+K EAEERR+RQKKA+EEFT MLEE + L SS+
Sbjct: 510  IINDKRYGALKTLGERKQAFNEYLGQRKKIEAEERRMRQKKAREEFTTMLEECKELTSSI 569

Query: 678  KWSKAVTMFEDDERFKAVEKPKDRQELFDNYLVELQKKEKEKADEEYRRNREDYWTFLES 737
            KWSKAV MF+DDERFKAVE+ +DR++LF+N+++ELQKKE+ KA EE +RNR +Y  FLES
Sbjct: 570  KWSKAVDMFQDDERFKAVERSRDREDLFENFIMELQKKERTKALEEQKRNRMEYRQFLES 629

Query: 738  CDFIEANSLWRKVQDRLEDDERCSRLEKIDRLEIFQDYIRFLEKGEEEQKKLQKEQLRRA 797
            CDFI+ NS WRKVQDRLEDDERCSRLEKIDRLEIFQ+YIR LE+ EEEQ+K+QKEQLRRA
Sbjct: 630  CDFIKVNSQWRKVQDRLEDDERCSRLEKIDRLEIFQEYIRDLEREEEEQRKIQKEQLRRA 689

Query: 798  ERKNRDEFRKVLEADVAAGVLTAKSIWRDYCAKVKESPAYLAVARNISGSTPKDLFEDAV 857
            ERKNRDEFRK++E  VAAG LTAK+ WRDYC KVK+S  YLAVA N SGSTPKDLFED  
Sbjct: 690  ERKNRDEFRKLMEEHVAAGTLTAKTHWRDYCMKVKDSSPYLAVASNTSGSTPKDLFEDVA 749

Query: 858  DELENQYHEDKSRIKDVMKLSKMTMTSTWTFDEFKESVSPDLGSPPISDINFKLVFEDQL 917
            +ELE QYHEDK+RIKD MKLSK+T+ STWTF +FK ++  D+GSP ISD+N KLVFE+ L
Sbjct: 750  EELEKQYHEDKARIKDAMKLSKVTIASTWTFGDFKAAILDDVGSPNISDVNLKLVFEELL 809

Query: 918  ERIKEKEEKEAKKRQHLVDDFTDLLRSLKDITASSTWEDSRQLFEDSEEYREIGNETLAM 977
            +RIKEKEEKEAKKRQ L DDF DLLRS K+ITASS WED + LFE+S+EYR IG E+   
Sbjct: 810  DRIKEKEEKEAKKRQRLADDFNDLLRSKKEITASSNWEDCKPLFEESQEYRSIGEESFGR 869

Query: 978  ETFKDCVVYLQEKAKEKERKREDEKVKKEKEREEKEKRKEKDRKEKDKDRDREKRKERSK 1037
            E F++ + +LQEKAKEKERKRE+EK KKEKEREEKEKRKEK+RKEKD+DR+REK KERS+
Sbjct: 870  EIFEEYIAHLQEKAKEKERKREEEKAKKEKEREEKEKRKEKERKEKDRDREREKGKERSR 929

Query: 1038 KEESDDEAVDATDSHSNKDENRKEKEKDRDRKHKKRHHDNTIDDGSSDRDDDKERRHRKR 1097
            K+E++ E VD T S+  K++ ++EK                          DK+R+HRKR
Sbjct: 930  KDETESENVDVTGSYGYKEDKKREK--------------------------DKDRKHRKR 989

Query: 1098 HHDTTDDVSSDKDDKEDHKKSRRHGSDRKKSRKHEHSPDSDGESKHKRHKRDHRDGSRRT 1144
            H    DD SSDK++KE+ KKSRRHGSDRKKSRKH ++P+SD ES+HKRHKR+H DGSRR 
Sbjct: 990  HQSAVDDASSDKEEKEESKKSRRHGSDRKKSRKHAYTPESDTESRHKRHKREHWDGSRRN 1029

BLAST of Spo27314.1 vs. UniProtKB/TrEMBL
Match: A0A0K9R6L6_SPIOL (Uncharacterized protein OS=Spinacia oleracea GN=SOVF_100850 PE=4 SV=1)

HSP 1 Score: 1859.0 bits (4814), Expect = 0.000e+0
Identity = 1004/1004 (100.00%), Postives = 1004/1004 (100.00%), Query Frame = 1

		  

Query: 141  QYRPAVPGQQGQPYLPGSAQQFLAPGQNIPSGHNQPMQFSQPMQFSQPMQQLPPRPGMPG 200
            QYRPAVPGQQGQPYLPGSAQQFLAPGQNIPSGHNQPMQFSQPMQFSQPMQQLPPRPGMPG
Sbjct: 14   QYRPAVPGQQGQPYLPGSAQQFLAPGQNIPSGHNQPMQFSQPMQFSQPMQQLPPRPGMPG 73

Query: 201  VPMSSQGMAMPYGQPNRPMTLGAQQNQHSAPPFGNHPSGVGGMGIPFSSSYTFQPVSHLS 260
            VPMSSQGMAMPYGQPNRPMTLGAQQNQHSAPPFGNHPSGVGGMGIPFSSSYTFQPVSHLS
Sbjct: 74   VPMSSQGMAMPYGQPNRPMTLGAQQNQHSAPPFGNHPSGVGGMGIPFSSSYTFQPVSHLS 133

Query: 261  TPAASVGGQPWLSSGSQSAAPFVPVPQTADQSSGSAAPVPAVNPPDTSQQSSSDWQEHNT 320
            TPAASVGGQPWLSSGSQSAAPFVPVPQTADQSSGSAAPVPAVNPPDTSQQSSSDWQEHNT
Sbjct: 134  TPAASVGGQPWLSSGSQSAAPFVPVPQTADQSSGSAAPVPAVNPPDTSQQSSSDWQEHNT 193

Query: 321  PDGRRYYYNKKTKQSSWEKPVELMTPTERADASTVWKEFTTPEGKKYYFNKVTKESKWTI 380
            PDGRRYYYNKKTKQSSWEKPVELMTPTERADASTVWKEFTTPEGKKYYFNKVTKESKWTI
Sbjct: 194  PDGRRYYYNKKTKQSSWEKPVELMTPTERADASTVWKEFTTPEGKKYYFNKVTKESKWTI 253

Query: 381  PEDLKLAREQAAKAVSQGAQLDAGMKSHPTTTGGFTSEVAPSTNPVSGSSNGSSSVSGAS 440
            PEDLKLAREQAAKAVSQGAQLDAGMKSHPTTTGGFTSEVAPSTNPVSGSSNGSSSVSGAS
Sbjct: 254  PEDLKLAREQAAKAVSQGAQLDAGMKSHPTTTGGFTSEVAPSTNPVSGSSNGSSSVSGAS 313

Query: 441  SIPLSFPSGVNPAPVVNDASSELLMEHSAAPTSMAVTNALAMTPLSASISGDDALPAALN 500
            SIPLSFPSGVNPAPVVNDASSELLMEHSAAPTSMAVTNALAMTPLSASISGDDALPAALN
Sbjct: 314  SIPLSFPSGVNPAPVVNDASSELLMEHSAAPTSMAVTNALAMTPLSASISGDDALPAALN 373

Query: 501  ASSITVNASDKLPSQEISTSTDGGSAHDLEELRKGITAGKVSLSEKPTNDEPLVFANKQE 560
            ASSITVNASDKLPSQEISTSTDGGSAHDLEELRKGITAGKVSLSEKPTNDEPLVFANKQE
Sbjct: 374  ASSITVNASDKLPSQEISTSTDGGSAHDLEELRKGITAGKVSLSEKPTNDEPLVFANKQE 433

Query: 561  AKGAFKSLLESANVHSDWNWDQAMRVIVNDKRYGALKTLGERKQAFNEYLGQRRKQEAEE 620
            AKGAFKSLLESANVHSDWNWDQAMRVIVNDKRYGALKTLGERKQAFNEYLGQRRKQEAEE
Sbjct: 434  AKGAFKSLLESANVHSDWNWDQAMRVIVNDKRYGALKTLGERKQAFNEYLGQRRKQEAEE 493

Query: 621  RRLRQKKAKEEFTKMLEESEVLASSMKWSKAVTMFEDDERFKAVEKPKDRQELFDNYLVE 680
            RRLRQKKAKEEFTKMLEESEVLASSMKWSKAVTMFEDDERFKAVEKPKDRQELFDNYLVE
Sbjct: 494  RRLRQKKAKEEFTKMLEESEVLASSMKWSKAVTMFEDDERFKAVEKPKDRQELFDNYLVE 553

Query: 681  LQKKEKEKADEEYRRNREDYWTFLESCDFIEANSLWRKVQDRLEDDERCSRLEKIDRLEI 740
            LQKKEKEKADEEYRRNREDYWTFLESCDFIEANSLWRKVQDRLEDDERCSRLEKIDRLEI
Sbjct: 554  LQKKEKEKADEEYRRNREDYWTFLESCDFIEANSLWRKVQDRLEDDERCSRLEKIDRLEI 613

Query: 741  FQDYIRFLEKGEEEQKKLQKEQLRRAERKNRDEFRKVLEADVAAGVLTAKSIWRDYCAKV 800
            FQDYIRFLEKGEEEQKKLQKEQLRRAERKNRDEFRKVLEADVAAGVLTAKSIWRDYCAKV
Sbjct: 614  FQDYIRFLEKGEEEQKKLQKEQLRRAERKNRDEFRKVLEADVAAGVLTAKSIWRDYCAKV 673

Query: 801  KESPAYLAVARNISGSTPKDLFEDAVDELENQYHEDKSRIKDVMKLSKMTMTSTWTFDEF 860
            KESPAYLAVARNISGSTPKDLFEDAVDELENQYHEDKSRIKDVMKLSKMTMTSTWTFDEF
Sbjct: 674  KESPAYLAVARNISGSTPKDLFEDAVDELENQYHEDKSRIKDVMKLSKMTMTSTWTFDEF 733

Query: 861  KESVSPDLGSPPISDINFKLVFEDQLERIKEKEEKEAKKRQHLVDDFTDLLRSLKDITAS 920
            KESVSPDLGSPPISDINFKLVFEDQLERIKEKEEKEAKKRQHLVDDFTDLLRSLKDITAS
Sbjct: 734  KESVSPDLGSPPISDINFKLVFEDQLERIKEKEEKEAKKRQHLVDDFTDLLRSLKDITAS 793

Query: 921  STWEDSRQLFEDSEEYREIGNETLAMETFKDCVVYLQEKAKEKERKREDEKVKKEKEREE 980
            STWEDSRQLFEDSEEYREIGNETLAMETFKDCVVYLQEKAKEKERKREDEKVKKEKEREE
Sbjct: 794  STWEDSRQLFEDSEEYREIGNETLAMETFKDCVVYLQEKAKEKERKREDEKVKKEKEREE 853

Query: 981  KEKRKEKDRKEKDKDRDREKRKERSKKEESDDEAVDATDSHSNKDENRKEKEKDRDRKHK 1040
            KEKRKEKDRKEKDKDRDREKRKERSKKEESDDEAVDATDSHSNKDENRKEKEKDRDRKHK
Sbjct: 854  KEKRKEKDRKEKDKDRDREKRKERSKKEESDDEAVDATDSHSNKDENRKEKEKDRDRKHK 913

Query: 1041 KRHHDNTIDDGSSDRDDDKERRHRKRHHDTTDDVSSDKDDKEDHKKSRRHGSDRKKSRKH 1100
            KRHHDNTIDDGSSDRDDDKERRHRKRHHDTTDDVSSDKDDKEDHKKSRRHGSDRKKSRKH
Sbjct: 914  KRHHDNTIDDGSSDRDDDKERRHRKRHHDTTDDVSSDKDDKEDHKKSRRHGSDRKKSRKH 973

Query: 1101 EHSPDSDGESKHKRHKRDHRDGSRRTGAHDDLEDGELGEDGEIS 1145
            EHSPDSDGESKHKRHKRDHRDGSRRTGAHDDLEDGELGEDGEIS
Sbjct: 974  EHSPDSDGESKHKRHKRDHRDGSRRTGAHDDLEDGELGEDGEIS 1017

BLAST of Spo27314.1 vs. UniProtKB/TrEMBL
Match: A0A0J8BHY3_BETVU (Uncharacterized protein OS=Beta vulgaris subsp. vulgaris GN=BVRB_1g021310 PE=4 SV=1)

HSP 1 Score: 1458.0 bits (3773), Expect = 0.000e+0
Identity = 817/1021 (80.02%), Postives = 883/1021 (86.48%), Query Frame = 1

		  

Query: 132  NFAHVLLLLQYRPAVPGQQGQPYLPGSAQQFLA-----PGQNIPSGHNQPMQFSQPMQFS 191
            NF    +  Q+RPAVPGQQGQPYLPGS+ QF +     PGQN+PSG NQPM  +QPMQFS
Sbjct: 6    NFGPPPVPAQFRPAVPGQQGQPYLPGSSPQFPSAGQNMPGQNMPSGPNQPMHHNQPMQFS 65

Query: 192  QPMQQLPPRPGMPGVPMSSQGMAMPYGQPNRPMTLGAQQNQHSAPPFGNHPSGVGGMGIP 251
            QPMQQLPPRPGMPGVP SSQGM M YGQ NRPMTLGAQQNQHSAPPFGNH SGVGGMG+P
Sbjct: 66   QPMQQLPPRPGMPGVPFSSQGMGMSYGQQNRPMTLGAQQNQHSAPPFGNHTSGVGGMGVP 125

Query: 252  FSSSYTFQPVSHLSTPAASVGGQPWLSSGSQSAAPFVPVPQTADQSSGSAAPVPAVNPPD 311
            FSSSYTFQPVSHL TPAA V G PWLSSG QS+APFVPVP TA+QS  S+    AVN PD
Sbjct: 126  FSSSYTFQPVSHLPTPAAPVAGHPWLSSGGQSSAPFVPVPPTAEQSPVSSTIGLAVNLPD 185

Query: 312  TSQQSSSDWQEHNTPDGRRYYYNKKTKQSSWEKPVELMTPTERADASTVWKEFTTPEGKK 371
            TSQQSSSDWQEH TPDGRRYYYNKKTKQSSWEKP ELMTP ERADASTVWKEFTTPEGKK
Sbjct: 186  TSQQSSSDWQEHTTPDGRRYYYNKKTKQSSWEKPSELMTPLERADASTVWKEFTTPEGKK 245

Query: 372  YYFNKVTKESKWTIPEDLKLAREQAAKAVSQGAQLDAGMKSHPTTTGGFTSEVAPSTNPV 431
            YY+NK TKESKWTIPE+LKLARE+A KA SQGAQL+ G+ S     G  T E A +   V
Sbjct: 246  YYYNKATKESKWTIPEELKLARERAEKAASQGAQLETGVNSQSKPAGSLTPEEARTAAAV 305

Query: 432  SGSSNGSSSVSGASSIPLSFPSGVNPAPVVNDASSELLMEHSAAPTSMAVTNALAMT-PL 491
              SSN SSS +      LS P  VNP+P+VN AS+E+ + HSAAPTSMAVTN  A+T PL
Sbjct: 306  LASSNVSSSAT------LSVPPAVNPSPLVNAASAEIPVGHSAAPTSMAVTNTPAVTAPL 365

Query: 492  SASISGDDALPAALNASSITVNASDKLPSQEISTSTDGGSAHDLEELRKG-ITAGKVSLS 551
            SA  +GD  LPA+ NA  ITVNA+DKL SQE  T  DG    D EE  KG + A K  L 
Sbjct: 366  SAFTTGDPGLPASSNAPLITVNAADKLQSQETLTPMDGAPVQDHEEAHKGMLAAAKGDLI 425

Query: 552  EKPTNDEPLVFANKQEAKGAFKSLLESANVHSDWNWDQAMRVIVNDKRYGALKTLGERKQ 611
            EKP +DEPLVFA+KQEAK AFKSLLESANV SDW WDQAMRVIVNDKRYGALKTLGERKQ
Sbjct: 426  EKPADDEPLVFASKQEAKAAFKSLLESANVQSDWTWDQAMRVIVNDKRYGALKTLGERKQ 485

Query: 612  AFNEYLGQRRKQEAEERRLRQKKAKEEFTKMLEESEVLASSMKWSKAVTMFEDDERFKAV 671
            AFNEYLGQR+KQEAEERR+RQKKAKEEF KMLEESEVL SS+KWSKA+TMFEDDERFKAV
Sbjct: 486  AFNEYLGQRKKQEAEERRMRQKKAKEEFMKMLEESEVLTSSIKWSKAITMFEDDERFKAV 545

Query: 672  EKPKDRQELFDNYLVELQKKEKEKADEEYRRNREDYWTFLESCDFIEANSLWRKVQDRLE 731
            EKPKDRQELF+NYLVELQKKEKEKA+EEYRRNRE+YW FLESCDFIEANSLWRKVQDRLE
Sbjct: 546  EKPKDRQELFENYLVELQKKEKEKAEEEYRRNREEYWKFLESCDFIEANSLWRKVQDRLE 605

Query: 732  DDERCSRLEKIDRLEIFQDYIRFLEKGEEEQKKLQKEQLRRAERKNRDEFRKVLEADVAA 791
            DDERCSRLEKIDRLEIFQDYIR+LEK EEEQKKLQKEQLRR ERKNRDEFRK+LE D AA
Sbjct: 606  DDERCSRLEKIDRLEIFQDYIRYLEKEEEEQKKLQKEQLRRVERKNRDEFRKMLEVDTAA 665

Query: 792  GVLTAKSIWRDYCAKVKESPAYLAVARNISGSTPKDLFEDAVDELENQYHEDKSRIKDVM 851
            G LTAK+ WR+YC KVK+SPAYLAVARN SGSTPKDLFED VDELENQYH+DKSRIKD M
Sbjct: 666  GALTAKTSWREYCGKVKDSPAYLAVARNTSGSTPKDLFEDVVDELENQYHDDKSRIKDAM 725

Query: 852  KLSKMTMTSTWTFDEFKESVSPDLGSPPISDINFKLVFEDQLERIKEKEEKEAKKRQHLV 911
            KL K+ MTSTWTF+EFK S+S +LGSP I+DIN KLVFE+QLER+KEKEEKEAKKRQ L+
Sbjct: 726  KLCKIVMTSTWTFEEFKASISLELGSPLIADINLKLVFEEQLERVKEKEEKEAKKRQRLL 785

Query: 912  DDFTDLLRSLKDITASSTWEDSRQLFEDSEEYREIGNETLAMETFKDCVVYLQEKAKEKE 971
            DDFTDLLRSLK+IT SS W+DS+QLFE+SEEYREI NE LA E F+D VVYLQEKAKEKE
Sbjct: 786  DDFTDLLRSLKEITVSSLWDDSKQLFEESEEYREIDNEILAKEAFEDYVVYLQEKAKEKE 845

Query: 972  RKREDEKVKKEKEREEKEKRKEKDRKEKDKDRDREKRKERSKKEESDDEAVDATDSH-SN 1031
            RKRE+EK KKEKEREEKEKRKEKDRKEKDKDRDREKRKERSKKEE+DDEA+DATDSH S 
Sbjct: 846  RKREEEKAKKEKEREEKEKRKEKDRKEKDKDRDREKRKERSKKEETDDEAMDATDSHNSY 905

Query: 1032 KDENRKEKEKDRDRKHKKRHHDNTIDDGSSDRDDDKERRHRKRHHDTTDDVSSDKDDKED 1091
            KDE+RKEKEKDRDRKH+KRHHD T DD +SDRDDD+ERR RKRHHDTTDDVSS++DDKE+
Sbjct: 906  KDESRKEKEKDRDRKHRKRHHDTT-DDVTSDRDDDRERRQRKRHHDTTDDVSSERDDKEE 965

Query: 1092 HKKSRRHGSDRKKSRKHEHSPDSDGESKHKRHKRDHRDGSRRTGAHDDLEDGELGEDGEI 1145
            HKKSRRHGSDRKK RKHEHSPDSDGESKHKRHKRDHRDGSRR+GAHDDLEDGELGEDGEI
Sbjct: 966  HKKSRRHGSDRKKPRKHEHSPDSDGESKHKRHKRDHRDGSRRSGAHDDLEDGELGEDGEI 1019

BLAST of Spo27314.1 vs. UniProtKB/TrEMBL
Match: A0A061FG70_THECC (Pre-mRNA-processing protein 40A isoform 1 OS=Theobroma cacao GN=TCM_034659 PE=4 SV=1)

HSP 1 Score: 1038.1 bits (2683), Expect = 8.000e-300
Identity = 613/1028 (59.63%), Postives = 756/1028 (73.54%), Query Frame = 1

		  

Query: 141  QYRPAVPGQQGQPYLPGSAQQFLAPGQNIPSGHNQPMQFSQPMQFSQPMQQLPPRPGMPG 200
            Q+RP VP QQGQ ++P ++QQF   GQ   S    P   +Q MQFSQPMQQ PPRP  PG
Sbjct: 33   QFRPVVPMQQGQHFVPAASQQFRPVGQVPSSNVGMPAVQNQQMQFSQPMQQFPPRPNQPG 92

Query: 201  VPM-SSQGMAMPYGQPNRPMTLGAQQNQHSAPPFGNHPSGVGGMGIPFSSSYT------- 260
            +   S+Q M +P+GQ NRP+T G+ Q+  +APP  +H  G+G  G+P SSSY+       
Sbjct: 93   LSAPSAQPMHVPFGQTNRPLTSGSPQSHQTAPPLNSHMPGLGAPGMPPSSSYSYVPSSFG 152

Query: 261  -----------FQPVSHLSTPAASVGGQPWLSSGSQSAAPFVPVPQTADQSSGSAAPVPA 320
                       FQP S +    A V GQPWLSSG+QS +  +P+ QT  Q    ++   A
Sbjct: 153  QPQNNVSASSQFQPTSQVHASVAPVAGQPWLSSGNQSVSLAIPIQQTGQQPPLISSADTA 212

Query: 321  VNPPDTSQQSSSDWQEHNTPDGRRYYYNKKTKQSSWEKPVELMTPTERADASTVWKEFTT 380
             N P  +  S+SDWQEH + DGRRYYYNKKT+QSSWEKP+ELMTP ERADASTVWKEFTT
Sbjct: 213  ANAPIHTPPSASDWQEHTSADGRRYYYNKKTRQSSWEKPLELMTPIERADASTVWKEFTT 272

Query: 381  PEGKKYYFNKVTKESKWTIPEDLKLAREQAAKAVSQGAQLDAGMKSHPTTTGGFTSEVAP 440
            PEG+KYY+NKVTK+SKWTIPE+LKLAREQA    SQGA  D G+ S     G  +S   P
Sbjct: 273  PEGRKYYYNKVTKQSKWTIPEELKLAREQAQVVASQGAPSDTGVASQAPVAGAVSSAEMP 332

Query: 441  STNPVSGSSNGSSSVSGASSIPLSFPSGVNPAPVVNDASSELLMEHSAAPTSMAVTN-AL 500
            +   +  SSN S + S  S  P++  +  NP+P +   S+ + +  SAA  +  V + A+
Sbjct: 333  AA-AIPVSSNTSQASSPVSVTPVA--AVANPSPTLVSGSTVVPVSQSAATNASEVQSPAV 392

Query: 501  AMTPLSASISGDDALP-AALNASSITVNASDKLPSQEISTSTDGGSAHDLEELRKGI-TA 560
            A+TPL A  SG    P  ++NA++  + + +   SQ+    T+G SA D+EE +KG+ TA
Sbjct: 393  AVTPLPAVSSGGSTTPVTSVNANTTMIRSLESTASQDSVHFTNGASAQDIEEAKKGMATA 452

Query: 561  GKVSLS---EKPTNDEPLVFANKQEAKGAFKSLLESANVHSDWNWDQAMRVIVNDKRYGA 620
            GKV+++   EK  +DEPLV+ANKQEAK AFKSLLESANV SDW W+Q MR I+NDKRYGA
Sbjct: 453  GKVNVTPVEEKVPDDEPLVYANKQEAKNAFKSLLESANVQSDWTWEQTMREIINDKRYGA 512

Query: 621  LKTLGERKQAFNEYLGQRRKQEAEERRLRQKKAKEEFTKMLEESEVLASSMKWSKAVTMF 680
            LKTLGERKQAFNEYLGQR+K EAEERR+RQKKA+EEFTKMLEES+ L SSM+WSKA ++F
Sbjct: 513  LKTLGERKQAFNEYLGQRKKLEAEERRMRQKKAREEFTKMLEESKELTSSMRWSKAQSLF 572

Query: 681  EDDERFKAVEKPKDRQELFDNYLVELQKKEKEKADEEYRRNREDYWTFLESCDFIEANSL 740
            E+DERFKAVE+ +DR++LF+NY+VEL++KE+E A EE RRN  +Y  FLESCDFI+ANS 
Sbjct: 573  ENDERFKAVERARDREDLFENYIVELERKERENAAEEKRRNIAEYRKFLESCDFIKANSQ 632

Query: 741  WRKVQDRLEDDERCSRLEKIDRLEIFQDYIRFLEKGEEEQKKLQKEQLRRAERKNRDEFR 800
            WRKVQDRLEDDERCSRLEKIDRL +FQDYI  LEK EEE+KK+QKEQLRRAERKNRD FR
Sbjct: 633  WRKVQDRLEDDERCSRLEKIDRLVMFQDYIHDLEKEEEEKKKMQKEQLRRAERKNRDAFR 692

Query: 801  KVLEADVAAGVLTAKSIWRDYCAKVKESPAYLAVARNISGSTPKDLFEDAVDELENQYHE 860
            K+++  V  G LTAK+ WRDYC KVK+ P YLAVA N SGSTPKDLFED V+ELE QY +
Sbjct: 693  KLMDEHVVDGTLTAKTYWRDYCLKVKDLPPYLAVASNTSGSTPKDLFEDVVEELEKQYQQ 752

Query: 861  DKSRIKDVMKLSKMTMTSTWTFDEFKESVSPDLGSPPISDINFKLVFEDQLERIKEKEEK 920
            DK+ IKD MK  K++M STWT ++FK ++S D+GS PISDIN KLV+E+ L+  KEKEEK
Sbjct: 753  DKTHIKDAMKSGKISMVSTWTVEDFKAAISEDVGSLPISDINLKLVYEELLKSAKEKEEK 812

Query: 921  EAKKRQHLVDDFTDLLRSLKDITASSTWEDSRQLFEDSEEYREIGNETLAMETFKDCVVY 980
            EAKKRQ L DDFT LL + K+ITASS WEDSR LFE+S+EYR I  E+L  E F++ + Y
Sbjct: 813  EAKKRQRLADDFTKLLHTYKEITASSDWEDSRPLFEESQEYRSIAEESLRREIFEEYIAY 872

Query: 981  LQEKAKEKERKREDEKVKKEKEREEKEKRKEKDRKEKDKDRDREKRKERSKKEESDDEAV 1040
            LQEKAKEKERKRE+EK KKEKEREEKEKRKEK+RKEK+++R+REK KER+KK+E+D E +
Sbjct: 873  LQEKAKEKERKREEEKAKKEKEREEKEKRKEKERKEKEREREREKGKERTKKDETDSENL 932

Query: 1041 DATDSHSNKDENRKEKEKDRDRKHKKRHHDNTIDDGSSDRDDDKERRHRKRHHDTTDDVS 1100
            D +DSH +K++ +KEKEKD                          R+HRKRH    DD S
Sbjct: 933  DISDSHGHKEDKKKEKEKD--------------------------RKHRKRHQSGGDDGS 992

Query: 1101 SDKDDKEDHKKSRRHGSDRKKSRKHEHSPDSDGESKHKRHKRDHRDGSRRTGAHDDLEDG 1144
            SDKDD+E+ KKSRRHGSDRKKSRKH HSP+SD ES+HK+HKRDHRDGSRR   +++LEDG
Sbjct: 993  SDKDDREESKKSRRHGSDRKKSRKHAHSPESDNESRHKKHKRDHRDGSRRNSGYEELEDG 1031

BLAST of Spo27314.1 vs. UniProtKB/TrEMBL
Match: A0A061FFL7_THECC (Pre-mRNA-processing protein 40A isoform 3 OS=Theobroma cacao GN=TCM_034659 PE=4 SV=1)

HSP 1 Score: 1030.8 bits (2664), Expect = 1.300e-297
Identity = 613/1037 (59.11%), Postives = 756/1037 (72.90%), Query Frame = 1

		  

Query: 141  QYRPAVPGQQGQPYLPGSAQQFLAPGQNIPSGHNQPMQFSQPMQFSQPMQQLPPRPGMPG 200
            Q+RP VP QQGQ ++P ++QQF   GQ   S    P   +Q MQFSQPMQQ PPRP  PG
Sbjct: 33   QFRPVVPMQQGQHFVPAASQQFRPVGQVPSSNVGMPAVQNQQMQFSQPMQQFPPRPNQPG 92

Query: 201  VPM-SSQGMAMPYGQPNRPMTLGAQQNQHSAPPFGNHPSGVGGMGIPFSSSYT------- 260
            +   S+Q M +P+GQ NRP+T G+ Q+  +APP  +H  G+G  G+P SSSY+       
Sbjct: 93   LSAPSAQPMHVPFGQTNRPLTSGSPQSHQTAPPLNSHMPGLGAPGMPPSSSYSYVPSSFG 152

Query: 261  -----------FQPVSHLSTPAASVGGQPWLSSGSQSAAPFVPVPQTADQSSGSAAPVPA 320
                       FQP S +    A V GQPWLSSG+QS +  +P+ QT  Q    ++   A
Sbjct: 153  QPQNNVSASSQFQPTSQVHASVAPVAGQPWLSSGNQSVSLAIPIQQTGQQPPLISSADTA 212

Query: 321  VNPPDTSQQSSSDWQEHNTPDGRRYYYNKKTKQSSWEKPVELMTPTERADASTVWKEFTT 380
             N P  +  S+SDWQEH + DGRRYYYNKKT+QSSWEKP+ELMTP ERADASTVWKEFTT
Sbjct: 213  ANAPIHTPPSASDWQEHTSADGRRYYYNKKTRQSSWEKPLELMTPIERADASTVWKEFTT 272

Query: 381  PEGKKYYFNKVTKESKWTIPEDLKLAREQAAKAVSQGAQLDAGMKSHPTTTGGFTSEVAP 440
            PEG+KYY+NKVTK+SKWTIPE+LKLAREQA    SQGA  D G+ S     G  +S   P
Sbjct: 273  PEGRKYYYNKVTKQSKWTIPEELKLAREQAQVVASQGAPSDTGVASQAPVAGAVSSAEMP 332

Query: 441  STNPVSGSSNGSSSVSGASSIPLSFPSGVNPAPVVNDASSELLMEHSAAPTSMAVTN-AL 500
            +   +  SSN S + S  S  P++  +  NP+P +   S+ + +  SAA  +  V + A+
Sbjct: 333  AA-AIPVSSNTSQASSPVSVTPVA--AVANPSPTLVSGSTVVPVSQSAATNASEVQSPAV 392

Query: 501  AMTPLSASISGDDALP-AALNASSITVNASDKLPSQEISTSTDGGSAHDLEELRKGI-TA 560
            A+TPL A  SG    P  ++NA++  + + +   SQ+    T+G SA D+EE +KG+ TA
Sbjct: 393  AVTPLPAVSSGGSTTPVTSVNANTTMIRSLESTASQDSVHFTNGASAQDIEEAKKGMATA 452

Query: 561  GKVSLS---EKPTNDEPLVFANKQEAKGAFKSLLESANVHSDWNWDQAMRVIVNDKRYGA 620
            GKV+++   EK  +DEPLV+ANKQEAK AFKSLLESANV SDW W+Q MR I+NDKRYGA
Sbjct: 453  GKVNVTPVEEKVPDDEPLVYANKQEAKNAFKSLLESANVQSDWTWEQTMREIINDKRYGA 512

Query: 621  LKTLGERKQAFNEYLGQRRKQEAEERRLRQKKAKEEFTKMLEESEVLASSMKWSKAVTMF 680
            LKTLGERKQAFNEYLGQR+K EAEERR+RQKKA+EEFTKMLEES+ L SSM+WSKA ++F
Sbjct: 513  LKTLGERKQAFNEYLGQRKKLEAEERRMRQKKAREEFTKMLEESKELTSSMRWSKAQSLF 572

Query: 681  EDDERFKAVEKPKDRQELFDNYLVELQKKEKEKADEEYRRNREDYWTFLESCDF------ 740
            E+DERFKAVE+ +DR++LF+NY+VEL++KE+E A EE RRN  +Y  FLESCDF      
Sbjct: 573  ENDERFKAVERARDREDLFENYIVELERKERENAAEEKRRNIAEYRKFLESCDFIKVQHF 632

Query: 741  ---IEANSLWRKVQDRLEDDERCSRLEKIDRLEIFQDYIRFLEKGEEEQKKLQKEQLRRA 800
               I+ANS WRKVQDRLEDDERCSRLEKIDRL +FQDYI  LEK EEE+KK+QKEQLRRA
Sbjct: 633  QKRIQANSQWRKVQDRLEDDERCSRLEKIDRLVMFQDYIHDLEKEEEEKKKMQKEQLRRA 692

Query: 801  ERKNRDEFRKVLEADVAAGVLTAKSIWRDYCAKVKESPAYLAVARNISGSTPKDLFEDAV 860
            ERKNRD FRK+++  V  G LTAK+ WRDYC KVK+ P YLAVA N SGSTPKDLFED V
Sbjct: 693  ERKNRDAFRKLMDEHVVDGTLTAKTYWRDYCLKVKDLPPYLAVASNTSGSTPKDLFEDVV 752

Query: 861  DELENQYHEDKSRIKDVMKLSKMTMTSTWTFDEFKESVSPDLGSPPISDINFKLVFEDQL 920
            +ELE QY +DK+ IKD MK  K++M STWT ++FK ++S D+GS PISDIN KLV+E+ L
Sbjct: 753  EELEKQYQQDKTHIKDAMKSGKISMVSTWTVEDFKAAISEDVGSLPISDINLKLVYEELL 812

Query: 921  ERIKEKEEKEAKKRQHLVDDFTDLLRSLKDITASSTWEDSRQLFEDSEEYREIGNETLAM 980
            +  KEKEEKEAKKRQ L DDFT LL + K+ITASS WEDSR LFE+S+EYR I  E+L  
Sbjct: 813  KSAKEKEEKEAKKRQRLADDFTKLLHTYKEITASSDWEDSRPLFEESQEYRSIAEESLRR 872

Query: 981  ETFKDCVVYLQEKAKEKERKREDEKVKKEKEREEKEKRKEKDRKEKDKDRDREKRKERSK 1040
            E F++ + YLQEKAKEKERKRE+EK KKEKEREEKEKRKEK+RKEK+++R+REK KER+K
Sbjct: 873  EIFEEYIAYLQEKAKEKERKREEEKAKKEKEREEKEKRKEKERKEKEREREREKGKERTK 932

Query: 1041 KEESDDEAVDATDSHSNKDENRKEKEKDRDRKHKKRHHDNTIDDGSSDRDDDKERRHRKR 1100
            K+E+D E +D +DSH +K++ +KEKEKD                          R+HRKR
Sbjct: 933  KDETDSENLDISDSHGHKEDKKKEKEKD--------------------------RKHRKR 992

Query: 1101 HHDTTDDVSSDKDDKEDHKKSRRHGSDRKKSRKHEHSPDSDGESKHKRHKRDHRDGSRRT 1144
            H    DD SSDKDD+E+ KKSRRHGSDRKKSRKH HSP+SD ES+HK+HKRDHRDGSRR 
Sbjct: 993  HQSGGDDGSSDKDDREESKKSRRHGSDRKKSRKHAHSPESDNESRHKKHKRDHRDGSRRN 1040

BLAST of Spo27314.1 vs. UniProtKB/TrEMBL
Match: B9IBN8_POPTR (FF domain-containing family protein OS=Populus trichocarpa GN=POPTR_0014s01360g PE=4 SV=2)

HSP 1 Score: 1004.2 bits (2595), Expect = 1.300e-289
Identity = 606/1031 (58.78%), Postives = 749/1031 (72.65%), Query Frame = 1

		  

Query: 141  QYRPAVPGQQGQPYLPGSAQQFLAPGQNIPSGH-NQPMQFSQPMQFSQPMQQLPPRPGMP 200
            Q+RP VP QQGQP++  ++QQF   GQ +PS H   P   SQ +QFSQP+QQLPP P  P
Sbjct: 11   QFRPMVPTQQGQPFIQVASQQFRPVGQGMPSSHVGMPAAQSQHLQFSQPIQQLPPWPNQP 70

Query: 201  GVPMSSQGMAMPYGQPNRPMTLGAQQNQHSAPPFGNHPSGVGGMGIPFSSSYT------- 260
            G P S+Q ++MPYGQ NRP+T  + Q Q +APP  NH   VG  G+P SS Y        
Sbjct: 71   GAP-SAQALSMPYGQLNRPLT--SSQPQQNAPPLSNHMHVVGTSGVPNSSPYAFAPSSFG 130

Query: 261  -----------FQPVSHLSTPAASVGGQPWLSSGSQSAAPFVPVPQTADQSSGSAAPVPA 320
                       F P+S +      +GGQPWLSSGS  A+   PV     Q S S++    
Sbjct: 131  LTQNSASALPQFPPMSQMHAHVVPMGGQPWLSSGSHGASLVPPVQPAVVQPSISSSSDST 190

Query: 321  VNPPDTSQQSSSDWQEHNTPDGRRYYYNKKTKQSSWEKPVELMTPTERADASTVWKEFTT 380
            V     SQQS SDWQEH   DGRRYYYN++TKQSSW+KP ELMTP ERADASTVWKEFTT
Sbjct: 191  VAVSSNSQQSLSDWQEHTASDGRRYYYNRRTKQSSWDKPFELMTPIERADASTVWKEFTT 250

Query: 381  PEGKKYYFNKVTKESKWTIPEDLKLAREQAAKAVSQGAQLDAGMKSHPTTTGGFTSEVAP 440
             EGKKYY+NKVTK+SKW+IPE+LK+AREQA + V QG Q +    S+  T    TS    
Sbjct: 251  QEGKKYYYNKVTKQSKWSIPEELKMAREQAQQTVGQGNQSETDAASNVPTAVAVTSS-ET 310

Query: 441  STNPVSGSSNGSSSVSGASSIPLSFPSGVNPAPVVNDASSELLMEHSAAPTSMAVTNALA 500
            ST  VS SS+ S  + G SS P+S  +  NP PVV   S  L + HS   T+ AV    +
Sbjct: 311  STTAVSVSSS-SVMLPGVSSSPISVTAVANPPPVVVSGSPALPVAHST--TASAVGVQPS 370

Query: 501  MTPLSASIS-GDDALPAALNASSITVNASDKLPSQEISTSTDGGSAHDLEELRK-GITAG 560
            +TPL  ++S G  A  AA++A + ++++ D L SQ  + S DG S  D  E  K  +  G
Sbjct: 371  VTPLPTAVSVGTGAPAAAVDAKTTSLSSIDNLLSQSAANSVDGASMMDTAEFNKVSMDMG 430

Query: 561  KVS---LSEKPTNDEPLVFANKQEAKGAFKSLLESANVHSDWNWDQAMRVIVNDKRYGAL 620
            K +   L EK  ++EPLVFANK EAK AFK+LLESANV SDW W+Q MR I+NDKRY AL
Sbjct: 431  KTNASPLEEKTPDEEPLVFANKLEAKNAFKALLESANVQSDWTWEQTMREIINDKRYAAL 490

Query: 621  KTLGERKQAFNEYLGQRRKQEAEERRLRQKKAKEEFTKMLEESEVLASSMKWSKAVTMFE 680
            KTLGERKQAFNEYLGQR+K EAEERR+RQKKA+EEF KMLEES+ L SSMKWSKA+++FE
Sbjct: 491  KTLGERKQAFNEYLGQRKKLEAEERRVRQKKAREEFAKMLEESKELTSSMKWSKAISLFE 550

Query: 681  DDERFKAVEKPKDRQELFDNYLVELQKKEKEKADEEYRRNREDYWTFLESCDFIEANSLW 740
            +DER+KA+E+ +DR++LFD+Y+V+L++KEKEKA E+ RRN  +Y  FLESCDFI+A+S W
Sbjct: 551  NDERYKALERARDREDLFDSYIVDLERKEKEKAAEDRRRNVAEYRKFLESCDFIKASSQW 610

Query: 741  RKVQDRLEDDERCSRLEKIDRLEIFQDYIRFLEKGEEEQKKLQKEQLRRAERKNRDEFRK 800
            RK+QDRLEDDERC  LEK+DRL IFQDYIR LEK EEEQKK+QKEQLRRAERKNRDEFRK
Sbjct: 611  RKIQDRLEDDERCLCLEKLDRLLIFQDYIRDLEKEEEEQKKIQKEQLRRAERKNRDEFRK 670

Query: 801  VLEADVAAGVLTAKSIWRDYCAKVKESPAYLAVARNISGSTPKDLFEDAVDELENQYHED 860
            +LE  VA+G LTAK+ W DYC KVK+ P Y AVA N SGS PKDLFED  +ELE QYH+D
Sbjct: 671  LLEEHVASGSLTAKTHWLDYCLKVKDLPPYQAVATNTSGSKPKDLFEDVSEELEKQYHDD 730

Query: 861  KSRIKDVMKLSKMTMTSTWTFDEFKESVSPDLGSPPISDINFKLVFEDQLERIKEKEEKE 920
            K+RIKD MKL K+TM STWTF++FK +V+ D+GSPPISDIN KL++E+ +ER KEKEEKE
Sbjct: 731  KTRIKDAMKLGKITMVSTWTFEDFKGAVADDIGSPPISDINLKLLYEELVERAKEKEEKE 790

Query: 921  AKKRQHLVDDFTDLLRSLKDITASSTWEDSRQLFEDSEEYREIGNETLAMETFKDCVVYL 980
            AKK+Q L DDFT LL +LK++T SS WED + LFE+S+EYR IG E+L+ E F++ V +L
Sbjct: 791  AKKQQRLADDFTKLLYTLKEVTPSSNWEDCKPLFEESQEYRSIGEESLSKEIFEEYVTHL 850

Query: 981  QEKAKEKERKREDEKVKKEKEREEKEKRKEKDR----KEKDKDRDREKRKERSKKEESDD 1040
            QEKAKEKERKRE+EK +KEKEREEK+KRKEK+R    KEK+K+R+REK K+R+KK E+D 
Sbjct: 851  QEKAKEKERKREEEKARKEKEREEKDKRKEKERKEKEKEKEKEREREKGKQRTKKNETDG 910

Query: 1041 EAVDATDSHSNKDENRKEKEKDRDRKHKKRHHDNTIDDGSSDRDDDKERRHRKRHHDTTD 1100
            E VDA+D + +KD+ ++EK                          DK+R+HRKRH    D
Sbjct: 911  ENVDASDGYGHKDDKKREK--------------------------DKDRKHRKRHQSAID 970

Query: 1101 DVSSDKDDKEDHKKSRRHGSDRKKSRKHEHSPDSDGESKHKRHKRDHRDGSRRTGAHDDL 1144
            DV+SDKD+KE+ KKSR+H SDRKKSRKH ++P+SDGES+HKRHKRDHRDGSRR G++++L
Sbjct: 971  DVNSDKDEKEESKKSRKHSSDRKKSRKHTYTPESDGESQHKRHKRDHRDGSRRNGSNEEL 1008

BLAST of Spo27314.1 vs. ExPASy Swiss-Prot
Match: PR40A_ARATH (Pre-mRNA-processing protein 40A OS=Arabidopsis thaliana GN=PRP40A PE=1 SV=1)

HSP 1 Score: 841.6 bits (2173), Expect = 9.800e-243
Identity = 537/1030 (52.14%), Postives = 706/1030 (68.54%), Query Frame = 1

		  

Query: 141  QYRPAVPGQQGQPYLPGSAQQFLAPGQNIPSGHNQPMQFSQPMQFSQPMQQ---LPPRPG 200
            Q+RP VPGQQGQ ++P ++Q F       P GH  P   SQP Q+SQP+QQ    P RPG
Sbjct: 12   QFRPMVPGQQGQHFVPAASQPFH------PYGHVPPNVQSQPPQYSQPIQQQQLFPVRPG 71

Query: 201  MP-GVPMSSQGMAMPYGQPNRPMTLGAQQNQHSAPPFGNHPSGVGGMGIPFSSSYTFQPV 260
             P  +  SSQ +++PY Q N+ +T G+ Q Q +APP     +G    G PFSS YTF P 
Sbjct: 72   QPVHITSSSQAVSVPYIQTNKILTSGSTQPQPNAPPM----TGFATSGPPFSSPYTFVPS 131

Query: 261  SHLSTPAAS---------VGGQP-----WLSSGSQSAAPFVPVPQTADQSSGSAAPVPAV 320
            S+      S         V G P     W    +QS +   PV QT  Q+  + +     
Sbjct: 132  SYPQQQPTSLVQPNSQMHVAGVPPAANTWPVPVNQSTSLVSPVQQTGQQTPVAVS----T 191

Query: 321  NPPDTSQQSSSDWQEHNTPDGRRYYYNKKTKQSSWEKPVELMTPTERADASTVWKEFTTP 380
            +P + + QS+SDWQEH + DGR+YYYNK+TKQS+WEKP+ELMTP ERADASTVWKEFTTP
Sbjct: 192  DPGNLTPQSASDWQEHTSADGRKYYYNKRTKQSNWEKPLELMTPLERADASTVWKEFTTP 251

Query: 381  EGKKYYFNKVTKESKWTIPEDLKLAREQAAKAVSQGAQLDAG---MKSHPTTTGGFTSEV 440
            EGKKYY+NKVTKESKWTIPEDLKLAREQA  A  + +  +AG   +  H  ++       
Sbjct: 252  EGKKYYYNKVTKESKWTIPEDLKLAREQAQLASEKTSLSEAGSTPLSHHAASSSDLAVST 311

Query: 441  APSTNPVSGSS---NGSSSVSGASSIPLSFPSGVNPAPVVNDASSELLMEHSAAPTSMAV 500
              S  P + S+   + SS +    ++P++ P  V P                  PTS A+
Sbjct: 312  VTSVVPSTSSALTGHSSSPIQAGLAVPVTRPPSVAP----------------VTPTSGAI 371

Query: 501  TNALAMTPLSASISGDDALPAALNASSITVNASDKLPSQEISTSTDGGSAHDLEELRKGI 560
            ++  A T     I GD+                  L S+    S DG +A + E   K +
Sbjct: 372  SDTEATT-----IKGDN------------------LSSRGADDSNDGATAQNNEAENKEM 431

Query: 561  TA-GKVSLS---EKPTNDEPLVFANKQEAKGAFKSLLESANVHSDWNWDQAMRVIVNDKR 620
            +  GK +LS   +K   +EP+V+A KQEAK AFKSLLES NVHSDW W+Q ++ IV+DKR
Sbjct: 432  SVNGKANLSPAGDKANVEEPMVYATKQEAKAAFKSLLESVNVHSDWTWEQTLKEIVHDKR 491

Query: 621  YGALKTLGERKQAFNEYLGQRRKQEAEERRLRQKKAKEEFTKMLEESEVLASSMKWSKAV 680
            YGAL+TLGERKQAFNEYLGQR+K EAEERR RQKKA+EEF KMLEE E L+SS+KWSKA+
Sbjct: 492  YGALRTLGERKQAFNEYLGQRKKVEAEERRRRQKKAREEFVKMLEECEELSSSLKWSKAM 551

Query: 681  TMFEDDERFKAVEKPKDRQELFDNYLVELQKKEKEKADEEYRRNREDYWTFLESCDFIEA 740
            ++FE+D+RFKAV++P+DR++LFDNY+VEL++KE+EKA EE+R+   DY  FLE+CD+I+A
Sbjct: 552  SLFENDQRFKAVDRPRDREDLFDNYIVELERKEREKAAEEHRQYMADYRKFLETCDYIKA 611

Query: 741  NSLWRKVQDRLEDDERCSRLEKIDRLEIFQDYIRFLEKGEEEQKKLQKEQLRRAERKNRD 800
             + WRK+QDRLEDD+RCS LEKIDRL  F++YI  LEK EEE K+++KE +RRAERKNRD
Sbjct: 612  GTQWRKIQDRLEDDDRCSCLEKIDRLIGFEEYILDLEKEEEELKRVEKEHVRRAERKNRD 671

Query: 801  EFRKVLEADVAAGVLTAKSIWRDYCAKVKESPAYLAVARNISGSTPKDLFEDAVDELENQ 860
             FR +LE  VAAG+LTAK+ W DYC ++K+ P Y AVA N SGSTPKDLFED  +ELE Q
Sbjct: 672  AFRTLLEEHVAAGILTAKTYWLDYCIELKDLPQYQAVASNTSGSTPKDLFEDVTEELEKQ 731

Query: 861  YHEDKSRIKDVMKLSKMTMTSTWTFDEFKESVSPDLGSPPISDINFKLVFEDQLERIKEK 920
            YHEDKS +KD MK  K++M S+W F++FK ++S DL +  ISDIN KL+++D + R+KEK
Sbjct: 732  YHEDKSYVKDAMKSRKISMVSSWLFEDFKSAISEDLSTQQISDINLKLIYDDLVGRVKEK 791

Query: 921  EEKEAKKRQHLVDDFTDLLRSLKDITASSTWEDSRQLFEDSEEYREIGNETLAMETFKDC 980
            EEKEA+K Q L ++FT+LL + K+IT +S WEDS+QL E+S+EYR IG+E+++   F++ 
Sbjct: 792  EEKEARKLQRLAEEFTNLLHTFKEITVASNWEDSKQLVEESQEYRSIGDESVSQGLFEEY 851

Query: 981  VVYLQEKAKEKERKREDEKVKKEKEREEKEKRKEKD--RKEKDKDRDREKRKERSKKEES 1040
            +  LQEKAKEKERKR++EKV+KEKER+EKEKRK+KD  R+EK+++R++EK KERSK+EES
Sbjct: 852  ITSLQEKAKEKERKRDEEKVRKEKERDEKEKRKDKDKERREKEREREKEKGKERSKREES 911

Query: 1041 DDE-AVDATDSHSNKDENRKEKEKDRDRKHKKRHHDNTIDDGSSDRDDDKERRHRKRHHD 1100
            D E A+D ++ H  KDE  K K KDRDRKH++RHH+N+ +D SSDRDD  E +       
Sbjct: 912  DGETAMDVSEGH--KDE--KRKGKDRDRKHRRRHHNNSDEDVSSDRDDRDESK------- 958

Query: 1101 TTDDVSSDKDDKEDHKKSRRHGSDRKKSRKHEHSPDSDGESKHKRHKRDHRDGSRRTGAH 1140
                           K SR+HG+DRKKSRKH +SP+S+ E++HKR K   ++ SRR+G +
Sbjct: 972  ---------------KSSRKHGNDRKKSRKHANSPESESENRHKRQK---KESSRRSG-N 958

BLAST of Spo27314.1 vs. ExPASy Swiss-Prot
Match: PR35B_ARATH (Pre-mRNA-processing protein 40B OS=Arabidopsis thaliana GN=PRP40B PE=1 SV=1)

HSP 1 Score: 519.2 bits (1336), Expect = 1.100e-145
Identity = 409/1008 (40.58%), Postives = 577/1008 (57.24%), Query Frame = 1

		  

Query: 141  QYRPAVPGQQGQPYLPGSAQQF--LAPGQNIPSGHNQPMQFS-QPMQFSQPMQQLPPRPG 200
            Q+ P +   Q +     S+Q F  +  G  + S    P  ++ Q +Q      + P +  
Sbjct: 34   QFLPTIQAPQSEQVARLSSQNFQCVGRGGTVLSIGYPPQSYAPQLLQSMHHSHERPSQLN 93

Query: 201  MPGVPMSSQGMAMPYGQPNRPMTLGAQQNQHSAPPFGNHPS-GVGGMGIPFSS-SYTFQP 260
               V     G      QPN  +  G   +Q    P+   P  G+ G G P +  SY    
Sbjct: 94   QVQVQHVPLGPPTLISQPNVSIASGTSLHQ----PYVQTPDIGMPGFGGPRALFSYPSAT 153

Query: 261  VSHLSTPAASVGGQPWLSSGSQSAAPFVPVPQTADQSSGSAAPVPAVNPPDTSQQSSSDW 320
                S     V G    S   Q A+      +++  +     P  A   P  SQ++ +DW
Sbjct: 154  SYEGSRVPPQVTGPSIHSQAQQRASIIHTSAESSIMNPTFEQPKAAFLKPLPSQKALTDW 213

Query: 321  QEHNTPDGRRYYYNKKTKQSSWEKPVELMTPTERADASTVWKEFTTPEGKKYYFNKVTKE 380
             EH + DGR+Y++NK+TK+S+WEKPVELMT  ERADA T WKE ++P+G+KYY+NK+TK+
Sbjct: 214  VEHTSADGRKYFFNKRTKKSTWEKPVELMTLFERADARTDWKEHSSPDGRKYYYNKITKQ 273

Query: 381  SKWTIPEDLKLAREQAAKAVSQGAQLDAGMKSHPTTTGGFTSEVAPSTNPVSGSSNGSSS 440
            S WT+PE++K+ REQA  A  QG   +  + +    T   T+  A  T   S +S     
Sbjct: 274  STWTMPEEMKIVREQAEIASVQGPHAEGIIDASEVLTRSDTASTAAPTGLPSQTSTSEGV 333

Query: 441  VSGASSIPLSFPSGV--NPAPVVNDASSELLMEHSAAPTSMAVTNALAMTPLSASISGDD 500
                 +  L  P+ V  + +PV N    ++  + ++     + T+ L++     S +   
Sbjct: 334  EKLTLTSDLKQPASVPGSSSPVENVDRVQMSADETSQLCDTSETDGLSVPVTETSAA--- 393

Query: 501  ALPAALNASSITVNASDKLPSQEISTSTDGGSAHDLEELRKGITAGKVS--LSEKPTNDE 560
                 +    I+V  S          +  G  +   E  +  + + KV     EK  + E
Sbjct: 394  ---TLVEKDEISVGNSGDSDDMSTKNANQGSGSGPKESQKPMVESEKVESQTEEKQIHQE 453

Query: 561  PLVFANKQEAKGAFKSLLESANVHSDWNWDQAMRVIVNDKRYGALKTLGERKQAFNEYLG 620
               F NK EA   FKSLL+SA V SDW W+QAMR I+NDKRYGAL+TLGERKQAFNE+L 
Sbjct: 454  SFSFNNKLEAVDVFKSLLKSAKVGSDWTWEQAMREIINDKRYGALRTLGERKQAFNEFLL 513

Query: 621  QRRKQEAEERRLRQKKAKEEFTKMLEESEVLASSMKWSKAVTMFEDDERFKAVEKPKDRQ 680
            Q ++   EER  RQKK  E+F +MLEE   L  S +WSK VTMFEDDERFKA+E+ KDR+
Sbjct: 514  QTKRAAEEERLARQKKLYEDFKRMLEECVELTPSTRWSKTVTMFEDDERFKALEREKDRR 573

Query: 681  ELFDNYLVELQKKEKEKADEEYRRNREDYWTFLESCDFIEANSLWRKVQDRLEDDERCSR 740
             +F++++ EL++K + KA E+ +RN  +Y  FLESC+FI+ NS WRKVQDRLE DERCSR
Sbjct: 574  NIFEDHVSELKEKGRVKALEDRKRNIIEYKRFLESCNFIKPNSQWRKVQDRLEVDERCSR 633

Query: 741  LEKIDRLEIFQDYIRFLEKGEEEQKKLQKEQLRRAERKNRDEFRKVLEADVAAGVLTAKS 800
            LEKID+LEIFQ+Y+R LE+ EEE+KK+QKE+L++ ERK+RDEF  +L+  +A G LTAK+
Sbjct: 634  LEKIDQLEIFQEYLRDLEREEEEKKKIQKEELKKVERKHRDEFHGLLDEHIATGELTAKT 693

Query: 801  IWRDYCAKVKESPAYLAVARNISGSTPKDLFEDAVDELENQYHEDKSRIKDVMKLSKMTM 860
            IWRDY  KVK+ P Y A+A N SG+TPKDLFEDAV++L+ + HE KS+IKDV+KL K+ +
Sbjct: 694  IWRDYLMKVKDLPVYSAIASNSSGATPKDLFEDAVEDLKKRDHELKSQIKDVLKLRKVNL 753

Query: 861  TSTWTFDEFKESVSPDLGSPPISDINFKLVFEDQLERIKEKEEKEAKKRQHLVDDFTDLL 920
            ++  TFDEFK S+S D+G P I D+  KLVF+D LER KEKEEKEA+K+    +   D+L
Sbjct: 754  SAGSTFDEFKVSISEDIGFPLIPDVRLKLVFDDLLERAKEKEEKEARKQTRQTEKLVDML 813

Query: 921  RSLKDITASSTWEDSRQLFEDSEEYREIGNETLAMETFKDCVVYLQEKAKEKERKREDEK 980
            RS KDITASS+WE+ + L E SE+   IG+E+     F+D V  L+E   +  R ++++K
Sbjct: 814  RSFKDITASSSWEELKHLVEGSEKCSTIGDESFRKRCFEDYVSLLKE---QSNRIKQNKK 873

Query: 981  VKKEKEREEKEKRKEKDRKEKDKDRDREKRKERSKKEESDDEAVDATDSHSNKDENRKEK 1040
            V  E  REE +K ++K  +EKD+ R+R          +SDD          N D N +  
Sbjct: 874  V-PEDVREEHDKGRDKYGREKDRVRER----------DSDDHHKKGAAGKYNHDMN-EPH 933

Query: 1041 EKDRDRKHKKRHHDNTIDDGSSDRDDDKERRHRKRHHDTTDDVSSDKDDKEDH-KKSRRH 1100
             K+R R  +  H+                 RHR+RH       +S K++  DH K+S + 
Sbjct: 934  GKERRRSGRDSHN-----------------RHRERH-------TSVKENDTDHFKESHKA 990

Query: 1101 GSDRKKSRKHEHSPDSDGESKHKRHKRDHRDGSRRTGAHDDLEDGELG 1139
            G   KKSR H+    S+ E + K  KR  ++ +R     ++LEDGE G
Sbjct: 994  GGGHKKSR-HQRGWVSEAEVEGK-EKRRRKEEAREHTKEEELEDGECG 990

BLAST of Spo27314.1 vs. ExPASy Swiss-Prot
Match: PR40A_HUMAN (Pre-mRNA-processing factor 40 homolog A OS=Homo sapiens GN=PRPF40A PE=1 SV=2)

HSP 1 Score: 240.0 bits (611), Expect = 1.300e-61
Identity = 267/915 (29.18%), Postives = 437/915 (47.76%), Query Frame = 1

		  

Query: 232  PFGNHPSGVGGMGIPFSSSYTFQPVSHLSTPAASVGGQPWLSSGSQSAAPFVPVPQTADQ 291
            P G HP G      P       Q +  +  P   +G  P + S   S  P + +   +  
Sbjct: 65   PMGMHPMGQRANMPPVPHGMMPQMMPPMGGPP--MGQMPGMMS---SVMPGMMMSHMSQA 124

Query: 292  SSGSAAPVPAVNPPD----TSQQSSSDWQEHNTPDGRRYYYNKKTKQSSWEKPVELMTPT 351
            S   A P P VN  D    T+  + S W EH +PDGR YYYN +TKQS+WEKP +L TP 
Sbjct: 125  SMQPALP-PGVNSMDVAAGTASGAKSMWTEHKSPDGRTYYYNTETKQSTWEKPDDLKTPA 184

Query: 352  ERADASTVWKEFTTPEGKKYYFNKVTKESKWTIPEDLK----LAREQAAKAVSQGAQLDA 411
            E+  +   WKE+ +  GK YY+N  TKES+W  P++L+          A ++   + L A
Sbjct: 185  EQLLSKCPWKEYKSDSGKPYYYNSQTKESRWAKPKELEDLEGYQNTIVAGSLITKSNLHA 244

Query: 412  GMKSHPTTTGGFTSEVAPSTNPVSGSSNGSSSVSGASSIPLSFPSGVNPAPVVNDASSEL 471
             +K+  ++     +  + +  P +      S+++ A                  +A++ +
Sbjct: 245  MIKAEESSKQEECTTTSTAPVPTTEIPTTMSTMAAA------------------EAAAAV 304

Query: 472  LMEHSAAPTSMAVTNALAMTPLSASISGD---------DALPAAL--NASSITVNASDK- 531
            +   +AA  + A  NA A T  S ++SG           ++ A +  N +++T++  ++ 
Sbjct: 305  VAAAAAAAAAAAAANANASTSASNTVSGTVPVVPEPEVTSIVATVVDNENTVTISTEEQA 364

Query: 532  -------LPSQEISTSTDGGSAHDLEELRKGITAGKVSLSEKPTNDEPLVFANKQEAKGA 591
                   +  Q +  S++ G     +E     T  K     +P   +   +  K+EAK A
Sbjct: 365  QLTSTPAIQDQSVEVSSNTGEETSKQETVADFTPKKEEEESQPAK-KTYTWNTKEEAKQA 424

Query: 592  FKSLLESANVHSDWNWDQAMRVIVNDKRYGALKTLGERKQAFNEYLGQRRKQEAEERRLR 651
            FK LL+   V S+ +W+QAM++I+ND RY AL  L E+KQAFN Y  Q  K+E EE R +
Sbjct: 425  FKELLKEKRVPSNASWEQAMKMIINDPRYSALAKLSEKKQAFNAYKVQTEKEEKEEARSK 484

Query: 652  QKKAKEEFTKMLEESEVLASSMKWSKAVTMFEDDERFKAVEKPKDRQELFDNYLVELQKK 711
             K+AKE F + LE  E + S+ ++ KA  MF + E + A+ + +DR E++++ L  L KK
Sbjct: 485  YKEAKESFQRFLENHEKMTSTTRYKKAEQMFGEMEVWNAISE-RDRLEIYEDVLFFLSKK 544

Query: 712  EKEKADEEYRRNREDYWTFLESCDFIEANSLWRKVQDRLED------DERCSRLEKIDRL 771
            EKE+A +  +RN E     L++   +  ++ W + Q  L D      DE    ++K D L
Sbjct: 545  EKEQAKQLRKRNWEALKNILDNMANVTYSTTWSEAQQYLMDNPTFAEDEELQNMDKEDAL 604

Query: 772  EIFQDYIRFLEKGEEEQKKLQKEQLRRAERKNRDEFRKVLEADVAAGVLTAKSIWRDYCA 831
              F+++IR LEK EEE+K+    + RR +RKNR+ F+  L+     G L + S W +   
Sbjct: 605  ICFEEHIRALEKEEEEEKQKSLLRERRRQRKNRESFQIFLDELHEHGQLHSMSSWMELYP 664

Query: 832  KVKESPAYLAVARNISGSTPKDLFEDAVDELENQYHEDKSRIKDVMKLSKMTMTSTWTFD 891
             +     +  +     GST  DLF+  V++L+ +YH++K  IKD++K     +    TF+
Sbjct: 665  TISSDIRFTNMLGQ-PGSTALDLFKFYVEDLKARYHDEKKIIKDILKDKGFVVEVNTTFE 724

Query: 892  EFKESVSPDLGSPPISDINFKLVFEDQLE----RIKEKEEKEAKKRQHLVDDFTDLLR-S 951
            +F   +S    S  +   N KL F   LE    R +E+E++EA+K +     F  +L+ +
Sbjct: 725  DFVAIISSTKRSTTLDAGNIKLAFNSLLEKAEAREREREKEEARKMKRKESAFKSMLKQA 784

Query: 952  LKDITASSTWEDSRQLFEDSEEYREIGNETLAMETFKDCVVYLQEKAKEKERKREDEKVK 1011
               I   + WED R+ F     + +I  E+     FKD +  L+ + +    K +    K
Sbjct: 785  APPIELDAVWEDIRERFVKEPAFEDITLESERKRIFKDFMHVLEHECQHHHSKNKKHS-K 844

Query: 1012 KEKEREEKEKRKEKDRKEKDKDRDREKRKERSKKEESDDEAVDATDSHSNKDENRKEKEK 1071
            K K+   K  R        D D   +K+++RS+   + + +  A    S K      K K
Sbjct: 845  KSKKHHRKRSRSRSGSDSDDDDSHSKKKRQRSESRSASEHSSSAESERSYK------KSK 904

Query: 1072 DRDRKHKKRHHDNTIDDGSSDRDDDKERRHRKRHHDTTDDVSSDKDDKEDHKKSRRHGSD 1109
               +K KKR H +   +  ++R+ DK+ + R+   D T   S  K           H S 
Sbjct: 905  KHKKKSKKRRHKSDSPESDAEREKDKKEKDRESEKDRTRQRSESK-----------HKSP 934

BLAST of Spo27314.1 vs. ExPASy Swiss-Prot
Match: PR40A_MOUSE (Pre-mRNA-processing factor 40 homolog A OS=Mus musculus GN=Prpf40a PE=1 SV=1)

HSP 1 Score: 228.0 bits (580), Expect = 5.100e-58
Identity = 283/987 (28.67%), Postives = 445/987 (45.09%), Query Frame = 1

		  

Query: 189  MQQLPPRPG-------MPGVPMSSQGMAMPYGQPNRPMTLGAQQNQHSAPPFGNHPSGVG 248
            M+  PP  G       + G  + S       G   R  ++G     H AP  G HP G  
Sbjct: 17   MESQPPSRGPGDGERRLSGSNLCSSSWVSADGFLRRRPSMG-HPGMHYAP-MGMHPMG-- 76

Query: 249  GMGIPFSSSYTFQPVSHLSTPAAS--VGGQPW-LSSGSQSAAPFVPVPQTADQSSGSAAP 308
                         PV H   P     +GG P     G  S+     +     Q+S   A 
Sbjct: 77   -------QRANMPPVPHGMMPQMMPPMGGPPMGQMPGMMSSVMSGMMMSHMSQASMQPAL 136

Query: 309  VPAVNPPDTSQQSSSD----WQEHNTPDGRRYYYNKKTKQSSWEKPVELMTPTERADAST 368
             P VN  D +  ++S     W EH +PDGR YYYN +TKQS+WEKP +L TP E+  +  
Sbjct: 137  PPGVNSMDVAAGAASGAKSMWTEHKSPDGRTYYYNTETKQSTWEKPDDLKTPAEQLLSKC 196

Query: 369  VWKEFTTPEGKKYYFNKVTKESKWTIPEDLKLAREQAAKAVSQG----AQLDAGMKSHPT 428
             WKE+ +  GK YY+N  TKES+W  P++L+         V+ G    + L A +K+  +
Sbjct: 197  PWKEYKSDSGKPYYYNSQTKESRWAKPKELEDLEGYQNTIVAGGLITKSNLHAMIKAEES 256

Query: 429  TTGGFTSEVAPSTNPVSGSSNGSSSVSGASSIPLSFPSGVNPAPVVNDASSELLMEHSAA 488
            +     +  + +  P +      S+++ A +      +    A   N            A
Sbjct: 257  SKQEECTTASTAPVPTTEIPTTMSTMAAAEAAAAVVAAAAAAAAAAN------------A 316

Query: 489  PTSMAVTNALAMTPLSASISGDDALPAAL-NASSITVNASDKLPSQEISTSTDGG---SA 548
             TS   TN +   P++        +  A+ N +++TV+  ++      +   D     S+
Sbjct: 317  NTSTTPTNTVGSVPVAPEPEVTSIVATAVDNENTVTVSTEEQAQLANTTAIQDLSGDISS 376

Query: 549  HDLEELRKGITAGKVSLSEKPTNDEPL----VFANKQEAKGAFKSLLESANVHSDWNWDQ 608
            +  EE  K  T    +  ++    +P      +  K+EAK AFK LL+   V S+ +W+Q
Sbjct: 377  NTGEEPAKQETVSDFTPKKEEEESQPAKKTYTWNTKEEAKQAFKELLKEKRVPSNASWEQ 436

Query: 609  AMRVIVNDKRYGALKTLGERKQAFNEYLGQRRKQEAEERRLRQKKAKEEFTKMLEESEVL 668
            AM++I+ND RY AL  L E+KQAFN Y  Q  K+E EE R + K+AKE F + LE  E +
Sbjct: 437  AMKMIINDPRYSALAKLSEKKQAFNAYKVQTEKEEKEEARSKYKEAKESFQRFLENHEKM 496

Query: 669  ASSMKWSKAVTMFEDDERFKAVEKPKDRQELFDNYLVELQKKEKEKADEEYRRNREDYWT 728
             S+ ++ KA  MF + E + A+ + +DR E++++ L  L KKEKE+A +  +RN E    
Sbjct: 497  TSTTRYKKAEQMFGEMEVWNAISE-RDRLEIYEDVLFFLSKKEKEQAKQLRKRNWEALKN 556

Query: 729  FLESCDFIEANSLWRKVQDRLED------DERCSRLEKIDRLEIFQDYIRFLEKGEEEQK 788
             L++   +  ++ W + Q  L D      DE    ++K D L  F+++IR LEK EEE+K
Sbjct: 557  ILDNMANVTYSTTWSEAQQYLMDNPTFAEDEELQNMDKEDALICFEEHIRALEKEEEEEK 616

Query: 789  KLQKEQLRRAERKNRDEFRKVLEADVAAGVLTAKSIWRDYCAKVKESPAYLAVARNISGS 848
            +    + RR +RKNR+ F+  L+     G L + S W +    +     +  +     GS
Sbjct: 617  QKTLLRERRRQRKNRESFQIFLDELHEHGQLHSMSSWMELYPTISSDIRFTNMLGQ-PGS 676

Query: 849  TPKDLFEDAVDELENQYHEDKSRIKDVMKLSKMTMTSTWTFDEFKESVSPDLGSPPISDI 908
            T  DLF+  V++L+ +YH++K  IKD++K     +    TF++F   +S    S  +   
Sbjct: 677  TALDLFKFYVEDLKARYHDEKKIIKDILKDKGFVVEVNTTFEDFVAIISSTKRSTTLDAG 736

Query: 909  NFKLVFEDQLE----RIKEKEEKEAKKRQHLVDDFTDLLR-SLKDITASSTWEDSRQLFE 968
            N KL F   LE    R +E+E++EA+K +     F  +L+ +   I   + WED R+ F 
Sbjct: 737  NIKLAFNSLLEKAEAREREREKEEARKMKRKESAFKSMLKQATPPIELDAVWEDIRERFV 796

Query: 969  DSEEYREIGNETLAMETFKDCVVYLQEKAKEKERKREDEKVKKEKEREEKEKRKEKDRKE 1028
                + +I  E+     FKD +  L+ + +    K                    K   +
Sbjct: 797  KEPAFEDITLESERKRIFKDFMHVLEHECQHHHSKN-------------------KKHSK 856

Query: 1029 KDKDRDREKRKERSKKEESDDEAVDATDSHSNKDENRKEKEKDRDRKHKKRHHDNTIDDG 1088
            K K   R++ + RS  E  DD      DSHS K   R E     +R              
Sbjct: 857  KSKKHHRKRSRSRSGSESDDD------DSHSKKKRQRSESHSASERS----------SSA 916

Query: 1089 SSDRDDDKERRHRKRHHDTTDDVSSDKDDKEDHKKSRRHGSDRKKSRKHEHSPDSDGESK 1138
             S+R   K ++H+K+         S + D E  K  +    D +K R  + S     ESK
Sbjct: 917  ESERSYKKSKKHKKKSKKRRHKSDSPESDTEREKDKKEKDRDSEKDRSRQRS-----ESK 938

BLAST of Spo27314.1 vs. ExPASy Swiss-Prot
Match: PR40B_MOUSE (Pre-mRNA-processing factor 40 homolog B OS=Mus musculus GN=Prpf40b PE=1 SV=2)

HSP 1 Score: 140.2 bits (352), Expect = 1.400e-31
Identity = 177/609 (29.06%), Postives = 299/609 (49.10%), Query Frame = 1

		  

Query: 545  EKPTNDEP----LVFANKQEAKGAFKSLLESANVHSDWNWDQAMRVIVNDKRYGALKTLG 604
            E+    EP    L ++N+++AK AFK LL    V S+ +W+QAM+++V D RY AL  L 
Sbjct: 259  EEEAKPEPERSGLSWSNREKAKQAFKELLRDKAVPSNASWEQAMKMVVTDPRYSALPKLS 318

Query: 605  ERKQAFNEYLGQRRKQEAEERRLRQKKAKEEFTKMLEESEVLASSMKWSKAVTMFEDDER 664
            E+KQAFN Y  QR K+E EE RLR K+AK+     LE+ E + S+ ++ +A   F D E 
Sbjct: 319  EKKQAFNAYKAQREKEEKEEARLRAKEAKQTLQHFLEQHERMTSTTRYRRAEQTFGDLEV 378

Query: 665  FKAVEKPKDRQELFDNYLVELQKKEKEKADEEYRRNREDYWTFLESCDFIEANSLWRKVQ 724
            + AV   ++R+E++D+ L  L KKEKE+A +  RRN +   + L+    +   + W + Q
Sbjct: 379  W-AVVPERERKEVYDDVLFFLAKKEKEQAKQLRRRNIQALKSILDGMSSVNFQTTWSQAQ 438

Query: 725  DRLED------DERCSRLEKIDRLEIFQDYIRFLEKGEEEQKKLQKEQLRRAERKNRDEF 784
              L D      D++   ++K D L  F+++IR LE+ EEE+++  + + RR +RKNR+ F
Sbjct: 439  QYLMDNPSFAQDQQLQNMDKEDALICFEEHIRALEREEEEERERARLRERRQQRKNREAF 498

Query: 785  RKVLEADVAAGVLTAKSIWRDYCAKVKESPAYLAVARNISGSTPKDLFEDAVDELENQYH 844
            +  L+     G L + S W +    V     + A      GSTP DLF+  V+EL+ ++H
Sbjct: 499  QSFLDELHETGQLHSMSTWMELYPAVSTDVRF-ANMLGQPGSTPLDLFKFYVEELKARFH 558

Query: 845  EDKSRIKDVMKLSKMTMTSTWTFDEFKESVSPDLGSPPISDINFKLVFEDQLERIK---- 904
            ++K  IKD++K     +     F++F   +S D  +  +   N KL F   LE+ +    
Sbjct: 559  DEKKIIKDILKDRGFCVEVNTAFEDFAHVISFDKRAAALDAGNIKLTFNSLLEKAEARET 618

Query: 905  EKEEKEAKKRQHLVDDFTDLLR-SLKDITASSTWEDSRQLFEDSEEYREIGNETLAMETF 964
            E+E++EA++ +     F  +LR ++  +   + WE+ R+ F     + +I  E+  +  F
Sbjct: 619  EREKEEARRMRRREAAFRSMLRQAVPALELGTAWEEVRERFVCDSAFEQITLESERIRLF 678

Query: 965  KDCVVYLQEKAKE----KERKREDEKVKKEKEREEKEKRKEKDRKEKDKDRDREKRKERS 1024
            ++ +  L++   +    K RK   +  K  ++R       E D +E      R  ++ R 
Sbjct: 679  REFLQVLEQTECQHLHTKGRKHGRKGKKHHRKRSHSPSGSESDEEELPPPSLRPPKRRRR 738

Query: 1025 KKEESDDEAVDATDSHSN-----------------KDENRKEKEKDRDRKHKKRHHDNTI 1084
               ES  E   + DS  +                   ++   K K   +K KKR H +T 
Sbjct: 739  NPSESGSEPSSSLDSVESGGAALGGPGSPSSHLLLGSDHGLRKTKKPKKKTKKRRHKSTS 798

Query: 1085 DDGSSDRDDDKERRHRKRHHDTTDDVSSDKDDKEDHKKSRRHGSDRKKSR---KHEHSPD 1115
             D  +D +D   +    R  +       D++ ++    +R  G   KK +       S  
Sbjct: 799  PDSETDPEDKAGKESEDREQE------QDREPRQAELPNRSPGFGIKKEKTGWDTSESEL 858

BLAST of Spo27314.1 vs. TAIR (Arabidopsis)
Match: AT1G44910.1 (pre-mRNA-processing protein 40A)

HSP 1 Score: 841.6 bits (2173), Expect = 5.500e-244
Identity = 537/1030 (52.14%), Postives = 706/1030 (68.54%), Query Frame = 1

		  

Query: 141  QYRPAVPGQQGQPYLPGSAQQFLAPGQNIPSGHNQPMQFSQPMQFSQPMQQ---LPPRPG 200
            Q+RP VPGQQGQ ++P ++Q F       P GH  P   SQP Q+SQP+QQ    P RPG
Sbjct: 12   QFRPMVPGQQGQHFVPAASQPFH------PYGHVPPNVQSQPPQYSQPIQQQQLFPVRPG 71

Query: 201  MP-GVPMSSQGMAMPYGQPNRPMTLGAQQNQHSAPPFGNHPSGVGGMGIPFSSSYTFQPV 260
             P  +  SSQ +++PY Q N+ +T G+ Q Q +APP     +G    G PFSS YTF P 
Sbjct: 72   QPVHITSSSQAVSVPYIQTNKILTSGSTQPQPNAPPM----TGFATSGPPFSSPYTFVPS 131

Query: 261  SHLSTPAAS---------VGGQP-----WLSSGSQSAAPFVPVPQTADQSSGSAAPVPAV 320
            S+      S         V G P     W    +QS +   PV QT  Q+  + +     
Sbjct: 132  SYPQQQPTSLVQPNSQMHVAGVPPAANTWPVPVNQSTSLVSPVQQTGQQTPVAVS----T 191

Query: 321  NPPDTSQQSSSDWQEHNTPDGRRYYYNKKTKQSSWEKPVELMTPTERADASTVWKEFTTP 380
            +P + + QS+SDWQEH + DGR+YYYNK+TKQS+WEKP+ELMTP ERADASTVWKEFTTP
Sbjct: 192  DPGNLTPQSASDWQEHTSADGRKYYYNKRTKQSNWEKPLELMTPLERADASTVWKEFTTP 251

Query: 381  EGKKYYFNKVTKESKWTIPEDLKLAREQAAKAVSQGAQLDAG---MKSHPTTTGGFTSEV 440
            EGKKYY+NKVTKESKWTIPEDLKLAREQA  A  + +  +AG   +  H  ++       
Sbjct: 252  EGKKYYYNKVTKESKWTIPEDLKLAREQAQLASEKTSLSEAGSTPLSHHAASSSDLAVST 311

Query: 441  APSTNPVSGSS---NGSSSVSGASSIPLSFPSGVNPAPVVNDASSELLMEHSAAPTSMAV 500
              S  P + S+   + SS +    ++P++ P  V P                  PTS A+
Sbjct: 312  VTSVVPSTSSALTGHSSSPIQAGLAVPVTRPPSVAP----------------VTPTSGAI 371

Query: 501  TNALAMTPLSASISGDDALPAALNASSITVNASDKLPSQEISTSTDGGSAHDLEELRKGI 560
            ++  A T     I GD+                  L S+    S DG +A + E   K +
Sbjct: 372  SDTEATT-----IKGDN------------------LSSRGADDSNDGATAQNNEAENKEM 431

Query: 561  TA-GKVSLS---EKPTNDEPLVFANKQEAKGAFKSLLESANVHSDWNWDQAMRVIVNDKR 620
            +  GK +LS   +K   +EP+V+A KQEAK AFKSLLES NVHSDW W+Q ++ IV+DKR
Sbjct: 432  SVNGKANLSPAGDKANVEEPMVYATKQEAKAAFKSLLESVNVHSDWTWEQTLKEIVHDKR 491

Query: 621  YGALKTLGERKQAFNEYLGQRRKQEAEERRLRQKKAKEEFTKMLEESEVLASSMKWSKAV 680
            YGAL+TLGERKQAFNEYLGQR+K EAEERR RQKKA+EEF KMLEE E L+SS+KWSKA+
Sbjct: 492  YGALRTLGERKQAFNEYLGQRKKVEAEERRRRQKKAREEFVKMLEECEELSSSLKWSKAM 551

Query: 681  TMFEDDERFKAVEKPKDRQELFDNYLVELQKKEKEKADEEYRRNREDYWTFLESCDFIEA 740
            ++FE+D+RFKAV++P+DR++LFDNY+VEL++KE+EKA EE+R+   DY  FLE+CD+I+A
Sbjct: 552  SLFENDQRFKAVDRPRDREDLFDNYIVELERKEREKAAEEHRQYMADYRKFLETCDYIKA 611

Query: 741  NSLWRKVQDRLEDDERCSRLEKIDRLEIFQDYIRFLEKGEEEQKKLQKEQLRRAERKNRD 800
             + WRK+QDRLEDD+RCS LEKIDRL  F++YI  LEK EEE K+++KE +RRAERKNRD
Sbjct: 612  GTQWRKIQDRLEDDDRCSCLEKIDRLIGFEEYILDLEKEEEELKRVEKEHVRRAERKNRD 671

Query: 801  EFRKVLEADVAAGVLTAKSIWRDYCAKVKESPAYLAVARNISGSTPKDLFEDAVDELENQ 860
             FR +LE  VAAG+LTAK+ W DYC ++K+ P Y AVA N SGSTPKDLFED  +ELE Q
Sbjct: 672  AFRTLLEEHVAAGILTAKTYWLDYCIELKDLPQYQAVASNTSGSTPKDLFEDVTEELEKQ 731

Query: 861  YHEDKSRIKDVMKLSKMTMTSTWTFDEFKESVSPDLGSPPISDINFKLVFEDQLERIKEK 920
            YHEDKS +KD MK  K++M S+W F++FK ++S DL +  ISDIN KL+++D + R+KEK
Sbjct: 732  YHEDKSYVKDAMKSRKISMVSSWLFEDFKSAISEDLSTQQISDINLKLIYDDLVGRVKEK 791

Query: 921  EEKEAKKRQHLVDDFTDLLRSLKDITASSTWEDSRQLFEDSEEYREIGNETLAMETFKDC 980
            EEKEA+K Q L ++FT+LL + K+IT +S WEDS+QL E+S+EYR IG+E+++   F++ 
Sbjct: 792  EEKEARKLQRLAEEFTNLLHTFKEITVASNWEDSKQLVEESQEYRSIGDESVSQGLFEEY 851

Query: 981  VVYLQEKAKEKERKREDEKVKKEKEREEKEKRKEKD--RKEKDKDRDREKRKERSKKEES 1040
            +  LQEKAKEKERKR++EKV+KEKER+EKEKRK+KD  R+EK+++R++EK KERSK+EES
Sbjct: 852  ITSLQEKAKEKERKRDEEKVRKEKERDEKEKRKDKDKERREKEREREKEKGKERSKREES 911

Query: 1041 DDE-AVDATDSHSNKDENRKEKEKDRDRKHKKRHHDNTIDDGSSDRDDDKERRHRKRHHD 1100
            D E A+D ++ H  KDE  K K KDRDRKH++RHH+N+ +D SSDRDD  E +       
Sbjct: 912  DGETAMDVSEGH--KDE--KRKGKDRDRKHRRRHHNNSDEDVSSDRDDRDESK------- 958

Query: 1101 TTDDVSSDKDDKEDHKKSRRHGSDRKKSRKHEHSPDSDGESKHKRHKRDHRDGSRRTGAH 1140
                           K SR+HG+DRKKSRKH +SP+S+ E++HKR K   ++ SRR+G +
Sbjct: 972  ---------------KSSRKHGNDRKKSRKHANSPESESENRHKRQK---KESSRRSG-N 958

BLAST of Spo27314.1 vs. TAIR (Arabidopsis)
Match: AT3G19670.1 (pre-mRNA-processing protein 40B)

HSP 1 Score: 521.9 bits (1343), Expect = 9.700e-148
Identity = 409/1009 (40.54%), Postives = 576/1009 (57.09%), Query Frame = 1

		  

Query: 141  QYRPAVPGQQGQPYLPGSAQQF--LAPGQNIPSGHNQPMQFS-QPMQFSQPMQQLPPRPG 200
            Q+ P +   Q +     S+Q F  +  G  + S    P  ++ Q +Q      + P +  
Sbjct: 34   QFLPTIQAPQSEQVARLSSQNFQCVGRGGTVLSIGYPPQSYAPQLLQSMHHSHERPSQLN 93

Query: 201  MPGVPMSSQGMAMPYGQPNRPMTLGAQQNQHSAPPFGNHPS-GVGGMGIPFSS-SYTFQP 260
               V     G      QPN  +  G   +Q    P+   P  G+ G G P +  SY    
Sbjct: 94   QVQVQHVPLGPPTLISQPNVSIASGTSLHQ----PYVQTPDIGMPGFGGPRALFSYPSAT 153

Query: 261  VSHLSTPAASVGGQPWLSSGSQSAAPFVPVPQTADQSSGSAAPVPAVNPPDTSQQSSSDW 320
                S     V G    S   Q A+      +++  +     P  A   P  SQ++ +DW
Sbjct: 154  SYEGSRVPPQVTGPSIHSQAQQRASIIHTSAESSIMNPTFEQPKAAFLKPLPSQKALTDW 213

Query: 321  QEHNTPDGRRYYYNKKTKQSSWEKPVELMTPTERADASTVWKEFTTPEGKKYYFNKVTKE 380
             EH + DGR+Y++NK+TK+S+WEKPVELMT  ERADA T WKE ++P+G+KYY+NK+TK+
Sbjct: 214  VEHTSADGRKYFFNKRTKKSTWEKPVELMTLFERADARTDWKEHSSPDGRKYYYNKITKQ 273

Query: 381  SKWTIPEDLKLAREQAAKAVSQGAQLDAGMKSHPTTTGGFTSEVAPSTNPVSGSSNGSSS 440
            S WT+PE++K+ REQA  A  QG   +  + +    T   T+  A  T   S +S     
Sbjct: 274  STWTMPEEMKIVREQAEIASVQGPHAEGIIDASEVLTRSDTASTAAPTGLPSQTSTSEGV 333

Query: 441  VSGASSIPLSFPSGV--NPAPVVNDASSELLMEHSAAPTSMAVTNALAMTPLSASISGDD 500
                 +  L  P+ V  + +PV N    ++  + ++     + T+ L++     S +   
Sbjct: 334  EKLTLTSDLKQPASVPGSSSPVENVDRVQMSADETSQLCDTSETDGLSVPVTETSAA--- 393

Query: 501  ALPAALNASSITVNASDKLPSQEISTSTDGGSAHDLEELRKGITAGKVS--LSEKPTNDE 560
                 +    I+V  S          +  G  +   E  +  + + KV     EK  + E
Sbjct: 394  ---TLVEKDEISVGNSGDSDDMSTKNANQGSGSGPKESQKPMVESEKVESQTEEKQIHQE 453

Query: 561  PLVFANKQEAKGAFKSLLESANVHSDWNWDQAMRVIVNDKRYGALKTLGERKQAFNEYLG 620
               F NK EA   FKSLL+SA V SDW W+QAMR I+NDKRYGAL+TLGERKQAFNE+L 
Sbjct: 454  SFSFNNKLEAVDVFKSLLKSAKVGSDWTWEQAMREIINDKRYGALRTLGERKQAFNEFLL 513

Query: 621  QRRKQEAEERRLRQKKAKEEFTKMLEESEVLASSMKWSKAVTMFEDDERFKAVEKPKDRQ 680
            Q ++   EER  RQKK  E+F +MLEE   L  S +WSK VTMFEDDERFKA+E+ KDR+
Sbjct: 514  QTKRAAEEERLARQKKLYEDFKRMLEECVELTPSTRWSKTVTMFEDDERFKALEREKDRR 573

Query: 681  ELFDNYLVELQKKEKEKADEEYRRNREDYWTFLESCDFIEANSLWRKVQDRLEDDERCSR 740
             +F++++ EL++K + KA E+ +RN  +Y  FLESC+FI+ NS WRKVQDRLE DERCSR
Sbjct: 574  NIFEDHVSELKEKGRVKALEDRKRNIIEYKRFLESCNFIKPNSQWRKVQDRLEVDERCSR 633

Query: 741  LEKIDRLEIFQDYIRFLEKGEEEQKKLQKEQLRRAERKNRDEFRKVLEADVAAGVLTAKS 800
            LEKID+LEIFQ+Y+R LE+ EEE+KK+QKE+L++ ERK+RDEF  +L+  +A G LTAK+
Sbjct: 634  LEKIDQLEIFQEYLRDLEREEEEKKKIQKEELKKVERKHRDEFHGLLDEHIATGELTAKT 693

Query: 801  IWRDYCAKVKESPAYLAVARNISGSTPKDLFEDAVDELENQYHEDKSRIKDVMKLSKMTM 860
            IWRDY  KVK+ P Y A+A N SG+TPKDLFEDAV++L+ + HE KS+IKDV+KL K+ +
Sbjct: 694  IWRDYLMKVKDLPVYSAIASNSSGATPKDLFEDAVEDLKKRDHELKSQIKDVLKLRKVNL 753

Query: 861  TSTWTFDEFKESVSPDLGSPPISDINFKLVFEDQLERIKEKEEKEAKKRQHLVDDFTDLL 920
            ++  TFDEFK S+S D+G P I D+  KLVF+D LER KEKEEKEA+K+    +   D+L
Sbjct: 754  SAGSTFDEFKVSISEDIGFPLIPDVRLKLVFDDLLERAKEKEEKEARKQTRQTEKLVDML 813

Query: 921  RSLKDITASSTWEDSRQLFEDSEEYREIGNETLAMETFKDCVVYLQEKAKE-KERKREDE 980
            RS KDITASS+WE+ + L E SE+   IG+E+     F+D V  L+E++   K+ K+  E
Sbjct: 814  RSFKDITASSSWEELKHLVEGSEKCSTIGDESFRKRCFEDYVSLLKEQSNRIKQNKKVPE 873

Query: 981  KVKKEKEREEKEKRKEKDRKEKDKDRDREKRKERSKKEESDDEAVDATDSHSNKDENRKE 1040
             V     REE +K ++K  +EKD+ R+R          +SDD          N D N + 
Sbjct: 874  DV-----REEHDKGRDKYGREKDRVRER----------DSDDHHKKGAAGKYNHDMN-EP 933

Query: 1041 KEKDRDRKHKKRHHDNTIDDGSSDRDDDKERRHRKRHHDTTDDVSSDKDDKEDH-KKSRR 1100
              K+R R  +  H+                 RHR+RH       +S K++  DH K+S +
Sbjct: 934  HGKERRRSGRDSHN-----------------RHRERH-------TSVKENDTDHFKESHK 990

Query: 1101 HGSDRKKSRKHEHSPDSDGESKHKRHKRDHRDGSRRTGAHDDLEDGELG 1139
             G   KKSR H+    S+ E + K  KR  ++ +R     ++LEDGE G
Sbjct: 994  AGGGHKKSR-HQRGWVSEAEVEGK-EKRRRKEEAREHTKEEELEDGECG 990

BLAST of Spo27314.1 vs. TAIR (Arabidopsis)
Match: AT3G19840.1 (pre-mRNA-processing protein 40C)

HSP 1 Score: 58.9 bits (141), Expect = 2.300e-8
Identity = 186/815 (22.82%), Postives = 316/815 (38.77%), Query Frame = 1

		  

Query: 147 PGQQGQPYLPGSAQQFLAPGQNIPSGHNQPMQFSQPMQFSQPMQ--QLPPRPGMPGVPMS 206
           PG    P L  S   F  PG N  S   +P   + P Q +  +     PP   +PG P  
Sbjct: 97  PGTLAPPGLMTSPPAF--PGSNPFSTTPRPGMSAGPAQMNPGIHPHMYPPYHSLPGTP-- 156

Query: 207 SQGMAMPYGQPNRPMTLG----AQQNQHSAPPFGNHPSGVGGMG--IPFSSSYTF--QPV 266
            QGM +      +P ++G    A    H     G++P  V G+   +P+S S+     P+
Sbjct: 157 -QGMWL------QPPSMGGIPRAPFLSHPTTFPGSYPFPVRGISPNLPYSGSHPLGASPM 216

Query: 267 SHLSTPAASVGGQPWLSSGSQSAAPFVPVPQTADQSSGSAAPVPAVNPPDTSQQSSSDWQ 326
             +    A  G QP +S G ++        +   Q  G+                   W 
Sbjct: 217 GSVGNVHALPGRQPDISPGRKTEELSGIDDRAGSQLVGNRLDA---------------WT 276

Query: 327 EHNTPDGRRYYYNKKTKQSSWEKP-----------VELMTPTERADASTVWKEFTTPEGK 386
            H +  G  YYYN  T QS++EKP           V+ +  +  +   T W   +T +GK
Sbjct: 277 AHKSEAGVLYYYNSVTGQSTYEKPPGFGGEPDKVPVQPIPVSMESLPGTDWALVSTNDGK 336

Query: 387 KYYFNKVTKESKWTIPEDL----KLAREQAAKAVSQGAQLDAGMKSHPTTTGGFTSEVAP 446
           KYY+N  TK S W IP ++    K   E+A ++V+     D   K         TS  AP
Sbjct: 337 KYYYNNKTKVSSWQIPAEVKDFGKKLEERAMESVASVPSADLTEKG-----SDLTSLSAP 396

Query: 447 STNPVSGSSNGSSSVSGASSIPLSFPSGVNPAPVVNDASSELLMEHSAAPTSMAVT-NAL 506
           +       SNG    +   +       G +   +V     +  M  S+  TS A +    
Sbjct: 397 AI------SNGGRDAASLKTTNF----GSSALDLVKKKLHDSGMPVSSTITSEANSGKTT 456

Query: 507 AMTPLSASISGDDALPAALNASSITVNASDKLPSQEISTSTDGGSAHDLEELRKGITAGK 566
            +TP   S +    +  A  A +++ ++SD         S D  S    EE  K     K
Sbjct: 457 EVTPSGESGNSTGKVKDAPGAGALSDSSSD---------SEDEDSGPSKEECSKQF---K 516

Query: 567 VSLSEKPTNDEPLVFANKQEAKGAFKSLLESANVHSDWNWDQAMRVIVNDKRYGALKTLG 626
             L E+     P     K+  K  F    ++   HS                        
Sbjct: 517 EMLKER--GIAPFSKWEKELPKIIFDPRFKAIPSHS------------------------ 576

Query: 627 ERKQAFNEYLGQRRKQEAEERRLRQKKAKEEFTKMLEE--------SEVLASSMKWSKAV 686
            R+  F +Y+  R ++E  E+R   K A E F ++L++        ++  A   KW    
Sbjct: 577 VRRSLFEQYVKTRAEEERREKRAAHKAAIEGFRQLLDDASTDIDQHTDYRAFKKKWG--- 636

Query: 687 TMFEDDERFKAVEKPKDRQELFDNYLVELQKKEKEKADEEYRRNREDYWTFLESCDFIEA 746
               +D RF+A+E+ K+R+ L +  ++ L++  ++KA E       D+ T L   + I  
Sbjct: 637 ----NDLRFEAIER-KEREGLLNERVLSLKRSAEQKAQEIRAAAASDFKTMLRERE-ISI 696

Query: 747 NSLWRKVQDRLEDDERCSRLEKIDRLEIFQDYIRFLE----------KGEEEQKKLQKEQ 806
           NS W KV+D L ++ R   +   DR   + +YI  L+          K  +E+ KL++ +
Sbjct: 697 NSHWSKVKDSLRNEPRYRSVAHEDREVFYYEYIAELKAAQRGDDHEMKARDEEDKLRERE 756

Query: 807 LRRAERKNRD--------------EFRKVLEADVAAGVLTAKSIWRDYCAKVKESPAYLA 866
               +RK R+              E     +A +   +   ++ W +    ++  P   A
Sbjct: 757 RELRKRKEREVQEVERVRQKIRRKEASSSYQALLVEKIRDPEASWTESKPILERDPQKRA 816

Query: 867 VARNISGSTPKDLFEDAVDEL-ENQYHEDKSRIKDVMKLSKMTM------TSTWTFDEFK 897
              ++  +  + LF D V  L E   H+ K+ + + +     T+      T+  ++   K
Sbjct: 817 SNPDLEPADKEKLFRDHVKSLYERCVHDFKALLAEALSSEAATLQTEDGKTALNSWSTAK 823

The following BLAST results are available for this feature:
BLAST of Spo27314.1 vs. NCBI nr
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. NCBI nr)
Total hits: 5
Match NameE-valueIdentityDescription
gi|902205333|gb|KNA15150.1|0.0e+0100.hypothetical protein SOVF_1008... [more]
gi|731361861|ref|XP_010692586.1|0.0e+080.0PREDICTED: pre-mRNA-processing... [more]
gi|731361863|ref|XP_010692587.1|0.0e+080.1PREDICTED: pre-mRNA-processing... [more]
gi|719971225|ref|XP_010273523.1|0.0e+061.1PREDICTED: pre-mRNA-processing... [more]
gi|731428902|ref|XP_010664484.1|7.8e-30961.2PREDICTED: pre-mRNA-processing... [more]
back to top
BLAST of Spo27314.1 vs. UniProtKB/TrEMBL
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. UniprotKB/TrEMBL)
Total hits: 5
Match NameE-valueIdentityDescription
A0A0K9R6L6_SPIOL0.0e+0100.Uncharacterized protein OS=Spi... [more]
A0A0J8BHY3_BETVU0.0e+080.0Uncharacterized protein OS=Bet... [more]
A0A061FG70_THECC8.0e-30059.6Pre-mRNA-processing protein 40... [more]
A0A061FFL7_THECC1.3e-29759.1Pre-mRNA-processing protein 40... [more]
B9IBN8_POPTR1.3e-28958.7FF domain-containing family pr... [more]
back to top
BLAST of Spo27314.1 vs. ExPASy Swiss-Prot
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. ExPASy SwissProt)
Total hits: 5
Match NameE-valueIdentityDescription
PR40A_ARATH9.8e-24352.1Pre-mRNA-processing protein 40... [more]
PR35B_ARATH1.1e-14540.5Pre-mRNA-processing protein 40... [more]
PR40A_HUMAN1.3e-6129.1Pre-mRNA-processing factor 40 ... [more]
PR40A_MOUSE5.1e-5828.6Pre-mRNA-processing factor 40 ... [more]
PR40B_MOUSE1.4e-3129.0Pre-mRNA-processing factor 40 ... [more]
back to top
BLAST of Spo27314.1 vs. TAIR (Arabidopsis)
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. TAIR)
Total hits: 3
Match NameE-valueIdentityDescription
AT1G44910.15.5e-24452.1pre-mRNA-processing protein 40... [more]
AT3G19670.19.7e-14840.5pre-mRNA-processing protein 40... [more]
AT3G19840.12.3e-822.8pre-mRNA-processing protein 40... [more]
back to top
InterPro
Analysis Name: InterPro Annotations of S. oleracea
Date Performed: 2018-06-29
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001202WW domainPFAMPF00397WWcoord: 356..381
score: 3.7E-7coord: 313..340
score: 2.7
IPR001202WW domainSMARTSM00456ww_5coord: 351..383
score: 8.1E-7coord: 310..342
score: 2.9
IPR001202WW domainPROSITEPS01159WW_DOMAIN_1coord: 315..340
scor
IPR001202WW domainPROFILEPS50020WW_DOMAIN_2coord: 350..383
score: 12.196coord: 309..342
score: 13
IPR001202WW domainunknownSSF51045WW domaincoord: 355..388
score: 1.48E-9coord: 314..348
score: 6.53
IPR002713FF domainGENE3D1.10.10.440coord: 555..614
score: 2.1E-20coord: 889..953
score: 1.1E-7coord: 759..829
score: 0.001coord: 683..749
score: 2.5E-7coord: 615..682
score: 3.1
IPR002713FF domainPFAMPF01846FFcoord: 627..677
score: 2.5E-12coord: 560..609
score: 3.1E-14coord: 695..744
score: 2.0E-8coord: 905..951
score: 4.
IPR002713FF domainSMARTSM00441FF_2coord: 693..747
score: 2.0E-4coord: 558..612
score: 3.9E-11coord: 767..828
score: 51.0coord: 625..680
score: 1.5E-9coord: 900..955
score:
IPR002713FF domainPROFILEPS51676FFcoord: 558..612
score: 11.885coord: 693..747
score: 12.12coord: 625..680
score: 12.178coord: 890..955
score: 8.162coord: 765..828
score: 9
IPR002713FF domainunknownSSF81698FF domaincoord: 893..962
score: 1.31E-9coord: 619..687
score: 5.1E-15coord: 550..612
score: 1.7E-17coord: 760..835
score: 5.3E-8coord: 685..753
score: 6.93
NoneNo IPR availableunknownCoilCoilcoord: 749..776
score: -coord: 958..1000
score: -coord: 615..635
score: -coord: 679..699
score: -coord: 819..846
scor
NoneNo IPR availableGENE3D2.20.70.10coord: 314..343
score: 1.4E-15coord: 356..385
score: 4.0
NoneNo IPR availablePANTHERPTHR11864PRE-MRNA-PROCESSING PROTEIN PRP40coord: 1031..1144
score: 0.0coord: 465..1011
score: 0.0coord: 143..393
score:
NoneNo IPR availablePANTHERPTHR11864:SF0PRP40 PRE-MRNA PROCESSING FACTOR 40 HOMOLOG A (YEAST)coord: 465..1011
score: 0.0coord: 143..393
score: 0.0coord: 1031..1144
score:

GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0016070 RNA metabolic process
biological_process GO:0010467 gene expression
cellular_component GO:0044424 intracellular part
molecular_function GO:0005515 protein binding