Spo18823.1 (mRNA)

Overview
NameSpo18823.1
TypemRNA
OrganismSpinacia oleracea (Spinach)
DescriptionSPOC domain / transcription elongation factor S-II protein
LocationSpoScf_01943 : 66528 .. 82059 (+)
Sequence length4510
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTGTCTTCGCTTTCTGTTCCTCCTCACTTCTCAACTCACTCACTCACCACTTCAACAGTTCAACTCAAACCCCTCTCTCACCCTCTCCAAAAATAAAACCCAGAAATTATCAGTGATTTCTCTGCTACATGCAATTCTGAATTTCGCCGTTTTTTTCATGAGTTTTGCGAATTCAATTCCTACACATTCTCAAGGTTTGTTCTTCTCTGTGTATCGTGCAATTTTTAATGCATTTTTTTGGTTGATTAATTATATTTTTGAAGGCTGGAATTAGGGTTTTGTTTTAATTGGTTGTTAATTTTCTGCGGAATTGTGCTTGTGTTAAAGATTTCTCCTTTATTTGAAGATTTGAGCAACATTGTTTTTTCTTTGAATGGAATTGGAGGTTTTAATTTGAAATTAATCTTATTTCGGCATGGATTTCTTTGTTATTAACCTAATTGCTGGAAATTGTAATTAAGTTGAAGTTTCGTAGGGTTTTACGGAAAATTAATTGGAATGATGGGTAATTGGGGGGAAAGTATATATGCGAGGTCTTTGGTTAATTTACTTGCATTTTCTTTTCAATACAAAGCTGCAATTGCCGAGAATAATCGTCTGTACTTGAGGAGGTTATTTTTCGTTCACTGAAAGCAGTGTGCTTGTGTATCATTTGAGGTTACCAGTGTGTTGAATTTTCTTTTTGGTTTCGGGACTTTTGGCTTTTCTTGTAGTAATTGTCCTTGTATTCACGATTGCATTATTGGGTTGCTGTGTTGAACTTTTTATGGGAATGCAGGGTTGTGGGAAAAGTGCGGTTAAAATTGAATCCGTATCAAGGAGCTGATAAAATAATAGGATGCTAATAGAGATACAGAAGTTATGATACTAGGTTTGCAGGAGTGTTCGTGTGTCGGGGGGTTGGAAAAGTGAAAAGGGTACTTCAAGAATTGCCGTTGGGTTATAGTCTCAACTTATAGACTAATTTACATCGCTTGTGAACTTAGGGTTAGAGTCAGGGTCTTTGGTATTCCCGACTTATAGTGTTTGGCCAAGGCCTTACTAGTGGTGTTTTATGATTTAATACTGGAATGGAATCTGGTGGAGGTTTGCAGGAGTGTTCGTGTGTCGGGGGGTTGGAAAAGTGAAAAGGGTACTTCAAGAATTGCAGTTGGGTTATAGTCTCAACTTATAGACTAATTTACATCGCTTGTGAACTTAGGGTTAGAGTCAGGGTCTTTGGTATTCCCGACTTATAGTGTTTGGCCAAGGCCTTACTAGTGGTGTTTTATGATTTAATACTGGAATGGAATCTGGTGGAGGTTTTTTTCATTTCCTTTATGATGTCCGTTGCTGAGAGCCTATGACTTCAAAATTGAGAGTTCATGCGCGAGAAGTGTCTTTAGTGTTGAGTGTGAACCTTAGTGAAATTTGTTAATTATTAGATATTGACTGATAAGATGTTTTTTGGATCCCCTGGTTGTTTTAAGTTTCAAAATTCATCGTTTTAGAACTTGTTTAGAACGAAATAGCTAGTAGGAAAAAACAACCAATGGATTTACCTCGAGTGATTACGATGGCTTCTCAAGATACTTATCAGACCACGCTTTTGTGGCTTTGCAATAACACCTTGAGGAGACCATCTTTTTCTTCGTCCTTTTCTTATTTCCTCTTTCATCGGATCTTGTGTATGATTTGCACTATTGCGTTTATTTTTTCAAAGGCATATTTAAATCTCTGTAAATTTTACTGATTGTCTGGGCTTGGTGTTTCAATGTCTTAGCTGTTAGTGCCCCTATGGATATTACTGCCTATCTATATAATATTTGTATGTGTACTGCATTGTAAATTATGTTGTGCTGTGAATGGTTTGATTTTCAGAAGGTTGGTGACATTGGCATTTTCCACAATTGGGTAATTTTTGCATGGTGGATATGAAGTTTTTAAAAGGAACCAGCCTGTAGACTAGCGAGAATCATATCGCAAGCACATAAAATGCAACCACCAGGCCCCCCGCGGCCATTCCGAGGAGCCCAATATGACAAAGACATGTTTACCTTTCTGCCTGCACGTTTTAACAGGGATGTTTTAGATGAGTCACCAAAAAAAAGAAAATTAACTTCCTCTGATCACCAAATCACCGATCCTGCTCCAAATGTGAGGCAAGGCATTTCTGGAGCTTCGGACTCCAAGCTCAATGATGGCAGCACACAACGTAAAGAATTTCAAAATTCTACCCCTTCTTCGGGACCAAGGAAATCATCAAAGGGTTATTCTGATCATAGTAAATACTGGGACCAGAATGATTATGCGGAAAAATGGCATCGTAATAAGAGGGGTTTTCATGAATCAGGATCTGAAAATACCGGATATACTTCTTTCAGTGGGCCGTTGCAATCTCCAAATCACAAAGTTTCCTCACCTTTATTTCCTCTCCCAGAGGCCCGTACTCATAAAACATCGAATTTGGCACCTTCAGCGACAGAAAACTTCAATCAACATGATAATTATGAATCAAATTTGCATGTAGCAACTCACAACGAACCAAATTGGCCCCCTAAGGTGACTGAAGACTTTAGCCGGTGTGAGAAACATAGATCAAAAGCACATGTAACAACTCATATTGCATTGGATTGGCAACCTCCATTGGTTCAAGACTCTTACCGGCGTGAGACTCGAGAATCAAATGCAATTGTAGTGACTCAAAATACATCAAATTGGGCGCCTTCGATGGCTCGAGATCTTCCACAGCATGGGAATTATGAGCCAATTGCACATGTAGCTGCTCATGAAAGATCAAATGGGATTCCACCAGAGGCTCAAGACCTCAAACACCATGAGAACTGTGATGCAAATGCACATGTGGTAAGGGTAGCACAATTTCCTCATCCCCAAGAGTCTCGGAGCTATGCACATCAAAGGCTTGTTCAGCCGAGGGCTATGGCATCCAATTGTTATGAGCAAGGGCCCAAAATTTTGGGGGAATCACAAAGATTGCAACAAAATGTTTTCGAGCCAGGACCTGTTATCGGGGGTTTAGATGAAAAATGCCGTGAAGAAACCTACAAGATTGACTGTCCTTCTAATGTCAAAGCACAAGGTGATGCTGGTGGGTTTGCCGGATTTGACAGTGCTGAAACGTACAAGGACAGGTCTTTCTCGAATGTCAAGCAAAGTGATACTGACAGTTTTACTGCATTTGACAGTAACAAAGCGCCTGGGAACAATGTCAGTACATCTGTATACAACTTACAGAGTGAAACAAATAATCTCTTAAATGTTTTGAAACTTTTGAGTGCTGCTTCCGGTGCTGGTTTAGCTGATAATAATAATGATGGTAATGCTCAAGGTTCTGGGCATGATGTTAAAATGATGAAATCACCACCAGGTGCTATTGTTCAGAATCTAAACCATCCTCCAGCAGTTACAAATCTGTCTAAAAACCACGAAGATGGTCTTATTCTTAACAACACTTCTGCAAATGGTGAAAAAGGTTATGCTAAATTAGATAATGAATGCAAGTCTGAAAATGGACCTTCAGCATTTGCTGAGAAGTTGTGGGATGGCACCCTTCAACTGAGTACCTCTGTTTCAGCGTCTGTCATTGCCTTCTTCAAAAGGTTATCATATACTTTTGATTATTCCAACTTATTGTGGTTTGAGTTGTTTGAGATGTTTAATGCTTCTATTCTAAGATATAGCAAAAGACCATTGTTCTTATGTTCTGCTATTTCCAATAAGTTTGAAAAAATGACGAACCTTGATAGAAATTGGAATCAGCAAAAATTGTATTGAAGTAGCGAATTCTCTGAGTATTGCCTGGTTTAGGCGAATGTGGTGTAAGTTTAGCTGATTTTTTGTGTGTTGATTAAGAAGAACGACATTAACTGAATTGTATTGTTAAGTCCCTGCGTTTATGCTGGTTTTGTTGCTTACTTCTTTTGTTAATAATTGTGGTGTATACCGGTGTTTAAAAAAGCGCGCTGAAAAGCTCTTCGGGCAAAGGGCTTAATTAGGTCTGAGCGAAGCATTTGTGGTTAAGCGCACTTTAGCACGCAAAGCGCAAAAGGCGCGCTTTTTGGCGCTTCATTGATTTATTTTAATTATAAATTACTTTACGTTGAAAGCTTTTTTTATATGTTGTTAATGTGTTTTGGACCACACTCTATTAATCTTTAACAATAGAGAGTGGGCCATGAGTCCAAAAGAAAAGGAAAAAGAAAAAGAGAAGAAGACAACAGTCACAACAGTCGGCCCTAGCTTCTTGATAAACACTATGTTATTATTGTTATATGCTTTATTTTTGTTAAAACAAAGAGTAGTATATTTTTGGAAGCAAATCTGATGTTTATATATTTTTGTGGTTATTTTAGGAAGATATATTTGCATAATAGTAACAAAACCTTTAATTCTTAAAAAGTGCGTTTCACTTTGATGAAGCGCGCGCCTTGCGCTTTGCGCTTAGGCTCTAGGACTCATATCGCTTTGGTGCGCTTTGTACTTTTTTATACACTGGTGTATACTTCTGGGATGCACTTTATGCCTTTGCTTCATCTGGAATACATCCGTGATTCATATTGTTAATTGATTGGTCATTGGTGTTGTGTGCTCTCTCCACTTTGCTTCTTTCTGAACTTTTTTGTTAATCTGTTTCTTTATTTGTGCTTTTGGGAGGATATAGATTGTAAATTTATTGTTGTGTACTTGCATCCTTTCGTTGCTCTCTTTTTTGTTTTTAATAGAGTTATGCTTTTATTGCGAGAAACAGACTGCTGCTCTCGCTTTTAGCTGTTCCTGTGATGAGTCGGATGATCATTTTGTTCTAATTGATATTAGAGGGCTCGACTTAGGGCAAGTCTGATTAAGATGAGGTGTCAGTAAATTTAACTGTTAATATTTGTTGATTTTAAATTGTAGCGGAGAGAAGATGCTGGACTTTAGCTGGTCTGAATCTGTGGAAGTTAGAGGAAAAGTAAGATTGGAGGCTTTTGAGAAGTATGTGCAAGATCTTCCCCGTTCCCGCTCTCGTGGACTCATGGTAAGACGAGTTACCTATCACTGTAGTCATATGTCTTTGCTTGTTAGCTGTAGAACATACTTACCTATATTAGATTGTAGATACTCTATTCCTGTTTTTGTCATGAGTTTGAGTGAAAATATGTTCTTTGCCTATCTTCCATGCCAGATTCACACAGTTATCCATATTTGATGGCATACACATGCAGACGTGCGGACAGCCTGAAGCTCCAGCTTCTATGTCTACATCTGTGAAATATGAAATAGATACTTAGCTTTTAGCATGACGTACATTAATCGTCTTAGGTCATTAATGATGTCACATGCTGTCATCGATGTTTGGCCTATAATTCCATATCATATTTAAGGGTGGTGGCTGCACGGTGTGGTGATCTGAAAAATATCTTGGTTCCTATGCCCCCCGCCCCCCATCCCCCAACATTTTTAGCCAGATGCATCTCCATGGCTCCATCCACCTTTTAACTACTTTTCTAAACACATTACTACCTAACCCACTAATTTGCTACCCCAAACCCACTAATTTGCTACACCAAACAGCCATTGGGATACTTACATTGTGTTTATATGAGAGAATTTGAAGGAAAGGGAAGGGTAGGGAGAGTGATGGATTAGCTTATCTCTATTTAGATTAAATTGGGGAAGGCAATGGAAGGATATAGCTTCCATTCTCTTTTAAGAAACTGATTCCTCCCATAACACAATACCATTCTCCTGTCTCCTCCCTCCCTTTCCTCTACCTACGCACACTCTTGAACTGAAGATGTGAGTTTAAGTAGCATGGGCTTGGCACAAGCACTCTTGAACTCGCTATGCGTGTCTGCTGTTTTACACATAAAATGTATTTTAACTTTCCATGGATAGGGATTTGACTGTGCTGCAAAATTCTAGGGCAAAGGAGGATAATGTGAGGTTCTGGTAAGCACAAAGACCACATTTGAGGAAACAAAAACTAAAGAATAAAATAACTCACTACAGAACAGGGGGCGGGTAAGATCTGCATCAATGGCTTCATTTGCTTGTTTGATAGTTTAGGAAGACCGAGAGGGGCGAGAAGAACTGAAGAAAGAAAATGAAATACAATTTATGTTTAAGTAGTCACTCAGGATGGTGAGAAGGGTACAGACTTTGGTGAGAACAAGTCAAGTGGGATGAGTATTTAGTGGGATAGGGATTAGGGAGACAATGAAAAAGGTAACTCTAAAATTCTTGTTGCCTTTGGTCTTTTGGCCTCGTTTATTTTGGATGAGTGACCATAGGGAAATTCATCTTATATGGAACATGAGCAAAATACTCAGTTCTCTGTAACTTTTCATGTTTAGGGAAAGAAAAAGAATTAGGACGTCAAGGTTTTACCATGATGTTTCTCATTACTTTGTAATATTTTAACCAACACTATTGTAGTTTAAATGCTCTTGCATATTATTAAACTGATTCTAGGTCACACAAAAGGGAAAATATGATGAATATTAGAGGATTGAAGGAGTATGATGTATGATTTGCTAACTCCCGTGCAATTTCAATACTCCACTGTGTACTATCTTCCCAAGTTCCCACGCATCACGTGTTAAGCTGAGTCGATGACCTACTTAAAAATAAGAAACATTTTTGGATTTGAATAAAAAGCCACGGCTGCTTCCGCAGGCTGATGGCCTTCCGTTCCATATTGTTTATATTGCACAAAGTACCTATTTTGTAGTTTGCAGGTGTGATGTTGTCTTTTTTGCTTGTCCCTTTGTGTCATACTTCGTCTTATAATTGTATAATCATAATATACACACCATTGTCTCAAATTTCGCTGCCCCAGAGAGTCAGGTTTGTCATATTACCTTTGCAAGTACAGAATATTGTTTGCATTAGATCCTTGATATGAAAGTTTATAGCTCTCTGCAACTTCACTAAAGTAAGCTCATGAATTTTATTCAACCTTTTGATGTACTCTATAATCAAATCGCTGATTGTAGTAAATAATTCTTTCTAACAAAGCTATCTTTTGAAATTAATTTTTAGGATAAATTGTTTTTTACCAACTAAAAAGTAAAAATTGTATTTTACCACCTAAAAAAATTTAAAACTTGTTTTTTACCACCTGAAAACGGTATGAAACTTTTATGTGGTGCACACTAATTCAATATCCCTCCAACTTTTTACATTCCATGTGTGATTTTCTTTAACTCCTTCACTCTCCTCTCTTTATCGTCATTGAATTTTGTCAATTTTGCCCAAAGTGAGCCATATTGAGACCGTGGCTGGTCAACCGCTAGGCTATTTGTTTTGTCCTCCTGAAATCGTGGAAGAAGAGAACATATCAGAATATTACCGTTACTGTAGCCTCCAACATAACATTTTCATCTATTTTTCCATTTTTCAGGTGGTAAAAAACAAGTTAAATTTTTTTTTTTAGGTGGTAAAATACAAGTTTTAGTTTTTTAGGTGGTAAAAAACAATTTTTTGATTTATTTAGGTGGTAAAAAAGAATTTTACGCATTTCATGTTTCATTGAAATTTTATTTCATGTTCTAGGACTTTGAGAAGTGTAGTTGGTCAATTGGTGTACGTACCAATGGAGCCTTAGAATAAGGAACTATGGAATATTGAGCTTAACCACTGCATGGTCAGCCGTTTTCTCGTGTTTTCTGAATAAAGGTCTATTAACCACATAATGCTTAATTTTGGCCTTTAGAAGGGTCCCTTTATGTTGAAACTTGCAAGGAGTGTTGCTTTGTCATTTCTGCTGAAGGAGTAATATCTTTACTAAGTAGCGTGTTCATTTTTTTCCTAGTTGATGATGGTACTTGCCGGTGATCAAACTTCCCACTTAAAGACCACAGTGCCCTCATCTTGAACAAAGCCCTTTACTAGGCTTCACCTTTTCCATCTTAGATTTGGTAGCATCGCAGGAGTTGACTCCTAGCTTATTATTAGGAGCCTCGACATGTGGACTTGTGAATTTTTCTCTAGTTTAAAGGAGATTCTTCTACAAAAGCGCAAAGAGTCATCATTATCCTTCAATTATGAATTAGGCTTTTTATAAACGTCTCCAACTTGGAAAAGAAAAGAAACTTATTCCTGCTTCTTATATGAGCTAAATCACCCAAATAATCTCTTGTTTCTTTCACTATCTGATGGTTTACAAAAACAAAGCAATATCAATATTCTGTCCTGAGCAAGGTTTGGTAGGTATGAAAGATGTACACAACATCCGAAGAGAGATTGCTGAGAAAAACCCACATTGGGCTAACAATTGGAAAGCATAATATATGGTTTAAAAAGTGGTATACTTGTCAACAATGAAGGAAAAGGTGCTAATAAGGAACTGATTATTTTATAGGCATTGATTAGGTACTGTCAAAAGTGTCGAATGTCTCTTACTGCAGTCCTCAGGAAATGCCTGTTCCAATTTCTGTTAAGGTAATGACCCTTTTCCCAAAAGGAAATGCCTCTATCCAGCTTCCGTTGGGATAAACCTTTTTTCTTTTTCCGGCAAGGGTTAGAGACTTAGAGGGGAAAACTCTACAGATTATGTGCAGAAATTTGTCAGTAACTTTGTTTGGATCAACAGGAAGGAGACTGCAAAAATCCATCTTCTGTTGGGTAATAGCTATGCTTTTACTGTGTTGGCTTGTTGTTTATGTGCGAGTTTTGGGTAAGAATATTTCTTGGGTATGATCAAAAGTATTGACATTTTAGAAGAAAAATGCATCGTCTTAGTAGGACATTTTGAGCATTGGCTGTTCAGGGTCCTTTAAGGTTTAAGCTTTTGTCTGCTGCCACCAGAATTAGGGGTGGCAAAAACTTACCCATAATTTTACAAACTTACACAAACCTATCATAATTTTACAAAGTTAAATGATTTGGCTTTGACATGAAGCTGAGTGCTTTATGAGTCATACTGACTGATATATTATTATAAGGGGCCTTTTGGTTAAATAAGCTGGAATAGATTGGTATTGAAACTTATACCGGAAGGTATGAATTCAAAACCTCTTACCAACATGAATTGTTTGGTACAATGTTGGACTGGATTCTATATATACATTCTTAATGTTTGGTTTTCTCTTGGAATGAATTCTCATTTTATATTTGTTTAACATTATTAGTTTTAAATAATGCTCCGAAATACAGGAGTAACCATTTTATTTATTTATTTTTCTAGAAGATAATTACCCATTACCTTATTTCTTAATTATACTACCCTGTACTTCGCTTCACATTTTTTTGACTCCAAAAATTCAAAGCTCAAAAGTAGAATGAAAAAGAACAAAAAACAAAGAAAATCAAACCTGAAGCATAAACCGAAATTGCAGAAGAAAGATGGGAGCAAAATTACCCAAATCAAAATACAATTTCGTCACCATATTTATTAAATTACTTAAATGTAATTTTCCAAGAATGAACACATTTGTGTCAATCATTTTTGCAATTAAGCTTAACTGCTTGTTTGTTCTATGAGAAGTTGAGAACTATCGTGCTGGAGAGGAAGTAGGAAGCTCCGGCTTGTGATGAGTCGTCGACTCACCGTATTCCGAAGTGGTGGGCGTTGTTCACCCTTGCACTTCTCTCTCCTCTCTCCTCTCTCTTATGTGTATTAGAGTTTGTACCTATGGTGTAGGGTGGGTATGAGATTCTAGGGGGATGAGGTGGAATTAAAACCCCTTTGGAATGAAATTTCGTACTTTAAAATGGATGGATTCTAACTCCATTCCAAAATGCCCTTCCAAACACCCTCTAAGCTAAGCTAATTGACAGTTGATTAACGTATGACCCAACTTGTAGTCCCTGGTCGAAACTAACCCTCAACTGAATTGAATGGCTTATGATTTGCACCGAACAACTAGTCTTCCACCTCTACACCAGCTGCTTCATAGTTGTTTGATGGTTTAATGTCTGGTAAATTTGCGACTGTCGCTTCTACAGGTTCTTATTTACTCGCGTCATGATTGCCAAAACTATGTCTGTTACTTTAAACAGGAAGTGTAGCTTGCGTGCTTCATGTTTTTTTTATGAAAACAGCATAAGAAACGCCATAACTTAGCTGACTTCTTTGCTACCTGCTTCTGCTACACATTTTTAACTGTAAGATTAAAAATTTGTTTTACTATTTACGTTGTTCAAGTTTCCCAACTTCAGATTAAAATTCAGAGTAAAAACTTACTGTACTAAATGTATGGTATTGGTAGCTTAGTTCTAGTAGTACTTCCTTTGGTATATATTAAGTTAGCTATGGAACAGAGAGAATAGAGTACAAAAGACTAGAATGGAGAACCTGCAGTTTCTGATGTCACAAATGGTAGCATCCTTACTGGGAAAAAATTCCGAAGGAAGGAAAAGATACTGAGTACATACTTCTTTGCGACATTTAGTTTGTGTGGGTGATTGATGGGATTTTCTTGTATGTAGGCTCTGTTTGGTTCAATTGAATTTGATTGAAATTAAAGCTGAACTTATCTAAACTGATTTTTGGTGAAAATAACTTATCTGTGAGTGTAAATTGTCTGGACTTAACTTGGAACCTATATGAACTTATCTGAATTTATTGTTGCTAAAATAAGTCAAAATAAGTTAAAACTGGAATCATAATACGTAGACTTCTTCGAACTTGTTGGAATTAGTATGCACCTTTTGAGGGAATTGAAGGATGATGTTAGGGGTCATAACAAAATGGGAGAGTTCAAAAGTTGCTTAAGATTGAGTCGGTCAAAACTTGATTTAACACTTAACTTGTCATAAAGAAGAATTAACAAAGGCGTGTTCTCTTCAATCAATAATAATATTCACAAGTAATAAGTTTCAATAAGTGAACAGAACACGGTCTTGAAAAGATCAGTCTATGGATACGAAAAATGGCCATAACATATCGAATCAAACACTTAAATTAACCGATGATGATCCAAACTGTTCGTCTACAGCATAAAAGCAAGTCTATTGAGCAATACGAGCTCTTATGCCAAGAACAGAGTAGCTTTCGTTAAAGTTGTTATCATAATTATACCAATCGAAGAAAGCATATGCCAACTATTTGTAATTGTGAAATCTAATTACAAGTAACAACAACTCAAATCCGACTAAATTTATACTCGATTGGAGCTCATTGAGTTCCATTTGGCAACCCGCATGGCCGCATGTAGACTTGGTTATTACACAGGGTAGCCCATTGGGTAATATAAAAGGACTATTATTGGATAGGATTATTAGTGGATAGGGTTGATACAACCTAATAATTTAGTAAAAATAAATGTATTTATAACTAATTACGTGTTGGTTTAGTGGGATTGGGGCTGAACTTGGTAGGGAGGACTCGTGTTCGATCCCCCGCAACAACAATTGGGAGGGGACTGAACGTATCCACCCAGAACTCGTCCCGAATCCGCAATCTCCTTAAGGGTGAACCGGGTGCTAACACCAAAAAAAAAATGTATTTATTTACATGGACTATTTCTTGGCTCTCTTTTCCATCATTTTTTCTTGATAATTTACTGGTCTATCCACTATGACCCGCTATTGACGCATCGACCCGCTATTGACCCGAACCGCTTTGTTGCTTTTAAAAGCAACCCTCTATTAACCCGACTCGCTAATGACCCGTACCGACCCTGCCCTGCCCGTTAACTGGGTCTACTAGCATTTACTTATTTAGATGCTCAAAGTATTGACGGAAACAGTTATCTTTACCAATCATTTGAAATTAGCATCTCATTAACCAAACAGCTTAGATGCTCAAAACATGCTAAATGTTGCCTAATCTTTACGTTTATCGACCAAGATACATGAAATAGCATAAATCTAACGCATAAACGATTTGTGGTTTAGGATTTCTCCCAAAACTTTGCCGCGTTTGGCAGACTGCACCAATGTGCTATGTTATTGGCATACTTGATGTTCTTCACCTATCTTGAATGAAATGGTCATCTTTGCACGTTTTCATTCTTTTGAACAAGTTATGGGATAATTGAACTTAAAAATATTCGATGTTGCTCACGAGCTTCGTGTTGAATAAAACTCACGTAACAAACAAGCCGAAACTAGAATGTTGACCTTGCCTCGTTTTCGTATTGGTAGAGCAAAGTTGAGCTTCATCGAGCCTGACTTGCATTGTTTGTGAGCAGCTTGGTTGTATCTAGGGATAACAACTTTTAAAAAAAAGGTTTTAAATTGTAAACTATTCTTTTCTCTGCTCAGGCAGACTTCATACTACAGAACTTCTGAAATCGAAAAAATAGGTGAAATTTCAGGTTTCCCTGTCTGGCAGAAGATTGGTTTTTGACCGTGTTAACTAAAACAAATATAAATGGAAATAGAAACAACCCATTAAATAAAAAATCAAAAAACAAAGCAGTCTTTCTTATGCAGATTGAACAAAGTTTTACGTCCATAGGTTGTCTTATGCAGTCTTTCTTCTATTCTCGGGACCAAAGTCGAAAAAATGACACTTGGACATGCACTTAGAGTAACGTCTAACACGAGTACACAAGTCCGAGGGACACCATGAAGAAACGGATAGGGAGACAAACTTATCCTACCACTAGGGTTGTAGAGGTCTGACAATCTGAGCTTCTGAACTTACATAAACTAATTTGCACTGAAAGAGTGCTGGTATCCATTTCCCTTCTTGATTCAGCATAATCAACATATAATGGGTAGTATTGTTAGGCTGCTAGCAGTTGCGTAGGTTTTGAATCTGGTCGTTCATAACGACACTCCTTGTGTGCCCGTTTATCTGTTCAAAATATCTTTTTCTTTCTTCCTTTGAGAGTTAATATGCATTGATAAGAAAACTCTAGGCTGCCAACATATACTTAACTCCTTGTATATAATGTGACTATGTGCCCTCTGCCCTTCCCTCGGCCTTGTGGATGCTTAACACTCGTTGGTCATATTGTATGCAACAAGAAACTGGAACTTCACTTACATCATGTGCAAGTTTTCGGTTCAGAGGTTTTTTTGTTTCTTTAAGGCTTAAAAGATCTGAAATTTTGAATTTAGGTGATGTCATTTTGCTGGAAGGAAGGTTCATCTGATGCCGGACATAAAGGCATGAAGGAGGTAACAATTTTTGAGTTAGATTTCTAAAGTTAAATTCCCCAAGCTACTAGAGAATTTCTCAGTTTCTCTTTTGGTAACAGGTTGCAAGGAAATATAAAGAAGGAAAGAGAGTTGGGTTTGCAAAGTTGTCTACTGATATTGATCTGTACATTTGTCCTCATAGTGATGCAATAATCACTATTCTTGCAAAACACGGCTTCTTTAAGGGAATGACCGCTGTCAAAGACAAATCGGAGCTGCTTATTGGATGTGCTGTGTGGAAAAAATACTCGTCTTTTGCTCCTGCCAAAAAAACAGCAGAGGTTAACGCCCCACAATTAGGTCAGAATCCGATGGAAGCAGCAAAATCTGATATAGGTCAGGCTCCGACTGAAGGTGTACAAAGTGGCAGAATAGACAAAAGTCCGGAGCTTTTTCCCTCACCTCAAGTTGAGCATGCTAAGCAGTCTATGGTCACTCAAAATAATTTGACCTCCGGTAGTGGTGCCTCTAGCCTGCTATCAGAGATTAAGACTGATGTGGTACTACATAAGCCTATTTTACCCATTCCTTCGGTTTCCTCAAAACAATCTGCAGATTTTTCTGATGATGATGATCTACCAGAATATGATTTTAGAGCTGCAGTTTCTCAGGCTTCAGTGACCAATACTTCAGCTGTTGCGACACCCATTTCTGCTGCAATGAAGTCAGAGGGTTCGACTGCTGCTTCAATACCTGTGCCAAATTCTAATTTTCAACAATGTGTTGAGCAATATTCTGAAGCACAAAAACCAAGCTTCCCAACTCAAGAACAGAACCAAATCGGGCATTTTCAAGGACCGGCCTCGCAAAACACAGGGCAGCAACATTTCCATTTTCAAGGACTAGTACCACAGAGCGGGAAGCAATATTCTGAAGCTCATCAACCAAGCTTCCCAAGACAGGACCAGAACCTAAGTGGGCATTTTCAAGGACCAGGCACTCAAAACTTGGATGAGCAGTCTCTGGTGAAAGTCTCAAGTTTGCCTACACAGGAGCAGAAAAAAATAGAGAATAGTTCGAAGCCAAGAAACCTTTTCGATGATGATGATATGCCTGAATGGTGCCCACCAGATTTCTTCAAGTCAAGAGAAGAGGCCACTACAAAGGCAGAAGCCTTGTTGTCTTCCAAAGTTTCTGGGATTGCGAATTTAACTTCTCCACCTCCACCTCCACCTCCACCGCCTCCCCCATTGACCCAGATGCTGCATCCTTCTAAGCCAGTTGCCAATGAAAACCCAGCCGGTACATCATCCCAGATGCAGTTCAAAGAGAGAGATTACGCTTGTACCCATCATCATGTACCCCCACCTCCACCTCCACCAGTTGGTAATCACCCATCTACCAATAATTTCATGCATCATAATAATAATAATCCAGCTGCTATGTCATCCCAGATGCAGTTGAATGAGAGGGAGTACTGTTCTTTCCCTAATGTTCCCCCACCTCCTCCTCCAACCAATAATTTCATGCAGCACAGTCCAGCTTCGACACGTGTGCAATTTGGAGACAATAATCCAGCTGTTTTTTCATCCCAGATGCAGTTCAATGAGAGAGAGTACCGTTCTGTTCCCCCAGCTCCACCTCCAGCTCCACCTCCACCTCCTCCAGTTGGTAATCATAGTTCAGCTGCGATGCAATTCGGAGACAGAAAACCAGCTGCTATATCGTCTCAGATGCAGTTCAATGAGAGAGAGTATCATTTTGCCCCCCACATGCCCCCACCTCCTCCTCCTGCTAATCTCATGCATCATAGTCAGATGCAATTTGGAGACACAAATCCTGCTTCTATACCCTCCCAGATGCATGTTCCCCCACCTCCAGTTAATACTGATAATTTCATGCCTCATAATCCAGCTGCTCGATCAACCCAGATGCAATTTGGAGACACTAATTACTCTAATGCCCCTCATCCTCCCACATCATACACTGACAACCCAATGAGTGATAGGGTCATCCATCAAAATCCAGCATATCCTCCCGGTTTCACACCAAATCCTGCTTTTCGGCCTTGTTTTGATCAGCCTGTCTGTCCTACTCGGAGTACAAGGCCATGACTTTTGGGGCTTTAGCTTTCTAGGATGATACAAGGTCATTATAGACTGCCGCTAGTTTCAACTTGTGTAAATAAGATAGAAATAGTTTTCCAGTTAATCTCTATACTTTGAAAGTCTAGTCTTCCATGCTAGTTTCCTGACATCAGTTCTATTCTGAATTCCAATGTCTTATGGATGCAAAGAAAATCCATGTCTGAATTATACAGAATTCACAGCATAACCCAAAGTGGGAGAAATTATACATAATTTCATGTGAAAAAAAAATTCTGTGAACTGATGGAAATTAACATTCATTAATTCATGTAAAATAATCTCGGACGGCAAAGTTCCTTGGGTAGAGAGGAAGGGCAAGACTTCAGTTTTCTATAATGTGCACAAGTTGAGCATTCATTATTGCTGTG

mRNA sequence

CTGTCTTCGCTTTCTGTTCCTCCTCACTTCTCAACTCACTCACTCACCACTTCAACAGTTCAACTCAAACCCCTCTCTCACCCTCTCCAAAAATAAAACCCAGAAATTATCAGTGATTTCTCTGCTACATGCAATTCTGAATTTCGCCGTTTTTTTCATGAGTTTTGCGAATTCAATTCCTACACATTCTCAAGAAGGTTGGTGACATTGGCATTTTCCACAATTGGGTAATTTTTGCATGGTGGATATGAAGTTTTTAAAAGGAACCAGCCTGTAGACTAGCGAGAATCATATCGCAAGCACATAAAATGCAACCACCAGGCCCCCCGCGGCCATTCCGAGGAGCCCAATATGACAAAGACATGTTTACCTTTCTGCCTGCACGTTTTAACAGGGATGTTTTAGATGAGTCACCAAAAAAAAGAAAATTAACTTCCTCTGATCACCAAATCACCGATCCTGCTCCAAATGTGAGGCAAGGCATTTCTGGAGCTTCGGACTCCAAGCTCAATGATGGCAGCACACAACGTAAAGAATTTCAAAATTCTACCCCTTCTTCGGGACCAAGGAAATCATCAAAGGGTTATTCTGATCATAGTAAATACTGGGACCAGAATGATTATGCGGAAAAATGGCATCGTAATAAGAGGGGTTTTCATGAATCAGGATCTGAAAATACCGGATATACTTCTTTCAGTGGGCCGTTGCAATCTCCAAATCACAAAGTTTCCTCACCTTTATTTCCTCTCCCAGAGGCCCGTACTCATAAAACATCGAATTTGGCACCTTCAGCGACAGAAAACTTCAATCAACATGATAATTATGAATCAAATTTGCATGTAGCAACTCACAACGAACCAAATTGGCCCCCTAAGGTGACTGAAGACTTTAGCCGGTGTGAGAAACATAGATCAAAAGCACATGTAACAACTCATATTGCATTGGATTGGCAACCTCCATTGGTTCAAGACTCTTACCGGCGTGAGACTCGAGAATCAAATGCAATTGTAGTGACTCAAAATACATCAAATTGGGCGCCTTCGATGGCTCGAGATCTTCCACAGCATGGGAATTATGAGCCAATTGCACATGTAGCTGCTCATGAAAGATCAAATGGGATTCCACCAGAGGCTCAAGACCTCAAACACCATGAGAACTGTGATGCAAATGCACATGTGGTAAGGGTAGCACAATTTCCTCATCCCCAAGAGTCTCGGAGCTATGCACATCAAAGGCTTGTTCAGCCGAGGGCTATGGCATCCAATTGTTATGAGCAAGGGCCCAAAATTTTGGGGGAATCACAAAGATTGCAACAAAATGTTTTCGAGCCAGGACCTGTTATCGGGGGTTTAGATGAAAAATGCCGTGAAGAAACCTACAAGATTGACTGTCCTTCTAATGTCAAAGCACAAGGTGATGCTGGTGGGTTTGCCGGATTTGACAGTGCTGAAACGTACAAGGACAGGTCTTTCTCGAATGTCAAGCAAAGTGATACTGACAGTTTTACTGCATTTGACAGTAACAAAGCGCCTGGGAACAATGTCAGTACATCTGTATACAACTTACAGAGTGAAACAAATAATCTCTTAAATGTTTTGAAACTTTTGAGTGCTGCTTCCGGTGCTGGTTTAGCTGATAATAATAATGATGGTAATGCTCAAGGTTCTGGGCATGATGTTAAAATGATGAAATCACCACCAGGTGCTATTGTTCAGAATCTAAACCATCCTCCAGCAGTTACAAATCTGTCTAAAAACCACGAAGATGGTCTTATTCTTAACAACACTTCTGCAAATGGTGAAAAAGGTTATGCTAAATTAGATAATGAATGCAAGTCTGAAAATGGACCTTCAGCATTTGCTGAGAAGTTGTGGGATGGCACCCTTCAACTGAGTACCTCTGTTTCAGCGTCTGTCATTGCCTTCTTCAAAAGCGGAGAGAAGATGCTGGACTTTAGCTGGTCTGAATCTGTGGAAGTTAGAGGAAAAGTAAGATTGGAGGCTTTTGAGAAGTATGTGCAAGATCTTCCCCGTTCCCGCTCTCGTGGACTCATGGTGATGTCATTTTGCTGGAAGGAAGGTTCATCTGATGCCGGACATAAAGGCATGAAGGAGGTTGCAAGGAAATATAAAGAAGGAAAGAGAGTTGGGTTTGCAAAGTTGTCTACTGATATTGATCTGTACATTTGTCCTCATAGTGATGCAATAATCACTATTCTTGCAAAACACGGCTTCTTTAAGGGAATGACCGCTGTCAAAGACAAATCGGAGCTGCTTATTGGATGTGCTGTGTGGAAAAAATACTCGTCTTTTGCTCCTGCCAAAAAAACAGCAGAGGTTAACGCCCCACAATTAGGTCAGAATCCGATGGAAGCAGCAAAATCTGATATAGGTCAGGCTCCGACTGAAGGTGTACAAAGTGGCAGAATAGACAAAAGTCCGGAGCTTTTTCCCTCACCTCAAGTTGAGCATGCTAAGCAGTCTATGGTCACTCAAAATAATTTGACCTCCGGTAGTGGTGCCTCTAGCCTGCTATCAGAGATTAAGACTGATGTGGTACTACATAAGCCTATTTTACCCATTCCTTCGGTTTCCTCAAAACAATCTGCAGATTTTTCTGATGATGATGATCTACCAGAATATGATTTTAGAGCTGCAGTTTCTCAGGCTTCAGTGACCAATACTTCAGCTGTTGCGACACCCATTTCTGCTGCAATGAAGTCAGAGGGTTCGACTGCTGCTTCAATACCTGTGCCAAATTCTAATTTTCAACAATGTGTTGAGCAATATTCTGAAGCACAAAAACCAAGCTTCCCAACTCAAGAACAGAACCAAATCGGGCATTTTCAAGGACCGGCCTCGCAAAACACAGGGCAGCAACATTTCCATTTTCAAGGACTAGTACCACAGAGCGGGAAGCAATATTCTGAAGCTCATCAACCAAGCTTCCCAAGACAGGACCAGAACCTAAGTGGGCATTTTCAAGGACCAGGCACTCAAAACTTGGATGAGCAGTCTCTGGTGAAAGTCTCAAGTTTGCCTACACAGGAGCAGAAAAAAATAGAGAATAGTTCGAAGCCAAGAAACCTTTTCGATGATGATGATATGCCTGAATGGTGCCCACCAGATTTCTTCAAGTCAAGAGAAGAGGCCACTACAAAGGCAGAAGCCTTGTTGTCTTCCAAAGTTTCTGGGATTGCGAATTTAACTTCTCCACCTCCACCTCCACCTCCACCGCCTCCCCCATTGACCCAGATGCTGCATCCTTCTAAGCCAGTTGCCAATGAAAACCCAGCCGGTACATCATCCCAGATGCAGTTCAAAGAGAGAGATTACGCTTGTACCCATCATCATGTACCCCCACCTCCACCTCCACCAGTTGGTAATCACCCATCTACCAATAATTTCATGCATCATAATAATAATAATCCAGCTGCTATGTCATCCCAGATGCAGTTGAATGAGAGGGAGTACTGTTCTTTCCCTAATGTTCCCCCACCTCCTCCTCCAACCAATAATTTCATGCAGCACAGTCCAGCTTCGACACGTGTGCAATTTGGAGACAATAATCCAGCTGTTTTTTCATCCCAGATGCAGTTCAATGAGAGAGAGTACCGTTCTGTTCCCCCAGCTCCACCTCCAGCTCCACCTCCACCTCCTCCAGTTGGTAATCATAGTTCAGCTGCGATGCAATTCGGAGACAGAAAACCAGCTGCTATATCGTCTCAGATGCAGTTCAATGAGAGAGAGTATCATTTTGCCCCCCACATGCCCCCACCTCCTCCTCCTGCTAATCTCATGCATCATAGTCAGATGCAATTTGGAGACACAAATCCTGCTTCTATACCCTCCCAGATGCATGTTCCCCCACCTCCAGTTAATACTGATAATTTCATGCCTCATAATCCAGCTGCTCGATCAACCCAGATGCAATTTGGAGACACTAATTACTCTAATGCCCCTCATCCTCCCACATCATACACTGACAACCCAATGAGTGATAGGGTCATCCATCAAAATCCAGCATATCCTCCCGGTTTCACACCAAATCCTGCTTTTCGGCCTTGTTTTGATCAGCCTGTCTGTCCTACTCGGAGTACAAGGCCATGACTTTTGGGGCTTTAGCTTTCTAGGATGATACAAGGTCATTATAGACTGCCGCTAGTTTCAACTTGTGTAAATAAGATAGAAATAGTTTTCCAGTTAATCTCTATACTTTGAAAGTCTAGTCTTCCATGCTAGTTTCCTGACATCAGTTCTATTCTGAATTCCAATGTCTTATGGATGCAAAGAAAATCCATGTCTGAATTATACAGAATTCACAGCATAACCCAAAGTGGGAGAAATTATACATAATTTCATGTGAAAAAAAAATTCTGTGAACTGATGGAAATTAACATTCATTAATTCATGTAAAATAATCTCGGACGGCAAAGTTCCTTGGGTAGAGAGGAAGGGCAAGACTTCAGTTTTCTATAATGTGCACAAGTTGAGCATTCATTATTGCTGTG

Coding sequence (CDS)

ATGCAACCACCAGGCCCCCCGCGGCCATTCCGAGGAGCCCAATATGACAAAGACATGTTTACCTTTCTGCCTGCACGTTTTAACAGGGATGTTTTAGATGAGTCACCAAAAAAAAGAAAATTAACTTCCTCTGATCACCAAATCACCGATCCTGCTCCAAATGTGAGGCAAGGCATTTCTGGAGCTTCGGACTCCAAGCTCAATGATGGCAGCACACAACGTAAAGAATTTCAAAATTCTACCCCTTCTTCGGGACCAAGGAAATCATCAAAGGGTTATTCTGATCATAGTAAATACTGGGACCAGAATGATTATGCGGAAAAATGGCATCGTAATAAGAGGGGTTTTCATGAATCAGGATCTGAAAATACCGGATATACTTCTTTCAGTGGGCCGTTGCAATCTCCAAATCACAAAGTTTCCTCACCTTTATTTCCTCTCCCAGAGGCCCGTACTCATAAAACATCGAATTTGGCACCTTCAGCGACAGAAAACTTCAATCAACATGATAATTATGAATCAAATTTGCATGTAGCAACTCACAACGAACCAAATTGGCCCCCTAAGGTGACTGAAGACTTTAGCCGGTGTGAGAAACATAGATCAAAAGCACATGTAACAACTCATATTGCATTGGATTGGCAACCTCCATTGGTTCAAGACTCTTACCGGCGTGAGACTCGAGAATCAAATGCAATTGTAGTGACTCAAAATACATCAAATTGGGCGCCTTCGATGGCTCGAGATCTTCCACAGCATGGGAATTATGAGCCAATTGCACATGTAGCTGCTCATGAAAGATCAAATGGGATTCCACCAGAGGCTCAAGACCTCAAACACCATGAGAACTGTGATGCAAATGCACATGTGGTAAGGGTAGCACAATTTCCTCATCCCCAAGAGTCTCGGAGCTATGCACATCAAAGGCTTGTTCAGCCGAGGGCTATGGCATCCAATTGTTATGAGCAAGGGCCCAAAATTTTGGGGGAATCACAAAGATTGCAACAAAATGTTTTCGAGCCAGGACCTGTTATCGGGGGTTTAGATGAAAAATGCCGTGAAGAAACCTACAAGATTGACTGTCCTTCTAATGTCAAAGCACAAGGTGATGCTGGTGGGTTTGCCGGATTTGACAGTGCTGAAACGTACAAGGACAGGTCTTTCTCGAATGTCAAGCAAAGTGATACTGACAGTTTTACTGCATTTGACAGTAACAAAGCGCCTGGGAACAATGTCAGTACATCTGTATACAACTTACAGAGTGAAACAAATAATCTCTTAAATGTTTTGAAACTTTTGAGTGCTGCTTCCGGTGCTGGTTTAGCTGATAATAATAATGATGGTAATGCTCAAGGTTCTGGGCATGATGTTAAAATGATGAAATCACCACCAGGTGCTATTGTTCAGAATCTAAACCATCCTCCAGCAGTTACAAATCTGTCTAAAAACCACGAAGATGGTCTTATTCTTAACAACACTTCTGCAAATGGTGAAAAAGGTTATGCTAAATTAGATAATGAATGCAAGTCTGAAAATGGACCTTCAGCATTTGCTGAGAAGTTGTGGGATGGCACCCTTCAACTGAGTACCTCTGTTTCAGCGTCTGTCATTGCCTTCTTCAAAAGCGGAGAGAAGATGCTGGACTTTAGCTGGTCTGAATCTGTGGAAGTTAGAGGAAAAGTAAGATTGGAGGCTTTTGAGAAGTATGTGCAAGATCTTCCCCGTTCCCGCTCTCGTGGACTCATGGTGATGTCATTTTGCTGGAAGGAAGGTTCATCTGATGCCGGACATAAAGGCATGAAGGAGGTTGCAAGGAAATATAAAGAAGGAAAGAGAGTTGGGTTTGCAAAGTTGTCTACTGATATTGATCTGTACATTTGTCCTCATAGTGATGCAATAATCACTATTCTTGCAAAACACGGCTTCTTTAAGGGAATGACCGCTGTCAAAGACAAATCGGAGCTGCTTATTGGATGTGCTGTGTGGAAAAAATACTCGTCTTTTGCTCCTGCCAAAAAAACAGCAGAGGTTAACGCCCCACAATTAGGTCAGAATCCGATGGAAGCAGCAAAATCTGATATAGGTCAGGCTCCGACTGAAGGTGTACAAAGTGGCAGAATAGACAAAAGTCCGGAGCTTTTTCCCTCACCTCAAGTTGAGCATGCTAAGCAGTCTATGGTCACTCAAAATAATTTGACCTCCGGTAGTGGTGCCTCTAGCCTGCTATCAGAGATTAAGACTGATGTGGTACTACATAAGCCTATTTTACCCATTCCTTCGGTTTCCTCAAAACAATCTGCAGATTTTTCTGATGATGATGATCTACCAGAATATGATTTTAGAGCTGCAGTTTCTCAGGCTTCAGTGACCAATACTTCAGCTGTTGCGACACCCATTTCTGCTGCAATGAAGTCAGAGGGTTCGACTGCTGCTTCAATACCTGTGCCAAATTCTAATTTTCAACAATGTGTTGAGCAATATTCTGAAGCACAAAAACCAAGCTTCCCAACTCAAGAACAGAACCAAATCGGGCATTTTCAAGGACCGGCCTCGCAAAACACAGGGCAGCAACATTTCCATTTTCAAGGACTAGTACCACAGAGCGGGAAGCAATATTCTGAAGCTCATCAACCAAGCTTCCCAAGACAGGACCAGAACCTAAGTGGGCATTTTCAAGGACCAGGCACTCAAAACTTGGATGAGCAGTCTCTGGTGAAAGTCTCAAGTTTGCCTACACAGGAGCAGAAAAAAATAGAGAATAGTTCGAAGCCAAGAAACCTTTTCGATGATGATGATATGCCTGAATGGTGCCCACCAGATTTCTTCAAGTCAAGAGAAGAGGCCACTACAAAGGCAGAAGCCTTGTTGTCTTCCAAAGTTTCTGGGATTGCGAATTTAACTTCTCCACCTCCACCTCCACCTCCACCGCCTCCCCCATTGACCCAGATGCTGCATCCTTCTAAGCCAGTTGCCAATGAAAACCCAGCCGGTACATCATCCCAGATGCAGTTCAAAGAGAGAGATTACGCTTGTACCCATCATCATGTACCCCCACCTCCACCTCCACCAGTTGGTAATCACCCATCTACCAATAATTTCATGCATCATAATAATAATAATCCAGCTGCTATGTCATCCCAGATGCAGTTGAATGAGAGGGAGTACTGTTCTTTCCCTAATGTTCCCCCACCTCCTCCTCCAACCAATAATTTCATGCAGCACAGTCCAGCTTCGACACGTGTGCAATTTGGAGACAATAATCCAGCTGTTTTTTCATCCCAGATGCAGTTCAATGAGAGAGAGTACCGTTCTGTTCCCCCAGCTCCACCTCCAGCTCCACCTCCACCTCCTCCAGTTGGTAATCATAGTTCAGCTGCGATGCAATTCGGAGACAGAAAACCAGCTGCTATATCGTCTCAGATGCAGTTCAATGAGAGAGAGTATCATTTTGCCCCCCACATGCCCCCACCTCCTCCTCCTGCTAATCTCATGCATCATAGTCAGATGCAATTTGGAGACACAAATCCTGCTTCTATACCCTCCCAGATGCATGTTCCCCCACCTCCAGTTAATACTGATAATTTCATGCCTCATAATCCAGCTGCTCGATCAACCCAGATGCAATTTGGAGACACTAATTACTCTAATGCCCCTCATCCTCCCACATCATACACTGACAACCCAATGAGTGATAGGGTCATCCATCAAAATCCAGCATATCCTCCCGGTTTCACACCAAATCCTGCTTTTCGGCCTTGTTTTGATCAGCCTGTCTGTCCTACTCGGAGTACAAGGCCATGA

Protein sequence

MQPPGPPRPFRGAQYDKDMFTFLPARFNRDVLDESPKKRKLTSSDHQITDPAPNVRQGISGASDSKLNDGSTQRKEFQNSTPSSGPRKSSKGYSDHSKYWDQNDYAEKWHRNKRGFHESGSENTGYTSFSGPLQSPNHKVSSPLFPLPEARTHKTSNLAPSATENFNQHDNYESNLHVATHNEPNWPPKVTEDFSRCEKHRSKAHVTTHIALDWQPPLVQDSYRRETRESNAIVVTQNTSNWAPSMARDLPQHGNYEPIAHVAAHERSNGIPPEAQDLKHHENCDANAHVVRVAQFPHPQESRSYAHQRLVQPRAMASNCYEQGPKILGESQRLQQNVFEPGPVIGGLDEKCREETYKIDCPSNVKAQGDAGGFAGFDSAETYKDRSFSNVKQSDTDSFTAFDSNKAPGNNVSTSVYNLQSETNNLLNVLKLLSAASGAGLADNNNDGNAQGSGHDVKMMKSPPGAIVQNLNHPPAVTNLSKNHEDGLILNNTSANGEKGYAKLDNECKSENGPSAFAEKLWDGTLQLSTSVSASVIAFFKSGEKMLDFSWSESVEVRGKVRLEAFEKYVQDLPRSRSRGLMVMSFCWKEGSSDAGHKGMKEVARKYKEGKRVGFAKLSTDIDLYICPHSDAIITILAKHGFFKGMTAVKDKSELLIGCAVWKKYSSFAPAKKTAEVNAPQLGQNPMEAAKSDIGQAPTEGVQSGRIDKSPELFPSPQVEHAKQSMVTQNNLTSGSGASSLLSEIKTDVVLHKPILPIPSVSSKQSADFSDDDDLPEYDFRAAVSQASVTNTSAVATPISAAMKSEGSTAASIPVPNSNFQQCVEQYSEAQKPSFPTQEQNQIGHFQGPASQNTGQQHFHFQGLVPQSGKQYSEAHQPSFPRQDQNLSGHFQGPGTQNLDEQSLVKVSSLPTQEQKKIENSSKPRNLFDDDDMPEWCPPDFFKSREEATTKAEALLSSKVSGIANLTSPPPPPPPPPPPLTQMLHPSKPVANENPAGTSSQMQFKERDYACTHHHVPPPPPPPVGNHPSTNNFMHHNNNNPAAMSSQMQLNEREYCSFPNVPPPPPPTNNFMQHSPASTRVQFGDNNPAVFSSQMQFNEREYRSVPPAPPPAPPPPPPVGNHSSAAMQFGDRKPAAISSQMQFNEREYHFAPHMPPPPPPANLMHHSQMQFGDTNPASIPSQMHVPPPPVNTDNFMPHNPAARSTQMQFGDTNYSNAPHPPTSYTDNPMSDRVIHQNPAYPPGFTPNPAFRPCFDQPVCPTRSTRP
Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Spo18823Spo18823gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Spo18823.1Spo18823.1-proteinpolypeptide


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
Spo18823.1.exon.1Spo18823.1.exon.1exon
Spo18823.1.exon.2Spo18823.1.exon.2exon
Spo18823.1.exon.3Spo18823.1.exon.3exon
Spo18823.1.exon.4Spo18823.1.exon.4exon
Spo18823.1.exon.5Spo18823.1.exon.5exon


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Spo18823.1.utr5p.1Spo18823.1.utr5p.1five_prime_UTR
Spo18823.1.utr5p.2Spo18823.1.utr5p.2five_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Spo18823.1.CDS.1Spo18823.1.CDS.1CDS
Spo18823.1.CDS.2Spo18823.1.CDS.2CDS
Spo18823.1.CDS.3Spo18823.1.CDS.3CDS
Spo18823.1.CDS.4Spo18823.1.CDS.4CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Spo18823.1.utr3p.1Spo18823.1.utr3p.1three_prime_UTR


Homology
BLAST of Spo18823.1 vs. NCBI nr
Match: gi|902233130|gb|KNA22934.1| (hypothetical protein SOVF_029320 [Spinacia oleracea])

HSP 1 Score: 2539.6 bits (6581), Expect = 0.000e+0
Identity = 1266/1266 (100.00%), Postives = 1266/1266 (100.00%), Query Frame = 1

		  

Query: 1    MQPPGPPRPFRGAQYDKDMFTFLPARFNRDVLDESPKKRKLTSSDHQITDPAPNVRQGIS 60
            MQPPGPPRPFRGAQYDKDMFTFLPARFNRDVLDESPKKRKLTSSDHQITDPAPNVRQGIS
Sbjct: 1    MQPPGPPRPFRGAQYDKDMFTFLPARFNRDVLDESPKKRKLTSSDHQITDPAPNVRQGIS 60

Query: 61   GASDSKLNDGSTQRKEFQNSTPSSGPRKSSKGYSDHSKYWDQNDYAEKWHRNKRGFHESG 120
            GASDSKLNDGSTQRKEFQNSTPSSGPRKSSKGYSDHSKYWDQNDYAEKWHRNKRGFHESG
Sbjct: 61   GASDSKLNDGSTQRKEFQNSTPSSGPRKSSKGYSDHSKYWDQNDYAEKWHRNKRGFHESG 120

Query: 121  SENTGYTSFSGPLQSPNHKVSSPLFPLPEARTHKTSNLAPSATENFNQHDNYESNLHVAT 180
            SENTGYTSFSGPLQSPNHKVSSPLFPLPEARTHKTSNLAPSATENFNQHDNYESNLHVAT
Sbjct: 121  SENTGYTSFSGPLQSPNHKVSSPLFPLPEARTHKTSNLAPSATENFNQHDNYESNLHVAT 180

Query: 181  HNEPNWPPKVTEDFSRCEKHRSKAHVTTHIALDWQPPLVQDSYRRETRESNAIVVTQNTS 240
            HNEPNWPPKVTEDFSRCEKHRSKAHVTTHIALDWQPPLVQDSYRRETRESNAIVVTQNTS
Sbjct: 181  HNEPNWPPKVTEDFSRCEKHRSKAHVTTHIALDWQPPLVQDSYRRETRESNAIVVTQNTS 240

Query: 241  NWAPSMARDLPQHGNYEPIAHVAAHERSNGIPPEAQDLKHHENCDANAHVVRVAQFPHPQ 300
            NWAPSMARDLPQHGNYEPIAHVAAHERSNGIPPEAQDLKHHENCDANAHVVRVAQFPHPQ
Sbjct: 241  NWAPSMARDLPQHGNYEPIAHVAAHERSNGIPPEAQDLKHHENCDANAHVVRVAQFPHPQ 300

Query: 301  ESRSYAHQRLVQPRAMASNCYEQGPKILGESQRLQQNVFEPGPVIGGLDEKCREETYKID 360
            ESRSYAHQRLVQPRAMASNCYEQGPKILGESQRLQQNVFEPGPVIGGLDEKCREETYKID
Sbjct: 301  ESRSYAHQRLVQPRAMASNCYEQGPKILGESQRLQQNVFEPGPVIGGLDEKCREETYKID 360

Query: 361  CPSNVKAQGDAGGFAGFDSAETYKDRSFSNVKQSDTDSFTAFDSNKAPGNNVSTSVYNLQ 420
            CPSNVKAQGDAGGFAGFDSAETYKDRSFSNVKQSDTDSFTAFDSNKAPGNNVSTSVYNLQ
Sbjct: 361  CPSNVKAQGDAGGFAGFDSAETYKDRSFSNVKQSDTDSFTAFDSNKAPGNNVSTSVYNLQ 420

Query: 421  SETNNLLNVLKLLSAASGAGLADNNNDGNAQGSGHDVKMMKSPPGAIVQNLNHPPAVTNL 480
            SETNNLLNVLKLLSAASGAGLADNNNDGNAQGSGHDVKMMKSPPGAIVQNLNHPPAVTNL
Sbjct: 421  SETNNLLNVLKLLSAASGAGLADNNNDGNAQGSGHDVKMMKSPPGAIVQNLNHPPAVTNL 480

Query: 481  SKNHEDGLILNNTSANGEKGYAKLDNECKSENGPSAFAEKLWDGTLQLSTSVSASVIAFF 540
            SKNHEDGLILNNTSANGEKGYAKLDNECKSENGPSAFAEKLWDGTLQLSTSVSASVIAFF
Sbjct: 481  SKNHEDGLILNNTSANGEKGYAKLDNECKSENGPSAFAEKLWDGTLQLSTSVSASVIAFF 540

Query: 541  KSGEKMLDFSWSESVEVRGKVRLEAFEKYVQDLPRSRSRGLMVMSFCWKEGSSDAGHKGM 600
            KSGEKMLDFSWSESVEVRGKVRLEAFEKYVQDLPRSRSRGLMVMSFCWKEGSSDAGHKGM
Sbjct: 541  KSGEKMLDFSWSESVEVRGKVRLEAFEKYVQDLPRSRSRGLMVMSFCWKEGSSDAGHKGM 600

Query: 601  KEVARKYKEGKRVGFAKLSTDIDLYICPHSDAIITILAKHGFFKGMTAVKDKSELLIGCA 660
            KEVARKYKEGKRVGFAKLSTDIDLYICPHSDAIITILAKHGFFKGMTAVKDKSELLIGCA
Sbjct: 601  KEVARKYKEGKRVGFAKLSTDIDLYICPHSDAIITILAKHGFFKGMTAVKDKSELLIGCA 660

Query: 661  VWKKYSSFAPAKKTAEVNAPQLGQNPMEAAKSDIGQAPTEGVQSGRIDKSPELFPSPQVE 720
            VWKKYSSFAPAKKTAEVNAPQLGQNPMEAAKSDIGQAPTEGVQSGRIDKSPELFPSPQVE
Sbjct: 661  VWKKYSSFAPAKKTAEVNAPQLGQNPMEAAKSDIGQAPTEGVQSGRIDKSPELFPSPQVE 720

Query: 721  HAKQSMVTQNNLTSGSGASSLLSEIKTDVVLHKPILPIPSVSSKQSADFSDDDDLPEYDF 780
            HAKQSMVTQNNLTSGSGASSLLSEIKTDVVLHKPILPIPSVSSKQSADFSDDDDLPEYDF
Sbjct: 721  HAKQSMVTQNNLTSGSGASSLLSEIKTDVVLHKPILPIPSVSSKQSADFSDDDDLPEYDF 780

Query: 781  RAAVSQASVTNTSAVATPISAAMKSEGSTAASIPVPNSNFQQCVEQYSEAQKPSFPTQEQ 840
            RAAVSQASVTNTSAVATPISAAMKSEGSTAASIPVPNSNFQQCVEQYSEAQKPSFPTQEQ
Sbjct: 781  RAAVSQASVTNTSAVATPISAAMKSEGSTAASIPVPNSNFQQCVEQYSEAQKPSFPTQEQ 840

Query: 841  NQIGHFQGPASQNTGQQHFHFQGLVPQSGKQYSEAHQPSFPRQDQNLSGHFQGPGTQNLD 900
            NQIGHFQGPASQNTGQQHFHFQGLVPQSGKQYSEAHQPSFPRQDQNLSGHFQGPGTQNLD
Sbjct: 841  NQIGHFQGPASQNTGQQHFHFQGLVPQSGKQYSEAHQPSFPRQDQNLSGHFQGPGTQNLD 900

Query: 901  EQSLVKVSSLPTQEQKKIENSSKPRNLFDDDDMPEWCPPDFFKSREEATTKAEALLSSKV 960
            EQSLVKVSSLPTQEQKKIENSSKPRNLFDDDDMPEWCPPDFFKSREEATTKAEALLSSKV
Sbjct: 901  EQSLVKVSSLPTQEQKKIENSSKPRNLFDDDDMPEWCPPDFFKSREEATTKAEALLSSKV 960

Query: 961  SGIANLTSPPPPPPPPPPPLTQMLHPSKPVANENPAGTSSQMQFKERDYACTHHHVPPPP 1020
            SGIANLTSPPPPPPPPPPPLTQMLHPSKPVANENPAGTSSQMQFKERDYACTHHHVPPPP
Sbjct: 961  SGIANLTSPPPPPPPPPPPLTQMLHPSKPVANENPAGTSSQMQFKERDYACTHHHVPPPP 1020

Query: 1021 PPPVGNHPSTNNFMHHNNNNPAAMSSQMQLNEREYCSFPNVPPPPPPTNNFMQHSPASTR 1080
            PPPVGNHPSTNNFMHHNNNNPAAMSSQMQLNEREYCSFPNVPPPPPPTNNFMQHSPASTR
Sbjct: 1021 PPPVGNHPSTNNFMHHNNNNPAAMSSQMQLNEREYCSFPNVPPPPPPTNNFMQHSPASTR 1080

Query: 1081 VQFGDNNPAVFSSQMQFNEREYRSVPPAPPPAPPPPPPVGNHSSAAMQFGDRKPAAISSQ 1140
            VQFGDNNPAVFSSQMQFNEREYRSVPPAPPPAPPPPPPVGNHSSAAMQFGDRKPAAISSQ
Sbjct: 1081 VQFGDNNPAVFSSQMQFNEREYRSVPPAPPPAPPPPPPVGNHSSAAMQFGDRKPAAISSQ 1140

Query: 1141 MQFNEREYHFAPHMPPPPPPANLMHHSQMQFGDTNPASIPSQMHVPPPPVNTDNFMPHNP 1200
            MQFNEREYHFAPHMPPPPPPANLMHHSQMQFGDTNPASIPSQMHVPPPPVNTDNFMPHNP
Sbjct: 1141 MQFNEREYHFAPHMPPPPPPANLMHHSQMQFGDTNPASIPSQMHVPPPPVNTDNFMPHNP 1200

Query: 1201 AARSTQMQFGDTNYSNAPHPPTSYTDNPMSDRVIHQNPAYPPGFTPNPAFRPCFDQPVCP 1260
            AARSTQMQFGDTNYSNAPHPPTSYTDNPMSDRVIHQNPAYPPGFTPNPAFRPCFDQPVCP
Sbjct: 1201 AARSTQMQFGDTNYSNAPHPPTSYTDNPMSDRVIHQNPAYPPGFTPNPAFRPCFDQPVCP 1260

Query: 1261 TRSTRP 1267
            TRSTRP
Sbjct: 1261 TRSTRP 1266

BLAST of Spo18823.1 vs. NCBI nr
Match: gi|731353158|ref|XP_010687913.1| (PREDICTED: uncharacterized protein LOC104901975 [Beta vulgaris subsp. vulgaris])

HSP 1 Score: 1051.6 bits (2718), Expect = 1.100e-303
Identity = 643/1125 (57.16%), Postives = 727/1125 (64.62%), Query Frame = 1

		  

Query: 1    MQPPGPPRPFRGAQYDKDMFTFLPARFNRDVLDESPKKRKLTSSDHQITDPAPNVRQGIS 60
            MQPPGPPRPFRGAQYDKDMFTFLP   NRDVL ES +KRKL SSDHQ T  A N R GIS
Sbjct: 1    MQPPGPPRPFRGAQYDKDMFTFLPPHSNRDVLAESSRKRKLNSSDHQNTVSAANARHGIS 60

Query: 61   GASDSKLNDGSTQRKEFQNSTPSSGPRKSS--KGYSDHSKYWDQNDYAEKWHRNKRGFHE 120
                 KLN  + Q KE  +S P+S PR+S   KGYSD S+ WDQNDYA   HR +R  ++
Sbjct: 61   VIPYPKLNAENLQGKEIHSSAPASQPRESLNFKGYSDQSQQWDQNDYAVNGHRKERLLND 120

Query: 121  SGSENTGYTSFSGPLQSPNHKVSSPLFPLPEARTHKTSNLAPSATENFNQHDNYESNLHV 180
            S SE TG+TSFSG LQSP+HK SS ++ L E  T KT NL P  TE+F+ H+N + N H 
Sbjct: 121  SRSEYTGFTSFSGQLQSPSHKSSSSVYSLSEIHTQKTLNLPPPVTEDFSLHENRKPNSHE 180

Query: 181  ATHNEPNWPPKVTEDFSRCEKHRSKAHVTTHIALDWQPPLVQDSYRRETRESNAIVVTQN 240
            AT    +  P +T DFSRCE   S AHV TH AL WQPP VQ+SY  E RESNA  V Q 
Sbjct: 181  ATLKGLSRSPLMTTDFSRCENRESNAHVATHKALGWQPPAVQNSYPHEIRESNANAVAQK 240

Query: 241  TSNWAPSMARDLPQHGNYEPIAHVAAHERSNGIPPEAQDLKHHENCDANAHVVRVAQFPH 300
             +N  PSM +D PQ+GN E   HVA    SNG     QDLKHHEN + NA V RV   PH
Sbjct: 241  RTNRLPSMTQDFPQNGNCESHPHVATQRTSNGPLLVTQDLKHHENREFNARVTRVENGPH 300

Query: 301  PQESRSYAHQRLVQPRAMASNCYEQGPKILGESQRLQQNVFEPGPVIGGLDEK--CREET 360
             Q+  S+AH++   P+ M S+ Y Q    LG S R  QN  +PG  IGGLDEK  C EET
Sbjct: 301  SQDFWSFAHEK---PKVMQSDPYAQASTSLGGSPRSPQNTIKPGAFIGGLDEKGRCEEET 360

Query: 361  YKIDCPSNVKAQGDAGGFAGFDSAETYKDRSFSNVKQSDTDSFTAFDSNKAPGNNVSTSV 420
            YK++CPSN+K  GDAG F+GFD A+  K  S SNV + DTD+FT FDSNKAPGNN S SV
Sbjct: 361  YKVNCPSNIKEYGDAGRFSGFDGADICKVGSSSNVTKRDTDNFTGFDSNKAPGNNDSASV 420

Query: 421  YNLQSETNNLLNVLKLLSAASGAGLADNNNDGNAQGSGHDVKMMKSPPGAIVQNLNHPPA 480
            YNLQSETNNLLNVL+LLSAASG GLADNN DGN QG  HDVK++KSPPG+ V NLNHPPA
Sbjct: 421  YNLQSETNNLLNVLRLLSAASGIGLADNN-DGNVQGCCHDVKLIKSPPGSTVHNLNHPPA 480

Query: 481  VTNLSKNHEDGLILNNTSANGEKGYAKLDNECKSENGPSAFAEKLWDGTLQLSTSVSASV 540
             TNLSK  EDG IL+  ++ GEKG  KLD++CK+ENG SAFAEKLWDGTLQLSTSVS S 
Sbjct: 481  ETNLSKKCEDGFILDKPASTGEKGSVKLDDQCKTENGASAFAEKLWDGTLQLSTSVSVSA 540

Query: 541  IAFFKSGEKMLDFSWSESVEVRGKVRLEAFEKYVQDLPRSRSRGLMVMSFCWKEGSSDAG 600
            IAFFKSGEKM DFSWSESVEV+GKVRLEAFEKYVQDLPRSRSRGLMVMS CWK+GSSDA 
Sbjct: 541  IAFFKSGEKMQDFSWSESVEVKGKVRLEAFEKYVQDLPRSRSRGLMVMSLCWKDGSSDAE 600

Query: 601  HKGMKEVARKYKEGKRVGFAKLSTDIDLYICPHSDAIITILAKHGFFKGMTAVKDKSELL 660
            HKGMKEVARKYKEGKRVGFAKLSTD+DLYICPHSDAIITILAKHGFFKGM A+K+KS+ L
Sbjct: 601  HKGMKEVARKYKEGKRVGFAKLSTDVDLYICPHSDAIITILAKHGFFKGMIAIKEKSDPL 660

Query: 661  IGCAVWKKYSSFAPAKKTAEVNAPQLGQNPMEAAKSDIGQAPTEGVQSGRIDKSPELFPS 720
            IGC VWKK SS AP  K+      QL   P E  KS   Q   + V++   +K PE+   
Sbjct: 661  IGCVVWKKTSSSAPPTKSGSSLCSQLESLP-EQGKS---QTQVDSVENNVKEKIPEILSC 720

Query: 721  PQVEHAKQSMVTQNNLTSGSGASSLLSEIKTDVVLHKPILPIPSVSSKQSADFSDDDDLP 780
            PQV+  KQSM                        L KPIL IPS+ SKQSADFSDDDDLP
Sbjct: 721  PQVDQPKQSM-----------------------ELTKPILSIPSIPSKQSADFSDDDDLP 780

Query: 781  EYDFRAAVSQASVTNTSA--------VATPISAAMKSEGSTAASIPVPNSNFQQCVEQYS 840
            EYDF+ A+SQ SV+ TS         V TP+SA  KS  S+A SI V          QYS
Sbjct: 781  EYDFKVAISQTSVSRTSEMQQIRSSHVLTPVSAEPKSV-SSAVSISV----------QYS 840

Query: 841  EAQKPSFPTQEQNQIGHFQGPASQNTGQQHFHFQGLVPQSGKQYSEAHQPSFPRQDQNLS 900
            E  KPSFPTQEQ Q GH QGPA   T  Q+F  Q          SE   PS P       
Sbjct: 841  EEHKPSFPTQEQKQTGHSQGPA---TAPQNFVLQ----------SEGKIPSLPM------ 900

Query: 901  GHFQGPGTQNLDEQSLVKVSSLPTQEQKKIENSS---KPRNLFDDDDMPEWCPPDFFKSR 960
                                    QE K IE  +   KP+NLFDDDDMPEWCPPDF K R
Sbjct: 901  ------------------------QEPKPIEKGASHLKPKNLFDDDDMPEWCPPDFMKIR 960

Query: 961  EEATTKAEALLSSKV----SGIAN----LTSPPPPPPPPPPPLTQ-----MLHPS----K 1020
            EEA  +     SS      SG A+    L  PPPPPPPPPPPLTQ     M HPS    K
Sbjct: 961  EEAAKRPGMSFSSSSEVPGSGPADSGYLLPLPPPPPPPPPPPLTQIQLRSMNHPSSHLPK 1020

Query: 1021 PVAN----ENPAGTSSQMQFKERDYACTHHHVPPPPPPPVGNHPSTNNFMHHN------- 1076
            P AN    ENP   SSQ+QF ER+Y+ T    P  PPPP+GN P+TNNF+H         
Sbjct: 1021 PAANSLVHENPGTLSSQVQFDEREYSPT----PRVPPPPIGN-PATNNFIHQRPAAPSTQ 1033

BLAST of Spo18823.1 vs. NCBI nr
Match: gi|698476695|ref|XP_009785631.1| (PREDICTED: uncharacterized protein LOC104233870 [Nicotiana sylvestris])

HSP 1 Score: 238.8 bits (608), Expect = 5.100e-59
Identity = 167/402 (41.54%), Postives = 225/402 (55.97%), Query Frame = 1

		  

Query: 437 SGAGLADNNNDG--NAQGSGHDVKMMKSPPGAIVQNLNHPPAVTNLSKNHEDGLILNN-- 496
           S  G  DN  D    ++G  H    MK     ++   N   ++T+   + +DG   +N  
Sbjct: 278 SSIGSLDNGVDNLLESRGGSHMEGAMKMKDLEVIDKENIKSSLTD-DLSSKDGHRPSNGV 337

Query: 497 ---TSANGEKGYAKLDNECK-SENGPSAFAEKLWDGTLQLSTSVSASVIAFFKSGEKMLD 556
               S        +LD + + S N     AEKLWDG+LQL++SV+ SV+AFFKSGEK+LD
Sbjct: 338 QHVESKKSNHSSQRLDEKSRLSSNKMPLAAEKLWDGSLQLNSSVTVSVVAFFKSGEKLLD 397

Query: 557 FSWSESVEVRGKVRLEAFEKYVQDLPRSRSRGLMVMSFCWKEGSSDAGHKGMKEVARKYK 616
            SWSE VEV+GKVRLEAFEKY+QDLPRSR+RGLMV+S C+KEGSS  G KGMKEVA+ Y 
Sbjct: 398 ISWSEFVEVKGKVRLEAFEKYIQDLPRSRNRGLMVISLCFKEGSSGKGLKGMKEVAKGYI 457

Query: 617 EGKRVGFAKLSTDIDLYICPHSDAIITILAKHGFFKGMTAVKDKSELLIGCAVWKKY--- 676
           +G+RVGFA+LS  +DLY+CP SDAIITILAK+GFFKGM AV+  SEL+IGC VW+K    
Sbjct: 458 KGERVGFAQLSPGVDLYLCPRSDAIITILAKYGFFKGMAAVEGNSELMIGCVVWRKNRTA 517

Query: 677 -------------SSFAPAKKTAEVNAPQLG-----------QNPMEAAK----SDIGQA 736
                        SS    +K+   ++   G           +N   +A     S + QA
Sbjct: 518 LTSVAKKSEGKVNSSQEQLQKSPSDSSTLQGGGQGSLPVPSVENSKPSAPVSSFSSLEQA 577

Query: 737 PTEGVQSGRIDKSPEL------FPSPQVEHAKQSMVTQNNLTSGSGASSLLSEIKTDVVL 794
            T   ++  ID S           SP  +  K+S ++ ++L    G    L  +K    L
Sbjct: 578 NTTDDKNVGIDSSSRTTLTTSGVKSPTFQQ-KESELSSSHLWGSKG--HFLEPLKESTDL 637

BLAST of Spo18823.1 vs. NCBI nr
Match: gi|697121633|ref|XP_009614795.1| (PREDICTED: uncharacterized protein LOC104107643 [Nicotiana tomentosiformis])

HSP 1 Score: 234.6 bits (597), Expect = 9.700e-58
Identity = 133/239 (55.65%), Postives = 169/239 (70.71%), Query Frame = 1

		  

Query: 503 KLDNECK-SENGPSAFAEKLWDGTLQLSTSVSASVIAFFKSGEKMLDFSWSESVEVRGKV 562
           +LD + + S N     AEKLWDG+LQL+ SV+ SV+AFFKSGEK+LD SWSE VEV+GKV
Sbjct: 320 RLDEKSRLSSNKVPLAAEKLWDGSLQLNASVTVSVVAFFKSGEKLLDLSWSEFVEVKGKV 379

Query: 563 RLEAFEKYVQDLPRSRSRGLMVMSFCWKEGSSDAGHKGMKEVARKYKEGKRVGFAKLSTD 622
           RLEAFEKY+QDLPRSR+RGLMV+S C+KEGSS  G KGMKEVA+ Y +G+RVGFA+LS  
Sbjct: 380 RLEAFEKYIQDLPRSRNRGLMVISLCFKEGSSGKGLKGMKEVAKGYIKGERVGFAQLSPG 439

Query: 623 IDLYICPHSDAIITILAKHGFFKGMTAVKDKSELLIGCAVWKK-YSSFAPAKKTAEVNAP 682
           +DLY+CP SDAIITILAK+GFFKGM AV+  SEL+IGC VW+K  ++     K +E  A 
Sbjct: 440 VDLYLCPRSDAIITILAKYGFFKGMAAVEGNSELMIGCVVWRKNRTALTSVAKKSEGKAN 499

Query: 683 QLGQNPMEAAKSDIGQAPTEGVQSGRIDKSPELFPSPQVEHAKQ------SMVTQNNLT 734
            L Q  ++ + SD   +  +G   G +       P P VE++K       SM+ Q N+T
Sbjct: 500 SL-QEQLQKSPSD--SSTLQGGGQGSL-------PVPSVENSKPSPLSSFSMLEQANVT 548

BLAST of Spo18823.1 vs. NCBI nr
Match: gi|590578607|ref|XP_007013557.1| (SPOC domain / Transcription elongation factor S-II protein, putative isoform 1 [Theobroma cacao])

HSP 1 Score: 229.9 bits (585), Expect = 2.400e-56
Identity = 161/411 (39.17%), Postives = 214/411 (52.07%), Query Frame = 1

		  

Query: 518 AEKLWDGTLQLSTSVSASVIAFFKSGEKMLDFSWSESVEVRGKVRLEAFEKYVQDLPRSR 577
           AEKLWDG+LQL++SV+ SV+AFFKSGE+M    WS  VEV+GKVRLEAFEKY+QDL RSR
Sbjct: 20  AEKLWDGSLQLNSSVTVSVVAFFKSGERMPCVQWSGLVEVKGKVRLEAFEKYIQDLARSR 79

Query: 578 SRGLMVMSFCWKEGSSDAGHKGMKEVARKYKEGKRVGFAKLSTDIDLYICPHSDAIITIL 637
           +RGLMV+S CWKEGSS++G  G+KEVA+ YK+G+RVGFAKLS  IDLYICP SDAIITIL
Sbjct: 80  NRGLMVVSLCWKEGSSESGLAGLKEVAKGYKKGERVGFAKLSPGIDLYICPRSDAIITIL 139

Query: 638 AKHGFFKGMTAVKDKSELLIGCAVW---------------KKYSSFAPAKKTAEVNAPQL 697
           AKHGFFKGM AV+DK   LIGC VW               +K+SS      ++      L
Sbjct: 140 AKHGFFKGMAAVEDKQNSLIGCVVWRRNHGPSNSVKKELERKHSSSTEQPLSSHSEQKVL 199

Query: 698 GQNP----MEAAKSDIGQAPTE---GVQSGRIDKSP-ELFPSPQVEHAKQSMVTQNNLTS 757
           G+      M+ A+  +   P     G+ S  I+++  E   S  ++ A  +  +  NL  
Sbjct: 200 GKKNDMACMQPAQESLPLTPIADCIGIGSAIINRNEGENVESSDIQLALHNSPSSANLLF 259

Query: 758 GSGASSLLSEIKT----DVVLH------------------------------KPILPIPS 817
            + A S L  ++T    D V H                               P+L +PS
Sbjct: 260 ATSALSNLVGLQTSSFSDSVCHFGPKGQSSEREMSLTANSESGKPKSSLGLQNPVLSLPS 319

Query: 818 VSSKQSADFSDDDDLPEYDFRAA--VSQA---------------------SVTNTSAVAT 843
           V +K+    +DDDDLPE+DF  A  +SQ                       +  +  + +
Sbjct: 320 VITKEHIPAADDDDLPEFDFGTACDISQTPRNKVLDNAEFHKNVLVEGLKKIVGSLPLTS 379

BLAST of Spo18823.1 vs. UniProtKB/TrEMBL
Match: A0A0K9RVJ3_SPIOL (Uncharacterized protein OS=Spinacia oleracea GN=SOVF_029320 PE=4 SV=1)

HSP 1 Score: 2539.6 bits (6581), Expect = 0.000e+0
Identity = 1266/1266 (100.00%), Postives = 1266/1266 (100.00%), Query Frame = 1

		  

Query: 1    MQPPGPPRPFRGAQYDKDMFTFLPARFNRDVLDESPKKRKLTSSDHQITDPAPNVRQGIS 60
            MQPPGPPRPFRGAQYDKDMFTFLPARFNRDVLDESPKKRKLTSSDHQITDPAPNVRQGIS
Sbjct: 1    MQPPGPPRPFRGAQYDKDMFTFLPARFNRDVLDESPKKRKLTSSDHQITDPAPNVRQGIS 60

Query: 61   GASDSKLNDGSTQRKEFQNSTPSSGPRKSSKGYSDHSKYWDQNDYAEKWHRNKRGFHESG 120
            GASDSKLNDGSTQRKEFQNSTPSSGPRKSSKGYSDHSKYWDQNDYAEKWHRNKRGFHESG
Sbjct: 61   GASDSKLNDGSTQRKEFQNSTPSSGPRKSSKGYSDHSKYWDQNDYAEKWHRNKRGFHESG 120

Query: 121  SENTGYTSFSGPLQSPNHKVSSPLFPLPEARTHKTSNLAPSATENFNQHDNYESNLHVAT 180
            SENTGYTSFSGPLQSPNHKVSSPLFPLPEARTHKTSNLAPSATENFNQHDNYESNLHVAT
Sbjct: 121  SENTGYTSFSGPLQSPNHKVSSPLFPLPEARTHKTSNLAPSATENFNQHDNYESNLHVAT 180

Query: 181  HNEPNWPPKVTEDFSRCEKHRSKAHVTTHIALDWQPPLVQDSYRRETRESNAIVVTQNTS 240
            HNEPNWPPKVTEDFSRCEKHRSKAHVTTHIALDWQPPLVQDSYRRETRESNAIVVTQNTS
Sbjct: 181  HNEPNWPPKVTEDFSRCEKHRSKAHVTTHIALDWQPPLVQDSYRRETRESNAIVVTQNTS 240

Query: 241  NWAPSMARDLPQHGNYEPIAHVAAHERSNGIPPEAQDLKHHENCDANAHVVRVAQFPHPQ 300
            NWAPSMARDLPQHGNYEPIAHVAAHERSNGIPPEAQDLKHHENCDANAHVVRVAQFPHPQ
Sbjct: 241  NWAPSMARDLPQHGNYEPIAHVAAHERSNGIPPEAQDLKHHENCDANAHVVRVAQFPHPQ 300

Query: 301  ESRSYAHQRLVQPRAMASNCYEQGPKILGESQRLQQNVFEPGPVIGGLDEKCREETYKID 360
            ESRSYAHQRLVQPRAMASNCYEQGPKILGESQRLQQNVFEPGPVIGGLDEKCREETYKID
Sbjct: 301  ESRSYAHQRLVQPRAMASNCYEQGPKILGESQRLQQNVFEPGPVIGGLDEKCREETYKID 360

Query: 361  CPSNVKAQGDAGGFAGFDSAETYKDRSFSNVKQSDTDSFTAFDSNKAPGNNVSTSVYNLQ 420
            CPSNVKAQGDAGGFAGFDSAETYKDRSFSNVKQSDTDSFTAFDSNKAPGNNVSTSVYNLQ
Sbjct: 361  CPSNVKAQGDAGGFAGFDSAETYKDRSFSNVKQSDTDSFTAFDSNKAPGNNVSTSVYNLQ 420

Query: 421  SETNNLLNVLKLLSAASGAGLADNNNDGNAQGSGHDVKMMKSPPGAIVQNLNHPPAVTNL 480
            SETNNLLNVLKLLSAASGAGLADNNNDGNAQGSGHDVKMMKSPPGAIVQNLNHPPAVTNL
Sbjct: 421  SETNNLLNVLKLLSAASGAGLADNNNDGNAQGSGHDVKMMKSPPGAIVQNLNHPPAVTNL 480

Query: 481  SKNHEDGLILNNTSANGEKGYAKLDNECKSENGPSAFAEKLWDGTLQLSTSVSASVIAFF 540
            SKNHEDGLILNNTSANGEKGYAKLDNECKSENGPSAFAEKLWDGTLQLSTSVSASVIAFF
Sbjct: 481  SKNHEDGLILNNTSANGEKGYAKLDNECKSENGPSAFAEKLWDGTLQLSTSVSASVIAFF 540

Query: 541  KSGEKMLDFSWSESVEVRGKVRLEAFEKYVQDLPRSRSRGLMVMSFCWKEGSSDAGHKGM 600
            KSGEKMLDFSWSESVEVRGKVRLEAFEKYVQDLPRSRSRGLMVMSFCWKEGSSDAGHKGM
Sbjct: 541  KSGEKMLDFSWSESVEVRGKVRLEAFEKYVQDLPRSRSRGLMVMSFCWKEGSSDAGHKGM 600

Query: 601  KEVARKYKEGKRVGFAKLSTDIDLYICPHSDAIITILAKHGFFKGMTAVKDKSELLIGCA 660
            KEVARKYKEGKRVGFAKLSTDIDLYICPHSDAIITILAKHGFFKGMTAVKDKSELLIGCA
Sbjct: 601  KEVARKYKEGKRVGFAKLSTDIDLYICPHSDAIITILAKHGFFKGMTAVKDKSELLIGCA 660

Query: 661  VWKKYSSFAPAKKTAEVNAPQLGQNPMEAAKSDIGQAPTEGVQSGRIDKSPELFPSPQVE 720
            VWKKYSSFAPAKKTAEVNAPQLGQNPMEAAKSDIGQAPTEGVQSGRIDKSPELFPSPQVE
Sbjct: 661  VWKKYSSFAPAKKTAEVNAPQLGQNPMEAAKSDIGQAPTEGVQSGRIDKSPELFPSPQVE 720

Query: 721  HAKQSMVTQNNLTSGSGASSLLSEIKTDVVLHKPILPIPSVSSKQSADFSDDDDLPEYDF 780
            HAKQSMVTQNNLTSGSGASSLLSEIKTDVVLHKPILPIPSVSSKQSADFSDDDDLPEYDF
Sbjct: 721  HAKQSMVTQNNLTSGSGASSLLSEIKTDVVLHKPILPIPSVSSKQSADFSDDDDLPEYDF 780

Query: 781  RAAVSQASVTNTSAVATPISAAMKSEGSTAASIPVPNSNFQQCVEQYSEAQKPSFPTQEQ 840
            RAAVSQASVTNTSAVATPISAAMKSEGSTAASIPVPNSNFQQCVEQYSEAQKPSFPTQEQ
Sbjct: 781  RAAVSQASVTNTSAVATPISAAMKSEGSTAASIPVPNSNFQQCVEQYSEAQKPSFPTQEQ 840

Query: 841  NQIGHFQGPASQNTGQQHFHFQGLVPQSGKQYSEAHQPSFPRQDQNLSGHFQGPGTQNLD 900
            NQIGHFQGPASQNTGQQHFHFQGLVPQSGKQYSEAHQPSFPRQDQNLSGHFQGPGTQNLD
Sbjct: 841  NQIGHFQGPASQNTGQQHFHFQGLVPQSGKQYSEAHQPSFPRQDQNLSGHFQGPGTQNLD 900

Query: 901  EQSLVKVSSLPTQEQKKIENSSKPRNLFDDDDMPEWCPPDFFKSREEATTKAEALLSSKV 960
            EQSLVKVSSLPTQEQKKIENSSKPRNLFDDDDMPEWCPPDFFKSREEATTKAEALLSSKV
Sbjct: 901  EQSLVKVSSLPTQEQKKIENSSKPRNLFDDDDMPEWCPPDFFKSREEATTKAEALLSSKV 960

Query: 961  SGIANLTSPPPPPPPPPPPLTQMLHPSKPVANENPAGTSSQMQFKERDYACTHHHVPPPP 1020
            SGIANLTSPPPPPPPPPPPLTQMLHPSKPVANENPAGTSSQMQFKERDYACTHHHVPPPP
Sbjct: 961  SGIANLTSPPPPPPPPPPPLTQMLHPSKPVANENPAGTSSQMQFKERDYACTHHHVPPPP 1020

Query: 1021 PPPVGNHPSTNNFMHHNNNNPAAMSSQMQLNEREYCSFPNVPPPPPPTNNFMQHSPASTR 1080
            PPPVGNHPSTNNFMHHNNNNPAAMSSQMQLNEREYCSFPNVPPPPPPTNNFMQHSPASTR
Sbjct: 1021 PPPVGNHPSTNNFMHHNNNNPAAMSSQMQLNEREYCSFPNVPPPPPPTNNFMQHSPASTR 1080

Query: 1081 VQFGDNNPAVFSSQMQFNEREYRSVPPAPPPAPPPPPPVGNHSSAAMQFGDRKPAAISSQ 1140
            VQFGDNNPAVFSSQMQFNEREYRSVPPAPPPAPPPPPPVGNHSSAAMQFGDRKPAAISSQ
Sbjct: 1081 VQFGDNNPAVFSSQMQFNEREYRSVPPAPPPAPPPPPPVGNHSSAAMQFGDRKPAAISSQ 1140

Query: 1141 MQFNEREYHFAPHMPPPPPPANLMHHSQMQFGDTNPASIPSQMHVPPPPVNTDNFMPHNP 1200
            MQFNEREYHFAPHMPPPPPPANLMHHSQMQFGDTNPASIPSQMHVPPPPVNTDNFMPHNP
Sbjct: 1141 MQFNEREYHFAPHMPPPPPPANLMHHSQMQFGDTNPASIPSQMHVPPPPVNTDNFMPHNP 1200

Query: 1201 AARSTQMQFGDTNYSNAPHPPTSYTDNPMSDRVIHQNPAYPPGFTPNPAFRPCFDQPVCP 1260
            AARSTQMQFGDTNYSNAPHPPTSYTDNPMSDRVIHQNPAYPPGFTPNPAFRPCFDQPVCP
Sbjct: 1201 AARSTQMQFGDTNYSNAPHPPTSYTDNPMSDRVIHQNPAYPPGFTPNPAFRPCFDQPVCP 1260

Query: 1261 TRSTRP 1267
            TRSTRP
Sbjct: 1261 TRSTRP 1266

BLAST of Spo18823.1 vs. UniProtKB/TrEMBL
Match: A0A0J8BS05_BETVU (Uncharacterized protein OS=Beta vulgaris subsp. vulgaris GN=BVRB_8g193690 PE=4 SV=1)

HSP 1 Score: 1051.6 bits (2718), Expect = 7.700e-304
Identity = 643/1125 (57.16%), Postives = 727/1125 (64.62%), Query Frame = 1

		  

Query: 1    MQPPGPPRPFRGAQYDKDMFTFLPARFNRDVLDESPKKRKLTSSDHQITDPAPNVRQGIS 60
            MQPPGPPRPFRGAQYDKDMFTFLP   NRDVL ES +KRKL SSDHQ T  A N R GIS
Sbjct: 1    MQPPGPPRPFRGAQYDKDMFTFLPPHSNRDVLAESSRKRKLNSSDHQNTVSAANARHGIS 60

Query: 61   GASDSKLNDGSTQRKEFQNSTPSSGPRKSS--KGYSDHSKYWDQNDYAEKWHRNKRGFHE 120
                 KLN  + Q KE  +S P+S PR+S   KGYSD S+ WDQNDYA   HR +R  ++
Sbjct: 61   VIPYPKLNAENLQGKEIHSSAPASQPRESLNFKGYSDQSQQWDQNDYAVNGHRKERLLND 120

Query: 121  SGSENTGYTSFSGPLQSPNHKVSSPLFPLPEARTHKTSNLAPSATENFNQHDNYESNLHV 180
            S SE TG+TSFSG LQSP+HK SS ++ L E  T KT NL P  TE+F+ H+N + N H 
Sbjct: 121  SRSEYTGFTSFSGQLQSPSHKSSSSVYSLSEIHTQKTLNLPPPVTEDFSLHENRKPNSHE 180

Query: 181  ATHNEPNWPPKVTEDFSRCEKHRSKAHVTTHIALDWQPPLVQDSYRRETRESNAIVVTQN 240
            AT    +  P +T DFSRCE   S AHV TH AL WQPP VQ+SY  E RESNA  V Q 
Sbjct: 181  ATLKGLSRSPLMTTDFSRCENRESNAHVATHKALGWQPPAVQNSYPHEIRESNANAVAQK 240

Query: 241  TSNWAPSMARDLPQHGNYEPIAHVAAHERSNGIPPEAQDLKHHENCDANAHVVRVAQFPH 300
             +N  PSM +D PQ+GN E   HVA    SNG     QDLKHHEN + NA V RV   PH
Sbjct: 241  RTNRLPSMTQDFPQNGNCESHPHVATQRTSNGPLLVTQDLKHHENREFNARVTRVENGPH 300

Query: 301  PQESRSYAHQRLVQPRAMASNCYEQGPKILGESQRLQQNVFEPGPVIGGLDEK--CREET 360
             Q+  S+AH++   P+ M S+ Y Q    LG S R  QN  +PG  IGGLDEK  C EET
Sbjct: 301  SQDFWSFAHEK---PKVMQSDPYAQASTSLGGSPRSPQNTIKPGAFIGGLDEKGRCEEET 360

Query: 361  YKIDCPSNVKAQGDAGGFAGFDSAETYKDRSFSNVKQSDTDSFTAFDSNKAPGNNVSTSV 420
            YK++CPSN+K  GDAG F+GFD A+  K  S SNV + DTD+FT FDSNKAPGNN S SV
Sbjct: 361  YKVNCPSNIKEYGDAGRFSGFDGADICKVGSSSNVTKRDTDNFTGFDSNKAPGNNDSASV 420

Query: 421  YNLQSETNNLLNVLKLLSAASGAGLADNNNDGNAQGSGHDVKMMKSPPGAIVQNLNHPPA 480
            YNLQSETNNLLNVL+LLSAASG GLADNN DGN QG  HDVK++KSPPG+ V NLNHPPA
Sbjct: 421  YNLQSETNNLLNVLRLLSAASGIGLADNN-DGNVQGCCHDVKLIKSPPGSTVHNLNHPPA 480

Query: 481  VTNLSKNHEDGLILNNTSANGEKGYAKLDNECKSENGPSAFAEKLWDGTLQLSTSVSASV 540
             TNLSK  EDG IL+  ++ GEKG  KLD++CK+ENG SAFAEKLWDGTLQLSTSVS S 
Sbjct: 481  ETNLSKKCEDGFILDKPASTGEKGSVKLDDQCKTENGASAFAEKLWDGTLQLSTSVSVSA 540

Query: 541  IAFFKSGEKMLDFSWSESVEVRGKVRLEAFEKYVQDLPRSRSRGLMVMSFCWKEGSSDAG 600
            IAFFKSGEKM DFSWSESVEV+GKVRLEAFEKYVQDLPRSRSRGLMVMS CWK+GSSDA 
Sbjct: 541  IAFFKSGEKMQDFSWSESVEVKGKVRLEAFEKYVQDLPRSRSRGLMVMSLCWKDGSSDAE 600

Query: 601  HKGMKEVARKYKEGKRVGFAKLSTDIDLYICPHSDAIITILAKHGFFKGMTAVKDKSELL 660
            HKGMKEVARKYKEGKRVGFAKLSTD+DLYICPHSDAIITILAKHGFFKGM A+K+KS+ L
Sbjct: 601  HKGMKEVARKYKEGKRVGFAKLSTDVDLYICPHSDAIITILAKHGFFKGMIAIKEKSDPL 660

Query: 661  IGCAVWKKYSSFAPAKKTAEVNAPQLGQNPMEAAKSDIGQAPTEGVQSGRIDKSPELFPS 720
            IGC VWKK SS AP  K+      QL   P E  KS   Q   + V++   +K PE+   
Sbjct: 661  IGCVVWKKTSSSAPPTKSGSSLCSQLESLP-EQGKS---QTQVDSVENNVKEKIPEILSC 720

Query: 721  PQVEHAKQSMVTQNNLTSGSGASSLLSEIKTDVVLHKPILPIPSVSSKQSADFSDDDDLP 780
            PQV+  KQSM                        L KPIL IPS+ SKQSADFSDDDDLP
Sbjct: 721  PQVDQPKQSM-----------------------ELTKPILSIPSIPSKQSADFSDDDDLP 780

Query: 781  EYDFRAAVSQASVTNTSA--------VATPISAAMKSEGSTAASIPVPNSNFQQCVEQYS 840
            EYDF+ A+SQ SV+ TS         V TP+SA  KS  S+A SI V          QYS
Sbjct: 781  EYDFKVAISQTSVSRTSEMQQIRSSHVLTPVSAEPKSV-SSAVSISV----------QYS 840

Query: 841  EAQKPSFPTQEQNQIGHFQGPASQNTGQQHFHFQGLVPQSGKQYSEAHQPSFPRQDQNLS 900
            E  KPSFPTQEQ Q GH QGPA   T  Q+F  Q          SE   PS P       
Sbjct: 841  EEHKPSFPTQEQKQTGHSQGPA---TAPQNFVLQ----------SEGKIPSLPM------ 900

Query: 901  GHFQGPGTQNLDEQSLVKVSSLPTQEQKKIENSS---KPRNLFDDDDMPEWCPPDFFKSR 960
                                    QE K IE  +   KP+NLFDDDDMPEWCPPDF K R
Sbjct: 901  ------------------------QEPKPIEKGASHLKPKNLFDDDDMPEWCPPDFMKIR 960

Query: 961  EEATTKAEALLSSKV----SGIAN----LTSPPPPPPPPPPPLTQ-----MLHPS----K 1020
            EEA  +     SS      SG A+    L  PPPPPPPPPPPLTQ     M HPS    K
Sbjct: 961  EEAAKRPGMSFSSSSEVPGSGPADSGYLLPLPPPPPPPPPPPLTQIQLRSMNHPSSHLPK 1020

Query: 1021 PVAN----ENPAGTSSQMQFKERDYACTHHHVPPPPPPPVGNHPSTNNFMHHN------- 1076
            P AN    ENP   SSQ+QF ER+Y+ T    P  PPPP+GN P+TNNF+H         
Sbjct: 1021 PAANSLVHENPGTLSSQVQFDEREYSPT----PRVPPPPIGN-PATNNFIHQRPAAPSTQ 1033

BLAST of Spo18823.1 vs. UniProtKB/TrEMBL
Match: A0A061GNW5_THECC (SPOC domain / Transcription elongation factor S-II protein, putative isoform 1 OS=Theobroma cacao GN=TCM_038158 PE=4 SV=1)

HSP 1 Score: 229.9 bits (585), Expect = 1.700e-56
Identity = 161/411 (39.17%), Postives = 214/411 (52.07%), Query Frame = 1

		  

Query: 518 AEKLWDGTLQLSTSVSASVIAFFKSGEKMLDFSWSESVEVRGKVRLEAFEKYVQDLPRSR 577
           AEKLWDG+LQL++SV+ SV+AFFKSGE+M    WS  VEV+GKVRLEAFEKY+QDL RSR
Sbjct: 20  AEKLWDGSLQLNSSVTVSVVAFFKSGERMPCVQWSGLVEVKGKVRLEAFEKYIQDLARSR 79

Query: 578 SRGLMVMSFCWKEGSSDAGHKGMKEVARKYKEGKRVGFAKLSTDIDLYICPHSDAIITIL 637
           +RGLMV+S CWKEGSS++G  G+KEVA+ YK+G+RVGFAKLS  IDLYICP SDAIITIL
Sbjct: 80  NRGLMVVSLCWKEGSSESGLAGLKEVAKGYKKGERVGFAKLSPGIDLYICPRSDAIITIL 139

Query: 638 AKHGFFKGMTAVKDKSELLIGCAVW---------------KKYSSFAPAKKTAEVNAPQL 697
           AKHGFFKGM AV+DK   LIGC VW               +K+SS      ++      L
Sbjct: 140 AKHGFFKGMAAVEDKQNSLIGCVVWRRNHGPSNSVKKELERKHSSSTEQPLSSHSEQKVL 199

Query: 698 GQNP----MEAAKSDIGQAPTE---GVQSGRIDKSP-ELFPSPQVEHAKQSMVTQNNLTS 757
           G+      M+ A+  +   P     G+ S  I+++  E   S  ++ A  +  +  NL  
Sbjct: 200 GKKNDMACMQPAQESLPLTPIADCIGIGSAIINRNEGENVESSDIQLALHNSPSSANLLF 259

Query: 758 GSGASSLLSEIKT----DVVLH------------------------------KPILPIPS 817
            + A S L  ++T    D V H                               P+L +PS
Sbjct: 260 ATSALSNLVGLQTSSFSDSVCHFGPKGQSSEREMSLTANSESGKPKSSLGLQNPVLSLPS 319

Query: 818 VSSKQSADFSDDDDLPEYDFRAA--VSQA---------------------SVTNTSAVAT 843
           V +K+    +DDDDLPE+DF  A  +SQ                       +  +  + +
Sbjct: 320 VITKEHIPAADDDDLPEFDFGTACDISQTPRNKVLDNAEFHKNVLVEGLKKIVGSLPLTS 379

BLAST of Spo18823.1 vs. UniProtKB/TrEMBL
Match: A0A067G4W9_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g006058mg PE=4 SV=1)

HSP 1 Score: 223.8 bits (569), Expect = 1.200e-54
Identity = 143/331 (43.20%), Postives = 193/331 (58.31%), Query Frame = 1

		  

Query: 505 DNECKSENGPSAFAEKLWDGTLQLSTSVSASVIAFFKSGEKMLDFSWSESVEVRGKVRLE 564
           +NE  S+  P+A AEKLWDG+LQLS+SV  S +AFFKSGEKM D  WS+  EV+GKVRL+
Sbjct: 50  ENEISSKRAPAA-AEKLWDGSLQLSSSVHVSAVAFFKSGEKMFDVQWSDIAEVKGKVRLD 109

Query: 565 AFEKYVQDLPRSRSRGLMVMSFCWKEGSSDAGHKGMKEVARKYKEGKRVGFAKLSTDIDL 624
           AFEKY+QDL RSR+R LMV+S CWKEGSS +G  GM++VA  YKE +RVGF KLS  +DL
Sbjct: 110 AFEKYIQDLSRSRNRRLMVVSLCWKEGSSKSGLVGMQKVAESYKEWERVGFVKLSPGVDL 169

Query: 625 YICPHSDAIITILAKHGFFKGMTAVKDKSELLIGCAVWKK-YSSFAPAKKTAEVNAPQLG 684
           Y+C  SDAIITILAKHGFFKGM A+ D  + LIGC V +K  +S + A K        L 
Sbjct: 170 YVCTRSDAIITILAKHGFFKGMAAILDYKDSLIGCVVRRKLQASTSSATKKLYCKNCSLS 229

Query: 685 QNPMEAAKSDIGQAPTEGVQSGRIDKSPELFPSPQVEHAKQSMVTQN------------- 744
           + P+    S  G +  + V+     + P +  S ++   K+S   ++             
Sbjct: 230 EKPL---NSSFGSSNHKSVEKDSSREQP-IQKSIRIAREKESSTLESTGNWGNEVKHLKT 289

Query: 745 NLTSGSGASSLLSEI-------------KTDVVLHKPILPIPSVSSKQSADFSDDDDLPE 804
           + TS  G+ S  SE+             K  +V  +P+L +PS  +KQ    S  +DLPE
Sbjct: 290 DCTSLKGSMSQSSEVEAPPIHKSSMERTKLSLVAQEPVLSLPSDVTKQPT--STLEDLPE 349

Query: 805 YDFRAAVSQASVTNTSAVATPISAAMKSEGS 809
           +DF        +T  S  A  +   + ++GS
Sbjct: 350 FDFGTGCG-IFLTPKSVHAATVDTKLLTQGS 372

BLAST of Spo18823.1 vs. UniProtKB/TrEMBL
Match: V4U0G9_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10007677mg PE=4 SV=1)

HSP 1 Score: 223.8 bits (569), Expect = 1.200e-54
Identity = 143/331 (43.20%), Postives = 193/331 (58.31%), Query Frame = 1

		  

Query: 505 DNECKSENGPSAFAEKLWDGTLQLSTSVSASVIAFFKSGEKMLDFSWSESVEVRGKVRLE 564
           +NE  S+  P+A AEKLWDG+LQLS+SV  S +AFFKSGEKM D  WS+  EV+GKVRL+
Sbjct: 50  ENEISSKRAPAA-AEKLWDGSLQLSSSVHVSAVAFFKSGEKMFDVQWSDIAEVKGKVRLD 109

Query: 565 AFEKYVQDLPRSRSRGLMVMSFCWKEGSSDAGHKGMKEVARKYKEGKRVGFAKLSTDIDL 624
           AFEKY+QDL RSR+R LMV+S CWKEGSS +G  GM++VA  YKE +RVGF KLS  +DL
Sbjct: 110 AFEKYIQDLSRSRNRRLMVVSLCWKEGSSKSGLVGMQKVAESYKEWERVGFVKLSPGVDL 169

Query: 625 YICPHSDAIITILAKHGFFKGMTAVKDKSELLIGCAVWKK-YSSFAPAKKTAEVNAPQLG 684
           Y+C  SDAIITILAKHGFFKGM A+ D  + LIGC V +K  +S + A K        L 
Sbjct: 170 YVCTRSDAIITILAKHGFFKGMAAILDYKDSLIGCVVRRKLQASTSSATKKLYCKNCSLS 229

Query: 685 QNPMEAAKSDIGQAPTEGVQSGRIDKSPELFPSPQVEHAKQSMVTQN------------- 744
           + P+    S  G +  + V+     + P +  S ++   K+S   ++             
Sbjct: 230 EKPL---NSSFGSSNHKSVEKDSSREQP-IQKSIRIAREKESSTLESTGNWGNEVKHLKT 289

Query: 745 NLTSGSGASSLLSEI-------------KTDVVLHKPILPIPSVSSKQSADFSDDDDLPE 804
           + TS  G+ S  SE+             K  +V  +P+L +PS  +KQ    S  +DLPE
Sbjct: 290 DCTSLKGSMSQSSEVEAPPIHKSSMERTKLSLVAQEPVLSLPSDVTKQPT--STLEDLPE 349

Query: 805 YDFRAAVSQASVTNTSAVATPISAAMKSEGS 809
           +DF        +T  S  A  +   + ++GS
Sbjct: 350 FDFGTGCG-IFLTPKSVHAATVDTKLLTQGS 372

BLAST of Spo18823.1 vs. TAIR (Arabidopsis)
Match: AT5G11430.1 (SPOC domain / Transcription elongation factor S-II protein)

HSP 1 Score: 117.1 bits (292), Expect = 8.000e-26
Identity = 85/271 (31.37%), Postives = 128/271 (47.23%), Query Frame = 1

		  

Query: 519 EKLWDGTLQLSTSVSASVIAFFKSGEKMLDFSWSESVEVRGKVRLEAFEKYVQDLPRSRS 578
           E+ WDG LQLS S    V   FKSGEK     W   VEV+G+VRL  F K++Q+LP+SR+
Sbjct: 541 ERAWDGILQLSMSSVVPVAGIFKSGEKAETSEWPAMVEVKGRVRLSGFGKFIQELPKSRT 600

Query: 579 RGLMVMSFCWKEGSSDAGHKGMKEVARKYKEGKRVGFAKLSTDIDLYICPHSDAIITILA 638
           R LMVM   +K+G S++    + EV   Y   +RVG+A+ ++ ++LY+CP     + +L 
Sbjct: 601 RALMVMYLAYKDGISESQRGSLIEVIDSYVADQRVGYAEPASGVELYLCPTRGETLDLLN 660

Query: 639 KHGFFKGMTAVKDKSELLIGCAVWKKYSSFAPAKKTAEVNAPQLGQNPMEAAKSDIGQAP 698
           K    + +  VK     L+G  VW++          A V  P  G     +  S IG   
Sbjct: 661 KVISQEQLDEVKSLDIGLVGVVVWRR----------AVVPKPGSGSKRQHSFSSSIG--- 720

Query: 699 TEGVQSGRIDKSPELFPSPQVEHAKQSMVTQNNLTSGSGASSLLSEIKTDVVLHKPILP- 758
                         + P   V   ++  VT+  L   S  +     +K D      + P 
Sbjct: 721 ----------SKTSVLP---VNKKQRVHVTEKPLVVASMRNHHHGYVKHDTAADDDVPPG 779

Query: 759 IPSVSSKQSADFSDDDDLPEYDFRAAVSQAS 789
              V+S+      D+DDLPE++F ++V   S
Sbjct: 781 FGPVASR------DEDDLPEFNFNSSVVPVS 779

BLAST of Spo18823.1 vs. TAIR (Arabidopsis)
Match: AT2G25640.1 (SPOC domain / Transcription elongation factor S-II protein)

HSP 1 Score: 115.2 bits (287), Expect = 3.000e-25
Identity = 60/149 (40.27%), Postives = 91/149 (61.07%), Query Frame = 1

		  

Query: 519 EKLWDGTLQLSTSVSASVIAFFKSGEKMLDFSWSESVEVRGKVRLEAFEKYVQDLPRSRS 578
           E+LW+G LQLS S  +SVI   +SGEK     W   +E++G+VRL+AFEK+V++LP SRS
Sbjct: 596 ERLWEGVLQLSPSTVSSVIGILRSGEKTTTKEWPILLEIKGRVRLDAFEKFVRELPNSRS 655

Query: 579 RGLMVMSFCWKEGSSDAGHKGMKEVARKYKEGKRVGFAKLSTDIDLYICPHSDAIITILA 638
           R +MVM F  KE  S    + + EV   Y +  RVG+A+ ++ ++LY+CP     + IL 
Sbjct: 656 RAVMVMCFVCKEECSKTEQENISEVVDSYAKDGRVGYAEPASGVELYLCPTRGRTVEILN 715

Query: 639 KHGFFKGMTAVKD-KSELLIGCAVWKKYS 667
           K      +  +K    + LIG  VW++++
Sbjct: 716 KIVPRNQLDFLKSINDDGLIGVVVWRRHT 744

BLAST of Spo18823.1 vs. TAIR (Arabidopsis)
Match: AT5G25520.2 (SPOC domain / Transcription elongation factor S-II protein)

HSP 1 Score: 115.2 bits (287), Expect = 3.000e-25
Identity = 62/156 (39.74%), Postives = 94/156 (60.26%), Query Frame = 1

		  

Query: 519 EKLWDGTLQLSTSVSASVIAFFKSGEKMLDFSWSESVEVRGKVRLEAFEKYVQDLPRSRS 578
           +++WDG LQLS++   SV   FKSGEK     W   VEV+G+VRL AF K+V++LP SRS
Sbjct: 653 DRIWDGILQLSSASVVSVTGIFKSGEKAKTSEWPTMVEVKGRVRLSAFGKFVKELPLSRS 712

Query: 579 RGLMVMSFCWKEGSSDAGHKGMKEVARKYKEGKRVGFAKLSTDIDLYICPHSDAIITILA 638
           R LMVM+   K G S +    + EVA+ Y   +RVG+A+ ++ ++LY+CP     + +L+
Sbjct: 713 RVLMVMNVVCKNGISQSQRDSLIEVAKSYVADQRVGYAEPTSGVELYLCPTLGETLDLLS 772

Query: 639 KHGFFKGMTAVKDKSEL-LIGCAVWKKYSSFAPAKK 674
           K      +  VK   ++ LIG  VW++    +P  +
Sbjct: 773 KIISKDYLDEVKCSEDIGLIGVVVWRRAVVASPGSR 808

BLAST of Spo18823.1 vs. TAIR (Arabidopsis)
Match: AT3G29639.1 (BEST Arabidopsis thaliana protein match is: SPOC domain / Transcription elongation factor S-II protein (TAIR:AT5G11430.1))

HSP 1 Score: 60.1 bits (144), Expect = 1.200e-8
Identity = 32/65 (49.23%), Postives = 40/65 (61.54%), Query Frame = 1

		  

Query: 526 LQLSTSVSASVIAFFKSGEKMLDFSWSESVEVRGKVRLEAFEKYVQDLPRSRSRGLMVMS 585
           LQLS S    V   FKSGEK     W   VEV+ +VRL  F K++Q+LP+SR+R LMV  
Sbjct: 5   LQLSMSSVVPVAGIFKSGEKAETSEWPAMVEVKRRVRLSGFGKFIQELPKSRTRALMVYK 64

Query: 586 --FCW 589
             FC+
Sbjct: 65  AFFCY 69

The following BLAST results are available for this feature:
BLAST of Spo18823.1 vs. NCBI nr
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. NCBI nr)
Total hits: 5
Match NameE-valueIdentityDescription
gi|902233130|gb|KNA22934.1|0.0e+0100.hypothetical protein SOVF_0293... [more]
gi|731353158|ref|XP_010687913.1|1.1e-30357.1PREDICTED: uncharacterized pro... [more]
gi|698476695|ref|XP_009785631.1|5.1e-5941.5PREDICTED: uncharacterized pro... [more]
gi|697121633|ref|XP_009614795.1|9.7e-5855.6PREDICTED: uncharacterized pro... [more]
gi|590578607|ref|XP_007013557.1|2.4e-5639.1SPOC domain / Transcription el... [more]
back to top
BLAST of Spo18823.1 vs. UniProtKB/TrEMBL
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. UniprotKB/TrEMBL)
Total hits: 5
Match NameE-valueIdentityDescription
A0A0K9RVJ3_SPIOL0.0e+0100.Uncharacterized protein OS=Spi... [more]
A0A0J8BS05_BETVU7.7e-30457.1Uncharacterized protein OS=Bet... [more]
A0A061GNW5_THECC1.7e-5639.1SPOC domain / Transcription el... [more]
A0A067G4W9_CITSI1.2e-5443.2Uncharacterized protein OS=Cit... [more]
V4U0G9_9ROSI1.2e-5443.2Uncharacterized protein OS=Cit... [more]
back to top
BLAST of Spo18823.1 vs. ExPASy Swiss-Prot
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. ExPASy SwissProt)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Spo18823.1 vs. TAIR (Arabidopsis)
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. TAIR)
Total hits: 4
Match NameE-valueIdentityDescription
AT5G11430.18.0e-2631.3SPOC domain / Transcription el... [more]
AT2G25640.13.0e-2540.2SPOC domain / Transcription el... [more]
AT5G25520.23.0e-2539.7SPOC domain / Transcription el... [more]
AT3G29639.11.2e-849.2BEST Arabidopsis thaliana prot... [more]
back to top
InterPro
Analysis Name: InterPro Annotations of S. oleracea
Date Performed: 2018-06-29
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR012921Spen paralogue and orthologue SPOC, C-terminalPFAMPF07744SPOCcoord: 522..628
score: 1.6
NoneNo IPR availablePANTHERPTHR11477TRANSCRIPTION ELONGATION FACTOR S-IIcoord: 504..759
score: 6.0E-51coord: 1130..1209
score: 6.0
NoneNo IPR availablePANTHERPTHR11477:SF15SUBFAMILY NOT NAMEDcoord: 504..759
score: 6.0E-51coord: 1130..1209
score: 6.0

GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006448 regulation of translational elongation
cellular_component GO:0044434 chloroplast part
cellular_component GO:0005840 ribosome
molecular_function GO:0003824 catalytic activity
molecular_function GO:0003746 translation elongation factor activity