Spo13795 (gene)

Overview
NameSpo13795
Typegene
OrganismSpinacia oleracea (Spinach)
DescriptionEndonuclease/exonuclease/phosphatase family protein
LocationSpoScf_01089 : 72991 .. 94699 (+)
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
CACCCATGTCGCATTTCCTTCTTTCAATAAAAAAAATGTGATTTCCAATTAAAAAAACTTTTCAAAATTCAAATGTTCTCTCTCCTTCAACCCAAACTCCCTTCTTCCTTCCCTCGTCTTCTTCAACTTAAACCCACCAAAAACCCTAACAAAATTCTTCAAATCCCTCATTTCCAATTCCAATCTCTCTCCTCATTTTCAACAACAACAACAATGTCTTCTTCTTGGTCTTGCTCTAAATGCACTTTCCTCAACCCGGGTTCTCAAAAATCAACCTGCCAAGTATGCCAAACCCCATTTTCTCTCTCCTCCACGTCATCGTCTCCTTCCTCGCAACCAAAATGGGCCTGCAAAGCCTGCACATTTCTCAATTCGTATAGTCTTTCCAATTGCGAAATTTGCGACACTCGTGCTTCACTATCGAATTTTCAAGACTTGGATTCGAATGACCCGGATGATGATCTTTCCGGGCCTTCTTCTGTGGGTTCGGTTTTCTTCCCATTGCGCCGCTGTAATACGGGTCGGGTTCATGAATTGGATGATTTGGGTGTGAAAGAAAAGGTTGAAACTCAAATGGGCACTGATCAATTTGTTAATCAAGGTGTTCAATTGAGTAGGCTGCAAAATTCTGGTTTTAACAAGCGCAAGATAGAGGACTTACTCTCTGGTTAGTAACTTAATACCAAGTTTTGTGTTACGTTCTTATTTTAATTTTAATTTTAATTATTTTATGATTAGGATGTTATTTTTGGTGTGCAATGAGCCATAAGCAATAACAAATTAAAAAAATTAGGAAGGGAATTTGAACGTGTGACCTATCGATTAAGATCTTTGTGCTAGATGAGTAGATGGTGTGTGGAGTGTTATTCAGTTCATGTTCTGTTCACCTTATTTCCTAGGAAAAATAAGTTCAGATAAGTTAAGTCAAATTCATATAAGTTCACATAAAACAAGGGTTGATCAAGTACAGTTTATAACTAAAATAAGGGTTGATCAAGTACAGTTTATAACTAAAATAAATTTTGATAAATTTTAATAAGTTCAGTTAAATTCAGATAAGATAAGTTTAGAAAACATTTATATATTCATAATGATACCATGAGTGCTTCAGGGCGTCCTCAAGATATGTTTTTCAAAAGCACCAGTTTGACTTGGGGAGGTAGTGATTTAGTAGCAGTGGGTGGAAAAGTAGCCTTGTTACCCATTTCACTAGGGACCGCAGATTTTTTAGTCCACCAAATTCTTGCTTTTACAATTCATGTGACGGTACTAATACTTCTAAAAGGCGTTCTATTTGCTCGTAGCTCTCGTTTGATAAAATTAGGTGAATAAAACGCAACTAGTATGTCACTACGTGAAATTGTCATTGACTCATAACTCATTGCTCAATGTTGTGTGTATTTTTGGTTTAATGTAAGGTAACAAGGTGAACAAGTTTATCACTTTTAGGTAGCAAAATTATTTTCGTATTAGCATTTAACCTAATTCTTTAAACCCTAAACTTTGGTACCGCCAACAATGTCAAAGAAAAAGGGGGGAAAACAGTTAGACAATTCCACGCGGTTCATGTGGGTACAATATTAAAAAGTAAAAACCCTAACTCCCCCATGTTTTTCTATAAATCCAACCTTTCAATATTTGATTTTCTCTCTCTAAAATCCAGATGATCTTTCTTTTTCTTTTCCTTTCACCATTATAGTTACAGAGATTGAAGATTAGAAAAGAGAAGTGGTGGGAAAGGAGATGAGTCTACAGGAAGTGGTAGGGATGGTGTGTTTGGGAAGGACTATGGAGGAATCCAATTCCCCTTTTTGAGGGTTGTTGGAATGGAATTCCTTCCCAAGACCCCTCATCCCAATCATTTTCTGTGGGTTTTCCCTTTCCCAGACTTTTCCTATTGTTTTGTTTTCTAAATTGTTCCCAAAGGTATTAACTTTTTATTGTTTAATCAATCACAAGATGAATGAATATATGTTTTTCTTTGATTTATTTGCCAAAGTACAAATCTTTTTGGTTGATTTTGATGTTTGGGTAGTTTCTTGATGGTTTTTTCATGTGATTGATCTCTGTTTTATTTATTTATGTGATGTACGTATCCAATGTTGTAATTAATCTTGTTATAAGATGTGTATTAAGTTGTATTACCATTTGAATTAATTTCGAGGCGAATTCTTAGTTTTTAAACGAGATTTCTTGCCAAATTGTCTCCTAACATGTGTTTAGAGTAACTATAGTGTCTAAGGGAATTGAGGGGTTGATTCGGCTGCAATCCTGCAATGTTAGTTTTCAAGTATATGAATTCTCTGAAGTTTGCAGTATTGACTATTAAACTTACACTAAACTTGTAAAATGGATTCCTTTGACTACTTGTAAAATGGATTCCTTTGACTTCTTTATGTTTCTGTTAAGATAGAATATAGGAGATATTATATTGTGAATATTCTGTAGAGTCCTAAATATGGTATGTTACCTAATATGAGTCCTACATATGGTATGTTACCTTATATGTAATCTAGGGTTTAGGTAATGAGTTGTACTATATATATACGTGTGTTATTAATGAGAAGGATACGAGATTACACAATATTTGACATGGTATCAAGAGCTTAGGGAGATCCTACGACCCGTCTACAAAAAAAATTATCCCCTGCCGGTGGGTTAGAAGAATAAATCCACCGGCGGGATTCGTAATTAATATAATAATTCCGCTGCAAATTGGTAGAAGATGAGGTTGGTGTTGGAGAAAAGACTCGTACTTGGAATCGCGTCCAATTCGAGTATTCATATTGAGATAATTGGTGAGTTCGAAAAGAAGGGCGTTCTCATCAAGAGAGAGAAAAAAAAAGTGCAGGTTCGAAAAGAATGGCGTTCTTGCTCGATTTTTTTTTATTGAAGAATGGTGTTTCCAATCGCAATGTTCGTGAAACTGCCGTTACGATTGAGGGAGGGAGTTAAGGATAACGTAGTTATCCTAGAGGATAGAATATCATATGAGATAGAGTTATGTTATATGATATGAGTGATACTGAGGGAGTGTCGTCACCTACGGGTGATACTGAGGGAGTGTTGTCGCATACAGGTGACAATGAGGACGTGTCGTTGACTAGTGGCGTCGACGAAGGGGTGTCGTCGCCTGTGAGGGAGGAACAACAACATTATACACCCAGTCGAGTGGAATCAGACGTCGTCTCAGCAACGACAGGTGTCGATGAGACGACGTGTAGTGATGAGAGTGAGACAAGGCAGTCATCAGCCCGTCGTATGTGTCGACGAAGGTTCAGTTAGCAGATATCGTCACTAAAGCGTTGGGGAAGAAGCAATTTGAGTTTTTTTTTGAACAAGATGGGCATTCGAGATCTTCATGCTCCACCTTGAGGGGGGATGTTAAGATAGAATATAGGAGATATTATATTGTGAATATTCTGTAGAGTCCTAAATATGGTATGTTACCTAATATGAGTCCTACATATGGTATGTTACCTTATATGTAATCTAGGGTTTAGGTAATGAGTTGTACTATATATATACGTGTGTTATTAATGAGAAGGATACGAGATTACACAATATTTGACATGGTATCAAGAGCTTAGGGAGATCCTACGACCCGTCAAAAAAAAAAAAAAATTATCCCCTGCCGGTGGGTTAGAAAAATAAATCCACCGGCGGGATTCGTAATTAATATAATAATTCCGCTGCAAATTGGTAGAAGATGAGGTTGGTGTTGGAGAAAAGACTCGTACTTGGAATCGCGTCCAATTCGAGTATTCATATTGAGATAATTGGTGAGTTCGAAAAGAAGGGCGTTCTCATCAAGAGAGAAAAAAAAAAAGTGCAGGTTCGAAAAGAATGGCGTTCTTGCTCGATTTTTTTTTATTGAAGAATGGTGTTTCCAATCGCAATGTTCGTGAAACTGCCGTTACGATTGAGGGAGGGAGTTAAGGATAACGTAGTTATCCTAGAGGATAGAATATCATATGAGATAGAGTTATGTTATATGATATGAGTGATACTGAGGGAGTGTCGTCACCTACGGGTGATACTGAGGGAGTGTTGTCGCATACAGGTGACAATGAGGACGTGTCGTTGACTAGTGGCGTCGACGAAGGGGTGTCGTCGCCTGTGAGGGAGGAACAACAACATTATACACCCAGTCGAGTGGAATCAGACGTCGTCTCAGCAACGACAGGTGTCGATGAGACGACGTGTAGTGATGAGAGTGAGACAAGGCAGTCATCAGCCCGTCGTATGTGTCGACGAAGGTTCAGTTAGCAGATATCGTCACTAAAGCGTTGGGGAAGAAGCAATTTGAGTTTTTTTTTGAACAAGATGGGCATTCGAGATCTTCATGCTCCACCTTGAGGGGGATGTTAAGATAGAATATAGGAGATATTATATTGTGAATATTCTGTAGAGTCCTAAATATGGTATGTTACCTAATATGAGTCCTACATATGGTATGTTACCTTATATGTAATCTAGGGTTTAGGTAATGAGTTGTACTATATATATACGTGTGTTATTAATGAGAAGGATACGAGATTACACAATATTTGACAGTTTCCCTTAACTCTGTTTTCTGGCCAAATTTTCAACAGAAAGGCACAATCTTAGCCTGCAAAATAATGTTGTAATCATAGTATAACTAGTTGCTTAAGTGCATTTATCTATATTGCTCGTACTCGAGAACTTTTGATATAACTCGGCTATTTGATGAATATATTCCCTGATTTAGGTCAAAATGGGTGTTGAAGTGTCCATATTTCCATACCCATTGTGGAGTGTCAATAATAATCTATCTTAAATGGAGAGTTTGAGTAGCTTAGGTAGTAAACTAGTAATTGATATGTTATAGTACTCTGTCTGACTGATGTATGTGATTCATTCATATGTCTTTGAAGGTCTAATACTGGTCCTGGATTGGTATAAAGCTGGTTGTTGTGTTTATTGTCTATCAATTGAAGTAGTGGAGAGAAAGAAAGACTTTTTTTTTTTTTGGCAATTAGACTTGAGATTTATTACCTAGGGATGAGAAACGAACTGTACCTTAAAAACACCAAAACACAAAAAAGCTTCAAGTTAATACCCCACATTCTAGGAAGGGTTGAAGACAAACAATCATAAGCAAGTATACAATAAACACAACTGCCTCTCTACACCTCTCCTACAAAAAGTCTCTCAATTTATCATAACACCTACAAGCTACTTTAAACATAATGCTATTTACCACGACTCAGTTCCTTCACAGTCTCCATTAATTTTTTTTATTATTTTGTTGCATTCAAATGCATAGACAGCCTCAACAAAACACATGCCATGTAGCTTCTGTCTACTCTGTTTCCCTCGGCAAGATCTAGTAGCGTAAAGAATCTCAGTCTCAAAAGGAAGAGAGACTTCAATGTGCTAAAACGTTTTTTTTCCTCTTTTTGTTCCAATAATGTCTTTATGGACTTGATAAAACACCTTTAAACTACAATTAGCATGATCGTGCAACTTTAGACTACTTACTTTGTATGCCATGAATTTTAGCAACAGAAGGTCTTGTATGGCTCTCAAACATCATAGTTTCAAGCAAATGACTATGGGTTGATTTTGAAGTTGCATTATTCTTGTATTTGATGTTTTACTTGTTGAAAGAAATATTTTACTTCACCCAAGCAAAGTGTGCTACTTAAATTTAGTAAATGAGTTCAATTTACATGTATATTGAACAATTTGTTTAATCTATTTTCTGAATAGACAAGGAGAAAAAAGTTTCTGCAATTATGCACTCACTGGATTTCACACTGGCCTTTAACATGTACTGTTAGATCACAGACCGTGGTTTCTTATCTCATCTTGATGTAATGATGTACACTGTCACTCTTATGGCTTCACTCATATGCATTAGGCAAACGTCCAGCTTTAACCCCGCACGATACACAGTCACGTATGGCAACCTTCATTCTTATAGATATACTGCACGTAACTGCTTTGATATGACTGCCTGGGCGGAAGTTTGTTCTTTGCTGAACACCTCTTACCCTAAAGTAATTGCTAGTTGCTGGAAAGTCATCTTTATTAAAACCACTCTTGGTCTCCAAAACCAAATCAAAAAAGATAAATCAATATCATATGAACATGAAAGAAAGTTCTCGCCTGTGTTGTGATGTATTTTAGGTAATTAAGTCTTATTTGCACACTGATTCAGTTCTCATGCAGTAATGATGTTCGGATGGAAGTAGAAGTTTAAGTTGTAGTTCCTTCTGGATGTTTCCTCTTTCCTGCAGTATTAGGCAAGAAATTTCCATGCCACACATTAGATTTTCCTGTATTTTTTGTCAACTAAACATTTCTTCTTAATCAAATTACCACCAAGCCTGTCATCTCACTCACTAATTTTCATTTACAAAGGACTGAACAACAAAATTTTAGGCCCCTGCGTTGTAAAAGTACTGAACCTTTTGTAATCAATAGTTCAATACTAAACAAGGTAAATGTATGACTTATGTAGTTATGTACACTATGTGAGTATTACCCTTATATTATCTAAACTTAGGTTCTGTGACCATACTGGGTGAGAATGCTGACGCTGTTAGCTGTTTTACTACGAAGAGAGAGAAAAATGATGTAGTATTGAAAGGTTTAGCATCAGCAAAGATTGCTGATAATGTTGAGGACAGTGTCACATTTAAGAGGAAACTCCGTGATTTAAGAGGAAACTCTGTGACTTAGGTGATTTTATTTAGAAACCCATCAAAATAGATGGCTTTGGCGAGGTTAAATCTTTCAGGAAGGCTGTTTCATTTGTAGGTTAGTATGAACTTTGTAGGTTGCTTAAAACTCATGAAAATTGCTTCTTTTCACTTCTAATCATCGCGGTCAGTTTCTCTTATCTACATCTGAAATTTGAATATTTCAATAGTCACAAATATTTCGATGGAAGTCATGGTGTTAAAAAGTGTGTTGTCTTACATTGGTGCAAGGGTAATTCTTACCGGCCACTTACTTGCTTTTCCTATGCTCCCCTCAGTTCATACAATGCAACCATGCAAATCCTAGCGAATATACTTCATTTTGATACCAACTGTAACTGTGGCGTGGATTCGGTATAAAATTGATATATGATATTGATTTTTGTTTCAGATTCTGTATATAGCTTAGACCAAGAGAAGGTTGGAAAGGCCAAAATAGATAGTGAGTCGTTTTCCGAGGAATTCACTGAAGATGCTCAACTTGAAGAAGTTTCAGGTGTGATAAAGATCCTTTCTTACAATGTCTGGTTCAGAGAAGACTTAGAGGTGCATAAGAGGATGCAAGCAATTGGTGACCTAATTAAGCTGCACTCTCCAGATATTATTTGTTTTCAGGTTTCATTTTGTGCACTTCACTATTTCATAACAAGGCTGTGTTGTTCATTTCTTGCATCTGTAATTTGTTTCTCTTTGGATTTTTAGGAGGTTACTCCTGGTATCTATGACATTTTCCGGAGATCTACGTGGTGGAAGAAATATTGCTGTTCTGTCTCGTACGAAATGGCAGCCTCTGGGGCGTACTTTTGCATGCAGGTGATTTATTTTTCACCTTTCCAACAGAAATGGTTTTACGTTACAAGTTTACGGTTATGCTAGCAGAATAGAACTCGGAAATGGTTTGCCATCTTTTTTTCTTCACAAGTTCCATTTTTATCTTACCATATGCTGGCATGCTGCTTTGGATTGACCTTACATTAACTTTTATTTGATGCTCCTTTATTCAGCATTGTTTGTTTCAATACTCGTGACCCAAAAGTCCAAAACCTGCTAATGACCCCCTCCCTCCCAGAATTTAAGGTCTAGAGCACAAAATTCTACCCGAAACCTATAATTCCTTGTTAAATGTCGCTGATTAATGGTTTGACCCTAAACGATTAGAAAAATGACTTATACTGTGCCTGAATATGCCCGACCCAAAACCGACCTGATTGACTGATTTGAGTCTTGACTGGTAGAATTTATGCGGTTTCCATGTGTTGTGACATTGTAACCATCTGATTCGTAAGCTTTTCTGGATCTTGTATCGTTTAGTAGCTTATTTTTTTCTTTGGTAAGTAAGATGAATTTTTATTAGCATTAAGCACAAAACAGAGTACTTGTGGGGAACAACCTTCTAGGAGGGACTCTTCCATCCCCAAGTCTCAAAATACAACAATAAGACCCTCTTCACACAAAACTAACACATGACCAACAATACAAACCATGCCTGGTCAGACTCGGACATTTTCTTGCTGACCATACTCCTAATTCTAACTGTCTTGTCTTTAGTAACAGCACCAACACTAGTCACCTTATGTTCCCACACAGCCTTGTTCCTGACTGTAATGGACCATACCAACACTGACCCACCTATGTTGGTCTATTACAGGCGCAACAGAAGAGGGGAGAGGCACCACAACCACAACAACAACACAGCTACAGCAGGGACAGCAGTTGGCACAAACTGACAAAGAAAGCAGAGGAAGCATTAATGTTTTGTACTTGTTGTTTACTTCTTAGTTTGTATAAATAAGAGAGGAGAGGAAGGGAAAGGGATGGAGAATTTATTGGTGAATTGGAGAGTAGTAACCTCAAGTTACTTACCTGAGAGTTGTAGCCTAAAGCTACTGCATTTTGTAATCCTTGACCGTTGAGTCCTAGTAATCAACAACTCCTTTCTCTATTCCTGTTTGTGTGTTGTGTGTGTGTGGTAAAGTCAGTCCAGGGCTTTACACTTGGTATCAGAGCAGCCACGTCCTGAGGAAGAAGGGTAGCCATGGTGCTTTCAAATTCGCTGAGATTTGAGGCCTTAGAAGGAGGGCTCACAGAAGTAGGGCAACGGATGAACACCTTGGAAGGGAGGGTGCAGCAGCTCTTAACAGAGAGTATGGAGACCTTCCAAAAGAATCTTGCTGAGCAATTCATGGCCAAGAGCGCGGAGGACACACAGAAAAATCAAGAAACGGTAGATGCAGCTACCATCCGCCTAGAAGGCAGAATTGATAGATTTAGAGAGGACCAAGAGACGCAACTGGCGTTGATGAAAAGGAACCAGGAGAAATTTCAAGAAGAGGTGAAGGCTTTATTGGCTAACAGACCCCCTGCTCAGACCTTGGAGGAGGAGGAGTACTCGGAAAATCAAATGAACCCACCGAGGCGGATTGGCAGGAACTTTGGGAGAGGTGGGTTCGACGATGGAGGTGGGGGGAACGGCGGTCACTGGAAGTATCGGAAATTGGATATGCCTCTGTTTGAAGGTAGCGATCCAGACGGCTGGATTCTTCGTGGTGAAAAATACTTTGACTTTTACAAGTTAACAGAGACTGAGAAAATGGATGCCGCGGTGGTGTCAATGGAGGGGGATGCGCTGAGGTGGTTCACGTTTGAATCACGCCGGAGACCGATGAGAAGCTGGCCGGAGTTGAAAGCTCGAGTTTTGGCAAAGTTTCGCCCCACTAATGCTGGAACTTTGCACGAACAATGGTTGGCGACGGCTCAAACCACCACGGTGGCTGACTACGTGAGGCAGTTCATTGACCTAGCATCTCCTCTAGATGACGTTCCAGAGAGTATAATGCTTGGGCAATTTGTGAATGGGCTTAAAGATGATATTAAAGCTGAAATTCGGGTTCTAAACCCGTATACACTGGATGAAGCGATGGATCTAGCTACTCGGGTTGAAGAACGTAACCGGGTACGAAGAAGTGGGTATGGGAATAACAGGAGTGGACAATTCTCATATTTCAGTAAAAGCCCAATAGCCACTAACCCAACACCAACAACCTCAGGGGCCCAGAACCATGTTTTAAACCCATCCCAGTATAACCCACTGTCAAAAAACACGTACCTCAAACCAGTTAACTCAAGTGGGTCCTACGGATCCTCCCCAAACACCAACCCACCTCGTTCTGCTACAGTAAACTCGCGATTTTCCGGCATTGGAGAAACAAGACGATTGACTGAGCGTGAATTACAGGAAAAAAAGTCTAAAGGCTTATGTTTTAGGTGTGATGAAAGGTGGAGCATCGGCCACCAGTGTAAGAAGAGGGAGATGAATGTTTTGTTGGTGGATGAGGAAGACGGTGGAGAGGAGCTGACCGGAGAGGAAGTCGGTGGTGAGTTTTATTCGCTGGAGGAGCAGAGAAAGGAGGAGATACCGACAACACTCTCAATTAATTCGATTGTAGGCCTTACCAACCCGAAGACACTTAAATTGGTCGGGCGCATCGGAGACGGCGAGGTGGTGGTGATGGTGGATCCCGGAGCCACCCACAACTTCATTTCGCTCCGGGCGGTGGAAAAATTAAAGGTACCTGTTATGGAGTCAATGGGGTTTGGAGTGTCACTTGGTAATGGTGAGGCAGTGAAAGGTAAAGGAATTTGTAAGCAGGTGAAACTGAAATTGAACTCAGAAATTGAAATAGAAGAAGATTTCTTGCCTCTGGAGTTGGGAAATTCGGATGTAATACTGGGCATTCAGTGGTTAGAGAAGCTAGGACCGGTGATCACCAACTGGAAAACACAGGTTATGAAGTATCAGGATAAGGGTGTGACTGTAACATTGAAAGGGGATCCGTCACTAGCTAGCTCAAGGGGAGGAGAGCCCATGATCCTATATAAACCCCACATCAGCCACCCATATGACCGATGTGGGACTTAAGTCCCATACCTTACAATGGACCGTAACATTAATCTCCTTGAAGGCAATGCTGAAAGTAATCAGGAGAGAAGGGGGAGGGATTTTACTGGAGTTTAACCAAGTGGAGGAAATCACCAGTGAAGGGCCAACTGTACCCTTGTTCTTGCACGAGGTTCTGGGAAAATTTCAAGGCATCTTTGGGCTTCCGAAGGGGTTACCGCCGAGCCGAGGGCATGAACACTCGATTGTCATGAAGGAAGGTAGTGATCCAGTTGGTGTTCGACCATATAGATACCCTCAAGTGCAGAAGGATGAGATAGAGCGGCTAATCCGAGAGATGTTAGAAGCCGGAATCATCAAGCCATCCACCAGTCCCTTCTCGAGTCCCGTGTTGTTAGTTAAAAAAAAAGACGGCTCTTGGCGATTCTGTGTGGATTACAGGGCCCTCAATAAAGAGACCGTTCCTGATAAATATCCAATCCCCGTAATTGATGAACTCCTTGATGAGTTGCACGGTGCTAAGGTGTTCTCAAAGCTAGACTTGCGAGCCGGTTACCATCAGATTCTTGTGAAACCGGAAGATACACACAAGACGGCCTTCCGCACCCACGAGGGTCACTATGAATTCTTGGTGATGCCGTTCGGCCTCACCAACGCTCCGGCGACGTTCCAGTCACTTATGAATGAGGTATTCCGACCATACTTGAGGAAATTTGTTCTTGTCTTTTTTGATGATATATTGGTATATAGTTCTTGTGAAACTGACCATGTGAAGCATATGCAATTAGTGTTGTCATTACTGGAAAAAAATCAGTTGTTTGCGAATTTCAAAAAGTGTGAGTTCGGGAAGGAAGAGGTAGCCTATCTCGGGCATGTAATATCCAGCCATGGTGTGTCTGTAGACCAAGAAAAAATCAGAGCCATGAAAGAATGGAGCACACCGAAAAATTTGAGGGAGCTTCGGGGTTTCCTAGGTCTCACGGGTTATTACCGAAAGTTTATTGCCGGTTACGCCCACATTGCACATCCCCTCACTGAACAATTGAGGAAGGACAATTTCGGGTGGAGTGAAGCAGCCACACAGGCTTTTGAAGAGCTGAAGAGAGCCATGACTACCGCCCCTGTACTAGCTATGCCAAACTTTAGTAAGGAGTTCGTGGTGGAGACAGATGCGTCGGGCTACGGATTGGGGGCTGTGATGATGCAGGAAGGCCGCCCAATCGCTTATTATAGTCGTGTGCTCGGGCCAAGAGCGAGAGGGAGGTCCATATACGAAAAAGAACTAATGGCAATCTGTTTGGCCGTGGAAAAATGGAAGCATTACCTCCTTGGGAGACACTTTGTGATCAGAACCGACCAGCAAAGTCTACGGCACATCACCCAACAGCGAGAAATCGGGGCTGATTATCAAAAATGGGTTCGGAAGTTGATTGGCTTTGATTTTGACATACAATATAAACCCGGATCATCTAACCGAGTGGCTGATGCACTCTCGAGGAAAGGAGAAGGAGAGCTTGAAATTGGCACACTGGAATTGGGTGCATTGCTGACTTCACAGGGGATAAATTGGGATACATTAGAGGAAGAAATCAAGGCAGATGAAGGGCTGGGAAGGATTAAACAGGAGCTTAGGGAGGGAAAACAGTTGGCGGGGTTCACGTTGGTGGGTGAGAAGTTGTTGTATAAAGGGCGAACAGTGTTGTCACACCTCTCCGCTTTTATTCCCCTACTGCTTCGTGAGTATCATGACTCACCCGTTGGAGGGCACTCCGGTGAGGTTAAGACTTACCTCCGTCTCACCTCGGAGTGGTTTTGGCGGGGGATGAGAAAAAGTGTAGCTAGGTATGTACGGGAGTGCGGAGTATGCCAACAAAACAAACACTCGCAACAGCAACCCGCTGGCTTGTTGCAACCATTACCTGTACCCGTGGCGGTATGGGAAGACATTAGCATGGACTTTATTGAGGGGTTGCCTCTCTCCAAAGGGGTTGATACAATATTGGTTGTGGTTGATAGATTGTCAAAGTATGCTCACTTCATTGTGCTTAGGCACCCATTCACCGCATTGTCTGTGGCAGCAGCTTTCATTCGTGATATTGTGAAGTTGCACGGCTTCCCTGCTTCAATTATCTCAGACAGGGATAGAGTGTTTCTCAGCATTTTTTGGCGTGAGTTGTTCAAATTACATGGCACCGATTTGAAGAGGAGTACTTCGTACCATCCCCAAACGGATGGACAATCGGAAAACGTTAACAAGGGGCTGGAAACATATTTGCGTTGTTTTGCGGGTGAGAAACCGAAGGAATGGGCTAAGTGGTTATCGTGGGCTGAGTACTCTTACAATACCTCACCTCACATATCCACTAGCATGACCCCTTTTAGAATTGTATATGGCAGGGATCCACCACGTCTAATGAGGATGGAGGCAAGGCAGACACCAGTTGATTCCTTGGATGAGTTAATACGTGAAAGAGATGGAGTTTTAGATGAACTCAAACTCAACCTGCTGCGTGCCCAACAAATCATGAAAGATAATGCTGACAAAAGAAGAAGGAATGAGAGTTTTGAAGTGGGGGACAAAGTCTTTCTGAAATTACAACCATATCGGCAACGTTCGTTAGCCAAGAGGCCCAATGAGAAATTGTCAGCCCGCTTTTATGGCCCATTCGAGATTATACAGAAGATTGGTGTGGTCGCCTATAAACTCAGGTTACCTCCCACCAGTAAAATTCACCCGGTATTCCATGTTTCACAGCTGAAACGAGCAGTTGGAGAAGTGGTGACTGCATCATCCTTGCCTGATCAACTTACTTCTGAGTTGGAGTTGATGGTGGAGCCAGAGGACCTGCTGGATGTGCGTGCAAGTGTGGGGAAAGAAACCATGGTCTTGATCAAATGGAAAGGCTTGCCCTCATTTGATGCTACGTGGGAAGTGGCAGGCTTGATCAATGACCGTTTTCCGGACTTCCACCTTGAGGACAAGGTGTTGGTTTGGGGGCGGGGTGTTGTAATGGACCATACCAACACTGACCCACCTATGTTGGTCTATTACAGGCGCAACAGAAGAGGGGAGAGGCACCACAACCACAACAACAACACAGCTACAGCAGGGACAGCAGTTGGCACAAACTGACAAAGAAAGCAGAGGAAGCATTAATGTTTTGTACTTGTTGTTTACTTCTTAGTTTGTATAAATAAGAGAGGAGAGGAAGGGAAAGGGATTGAGAATTTATTGGTGAATTGGAGAGTAGTAACCTCAAGTTACTTACCTGAGAGTTGTAGCCTAAAGCTACTGCATTTTGTAATCCTTGACCGTTGAGTCCTAGTAATCAACAACTCCTTTCTCTATTCCTGTTTGTGTGTTGTGTGTGTGTGGTAAAGTCAGTCCAGGGCTTTACACTTAGTATCAGAGCAGCCACGTCCTGAGGAAGAAGGGTAGCCATGGTGCTTTCAAATTCGCTGAGATTTGAGGCCTTAGAAGGAGGGCTCACAGAAGTAGGGCAACGGATGAACACCTTGGAAGGGAGGGTGCAGCAGCTCTTAACAGAGAGTATGGAGACCTTCCAAAAGAATCTTGCTGAGCAATTCATGGCCAAGAGCGCGGAGGACACACAGAAAAATCAAGAAACGGTAGATGCAGCTACCATCCGCCTAGAAGGCAGAATTGATAGATTTAGAGAGGACCAAGAGACGCAACTGGCGTTGATGAAAAGGAACCAGGAGAAATTTCAAGAAGAGGTGAAGGCTTTATTGGCTAACAGACCCCCTGCTCAGACCTTGGAGGAGGAGGAGTACTCGGAAAATCAAATGAACCCACCGAGGCGGATTGGCAGGAACTTTGGGAGAGGTGGGTTCGACGATGGAGGTGGGGGGAACGGCGGTCACTGGAAGTATCGGAAATTGGATATGCCTCTGTTTGAAGGTAGCGATCCAGACGGCTGGATTCTTCGTGGTGAAAAATACTTTGACTTTTACAAGTTAACAGAGACTGAGAAAATGGATGCCGCGGTGGTGTCAATGGAGGGGGATGCGCTGAGGTGGTTCACGTTTGAATCACGCCGGAGACCGATGAGAAGCTGGCCGGAGTTGAAAGCTCGAGTTTTGGCAAAGTTTCGCCCCACTAATGCTGGAACTTTGCACGAACAATGGTTGGCGACGGCTCAAACCACCACGGTGGCTGACTACGTGAGGCAGTTCATTGACCTAGCATCTCCTCTAGATGACGTTCCAGAGAGTATAATGCTTGGGCAATTTGTGAATGGGCTTAAAGATGATATTAAAGCTGAAATTCGGGTTCTAAACCCGTATACACTGGATGAAGCGATGGATCTAGCTACTCGGGTTGAAGAACGTAACCGGGTACGAAGAAGTGGGTATGGGAATAACAGGAGTGGACAATTCTCATATTTCAGTAAAAGCCCAATAGCCACTAACCCAACACCAACAACCTCAGGGGCCCAGAACCATGTTTTAAACCCATCCCAGTATAACCCACTGTCAAAAAACACGTACCTCAAACCAGTTAACTCAAGTGGGTCCTACGGATCCTCCCCAAACACCAACCCACCTCGTTCTGCTACAGTAAACTCGCGATTTTCCGGCATTGGAGAAACAAGACGATTGACTGAGCGTGAATTACAGGAAAAAAAGTCTAAAGGCTTATGTTTTAGGTGTGATGAAAGGTGGAGCATCGGCCACCAGTGTAAGAAGAGGGAGATGAATGTTTTGTTGGTGGATGAGGAAGACGGTGGAGAGGAGCTGACCGGAGAGGAAGTCGGTGGTGAGTTTTATTCGCTGGAGGAGCAGAGAAAGGAGGAGATACCGACAACACTCTCAATTAATTCGATTGTAGGCCTTACCAACCCGAAGACACTTAAATTGGTCGGGCGCATCGGAGACGGCGAGGTGGTGGTGATGGTGGATCCCGGAGCCACCCACAACTTCATTTCGCTCCGGGCGGTGGAAAAATTAAAGGTACCTGTTATGGAGTCAATGGGGTTTGGAGTGTCACTTGGTAATGGTGAGGCAGTGAAAGGTAAAGGAATTTGTAAGCAGGTGAAACTGAAATTGAACTCAGAAATTGAAATAGAAGAAGATTTCTTGCCTCTGGAGTTGGGAAATTCGGATGTAATACTGGGCATTCAGTGGTTAGAGAAGCTAGGACCGGTGATCACCAACTGGAAAACACAGGTTATGAAGTATCAGGATAAGGGTGTGACTGTAACATTGAAAGGGGATCCGTCACTAGCTAGCTCAAGGGGAGGAGAGCCCATGATCCTATATAAACCCCACATCAGCCACCCATATGACCGATGTGGGACTTAAGTCCCATACCTTACAATGGACCGTAACATTAATCTCCTTGAAGGCAATGCTGAAAGTAATCAGGAGAGAAGGGGGAGGGATTTTACTGGAGTTTAACCAAGTGGAGGAAATCACCAGTGAAGGGCCAACTGTACCCTTGTTCTTGCACGAGGTTCTGGGAAAATTTCAAGGCATCTTTGGGCTTCCGAAGGGGTTACCGCCGAGCCGAGGGCATGAACACTCGATTGTCATGAAGGAAGGTAGTGATCCAGTTGGTGTTCGACCATATAGATACCCTCAAGTGCAGAAGGATGAGATAGAGCGGCTAATCCGAGAGATGTTAGAAGCCGGAATCATCAAGCCATCCACAAGTCCCTTCTCGAGTCCCGTGTTGTTAGTTAAAAAAAAAGACGGCTCTTGGCGATTCTGTGTGGATTACAGGGCCCTCAATAAAGAGACCGTTCCTGATAAATATCCAATCCCCGTAATTGATGAACTCCTTGATGAGTTGCACGGTGCTAAGGTGTTCTCAAAGCTAGACTTGCGAGCCGGTTACCATCAGATTCTTATGAAACCGGAAGATACACACAAGACGGCCTTCCGCACCCACGAGGGTCACTATGAATTCTTGGTGATGCCGTTCGGCCTCACCAACGCTCCGGCGACATTCCAGTCACTTATGAATGAGGTATTCCGACCATACTTGAGGAAATTTGTTCTTGTCTTTTTTGATGATATATTGGTATATAGTTCTTGTGAAACTGACCATGTAAAGCATATGCAATTAGTGTTGTCATTACTGGAAAAAAAATCAGTTGTTTGCGAATTTCAAAAAGTGTGAGTTCGGGAAGGAAGAGGTAGCCTATCTCGGGCATGTAATATCCAGCCATGGTGTGTCTGTAGACCAAGAAAAAATCAGAGCCATGAAAGAATGGAGCACACCGAAAAATTTGAGGGAGCTTCGGGGTTTCCTAGGTCTCACGGGTTATTACCGAAAGTTTATTGCCGGTTACGCCCACATTGCACATCCCCTCACTGAACAATTGAGGAAGGACAATTTCGGGTGGAGTGAAGCAGCCACACAGGCTTTTGAAGAGCTGAAGAGAGCCATGACTACCGCCCCTGTACTAGCTATGCCAAACTTTAGTAAGGAGTTCGTGGTGGAGACAGATGCGTCGGGCTACGGATTGGGGGCTGTGATGATGCAGGAAGGCCGCCCAATCGCTTATTATAGTCGTGTGCTCGGGCCAAGAGCGAGAGGGAGGTCCATATACGAAAAAGAACTAATGGCAATCTGTTTGGCCGTGGAAAAATGGAAGCATTACCTCCTTGGGAGACACTTTGTGATCAGAACCGACCAGCAAAGTCTACGGCACATCACCCAACAGCGAGAAATCGGGGCTGATTATCAAAAATGGGTTCGGAAGTTGATTGGCTTTGATTTTGACATACAATATAAACCCGGATCATCTAACCGAGTGGCTGATGCACTCTCGAGGAAAGGAGAAGGAGAGCTTGAAATTGGCACACTGGAATTGGGTGCATTGCTGACTTCACAGGGGATAAATTGGGATACATTAGAGGAAGAAATCAAGGCAGATGAAGGGCTGGGAAGGATTAAACAGGAGCTTAGGGAGGGAAAACAGTTGGCGGGGTTCACGTTGGTGGGTGAGAAGTTGTTGTATAAAGGGCGAACAGTGTTGTCACACCTCTCCGCTTTTATTCCCCTACTGCTTCGTGAGTATCATGACTCACCCGTTGGAGGGCACTCCGGTGAGGTTAAGACTTACCTCCGTCTCACCTCGGAGTGGTTTTGGCGGGGGATGAGAAAAAGTGTAGCTAGGTATGTACGGGAGTGCGGAGTATGCCAACAAAACAAACACTCGCAACAGCAACCCGCTGGCTTGTTGCAACCATTACCTGTACCCGTGGCGGTATGGGAAGACATTAGCATGGACTTTATTGAGGGGTTGCCTCTCTCCAAAGGGGTTGATACAATATTGGTTGTGGTTGATAGATTGTCAAAGTATGCTCACTTCATTGTGCTTAGGCACCCATTCACCGCATTGTCTGTGGCAGCAGCTTTCATTCGTGATATTGTGAAGTTGCACGGCTTCCCTGCTTCAATTATCTCAGACAGGGATAGAGTGTTTCTCAGCATTTTTTGGCGTGAGTTGTTCAAATTACATGGCACCGATTTGAAGAGGAGTACTTCGTACCATCCCCAAACGGATGGACAATCGGAAAACGTTAACAAGGGGCTGGAAACATATTTGCGTTGTTTTGCGGGTGAGAAACCGAAGGAATGGGCTAAGTGGTTATCGTGGGCTGAGTACTCTTACAATACCTCACCTCACATATCCACTAGCATGACCCCTTTTAGAATTGTATATGGCAGGGATCCACCACGTCTAATGAGGATGGAGGCAAGGCAGACACCAGTTGATTCCTTGGATGAGTTAATACGTGAAAGAGATGGAGTTTTAGATGAACTCAAACTCAACCTGCTGCGTGCCCAACAAATCATGAAAGATAATGCTGACAAAAGAAGAAGGAATGAGAGTTTTGAAGTGGGGGACAAAGTCTTTCTGAAATTACAACCATATCGGCAACGTTCGTTAGCCAAGAGGCCCAATGAGAAATTGTCAGCCCGCTTTTATGGCCCATTCGAGATTATACAGAAGATTGGTGTGGTCGCCTATAAACTCAGGTTACCTCCCACCAGTAAAATTCACCCGGTATTCCATGTTTCACAGCTGAAACGAGCAGTTGGAGAAGTGGTGACTGCATCATCCTTGCCTGATCAACTTACTTCTGAGTTGGAGTTGATGGTGGAGCCAGAGGACCTGCTGGATGTGCGTGCAAGTGTGGGGAAAGAAACCATGGTCTTGATCAAATGGAAAGGCTTGCCCTCATTTGATGCTACGTGGGAAGTGGCAGGCTTGATCAATGACCGTTTTCCGGACTTCCACCTTGAGGACAAGGTGTTGGTTTGGGGGCGGGGTGTTGTAATGGACCATACCAACACTGACCCACCTATGTTGGTCTATTACAGGCGCAACAGAAGAGGGGAGAGGCACCACAACCACAACAACAACACAGCTACAGCAGGGACAGCAGTTGGCACAAACTGACAAAGAAAGCAGAGGAAGCATTAATGTTTTGTACTTGTTGTTTACTTCTTAGTTTGTATAAATAAGAGAGGAGAGGAAGGGAAAGGGATGGAGAATTTATTGGTGAATTGGAGAGTAGTAACCTCAAGTTACTTACCTGAGAGTTGTAGCCTAAAGCTACTGCATTTTGTAATCCTTGACCGTTGAGTCCTAGTAATCAACAACTCCTTTCTCTATTCCTGTTTGTGTGTTGTGTGTGTGTGGTAAAGTCAGTCCAGGGCTTTACACTGACATTCCAATCATATAAACAGCAGCAACAATTGTTGCACTACAAACTTGCTTCCAAAACCGAGATTTTTTTGATCTCTTACTTATCCAACGAAGGAGAGCGAAGAAATCAGTGGTCTCAGCTCTAATCTCAACCATTCCTTGGTATGTTGTATAACCTGTTTGCTATAATCACAGTTAAAGAACAAGTGGCTGTGAGATTCCACCTCTTTGTTGCAAATGGTACACAAATCAGTATTGCTTATCTGAAGGCGTTGCAATCGTGGCGTTGTCTGTATAGCTAACCATAGGATGTACCTATTCTTTAGGCATAGAAAGCCTACACCATACCATTTTACTCCGGTGCACTTTACGACCTTGCCCAACCAAAATCAGAATATATGCTACCGATGCTATACTTAGGCCCTACTCTCCAACCATCACCAGATAAAGCCAAGCTCAAATCATGCTTAGCCTTGTTTAGTTACTTATTGCTACTCTGTAGTTTGTTACAACTTCCTGGCTTATCTATTGCCACTTGACCGTTAATTCATGGTCTTTGCATGTTGTACACTTTTGCCCAATGAACATGGTTGGTCAGAGTTACGACTACTAACCAATGATTTCGTTATAAACTAAATAATCCGATGATTTTCTCTGCATTTTCTGGATCTTGTATCGTTTAGTTGCTTATTGCTACTCTGTAGTCTGTTACAACTTTCTGGCTTATCTATTGCCAATTTGCCACTTGACCTGTTAATTCATTGTCTCTGCTTGACTGCTTGTTGTACACTTTTGCCCGATGAACATGGTTGGTCAGAGTTACGGCTACTAACCAATGATTTCTTTACAAAATAAATAATCCGATGATTTTCTCTTCATTTTATAATCTACAATCCAGTAATTCTCTCTGCAGTTAACCAAACTTCCAGTAAAATCATTCAGATGTGAGAAATTCAGCTACTCAGCAATGGGCAGAGAACTGAGTGTAGGTGAAATCCAAGTGAATCAGAACAAGACGCTCGTGATTGCTACTAGCCATCTCGAGAGTCCTACACCGGGACCCCCAACATGGGATCAAATGAACAGCAAAGAAAGAGTCAACCAGGCAAAAGAGGCCATAAATATTCTGAAAAGATACCCAAATGTCATATTCTGTGGTGATATGAACTGGGATGATAAACTCGATGGCCATTTCCCGTACGTTGAAGGATGGTTCGATGCTTGGGAGAGACTTAGACCTGCCGAGATAGGATGGACTTACGATACTAAGTCAAACAAAATGTTGACTGGAAACCGGTTGTTGCAGAAACGGCTTGACAGGTTTATTTGCAGTCTGACTGATTTCAAGATTACCAGTATAGATATGATTGGAACTGATGCAATACCTGGTGTAACATTCAGCAAGCAGAAGAAACTGAAGAACAAGATTCAAGAATTGATACTTCCTGTTTTTCCTAGTGATCATTTTGGTTTGCTTTTGACCATTACCAGTAATTTTTAAGTAGCGCAAATGTGTGCAATGACCGTTTTTGTACAGGGTAATACGCCGATGTCTGTTCCTTTCTTCTGCTCTGGAATTACTTTGTTGAAGTTGCTATGCAGGTTTTTAGATTAGTGTTGGAATGGATGTGTTTTTGGTCAGAGAACTGAGCTCTGAGTTTTAGTATATGGATGTTTTAGATAGATTTAAATGAATGGTTTTCTTTAACTCTTACTTCATTTGAATCTTCATTGTAAATTCTTTTGACAGGAAAATCTATTAGCTGTAGTGACTTCTTCGTGTTAACAAGGTTAGGGATGTCTAAAATCAATTTATTAATAAAAAGATACTTGATCAGTCCCTAAAGTTTCGCACATTGTCCTTTTTGGTTGTCTCTTTATATTTGCCACTTACCATAAATAGTTATAGGTTCCCGCTATTATATTTTTGTATGTACTGTACCAAGTATCACTTTGCTCCTACCTAAAGTTTTATATGTCACCAGTCACACCATTAACTTGATGGGTGTGACATGTTCACACTTATAAACCAGCAATTTTATAGTTTTTTAGGCCAAATATTTTAATAGTTTACTTCAAATACTTCTATAATTTTATAGTTTCATGGAAGATATTTTTATAGTTTTTGGGAGGTAGTTTTATAGTGTTAGTCAAGGTACTTCTATAGTTTTATAGATTTAGGGGAAGTACTTTTATAGTTTTAGTCAAGGTACTTCTATAGTTCTATAGTTTCATGGAAGGTACTTTTATAGTTTTAGTGATGTAGTTTTATAGTTTTAGGTGATGTAGTTTTATAGTTTTAGTCATGGTATTTCTATAGTTTATAGATTAAGAGTAAGTACTTTTATAATTTTAGTCAAGGTACTTCTATAGTTTTATAGTTTCAGGGTTAG

mRNA sequence

CACCCATGTCGCATTTCCTTCTTTCAATAAAAAAAATGTGATTTCCAATTAAAAAAACTTTTCAAAATTCAAATGTTCTCTCTCCTTCAACCCAAACTCCCTTCTTCCTTCCCTCGTCTTCTTCAACTTAAACCCACCAAAAACCCTAACAAAATTCTTCAAATCCCTCATTTCCAATTCCAATCTCTCTCCTCATTTTCAACAACAACAACAATGTCTTCTTCTTGGTCTTGCTCTAAATGCACTTTCCTCAACCCGGGTTCTCAAAAATCAACCTGCCAAGTATGCCAAACCCCATTTTCTCTCTCCTCCACGTCATCGTCTCCTTCCTCGCAACCAAAATGGGCCTGCAAAGCCTGCACATTTCTCAATTCGTATAGTCTTTCCAATTGCGAAATTTGCGACACTCGTGCTTCACTATCGAATTTTCAAGACTTGGATTCGAATGACCCGGATGATGATCTTTCCGGGCCTTCTTCTGTGGGTTCGGTTTTCTTCCCATTGCGCCGCTGTAATACGGGTCGGGTTCATGAATTGGATGATTTGGGTGTGAAAGAAAAGGTTGAAACTCAAATGGGCACTGATCAATTTGTTAATCAAGGTGTTCAATTGAGTAGGCTGCAAAATTCTGGTTTTAACAAGCGCAAGATAGAGGACTTACTCTCTGGTTCTGTGACCATACTGGGTGAGAATGCTGACGCTGTTAGCTGTTTTACTACGAAGAGAGAGAAAAATGATGTAGTATTGAAAGGTTTAGCATCAGCAAAGATTGCTGATAATGTTGAGGACAGTGTCACATTTAAGAGGAAACTCCGTGATTTAAGAGGAAACTCTAAACCCATCAAAATAGATGGCTTTGGCGAGGTTAAATCTTTCAGGAAGGCTGTTTCATTTGTAGATTCTGTATATAGCTTAGACCAAGAGAAGGTTGGAAAGGCCAAAATAGATAGTGAGTCGTTTTCCGAGGAATTCACTGAAGATGCTCAACTTGAAGAAGTTTCAGGTGTGATAAAGATCCTTTCTTACAATGTCTGGTTCAGAGAAGACTTAGAGGTGCATAAGAGGATGCAAGCAATTGGTGACCTAATTAAGCTGCACTCTCCAGATATTATTTGTTTTCAGGAGGTTACTCCTGGTATCTATGACATTTTCCGGAGATCTACGTGGTGGAAGAAATATTGCTGTTCTGTCTCGTACGAAATGGCAGCCTCTGGGGCGTACTTTTGCATGCAGTTAACCAAACTTCCAGTAAAATCATTCAGATGTGAGAAATTCAGCTACTCAGCAATGGGCAGAGAACTGAGTGTAGGTGAAATCCAAGTGAATCAGAACAAGACGCTCGTGATTGCTACTAGCCATCTCGAGAGTCCTACACCGGGACCCCCAACATGGGATCAAATGAACAGCAAAGAAAGAGTCAACCAGGCAAAAGAGGCCATAAATATTCTGAAAAGATACCCAAATGTCATATTCTGTGGTGATATGAACTGGGATGATAAACTCGATGGCCATTTCCCGTACGTTGAAGGATGGTTCGATGCTTGGGAGAGACTTAGACCTGCCGAGATAGGATGGACTTACGATACTAAGTCAAACAAAATGTTGACTGGAAACCGGTTGTTGCAGAAACGGCTTGACAGGTTTATTTGCAGTCTGACTGATTTCAAGATTACCAGTATAGATATGATTGGAACTGATGCAATACCTGGTGTAACATTCAGCAAGCAGAAGAAACTGAAGAACAAGATTCAAGAATTGATACTTCCTGGTTAG

Coding sequence (CDS)

ATGTTCTCTCTCCTTCAACCCAAACTCCCTTCTTCCTTCCCTCGTCTTCTTCAACTTAAACCCACCAAAAACCCTAACAAAATTCTTCAAATCCCTCATTTCCAATTCCAATCTCTCTCCTCATTTTCAACAACAACAACAATGTCTTCTTCTTGGTCTTGCTCTAAATGCACTTTCCTCAACCCGGGTTCTCAAAAATCAACCTGCCAAGTATGCCAAACCCCATTTTCTCTCTCCTCCACGTCATCGTCTCCTTCCTCGCAACCAAAATGGGCCTGCAAAGCCTGCACATTTCTCAATTCGTATAGTCTTTCCAATTGCGAAATTTGCGACACTCGTGCTTCACTATCGAATTTTCAAGACTTGGATTCGAATGACCCGGATGATGATCTTTCCGGGCCTTCTTCTGTGGGTTCGGTTTTCTTCCCATTGCGCCGCTGTAATACGGGTCGGGTTCATGAATTGGATGATTTGGGTGTGAAAGAAAAGGTTGAAACTCAAATGGGCACTGATCAATTTGTTAATCAAGGTGTTCAATTGAGTAGGCTGCAAAATTCTGGTTTTAACAAGCGCAAGATAGAGGACTTACTCTCTGGTTCTGTGACCATACTGGGTGAGAATGCTGACGCTGTTAGCTGTTTTACTACGAAGAGAGAGAAAAATGATGTAGTATTGAAAGGTTTAGCATCAGCAAAGATTGCTGATAATGTTGAGGACAGTGTCACATTTAAGAGGAAACTCCGTGATTTAAGAGGAAACTCTAAACCCATCAAAATAGATGGCTTTGGCGAGGTTAAATCTTTCAGGAAGGCTGTTTCATTTGTAGATTCTGTATATAGCTTAGACCAAGAGAAGGTTGGAAAGGCCAAAATAGATAGTGAGTCGTTTTCCGAGGAATTCACTGAAGATGCTCAACTTGAAGAAGTTTCAGGTGTGATAAAGATCCTTTCTTACAATGTCTGGTTCAGAGAAGACTTAGAGGTGCATAAGAGGATGCAAGCAATTGGTGACCTAATTAAGCTGCACTCTCCAGATATTATTTGTTTTCAGGAGGTTACTCCTGGTATCTATGACATTTTCCGGAGATCTACGTGGTGGAAGAAATATTGCTGTTCTGTCTCGTACGAAATGGCAGCCTCTGGGGCGTACTTTTGCATGCAGTTAACCAAACTTCCAGTAAAATCATTCAGATGTGAGAAATTCAGCTACTCAGCAATGGGCAGAGAACTGAGTGTAGGTGAAATCCAAGTGAATCAGAACAAGACGCTCGTGATTGCTACTAGCCATCTCGAGAGTCCTACACCGGGACCCCCAACATGGGATCAAATGAACAGCAAAGAAAGAGTCAACCAGGCAAAAGAGGCCATAAATATTCTGAAAAGATACCCAAATGTCATATTCTGTGGTGATATGAACTGGGATGATAAACTCGATGGCCATTTCCCGTACGTTGAAGGATGGTTCGATGCTTGGGAGAGACTTAGACCTGCCGAGATAGGATGGACTTACGATACTAAGTCAAACAAAATGTTGACTGGAAACCGGTTGTTGCAGAAACGGCTTGACAGGTTTATTTGCAGTCTGACTGATTTCAAGATTACCAGTATAGATATGATTGGAACTGATGCAATACCTGGTGTAACATTCAGCAAGCAGAAGAAACTGAAGAACAAGATTCAAGAATTGATACTTCCTGGTTAG

Protein sequence

MFSLLQPKLPSSFPRLLQLKPTKNPNKILQIPHFQFQSLSSFSTTTTMSSSWSCSKCTFLNPGSQKSTCQVCQTPFSLSSTSSSPSSQPKWACKACTFLNSYSLSNCEICDTRASLSNFQDLDSNDPDDDLSGPSSVGSVFFPLRRCNTGRVHELDDLGVKEKVETQMGTDQFVNQGVQLSRLQNSGFNKRKIEDLLSGSVTILGENADAVSCFTTKREKNDVVLKGLASAKIADNVEDSVTFKRKLRDLRGNSKPIKIDGFGEVKSFRKAVSFVDSVYSLDQEKVGKAKIDSESFSEEFTEDAQLEEVSGVIKILSYNVWFREDLEVHKRMQAIGDLIKLHSPDIICFQEVTPGIYDIFRRSTWWKKYCCSVSYEMAASGAYFCMQLTKLPVKSFRCEKFSYSAMGRELSVGEIQVNQNKTLVIATSHLESPTPGPPTWDQMNSKERVNQAKEAINILKRYPNVIFCGDMNWDDKLDGHFPYVEGWFDAWERLRPAEIGWTYDTKSNKMLTGNRLLQKRLDRFICSLTDFKITSIDMIGTDAIPGVTFSKQKKLKNKIQELILPG
Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Spo13795.1Spo13795.1mRNA


Homology
BLAST of Spo13795.1 vs. NCBI nr
Match: gi|731351024|ref|XP_010686817.1| (PREDICTED: uncharacterized protein LOC104900976 [Beta vulgaris subsp. vulgaris])

HSP 1 Score: 778.9 bits (2010), Expect = 6.200e-222
Identity = 400/570 (70.18%), Postives = 456/570 (80.00%), Query Frame = 1

		  

Query: 2   FSLLQPKLPSSFPRLLQLKPTKNPNKILQIPHFQFQSLSSFSTTTTMSSSWSCSKCTFLN 61
           FS  Q KL SS   LLQL P KNPNK L   H +FQ LSS +T ++ SSSW+CSKCTFLN
Sbjct: 4   FSPFQTKLSSSL-HLLQLHPIKNPNKTLH--HLKFQFLSSSTTMSSSSSSWACSKCTFLN 63

Query: 62  PGSQKSTCQVCQTPFSLSSTSS-----SPSSQPKWACKACTFLNSYSLSNCEICDTRASL 121
           P +QKSTCQ+CQTPFSLSS+SS     S SS  KW+CKACTFLNSYS SNCE+CDTRASL
Sbjct: 64  PPTQKSTCQICQTPFSLSSSSSILPSSSSSSPSKWSCKACTFLNSYSRSNCEVCDTRASL 123

Query: 122 SNFQDLDSNDPDDDLSGPSSVGSVFFPLRRCNTGRVHELDDLGVKEKVETQMGTDQFVNQ 181
           S  QDLDSNDP+DD  G SSVGS+F+PLRRCN+G+V E+DD G+  KV+TQ+GT Q V+ 
Sbjct: 124 SYIQDLDSNDPNDDFDGSSSVGSIFYPLRRCNSGKVQEIDDFGLNNKVQTQLGTHQIVDH 183

Query: 182 GVQLSRLQNSGFNKRKIEDLLSGSVTILGENADAVSCFTTKREKNDVVLKGLASAKIADN 241
            V+L  L +S  NKRK++DL+S       EN  AVSCF+ K+E+   V+KGLA AKIADN
Sbjct: 184 SVKLGGLWSSASNKRKLQDLVS-------ENDAAVSCFSAKKEEPAAVMKGLAPAKIADN 243

Query: 242 VEDSVTFKRKLRDLRGN-SKPIKIDGFGEVKSFRKAVSFVDSVYSLDQEKVGKAKIDSES 301
            E SVT KRKL DL     KPIK+DGF EVKS RK VS +DSV     +  G  K +  S
Sbjct: 244 AEGSVTLKRKLCDLGDFIEKPIKLDGFCEVKSSRKTVSVLDSV-----DDFGIEKANGNS 303

Query: 302 FSEEFTEDAQLEEVSGVIKILSYNVWFREDLEVHKRMQAIGDLIKLHSPDIICFQEVTPG 361
            S E TE+A+L+EVSGVIKILSYNVWFRED+EVH RMQAIGDLI+LHSPD+ICFQEVTP 
Sbjct: 304 ASMEATENAELKEVSGVIKILSYNVWFREDIEVHNRMQAIGDLIQLHSPDVICFQEVTPM 363

Query: 362 IYDIFRRSTWWKKYCCSVSYEMAASGAYFCMQLTKLPVKSFRCEKFSYSAMGRELSVGEI 421
           IYDI R+S WWKKY CS+SYE A S  Y+CMQL KLPVKSF C++FSYSAMGREL V EI
Sbjct: 364 IYDILRKSLWWKKYKCSLSYEEAGSRGYYCMQLAKLPVKSFSCKEFSYSAMGRELCVAEI 423

Query: 422 QVNQNKTLVIATSHLESPTPGPPTWDQMNSKERVNQAKEAINILKRYPNVIFCGDMNWDD 481
           +VN N  LVIATSHLESP+PGPPTWDQM SKERVNQAKEAIN L++YPNVIFCGDMNWDD
Sbjct: 424 EVNPNNLLVIATSHLESPSPGPPTWDQMYSKERVNQAKEAINFLQKYPNVIFCGDMNWDD 483

Query: 482 KLDGHFPYVEGWFDAWERLRPAEIGWTYDTKSNKMLTGNRLLQKRLDRFICSLTDFKITS 541
           K DG+FP  EGWFDAWE+LRPAE+GWTYDTKSNKMLTGNR LQKRLDRFIC L+ F+I+ 
Sbjct: 484 KRDGNFPVSEGWFDAWEKLRPAEVGWTYDTKSNKMLTGNRALQKRLDRFICRLSGFEISC 543

Query: 542 IDMIGTDAIPGVTFSKQKKLKNKIQELILP 566
           IDMIGT+AIPGVT+SKQKKLKNKIQEL LP
Sbjct: 544 IDMIGTEAIPGVTYSKQKKLKNKIQELTLP 558

BLAST of Spo13795.1 vs. NCBI nr
Match: gi|870852204|gb|KMT04148.1| (hypothetical protein BVRB_8g185970 isoform B [Beta vulgaris subsp. vulgaris])

HSP 1 Score: 749.6 bits (1934), Expect = 4.000e-213
Identity = 376/533 (70.54%), Postives = 432/533 (81.05%), Query Frame = 1

		  

Query: 46  TTMSSSWSCSKCTFLNPGSQKSTCQVCQTPFSLSSTSS-----SPSSQPKWACKACTFLN 105
           ++ SSSW+CSKCTFLNP +QKSTCQ+CQTPFSLSS+SS     S SS  KW+CKACTFLN
Sbjct: 2   SSSSSSWACSKCTFLNPPTQKSTCQICQTPFSLSSSSSILPSSSSSSPSKWSCKACTFLN 61

Query: 106 SYSLSNCEICDTRASLSNFQDLDSNDPDDDLSGPSSVGSVFFPLRRCNTGRVHELDDLGV 165
           SYS SNCE+CDTRASLS  QDLDSNDP+DD  G SSVGS+F+PLRRCN+G+V E+DD G+
Sbjct: 62  SYSRSNCEVCDTRASLSYIQDLDSNDPNDDFDGSSSVGSIFYPLRRCNSGKVQEIDDFGL 121

Query: 166 KEKVETQMGTDQFVNQGVQLSRLQNSGFNKRKIEDLLSGSVT-------ILGENADAVSC 225
             KV+TQ+GT Q V+  V+L  L +S  NKRK++DL+SG +        +  EN  AVSC
Sbjct: 122 NNKVQTQLGTHQIVDHSVKLGGLWSSASNKRKLQDLVSGQLFPCSDLGFVSKENDAAVSC 181

Query: 226 FTTKREKNDVVLKGLASAKIADNVEDSVTFKRKLRDLRGN-SKPIKIDGFGEVKSFRKAV 285
           F+ K+E+   V+KGLA AKIADN E SVT KRKL DL     KPIK+DGF EVKS RK V
Sbjct: 182 FSAKKEEPAAVMKGLAPAKIADNAEGSVTLKRKLCDLGDFIEKPIKLDGFCEVKSSRKTV 241

Query: 286 SFVDSVYSLDQEKVGKAKIDSESFSEEFTEDAQLEEVSGVIKILSYNVWFREDLEVHKRM 345
           S +DSV     +  G  K +  S S E TE+A+L+EVSGVIKILSYNVWFRED+EVH RM
Sbjct: 242 SVLDSV-----DDFGIEKANGNSASMEATENAELKEVSGVIKILSYNVWFREDIEVHNRM 301

Query: 346 QAIGDLIKLHSPDIICFQEVTPGIYDIFRRSTWWKKYCCSVSYEMAASGAYFCMQLTKLP 405
           QAIGDLI+LHSPD+ICFQEVTP IYDI R+S WWKKY CS+SYE A S  Y+CMQL KLP
Sbjct: 302 QAIGDLIQLHSPDVICFQEVTPMIYDILRKSLWWKKYKCSLSYEEAGSRGYYCMQLAKLP 361

Query: 406 VKSFRCEKFSYSAMGRELSVGEIQVNQNKTLVIATSHLESPTPGPPTWDQMNSKERVNQA 465
           VKSF C++FSYSAMGREL V EI+VN N  LVIATSHLESP+PGPPTWDQM SKERVNQA
Sbjct: 362 VKSFSCKEFSYSAMGRELCVAEIEVNPNNLLVIATSHLESPSPGPPTWDQMYSKERVNQA 421

Query: 466 KEAINILKRYPNVIFCGDMNWDDKLDGHFPYVEGWFDAWERLRPAEIGWTYDTKSNKMLT 525
           KEAIN L++YPNVIFCGDMNWDDK DG+FP  EGWFDAWE+LRPAE+GWTYDTKSNKMLT
Sbjct: 422 KEAINFLQKYPNVIFCGDMNWDDKRDGNFPVSEGWFDAWEKLRPAEVGWTYDTKSNKMLT 481

Query: 526 GNRLLQKRLDRFICSLTDFKITSIDMIGTDAIPGVTFSKQKKLKNKIQELILP 566
           GNR LQKRLDRFIC L+ F+I+ IDMIGT+AIPGVT+SKQKKLKNKIQEL LP
Sbjct: 482 GNRALQKRLDRFICRLSGFEISCIDMIGTEAIPGVTYSKQKKLKNKIQELTLP 529

BLAST of Spo13795.1 vs. NCBI nr
Match: gi|870852203|gb|KMT04147.1| (hypothetical protein BVRB_8g185970 isoform A [Beta vulgaris subsp. vulgaris])

HSP 1 Score: 747.7 bits (1929), Expect = 1.500e-212
Identity = 375/526 (71.29%), Postives = 429/526 (81.56%), Query Frame = 1

		  

Query: 46  TTMSSSWSCSKCTFLNPGSQKSTCQVCQTPFSLSSTSS-----SPSSQPKWACKACTFLN 105
           ++ SSSW+CSKCTFLNP +QKSTCQ+CQTPFSLSS+SS     S SS  KW+CKACTFLN
Sbjct: 2   SSSSSSWACSKCTFLNPPTQKSTCQICQTPFSLSSSSSILPSSSSSSPSKWSCKACTFLN 61

Query: 106 SYSLSNCEICDTRASLSNFQDLDSNDPDDDLSGPSSVGSVFFPLRRCNTGRVHELDDLGV 165
           SYS SNCE+CDTRASLS  QDLDSNDP+DD  G SSVGS+F+PLRRCN+G+V E+DD G+
Sbjct: 62  SYSRSNCEVCDTRASLSYIQDLDSNDPNDDFDGSSSVGSIFYPLRRCNSGKVQEIDDFGL 121

Query: 166 KEKVETQMGTDQFVNQGVQLSRLQNSGFNKRKIEDLLSGSVTILGENADAVSCFTTKREK 225
             KV+TQ+GT Q V+  V+L  L +S  NKRK++DL+S       EN  AVSCF+ K+E+
Sbjct: 122 NNKVQTQLGTHQIVDHSVKLGGLWSSASNKRKLQDLVS-------ENDAAVSCFSAKKEE 181

Query: 226 NDVVLKGLASAKIADNVEDSVTFKRKLRDLRGN-SKPIKIDGFGEVKSFRKAVSFVDSVY 285
              V+KGLA AKIADN E SVT KRKL DL     KPIK+DGF EVKS RK VS +DSV 
Sbjct: 182 PAAVMKGLAPAKIADNAEGSVTLKRKLCDLGDFIEKPIKLDGFCEVKSSRKTVSVLDSV- 241

Query: 286 SLDQEKVGKAKIDSESFSEEFTEDAQLEEVSGVIKILSYNVWFREDLEVHKRMQAIGDLI 345
               +  G  K +  S S E TE+A+L+EVSGVIKILSYNVWFRED+EVH RMQAIGDLI
Sbjct: 242 ----DDFGIEKANGNSASMEATENAELKEVSGVIKILSYNVWFREDIEVHNRMQAIGDLI 301

Query: 346 KLHSPDIICFQEVTPGIYDIFRRSTWWKKYCCSVSYEMAASGAYFCMQLTKLPVKSFRCE 405
           +LHSPD+ICFQEVTP IYDI R+S WWKKY CS+SYE A S  Y+CMQL KLPVKSF C+
Sbjct: 302 QLHSPDVICFQEVTPMIYDILRKSLWWKKYKCSLSYEEAGSRGYYCMQLAKLPVKSFSCK 361

Query: 406 KFSYSAMGRELSVGEIQVNQNKTLVIATSHLESPTPGPPTWDQMNSKERVNQAKEAINIL 465
           +FSYSAMGREL V EI+VN N  LVIATSHLESP+PGPPTWDQM SKERVNQAKEAIN L
Sbjct: 362 EFSYSAMGRELCVAEIEVNPNNLLVIATSHLESPSPGPPTWDQMYSKERVNQAKEAINFL 421

Query: 466 KRYPNVIFCGDMNWDDKLDGHFPYVEGWFDAWERLRPAEIGWTYDTKSNKMLTGNRLLQK 525
           ++YPNVIFCGDMNWDDK DG+FP  EGWFDAWE+LRPAE+GWTYDTKSNKMLTGNR LQK
Sbjct: 422 QKYPNVIFCGDMNWDDKRDGNFPVSEGWFDAWEKLRPAEVGWTYDTKSNKMLTGNRALQK 481

Query: 526 RLDRFICSLTDFKITSIDMIGTDAIPGVTFSKQKKLKNKIQELILP 566
           RLDRFIC L+ F+I+ IDMIGT+AIPGVT+SKQKKLKNKIQEL LP
Sbjct: 482 RLDRFICRLSGFEISCIDMIGTEAIPGVTYSKQKKLKNKIQELTLP 515

BLAST of Spo13795.1 vs. NCBI nr
Match: gi|902205245|gb|KNA15127.1| (hypothetical protein SOVF_101000 [Spinacia oleracea])

HSP 1 Score: 603.2 bits (1554), Expect = 4.600e-169
Identity = 290/290 (100.00%), Postives = 290/290 (100.00%), Query Frame = 1

		  

Query: 276 DSVYSLDQEKVGKAKIDSESFSEEFTEDAQLEEVSGVIKILSYNVWFREDLEVHKRMQAI 335
           DSVYSLDQEKVGKAKIDSESFSEEFTEDAQLEEVSGVIKILSYNVWFREDLEVHKRMQAI
Sbjct: 199 DSVYSLDQEKVGKAKIDSESFSEEFTEDAQLEEVSGVIKILSYNVWFREDLEVHKRMQAI 258

Query: 336 GDLIKLHSPDIICFQEVTPGIYDIFRRSTWWKKYCCSVSYEMAASGAYFCMQLTKLPVKS 395
           GDLIKLHSPDIICFQEVTPGIYDIFRRSTWWKKYCCSVSYEMAASGAYFCMQLTKLPVKS
Sbjct: 259 GDLIKLHSPDIICFQEVTPGIYDIFRRSTWWKKYCCSVSYEMAASGAYFCMQLTKLPVKS 318

Query: 396 FRCEKFSYSAMGRELSVGEIQVNQNKTLVIATSHLESPTPGPPTWDQMNSKERVNQAKEA 455
           FRCEKFSYSAMGRELSVGEIQVNQNKTLVIATSHLESPTPGPPTWDQMNSKERVNQAKEA
Sbjct: 319 FRCEKFSYSAMGRELSVGEIQVNQNKTLVIATSHLESPTPGPPTWDQMNSKERVNQAKEA 378

Query: 456 INILKRYPNVIFCGDMNWDDKLDGHFPYVEGWFDAWERLRPAEIGWTYDTKSNKMLTGNR 515
           INILKRYPNVIFCGDMNWDDKLDGHFPYVEGWFDAWERLRPAEIGWTYDTKSNKMLTGNR
Sbjct: 379 INILKRYPNVIFCGDMNWDDKLDGHFPYVEGWFDAWERLRPAEIGWTYDTKSNKMLTGNR 438

Query: 516 LLQKRLDRFICSLTDFKITSIDMIGTDAIPGVTFSKQKKLKNKIQELILP 566
           LLQKRLDRFICSLTDFKITSIDMIGTDAIPGVTFSKQKKLKNKIQELILP
Sbjct: 439 LLQKRLDRFICSLTDFKITSIDMIGTDAIPGVTFSKQKKLKNKIQELILP 488

BLAST of Spo13795.1 vs. NCBI nr
Match: gi|743907026|ref|XP_011046939.1| (PREDICTED: uncharacterized protein LOC105141426 isoform X1 [Populus euphratica])

HSP 1 Score: 469.5 bits (1207), Expect = 8.000e-129
Identity = 269/575 (46.78%), Postives = 343/575 (59.65%), Query Frame = 1

		  

Query: 1   MFSLLQPKLPSSFPRLLQLKPTKNPNKILQIPHFQFQSLSSFSTTTTMSSSWSCSKCTFL 60
           M SLLQ KL  +F  ++ + P   P+K   +           S   +   SWSC KCTF+
Sbjct: 1   MQSLLQ-KL--TFAPIVIISPLSKPSKSCNL-----------SVHLSNLMSWSCKKCTFI 60

Query: 61  NPGSQKSTCQVCQTPFSLSSTSSSPSSQ--PKWACKACTFLNSYSLSNCEICDTRASLSN 120
           N  S K TCQ+C +P S     SS S+Q  PKW+CKACTFLN Y  S+CE+C TR S+ +
Sbjct: 61  NSPSPKPTCQICLSPPSPPPLPSSSSNQESPKWSCKACTFLNPYKNSSCEVCGTRGSVFS 120

Query: 121 FQDLDSNDPDDDLSG--PSSVGSVFFPLRRCNTGRVHELDD------LGVKEKVETQMGT 180
              LD       L G   SSVGSVF PLR C       +DD       GVK     ++G 
Sbjct: 121 LSSLDDLTDTSGLDGDVDSSVGSVFMPLRHCKRKVRDSVDDHQEVGFRGVKSLNNVEVGG 180

Query: 181 DQFVNQGVQLSRLQNSGFNKRKIEDLLSGSVTILGENADAVSCFTTKREKNDVVLKGLAS 240
           D       +L   Q    + +         VT+L +  D           + V L G   
Sbjct: 181 D---GDSAKLGAFQGVKASNK--------GVTVLKDGVDG----------HSVRLGGFQG 240

Query: 241 AKIADNVEDSVTFKRKLRDLRGNSKPIKIDGFGEVKSFRKAVSFVDSVYSLDQEKVGKAK 300
            K ++  +  +  K +      +   +K+  F   ++  K V+ +               
Sbjct: 241 VKSSN--KGVIVLKDE-----SDGASVKLGAFLGARASNKGVAVL--------------- 300

Query: 301 IDSESFSEEFTEDAQLEEVSGVIKILSYNVWFREDLEVHKRMQAIGDLIKLHSPDIICFQ 360
                     TED     V G  KILSYNVWFREDLE+H+RM+A+G+LI+LHSPD+IC Q
Sbjct: 301 ----------TEDTNSVAVLGSFKILSYNVWFREDLEMHRRMKALGELIQLHSPDVICLQ 360

Query: 361 EVTPGIYDIFRRSTWWKKYCCSVSYEMAASGAYFCMQLTKLPVKSFRCEKFSYSAMGREL 420
           EV P IYDIF+RS+WWK Y CSVS E+A+S  YFCMQL+KLPVKSF  + F  S MGREL
Sbjct: 361 EVIPDIYDIFQRSSWWKAYQCSVSSEIASSRGYFCMQLSKLPVKSFSTKPFMNSIMGREL 420

Query: 421 SVGEIQVNQNKTLVIATSHLESPTPGPPTWDQMNSKERVNQAKEAINILKRYPNVIFCGD 480
            + E++V   K+LV+ATSHLESP P PP WDQM SKERV+QAKEAIN+LK+  NVIFCGD
Sbjct: 421 CIAELEVPGKKSLVVATSHLESPCPAPPKWDQMFSKERVDQAKEAINLLKKNSNVIFCGD 480

Query: 481 MNWDDKLDGHFPYVEGWFDAWERLRPAEIGWTYDTKSNKMLTGNRLLQKRLDRFICSLTD 540
           MNWDDKLDG FP+ +GW DAW  L+P + GWTYDTKSN+ML+GNR LQKRLDRFICSL D
Sbjct: 481 MNWDDKLDGQFPFPDGWVDAWVELKPGDNGWTYDTKSNQMLSGNRALQKRLDRFICSLCD 508

Query: 541 FKITSIDMIGTDAIPGVTFSKQKKLKNKIQELILP 566
           FKI+ IDMIG DAIPG+++ K+KK++ +++ L LP
Sbjct: 541 FKISKIDMIGKDAIPGLSYMKEKKVRKEVKMLELP 508

BLAST of Spo13795.1 vs. UniProtKB/TrEMBL
Match: A0A0J8BW46_BETVU (Uncharacterized protein OS=Beta vulgaris subsp. vulgaris GN=BVRB_8g185970 PE=4 SV=1)

HSP 1 Score: 749.6 bits (1934), Expect = 2.800e-213
Identity = 376/533 (70.54%), Postives = 432/533 (81.05%), Query Frame = 1

		  

Query: 46  TTMSSSWSCSKCTFLNPGSQKSTCQVCQTPFSLSSTSS-----SPSSQPKWACKACTFLN 105
           ++ SSSW+CSKCTFLNP +QKSTCQ+CQTPFSLSS+SS     S SS  KW+CKACTFLN
Sbjct: 2   SSSSSSWACSKCTFLNPPTQKSTCQICQTPFSLSSSSSILPSSSSSSPSKWSCKACTFLN 61

Query: 106 SYSLSNCEICDTRASLSNFQDLDSNDPDDDLSGPSSVGSVFFPLRRCNTGRVHELDDLGV 165
           SYS SNCE+CDTRASLS  QDLDSNDP+DD  G SSVGS+F+PLRRCN+G+V E+DD G+
Sbjct: 62  SYSRSNCEVCDTRASLSYIQDLDSNDPNDDFDGSSSVGSIFYPLRRCNSGKVQEIDDFGL 121

Query: 166 KEKVETQMGTDQFVNQGVQLSRLQNSGFNKRKIEDLLSGSVT-------ILGENADAVSC 225
             KV+TQ+GT Q V+  V+L  L +S  NKRK++DL+SG +        +  EN  AVSC
Sbjct: 122 NNKVQTQLGTHQIVDHSVKLGGLWSSASNKRKLQDLVSGQLFPCSDLGFVSKENDAAVSC 181

Query: 226 FTTKREKNDVVLKGLASAKIADNVEDSVTFKRKLRDLRGN-SKPIKIDGFGEVKSFRKAV 285
           F+ K+E+   V+KGLA AKIADN E SVT KRKL DL     KPIK+DGF EVKS RK V
Sbjct: 182 FSAKKEEPAAVMKGLAPAKIADNAEGSVTLKRKLCDLGDFIEKPIKLDGFCEVKSSRKTV 241

Query: 286 SFVDSVYSLDQEKVGKAKIDSESFSEEFTEDAQLEEVSGVIKILSYNVWFREDLEVHKRM 345
           S +DSV     +  G  K +  S S E TE+A+L+EVSGVIKILSYNVWFRED+EVH RM
Sbjct: 242 SVLDSV-----DDFGIEKANGNSASMEATENAELKEVSGVIKILSYNVWFREDIEVHNRM 301

Query: 346 QAIGDLIKLHSPDIICFQEVTPGIYDIFRRSTWWKKYCCSVSYEMAASGAYFCMQLTKLP 405
           QAIGDLI+LHSPD+ICFQEVTP IYDI R+S WWKKY CS+SYE A S  Y+CMQL KLP
Sbjct: 302 QAIGDLIQLHSPDVICFQEVTPMIYDILRKSLWWKKYKCSLSYEEAGSRGYYCMQLAKLP 361

Query: 406 VKSFRCEKFSYSAMGRELSVGEIQVNQNKTLVIATSHLESPTPGPPTWDQMNSKERVNQA 465
           VKSF C++FSYSAMGREL V EI+VN N  LVIATSHLESP+PGPPTWDQM SKERVNQA
Sbjct: 362 VKSFSCKEFSYSAMGRELCVAEIEVNPNNLLVIATSHLESPSPGPPTWDQMYSKERVNQA 421

Query: 466 KEAINILKRYPNVIFCGDMNWDDKLDGHFPYVEGWFDAWERLRPAEIGWTYDTKSNKMLT 525
           KEAIN L++YPNVIFCGDMNWDDK DG+FP  EGWFDAWE+LRPAE+GWTYDTKSNKMLT
Sbjct: 422 KEAINFLQKYPNVIFCGDMNWDDKRDGNFPVSEGWFDAWEKLRPAEVGWTYDTKSNKMLT 481

Query: 526 GNRLLQKRLDRFICSLTDFKITSIDMIGTDAIPGVTFSKQKKLKNKIQELILP 566
           GNR LQKRLDRFIC L+ F+I+ IDMIGT+AIPGVT+SKQKKLKNKIQEL LP
Sbjct: 482 GNRALQKRLDRFICRLSGFEISCIDMIGTEAIPGVTYSKQKKLKNKIQELTLP 529

BLAST of Spo13795.1 vs. UniProtKB/TrEMBL
Match: A0A0J8BSK5_BETVU (Uncharacterized protein OS=Beta vulgaris subsp. vulgaris GN=BVRB_8g185970 PE=4 SV=1)

HSP 1 Score: 747.7 bits (1929), Expect = 1.100e-212
Identity = 375/526 (71.29%), Postives = 429/526 (81.56%), Query Frame = 1

		  

Query: 46  TTMSSSWSCSKCTFLNPGSQKSTCQVCQTPFSLSSTSS-----SPSSQPKWACKACTFLN 105
           ++ SSSW+CSKCTFLNP +QKSTCQ+CQTPFSLSS+SS     S SS  KW+CKACTFLN
Sbjct: 2   SSSSSSWACSKCTFLNPPTQKSTCQICQTPFSLSSSSSILPSSSSSSPSKWSCKACTFLN 61

Query: 106 SYSLSNCEICDTRASLSNFQDLDSNDPDDDLSGPSSVGSVFFPLRRCNTGRVHELDDLGV 165
           SYS SNCE+CDTRASLS  QDLDSNDP+DD  G SSVGS+F+PLRRCN+G+V E+DD G+
Sbjct: 62  SYSRSNCEVCDTRASLSYIQDLDSNDPNDDFDGSSSVGSIFYPLRRCNSGKVQEIDDFGL 121

Query: 166 KEKVETQMGTDQFVNQGVQLSRLQNSGFNKRKIEDLLSGSVTILGENADAVSCFTTKREK 225
             KV+TQ+GT Q V+  V+L  L +S  NKRK++DL+S       EN  AVSCF+ K+E+
Sbjct: 122 NNKVQTQLGTHQIVDHSVKLGGLWSSASNKRKLQDLVS-------ENDAAVSCFSAKKEE 181

Query: 226 NDVVLKGLASAKIADNVEDSVTFKRKLRDLRGN-SKPIKIDGFGEVKSFRKAVSFVDSVY 285
              V+KGLA AKIADN E SVT KRKL DL     KPIK+DGF EVKS RK VS +DSV 
Sbjct: 182 PAAVMKGLAPAKIADNAEGSVTLKRKLCDLGDFIEKPIKLDGFCEVKSSRKTVSVLDSV- 241

Query: 286 SLDQEKVGKAKIDSESFSEEFTEDAQLEEVSGVIKILSYNVWFREDLEVHKRMQAIGDLI 345
               +  G  K +  S S E TE+A+L+EVSGVIKILSYNVWFRED+EVH RMQAIGDLI
Sbjct: 242 ----DDFGIEKANGNSASMEATENAELKEVSGVIKILSYNVWFREDIEVHNRMQAIGDLI 301

Query: 346 KLHSPDIICFQEVTPGIYDIFRRSTWWKKYCCSVSYEMAASGAYFCMQLTKLPVKSFRCE 405
           +LHSPD+ICFQEVTP IYDI R+S WWKKY CS+SYE A S  Y+CMQL KLPVKSF C+
Sbjct: 302 QLHSPDVICFQEVTPMIYDILRKSLWWKKYKCSLSYEEAGSRGYYCMQLAKLPVKSFSCK 361

Query: 406 KFSYSAMGRELSVGEIQVNQNKTLVIATSHLESPTPGPPTWDQMNSKERVNQAKEAINIL 465
           +FSYSAMGREL V EI+VN N  LVIATSHLESP+PGPPTWDQM SKERVNQAKEAIN L
Sbjct: 362 EFSYSAMGRELCVAEIEVNPNNLLVIATSHLESPSPGPPTWDQMYSKERVNQAKEAINFL 421

Query: 466 KRYPNVIFCGDMNWDDKLDGHFPYVEGWFDAWERLRPAEIGWTYDTKSNKMLTGNRLLQK 525
           ++YPNVIFCGDMNWDDK DG+FP  EGWFDAWE+LRPAE+GWTYDTKSNKMLTGNR LQK
Sbjct: 422 QKYPNVIFCGDMNWDDKRDGNFPVSEGWFDAWEKLRPAEVGWTYDTKSNKMLTGNRALQK 481

Query: 526 RLDRFICSLTDFKITSIDMIGTDAIPGVTFSKQKKLKNKIQELILP 566
           RLDRFIC L+ F+I+ IDMIGT+AIPGVT+SKQKKLKNKIQEL LP
Sbjct: 482 RLDRFICRLSGFEISCIDMIGTEAIPGVTYSKQKKLKNKIQELTLP 515

BLAST of Spo13795.1 vs. UniProtKB/TrEMBL
Match: A0A0K9R8R3_SPIOL (Uncharacterized protein OS=Spinacia oleracea GN=SOVF_101000 PE=4 SV=1)

HSP 1 Score: 603.2 bits (1554), Expect = 3.200e-169
Identity = 290/290 (100.00%), Postives = 290/290 (100.00%), Query Frame = 1

		  

Query: 276 DSVYSLDQEKVGKAKIDSESFSEEFTEDAQLEEVSGVIKILSYNVWFREDLEVHKRMQAI 335
           DSVYSLDQEKVGKAKIDSESFSEEFTEDAQLEEVSGVIKILSYNVWFREDLEVHKRMQAI
Sbjct: 199 DSVYSLDQEKVGKAKIDSESFSEEFTEDAQLEEVSGVIKILSYNVWFREDLEVHKRMQAI 258

Query: 336 GDLIKLHSPDIICFQEVTPGIYDIFRRSTWWKKYCCSVSYEMAASGAYFCMQLTKLPVKS 395
           GDLIKLHSPDIICFQEVTPGIYDIFRRSTWWKKYCCSVSYEMAASGAYFCMQLTKLPVKS
Sbjct: 259 GDLIKLHSPDIICFQEVTPGIYDIFRRSTWWKKYCCSVSYEMAASGAYFCMQLTKLPVKS 318

Query: 396 FRCEKFSYSAMGRELSVGEIQVNQNKTLVIATSHLESPTPGPPTWDQMNSKERVNQAKEA 455
           FRCEKFSYSAMGRELSVGEIQVNQNKTLVIATSHLESPTPGPPTWDQMNSKERVNQAKEA
Sbjct: 319 FRCEKFSYSAMGRELSVGEIQVNQNKTLVIATSHLESPTPGPPTWDQMNSKERVNQAKEA 378

Query: 456 INILKRYPNVIFCGDMNWDDKLDGHFPYVEGWFDAWERLRPAEIGWTYDTKSNKMLTGNR 515
           INILKRYPNVIFCGDMNWDDKLDGHFPYVEGWFDAWERLRPAEIGWTYDTKSNKMLTGNR
Sbjct: 379 INILKRYPNVIFCGDMNWDDKLDGHFPYVEGWFDAWERLRPAEIGWTYDTKSNKMLTGNR 438

Query: 516 LLQKRLDRFICSLTDFKITSIDMIGTDAIPGVTFSKQKKLKNKIQELILP 566
           LLQKRLDRFICSLTDFKITSIDMIGTDAIPGVTFSKQKKLKNKIQELILP
Sbjct: 439 LLQKRLDRFICSLTDFKITSIDMIGTDAIPGVTFSKQKKLKNKIQELILP 488

BLAST of Spo13795.1 vs. UniProtKB/TrEMBL
Match: V4UKW6_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10014440mg PE=3 SV=1)

HSP 1 Score: 404.4 bits (1038), Expect = 2.200e-109
Identity = 187/264 (70.83%), Postives = 216/264 (81.82%), Query Frame = 1

		  

Query: 302 EDAQLEEVSGVIKILSYNVWFREDLEVHKRMQAIGDLIKLHSPDIICFQEVTPGIYDIFR 361
           ++++   VSG +KILSYNVWFREDLE+H RM+ IGDLI+LHSPDIICFQE+TP IYDI  
Sbjct: 430 DNSESGAVSGSLKILSYNVWFREDLEMHPRMKTIGDLIQLHSPDIICFQEITPNIYDILC 489

Query: 362 RSTWWKKYCCSVSYEMAASGAYFCMQLTKLPVKSFRCEKFSYSAMGRELSVGEIQVNQNK 421
           +S+WWK Y CSV  EMA S  YFCMQL+KLPVKSF CE F  S MGREL V E++V + K
Sbjct: 490 KSSWWKGYRCSVPNEMADSRGYFCMQLSKLPVKSFTCEPFKNSIMGRELCVAEVEVQEKK 549

Query: 422 TLVIATSHLESPTPGPPTWDQMNSKERVNQAKEAINILKRYPNVIFCGDMNWDDKLDGHF 481
            LV+ATSHLESP PGPPTWDQM SKERV QAKEAIN+LK+ PNVIFCGDMNWDDKLD  F
Sbjct: 550 PLVVATSHLESPCPGPPTWDQMFSKERVEQAKEAINLLKKNPNVIFCGDMNWDDKLDSKF 609

Query: 482 PYVEGWFDAWERLRPAEIGWTYDTKSNKMLTGNRLLQKRLDRFICSLTDFKITSIDMIGT 541
           P  +GW DAW  LRP E GWTYDTKSNKML+GNR LQKRLDRFICSL DFKI  IDMIG 
Sbjct: 610 PLPDGWVDAWTELRPGENGWTYDTKSNKMLSGNRTLQKRLDRFICSLRDFKIVRIDMIGV 669

Query: 542 DAIPGVTFSKQKKLKNKIQELILP 566
           +AIPG+ + K+KK++ ++Q+L LP
Sbjct: 670 EAIPGLLYVKEKKVRKEMQKLELP 693

BLAST of Spo13795.1 vs. UniProtKB/TrEMBL
Match: A0A067GEZ3_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g012811mg PE=4 SV=1)

HSP 1 Score: 401.4 bits (1030), Expect = 1.900e-108
Identity = 203/324 (62.65%), Postives = 233/324 (71.91%), Query Frame = 1

		  

Query: 242 TFKRKLRDLRGNSKPIKIDGFGEVKSFRKAVSFVDSVYSLDQEKVGKAKIDSESFSEEFT 301
           T KRK+RD          D  G+   FR      +SV   D    G +  +SES +    
Sbjct: 139 TGKRKIRDQ---------DCDGDFDGFR----VTNSVSIKDDTTSGPSADNSESGA---- 198

Query: 302 EDAQLEEVSGVIKILSYNVWFREDLEVHKRMQAIGDLIKLHSPDIICFQEVTPGIYDIFR 361
                  VSG +KILSYNVWFREDLE+H RM+ IGDLI+LHSPDIICFQE+TP IYDI  
Sbjct: 199 -------VSGSLKILSYNVWFREDLEMHPRMKTIGDLIQLHSPDIICFQEITPNIYDILC 258

Query: 362 RSTWWKKYCCSVSYEMAASGAYFCMQLTKLPVKSFRCEKFSYSAMGRELSVGEIQVNQNK 421
           +S+WWK Y CSV  EMA S  YFCMQL+KL  KSF CE F  S MGREL V E++V   K
Sbjct: 259 KSSWWKGYRCSVPNEMADSRGYFCMQLSKLQAKSFTCEPFRNSIMGRELCVAEVEVQGKK 318

Query: 422 TLVIATSHLESPTPGPPTWDQMNSKERVNQAKEAINILKRYPNVIFCGDMNWDDKLDGHF 481
            LV+ATSHLESP PGPPTWDQM SKERV QAKEAIN+LK+ PNVIFCGDMNWDDKLDG F
Sbjct: 319 PLVVATSHLESPCPGPPTWDQMFSKERVEQAKEAINLLKKNPNVIFCGDMNWDDKLDGKF 378

Query: 482 PYVEGWFDAWERLRPAEIGWTYDTKSNKMLTGNRLLQKRLDRFICSLTDFKITSIDMIGT 541
           P  +GW DAW  LRP E GWTYDTKSNKML+GNR LQKRLDRFICSL DFKI  IDMIG 
Sbjct: 379 PLPDGWVDAWTELRPGENGWTYDTKSNKMLSGNRTLQKRLDRFICSLRDFKIIRIDMIGV 438

Query: 542 DAIPGVTFSKQKKLKNKIQELILP 566
           +AIPG+ + K+KK++ ++Q+L LP
Sbjct: 439 EAIPGLLYVKEKKVRKEMQKLELP 438

BLAST of Spo13795.1 vs. TAIR (Arabidopsis)
Match: AT1G11800.1 (endonuclease/exonuclease/phosphatase family protein)

HSP 1 Score: 346.3 bits (887), Expect = 3.600e-95
Identity = 160/253 (63.24%), Postives = 198/253 (78.26%), Query Frame = 1

		  

Query: 313 IKILSYNVWFREDLEVHKRMQAIGDLIKLHSPDIICFQEVTPGIYDIFRRSTWWKKYCCS 372
           +KILSYNVWFREDLE++ RM+AIG LI+LHSP +ICFQEVTP IYDIFR+S WWK Y CS
Sbjct: 173 LKILSYNVWFREDLELNLRMRAIGHLIQLHSPHLICFQEVTPEIYDIFRKSNWWKAYSCS 232

Query: 373 VSYEMAASGAYFCMQLTKLPVKSFRCEKFSYSAMGRELSVGEIQVNQNKTLVIATSHLES 432
           VS ++A S  Y+CM L+KL VKSF  + F  S MGRELS+ E++V   K LV ATSHLES
Sbjct: 233 VSVDVAVSRGYYCMLLSKLGVKSFSSKSFGNSIMGRELSIAEVEVPGRKPLVFATSHLES 292

Query: 433 PTPGPPTWDQMNSKERVNQAKEAINILKRYPNVIFCGDMNWDDKLDGHFPYVEGWFDAWE 492
           P PGPP WDQM S+ERV QAKEAI IL+   NVIF GDMNW DKLDG FP  + W D WE
Sbjct: 293 PCPGPPKWDQMFSRERVEQAKEAIEILRPNANVIFGGDMNWCDKLDGKFPLPDKWVDVWE 352

Query: 493 RLRPAEIGWTYDTKSNKMLTGNRLLQKRLDRFICSLTDFKITSIDMIGTDAIPGVTFSKQ 552
            L+P ++G+TYDTK+N ML+GNR LQKRLDR +C L D+K+  I+M+G +AIPG+++ K+
Sbjct: 353 VLKPGDLGFTYDTKANPMLSGNRALQKRLDRILCRLDDYKLGGIEMVGKEAIPGLSYVKE 412

Query: 553 KKLKNKIQELILP 566
           KK++  I++L LP
Sbjct: 413 KKVRGDIKKLELP 425

The following BLAST results are available for this feature:
BLAST of Spo13795.1 vs. NCBI nr
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. NCBI nr)
Total hits: 5
Match NameE-valueIdentityDescription
gi|731351024|ref|XP_010686817.1|6.2e-22270.1PREDICTED: uncharacterized pro... [more]
gi|870852204|gb|KMT04148.1|4.0e-21370.5hypothetical protein BVRB_8g18... [more]
gi|870852203|gb|KMT04147.1|1.5e-21271.2hypothetical protein BVRB_8g18... [more]
gi|902205245|gb|KNA15127.1|4.6e-169100.hypothetical protein SOVF_1010... [more]
gi|743907026|ref|XP_011046939.1|8.0e-12946.7PREDICTED: uncharacterized pro... [more]
back to top
BLAST of Spo13795.1 vs. UniProtKB/TrEMBL
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. UniprotKB/TrEMBL)
Total hits: 5
Match NameE-valueIdentityDescription
A0A0J8BW46_BETVU2.8e-21370.5Uncharacterized protein OS=Bet... [more]
A0A0J8BSK5_BETVU1.1e-21271.2Uncharacterized protein OS=Bet... [more]
A0A0K9R8R3_SPIOL3.2e-169100.Uncharacterized protein OS=Spi... [more]
V4UKW6_9ROSI2.2e-10970.8Uncharacterized protein OS=Cit... [more]
A0A067GEZ3_CITSI1.9e-10862.6Uncharacterized protein OS=Cit... [more]
back to top
BLAST of Spo13795.1 vs. ExPASy Swiss-Prot
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. ExPASy SwissProt)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Spo13795.1 vs. TAIR (Arabidopsis)
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. TAIR)
Total hits: 1
Match NameE-valueIdentityDescription
AT1G11800.13.6e-9563.2endonuclease/exonuclease/phosp... [more]
back to top
InterPro
Analysis Name: InterPro Annotations of S. oleracea
Date Performed: 2018-06-29
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001876Zinc finger, RanBP2-typeSMARTSM00547zf_4coord: 50..75
score: 2.4E-5coord: 89..113
score: 1.
IPR001876Zinc finger, RanBP2-typePROSITEPS01358ZF_RANBP2_1coord: 91..110
scor
IPR001876Zinc finger, RanBP2-typeunknownSSF90209Ran binding protein zinc finger-likecoord: 87..112
score: 8.1
IPR005135Endonuclease/exonuclease/phosphataseGENE3D3.60.10.10coord: 313..527
score: 1.5
IPR005135Endonuclease/exonuclease/phosphatasePFAMPF03372Exo_endo_phoscoord: 316..486
score: 2.7
IPR005135Endonuclease/exonuclease/phosphataseunknownSSF56219DNase I-likecoord: 313..545
score: 9.95
NoneNo IPR availablePANTHERPTHR15822TRAF AND TNF RECEPTOR-ASSOCIATED PROTEINcoord: 311..557
score: 6.2E-111coord: 51..256
score: 6.2E
NoneNo IPR availablePANTHERPTHR15822:SF4TYROSYL-DNA PHOSPHODIESTERASE 2coord: 51..256
score: 6.2E-111coord: 311..557
score: 6.2E

GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0005975 carbohydrate metabolic process
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0004527 exonuclease activity
molecular_function GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds
molecular_function GO:0008270 zinc ion binding
RNA-Seq Expression
   



Co-expression
Gener valueExpression
Spo022080.66Barchart | Table