Spo14921.1 (mRNA)

Overview
NameSpo14921.1
TypemRNA
OrganismSpinacia oleracea (Spinach)
DescriptionTyrosyl-DNA phosphodiesterase 1
LocationSpoScf_01888 : 35367 .. 58162 (-)
Sequence length2209
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
AATAAACCGGCCTGCTGCTGAAAGAAAGGCGCGGAAATAACGTAACTGAAGAATTTTCCTTGTAAATGCGATTAAATTATTAAAAGCTTGCCACTGCACCGGCAAAGCGGCAACTGAGTAATTTCAAGTTCTGAGGCTGTACTATTCTCTCTCTTAGGGTTCCATTATCTCTTCTCTCCATTCGCATGCTTTCGGAGCTTCTTCAATGGCGTCTTCTCAGGTATCTCTCATTCGTTACCCTTTCTCTCTCTCCTCGTAATTTTGAATTTAATGGGACGTTTTTGGTGTTTGAAGATAGGAATTCTGGTTCCGTTGAACAAGAACCTGGAGAAGGAGAAATCGTTGCCGGAATTTCCGATTTCTGAAGGTGATAATGTGATAGGGCGTAATAATGTTCCGGTGACTGATAAAAGACTCAGTCGCAAGCACCTCATTTTGTCCGCTTCATCTGGTGGCTCTACTGATTTGCGTGTGGTATAATTTTTGAACTTCTCTCATTTACGTGTTTCATTTGGTGCATTTTGTCTGAATTTTGCTTGTGATTGGAAATTGGAATTATCTACTTATGATGAGTCATTATTTAGAAAATTGATAGTACTTGTATGAGTTGTATCGCTATTGATGCAATGTTAAAATACTGAAGTTGAAGCTTACTTTTCAATTTTTCATTTAATTTAAATGAATATTTGACCTTCATTCGGATTTGGACAAATTGAAGTTGTGGCGATTGGCGAATATTTAAGCTAATAATTATATTAGCTTATACTTCGTAACATGTAAGTTTGAAGCAAATAGCTACGCATGATACATTTCGTTTAGAAGTACAAAACAGCTGTAGTTTTTTTACGTGAAGTGTAAGCTTTTAATTTCAAGAGGGAAGGTCCGTTTGCAATGTGCAATCTCACACTACATTATGTGCATTATTAAACATTAGAAAAGAAAGTAAAAAGGAGAAATTGGACAGTTGCGAAACTATGATTAGCATTCTCAAGTGTTCTCTTATTTGGAATGCAAGAGATGGTATCAGTAATGTTCAGCGAAAGCTGCTAATGTGTTGTTTGAAGTTTTGAGGACAATGTGTTTTCCTTCTAGGCTGAAGTTTGCAAGTGGTAATGTAAGTCCTTGAAAGTTGGCTAATCAGAATTGGTTAAGCTGTGCCCTATTCATTGAGATATTGGGAAGTTATTGAAGGGTCAATTTAAGTTATGTTGAAGGACTTTCTCTATACACTACTCCGTATTTGTTTAATAGGGCACTTGTATTCCGAGGATTAGACACCATCACTTTTGGCCAACTTTTCAATTGAAAATAATATGTCCAAATAACATTGTTGCTCATTCTTATCTTGGATCTTTAATGAATGACCTGGCAGGAAGGAATGAATCCTGTGGTGATTAATTCTGGTGGTCAGAGGAAGTTACTGAATTCTGGGGAAAAAGTTACATTTAGAAATGGTGATATCTTGGAGTTGATTCCAGGGAGCCACTATTTCAAGTATGAAATTTTAAGTAATCAGAGAAACCCCGGCGTTGTTAGCAAAAGGAGTGAAGATGTTAGTGAATGTACAAATGTTGGAGATGGCAGGAAGAGAGCACGAGAAGACTTGAATGCTGGAGCTTCTACAGAACAGTCAACGGTACTCTCTGCTGTTTGAATCCTCACATCATATATTGCTCACTGTATTCGTCAACTTCCTTTTTAGTTCCCAACTCCAATTCATAGATGTCAGCTGTGTTCTCCCTTGAGCATAGTAATATGACATAGATTTCTCCCCTTCCTCTTATGGCATTTCTCCTCAGATCGCCCTCTACTGACGGGAAGGGAACTTATTCAGAAGACTGAATCGTCCTCTTCCTGTAATCTGACCTTACCATAGCTTTCTTAAAGCTTTTGGCTTATTGATCTCCAACTGTATTTGTATTCGTATCTGGTAATTCCCTTTGTAGTGGATCAAGATATTCTAGGCTAATCATTTCTTCCTTGAAAATGGAAATCGCCCTTAGTAATTTAACTAGAAGGAAAGTTCTAATAATCTTGATACAACTCAAAATTATTGACATTTTTTTTGAGGAAGTAAAATAAAATATAGACATTTATGGATTCAATGCGAAAATTGTTATGCATTAAATTATAGGAAATTGCTGAAGTAAAAAATGGGTATTCGCTTAAGTCGAAGCTTTTGGTATGAGGTTTTGTTTGAAATTCCAGTATGAATCTGGGGGTTCAAGCGATTCATTGGGTCTTAACAAGATAAATTCCTATAAGTGAATTATTTTAAAGAAAACTCACATTATAATTGTACACAAATAAAAAAAGAATATTCAAGAGGACTATAGTCAAGAGAAAGACGATTGAGCTGACTTGATATTTTTTATCAACTTGTTATTGAATAGCTGGATATTAGTGGCTTCTGTCATTTTCACAGAAAAAGAGAAATTGCCTTTGCAAAGAGAGACAGAAGTTCACTTCTTTATGTAAAGTATAGTGGTTGACATTAGTGATTCCTACCCTCTCTTTTCCTTACTCTTTTAAATTACTTTTATAAATATGTTTAAGAAAACTTTACCTGCAAACTGCATAGTAATTTGCTAGTTGTCTCTTACCTGCTCTTTTGCTTTGTGTATTTGTACAGCTTCCACATGGAATTGGGAGAAAATTAAAAGAAAATATAGATAATGTAAGTGTTGAATCAAAAAAACAAAATCTTTCAGTCTCTGGGAATAGCGAGGAGGCGATACGTCATTTCCATGTATCCGATGATGGATTGCCTTTAACTTTTCGTCTTATGAAAGTACAAGGGTTACCAGAATGGGCGAACACCTCTTGTGTATCTATTGACAATGTGATTGAGGTAATGTGTACTATGAACGTTGAGCTGCACGTGTTATGGTTTGCATGAAAGACATTTTTGACATGAATAGCTAGAATGCACTTGATCAAGGTTGACATTTAATTATGAGTAATTGGATATTCATTTTTGCCTCGTGTTTGAGGTGATCGATCTTAGGAAAAGGTCCTTGTAGATTGAATATTTATTTTTGCCTCGTGTTTGAGGAATAAGTGATCGATCTTAGGCAAAGGTCCTTGTAGATTAATGAGATTATGAATTTATGAATTTATGATGTTGTTGACAACCTGGATTGAAAATGTAATTGCCTTCAACATTATTTGACTGATCTAGTGAGTCTCTGTTCAATTATTTATTATGAGCTTTAGGAGAAAATACTATCAAATCATTTGAAAATATATTTAGGCACATCCATTCATGTTGTTTTTTTATTTCACTTTTGTGCTTGACATCTGTCTTAGCATGCAAAGTGAACTAGGACGTATATTAGTTGGATGAATTTCCTGAATAGATAAATGTGCTTCGTGATTCATGAAGTTATCAGACTGACTGGTTGTTGATCTTTTATTTAGGGGGATGTGCTTGTTGCTGTACTTTCAAATTACATGGTGGACATAGACTGGTTGTTATCTGGTAATAACCTTGATAAATTCACTCCTTTGAATGTGCCGTTTGTCAGATCAACTATTATGGAGCGTAAGGTGTTGACAGGAAGCTCGCTTTTAAGTTGTTATAGACTTATAGTAGAATCAGGTCATATAGTGCATAATGCTGTTGTGTGGGGAATAGTATTTGAGTTGTCGTTGTTGTCTCTTTGCCTCTATGCTGTTTTATGTTGGAATAATTATTGGGGTGTCGAAAATTCAATTCATACACAATTACACAGTTACATCTAAATATATAATCCCCTGCAAAAAAATCTTTTTGTGGATTTGAAAAAGGTTTGATTTTTAATCGTTCAGCTTCCCTGTTCTTCAAGTAATCCTTGCTGCATTAGAGTCTGCAGTTGCTACTTTACAGGGAAAGATATATTCTTGAAGGTGCTTGTGGGTACGCGATTATGTAGAAGGTTGTTTAGTTACATATAAAGGCAGTTCTGTAAAGGAGTTTCGACTTTCCTTGTTGCTAAAACCTATCCCATAACATCTGTCGTACAATGACACATCAATGTTAATTATAGGCTATGTTACCACATTAGCTTACATGATTGACACTTTTGAATGGATACATTAAGTTCCGACTCAAAATATAGTCACTATAAATGTACAGCCACTTATTTTTTAGCATGCTGGAAAGGATTTTATCAAGTATTCGTATAATCGTATGTTATAAAAATATATTTTAATATTTATATTACTCGTATGATGTTGAAGTGAACGTTTCCAACTTTTCAATTCAGGCCATGGCAACCATGACGCAAAATTTAACACTGTTAATGCCGTCTCTGTAACTCATTGACTATTATTGCAGCCTTGATTACAACTGTTGGTTTTTCTGTGCTTGTGGGTTCTGGTGATTTTTAGTCTTAGATCTTTTGAGCATTTCATCGTACGAGATAGCTAATGTTTATTTATTTGTTTTCATAGAATCCAAATGTTTGGTTTTGTCCGTTTTCAAGTGAAATCCTAGTTGACTTCTGGTTGGGATTTAATATGGAAATCTTGTTATTCATCCTTGGGTACCCCCAACTGTTGAATGTGTTCTATTTTGTAGTACTACTTGCTACGTCACTCTTTTCAGCTCAACCCTTCCTGACATTTATTGTCTTCATCTTCTGTTTTATTCTGTAGCATGCCCAATGCTTAAAAAAGTTCCTCAGGTGCTAGTTGTTCATGGAGAAAGTGACGGTACCGCTGATTACATGAAGGTATGTTGCGTGGACTAATATTTGGACTAATATTTGGACTAATATCTTATAATGCTCCAGATCTGGAATCTGAGTATCTAGTTGACGTACTTACTCAGCATCAGATACATTTTCTTTGTAGAGGAACAAACCCAGCAATTGGATTTTACACAAACCCCCATTACCCATCTCGTACGGGACACACCATTCGAAGGCTATGCTTCTTGTGTATCCTCGAGGAGTGAGGGTCATTGTGCATACAGCAAACTTGATTAATGTCGATTGGAATAATAAAAGTCAAGGCTTGTGGATGCAAGATTTCCCTTGGAAAGATCAGAATGAGGCAAGCAAGGGGTGTCCATTTGAAAGTGATCTTATTGATTATCTACAAGCTCTAAAGGTTTATATTTTCTTAATTTTGTGTGCTAAGTGGAAGGTTTAGTGCATATAATATAATCCCTTGCTATTATATTATATTATTTGTTTTATTATTGGTCATGCTTAAGTGTTAAACTATAAAATTGTCTAACAGCGGGAGGGTTTTCTTCCCTCTTCCCTCTTTTCTCTTTGTCCCCTTCTCCTTTTATTAAAAATACATGTCTTATGTCCTTTAGTTTTCTTTTATATTTTCCTAATCTACATTGGAGTTGGAAGTATTAATTTGCTTCTTTATTACATTTGATCACAATTGATAGGATCAATACAACAATAAATAAAAGTTCCAAAGAATTTATTACCCATTTAGTGAAAAAAATCTTTTTTATTGGAATTTAATTTTACATGCTTTACCGAAGTCTACACACCAAAAATAATCGAGATTGTAGTTAAGGGTTTTGAAGCAGATTAGTTGAGCTTATGTTGGATCAGAAGTCCATTTTTGCAACTTTTTGATTTTATGATATTGCTAAAGATTGTGTGATGATTGGGTTTCGGATTTGAGCCCCAAATCTCATGAAGCCATTGTCTGGACTTTGGAGTGCCGATTTGAGGACTTTATGTGGAGGAGATTAGAGAAAGTGTCTTGTAAAGGACGAAAGTCCCTGATCCTAATGGTTTTGTTACCTTAGGATACCGTTTGAAGGAACCTATGTATGATGTTCATACTTCCAAAAACAAAAAATATTTATCTTCTTTTTCTTAAAAAATATCTCGTTTGGTGAGTTTTTGAGACTGATATTGTCGGAAAGAGACATACTGACATCCTTTTTCCCCCAATTTTGTCCATAAGGGATAGGTCCAAGAAATCGAGATGGCTTTCGGCTGATTAGGTTACGACTTAACTCACTAATGTATAAACGATTACGACTAAGGTGTTGCAAATCAGTTTGGGATGGTGTTTTCTAGTACGGTAACTAAGGTGCCTAGTTCATTGGTTGACAGTTGTTATATTTTGGGCACTACCTTGGTAATGGGATGGGGGTAGTTGTTAGTTGTTGATGTGAGAAAGGCTTAATTTTGAGAAAGCGTATGATAAGGTTAGTAAAAAGTTGCTTGATAGAGTTTTGGCATACAAGGGGTTTGGGACTTGGGAGTTGTTGGAGGCCTTGAATTTTGGGAAGTCTCATTACTGTTCTTTACCCTATGTGGTAATCAATGGAACGCCTACAAGTTGGTTTGGTGTGTCGCGGGTCCTTTGGCAATATGACCCCTTGTCTCCTTTTATATTTTAGTGTAGTCGTGGATGTCCTTGATAGTTGTTTTTTTCCAGAGGTGTTGCTTGGTAAAATTAAGGGCCTAAATGTTTTGCGATAAGGGTAGTTGCTATTTCCCGCTTGCTGTGACACACACACTTCTTGGATGGCTTTTTAAGTTCAAGAATGTCGTTACTCAACTTTAGGGGTTGAAGCCTGCAATTGGTCTTTAGGCTTAATCTTATTAGGTCATTAATTTGCGGTATTGGATTTGTTTAATCTGGCTTTGAGGTTTATGTGGTTGGGACTTGGGAGTTTTTCTATTAGGTATCTTGGGGTTCCTTTTTTGCTAATCTTAGGATTCTTTCTTTTTGTGATGATATGATTTTCTAGTTGAGAAGTCTCTATGACATTAGATGTGTGGAAGAGGGCCTATATTTTTCTTCGAAAGAGGATACCTCTTACCAGGTCTTTTCTTATATCCTCTTTTTAACCTTTTCCTCTCAAAGGAGGGACGCCGAAAAAGCTAGAGATATGCAGGATATTGTTGTGGGGGGGGGGGGGGGNGGGGGGGCGAGGGTGACGAGGAAATGAGAAATCGTCTTGTTGCTTGTACTTCGAGTTTCTCGTAAGAACATAGTTTGGGTTTTGGGAAGGTGGTATCCAATTTGTTTAAGATGTTAGTGCTTATGTGTGAAGTCTGACTCCCTTTAATGCAGGGTGATATAGAGTAAGCTTGGCTTCCATTGACATCTTAGCCACTAAGGTAGGAAATAGGAAATTAGAAGACATGGACGTCCTGTTGGGGTTTATGGATTCCCTCTTTCTATGATCATCCTATTCGCAAGCCGTTTATTTTGGCAACTTGTCTCATGATCCTTTTACAAACAATTGAAAACTTCCAATCCAATAAGCTTCCTTATTTCCGATCTTAGTTTTAAAAGGCGTGAAGCGCACTTAAGCGCAAAGGATCCTTGGAGCCTAGGCGCTGGACGCAAAGCGAAGCGCACGCTTTTCATAGGCGAGGCACACCTTCAATTATTTTTAAAAAAAATCTTATATGATATAAACTTTAATTAAAAAGAACAGTTTTTGCAAATTGTTAAAAGATCCCTTTTTAATTAAAGGAAAAAGGGTCACATGCCCACAAGTTACACAAGTAGAGTAGACAACAATGGGTTATTTTATTCTTTTAGGGGAAGGACAACAAAAAGTTGAAAAAAAAAGAAAAATGGTGTAAAGTAGAAATCCAATGCTAATCTAAAAAGAGAAAAAACAAAGATAAGAAAGGGAAGGGTCATCTGAAAGTACATAAAAAGGGCAAATTGTTGACAACTAGCAATGTTAAAAGAAGAAAATGACAAAAGAAACCAAAAACTGTCACTTTCAACCGGCCTTTTTTCTTTTTTACTTTTATCCTTCCTTTTTTGTCCAAGCTAAGGGATTGGGGTGGGGCCAGGGTCACATGCCCCCTTATTTGGTGTGCAACGGTCACTATTCAAACAGAAAGCACACTCTCACACCTAAGGGTTGCCGCTTTTTCGCCTGAAGCTTATGTAAAAGCGCATAGAAGCGCGCGCTTCTTGTAGTTGCTCCGCTTGTGCAGGGTAGAGCGTTTTTGTTGCGCCTAGAGCCGCTTCTTGCTTTTAGGCGCGATTTTAAAACAAAGTTTCTGATTCATTTGAAAGGCAATGGTCCTTTTACGAAGGTTAAGGCTTGGGAAATAACCTTTCAGTAAGAAAAATAATCTGATATTTTGTTATTTCGTAGTTGCAGTGGCCAAGTAAAAATGTGTTCTCTCTAGTTTGTGCTTATGCTCAGTGGTCATGATCTTAAAGTTGGGGTGTCGTCAATTAGCAAAATTATACTCTCTCCGCCCCGAAATACTCGCCCCGCTTCCGTTATGAAGCCGTCTGATACGTGTCGTTTAGCGTATTAAATTAAAACCTATCTTATACCTTCAAAAATATTAACAAGTAGTAAAAGGGTAAGAAAGGATCGATCCCACGAGGAGACTTATTAATCTACTGAACATCAAAGCTTAACCAGTGGAACTAAATTGAACATTATTTACATAAACTATATACAAATACTTACCTATTCACATAAAGAGGGGGGGGGTTGTGAAATTCCTACGAACCACTACTTGGTAATGGCTAAAAAAATGACTACTACTTACAATTATACAGAAAGTGACGACACAACTTAACCAATTCAGTTTATGGTTTTTCTACCAGATTAGATTCTTTAATCTTTGAAAATAATCAAACTTGAGGAATTAACCAATTTAATCCACTAGACTAACTGTAGAGACTTAAGTAACAAAGACTCCCGGTACTCCCTTATTTGTAATCAATTATGAACTAAGCCCAATTGATCCTAAACCACTCTAACTTACACAAATATGGTCGCAATCTGCATAATTTAGAAATTGGCAGCAATATAAACTTGGAGACATCATTAGTCCTAATTAATCTAATCACATAAATGTTTAGGCTAATTAGATTGAGTTTACTTTCATAATCTATACTATCACCATGATCGCAGGCTCCATTATAAATTACGTGCTACTCATTATACGATCTAAACTAAGCCAAATCATATAATTAGCAATCAAATACAATCATACATAATGAAAGATAACAATCAAACATATAATTGTAATTTAATCAAAGTAAAAATAAGGAAACCCAATTCATAATTAAAACAGAATAATCACTCAAGTAGGTTGTAAAACCTCAACCAAAACTTCATACTAGAACTTAATCACTTAATATGACAAAGCAACATGGTAATTAAAGATGAGAAATTGGAACAATTGAGTAGAAGAAGAACTTATTATATCTTCAGATTAAGGACACCCAAGCTAGCTCCGGCTTCGACCCCGTAACTCGCCGTCGTCCTTCTTGACCTTGATTCGAGACTAAGCTCTCTCCAATGGCTGTCTTCTCCTTTTCCAATTACCCAATTTCGAACCCGATAATAACCCAATTACTCACGATATTTTTTTGTGGCTCTCGTATATGCCCTCTCTCTCCGTATGTGTATTTGAGGGGTTGGGCTGCAATTTTATATGAGATAAACCCTAGTTTTTGACCGGAGATATCGGAACTAGAAGATCGGTAGTGTGATTCTGTTGTTGTTGCGTCGAGAAGGAAGGCAAAGAAACTGCTGCTCGTAAGAAGATTTCGTTCTCCAACAACCCACATATTTGATTTTTAGGTATTGGGCTTTCTCCACTAATTTTGGAATGTAGTGGGATTATTCACTTACATCCCACGTGTTGCTGTAGTTGGGTTTTCTTCATTTGGATGAAAGCTAGAGTTAGTGGGCTCAATTCCCTTCACAATGTTCCTCCTTATATCTTTGTATTAGTTTTGTGTCTCGTATGAATCCACGGGCTTCGGTTTTCTTCTCGAGCAATTACTACCTGTAGCATTAACGAGTACTAATTAAGCTAATTTAACAATAAAATAGAAACTAAGAAAAATATATTTATAATTATTAAAAGTAATTCAAGACCCAAAAAATAATAAATAAAAGATGATTGTATGGTTTAAATAAGGGAATAAATATATGGTAAATAATTCACTTATCACCGTCTCGGATGTTTATGTAAATATCATATGTTTTAAGGTCCCACTTGTATTCCTAATATCTTAAAAATCATTAAAAAAGTCACCCCCACCCTCTACCACCCTCAAAAAATTAACACATTTCCCACCAACTATATTAAAAAAATACCCAACTATCAACTAACACCTAATTAAATAGTAAGCCAATTAAATTGCCTTAAACTCTGTGCTTGTCAAACTGGTGTGAGTATTCTGGGCGGAGGGAGTAGTGTAAAGTGAGGGAAAACCGACAAGACCAGTTTTCTTTTTTGTTTTTATTTGGGGTGTCGGTGACCACCCATGATCCAATGTGGCTTCGCCACTGTTTGTGCTATTTGATGTTAAGGTGCATATATAGGTGCACTGTGTAGTATACTAGTATCATCTTTTCGTTCAATGCAAGGTGAATAGGTGGTGGTTGTTGTATGGTTTAGGATATTTTGTTTCAAAGGGATTTTTGGAAGTTTATATAATATTTAATCTTCTTATAAAAATGTTGTCCGCTCATCTACATGATGGGCATGTGTATTTCTGGACGTGGTTTGGTTGCGTTGGTTAATAAACAATGATCATATCTTCTATAGTATTAGCCATATGGCTAGTGCTTAGTAGAGTGGAATGTTATAGGGGTTTGGTGTTTGGAGCTCTTCCTATTTACCTTTGATTTACAGCTTGACAGGCATGTTATTTTTTAACCCTTAGAGGAGATAAGGTTAAACTCGTATTGAATTGTTGTATGGTTTTCCACGTTCTATTTACCATTGTTTTTCTACCTAAAATAAAAGCTTTCTAGAAGGTAAGGGGTTCAAAACGGTGAATGTTAATGGTTTAGGGTGTCGTTCAAAGCAAACCTTGATTAAGGAGGTAATGATTTAGACCTCCCATAATATATTTTACTTGATGACACAAAAGTTGATTCTTTTGATCCTTCATTGCTTAAGAGTTTATGGGAGGAGTAAGATGAGGAATGGGGTTTTCTACCTTTTCTCGGAACTCCAGGAGGATTCTTTGTGGGATGGGATACACCATTTTCATCCAAGGAAAAGGTGTTATGGATGTTTCTCGGCTTGGTGTGTCTAGAGAACTTGGACGTTTGGGGTAGGTGAGATGGTGGATTTCATTAGTTTTTTGTCCATGTCGTCATTGAGAATAGGGTGTTGTAAGAGTTAACTGCACTTGCATTGTTTTGTAAGAGTTAAGTGCACTTGCATTGCTCTTCTTGGTAGATTATTGGTGGAGATTTATATAGTGCGAATGGGGAAGAAGATTGGGTTGTTCAGGAGGTATGACTTATATGGTTGAGTTTAATTCTTTCATTAGGGAATAGAGAGCGTAAAATTCAAGACATTTCTTTAAGTAATCGCAAAATGAGCAATGCACAATGCACAATACACAAGATCAATAACTTTTTTGCTCACTGATGAGTTTGAAGGTGAACCCTGTTCTTATTCAAGAGGTATTTCCTTGGCTAGTTTCCCTTGCTATGATAATTTCCTATTTCCTTTAATCTTCTCTGGTCAAATGGGACGTCGCCTTTTTATTTTGAACACGTTTCTAAGTCCAAGTTGGAGAATAGTGGTAGACTTCTCATGTTTAAGGATTGAGGGCTTTTGTTTATGAAAAAGTTAAAGTTCCTCAATAGAATTTGAAAGGGTGGAATAAGGAGGTTTTTTTTTGGAGATGTGAGGCAGGGAAATGATGTCTTTTAAAGGAGATTGCTTTAACTGATATACGAGTTCGTATTGGCTATGAATAGGTAACACAAACTAGTTTAAACCTAAAGGCAAAGTTGAAAGAGGTGGTTTTAAGGGAGAAACAATGTCAATATTAGAGGAAAAGATGTCAAACTACTTTACAAAGTTGTGATTGGGAGGAGGCAAAATTCCATTAATTAAATGGGGAGTGACTTTTGAAACACCTTCTACGGAGAGACTCCCCAGTTATGTAAGACTCGATTCTCATCGGAGTGGATTTAACAACCATGAAGGTCTTGATAGGGATCCTATTCTTCCTTATGTAGGTGAGTGGCTAGAAGTACTTTGGGGAGGAGGAAGTTAGAAGAGCAGTGTTTGAATGTACAAGGATAAGGCTCCATGATCTTTCAATGGCTTTCTACAAGGTTTGTTGGGACGAGTTTAAAGATGACGAGTTGGTTTGTAGAAAAATTGTATAAATCCAAATTCTAGTCTTGGTACCTAGAAAGGATCGATGAGTAAAAGTCAAGGATTACCTTTCTTAAATTTACTCAGCATATAAAATGTTCACATAAGTGTTGTTGTCTTGTTTGTGAGAGGTTCGTTCCTATAATGTCTCTTAGTCTGAATGTGTTAGGAATTAAGCGCAACTTAATTAAATCGACAATAACTCATGTCGGAGATTTGAGTCAAAATCGGACTCTTGGTTACATGCCAAGCAAAGGTTATTGTCAAACTAAGATATCTCGTAGAAATTGGTATTAGAGTATAAATATCTCACATGGGAGAAAAAGTGATTTTCGCACTCCGCAAGACTCTGTGTTTGTTAAGCTTCTTGTTTGTTACTCCCTCCGTCCCGGAATACTTGACCTGTTTTCCTTATCGGGCCGTCCCTTAATACTTGACCTGTTTCTAAAAATGGAAATATTCTAACAATATTATATTATTTCTCACTCCACCCCTATTAACCCACCTACCCCCTACTCCATACAAAAAATAATTAAAAATCCAACCCCTACTCTCCCCCAACCCCACCCCTTTACACATTTCCCACTTACTACATTAAAATAATACCCCACTATCAACTACTACCTATTAAATTAAATAAGTCAATTCAAGTCCCTTAAACTCTGTGCCGGTCAAACCGGGTCGAGTATTCCGGGACGGAGGGAGTAATATTATGTTTCCTAGTTTTATATAGAATTTTATTTAGGAAACTAATAGTATTTTGGTATGTAATAACCTCTAGACTTATTTAGTATTGTGGACCAAGTATAATTCTAGTAGACGAGGGTTTCTAGTTTGGCCTATAAAAGCGGGTTTAGGCCTAGCTAGAAAATAAGAGGGAAAATACATTGGTGAGAGGAATCATGTATTGGGGGATTGTTATACTATTTTGCAGGAACTTAATAATAGGGAAATTTGTTTTTTACCACATAAAAAAAATCGAAAAATTGTTTTTTACCACCAAAAAAAAATAAAACTTGTATTTTACCACCTAAAAATAATAATTTAACATTTGTTTTTGACCACCTAAAAATGGAAAAAATAGATAAAACCGTTATGTTGAGGGGCACAATAACGGTAATATATCTAATATTTTTCTCCTTTTCCACGATTTCATGTATTTAACTCCCTTCTTTTCCCCTACTTTATTCTTTTTATGAATTTACATGATTTTTTCTCTCATTAATTCACAAAATTAAAAAAATTATATGGAAATGAAGAGAGGAGGGTAAAGGAGTTAAAGAAAATAGCAGAGGGAGTGTATAAAATTGGAGGAAAATTGAATTAGTGAGCCCTACATAAAGACTTCCATCTTTTTTTTTGCCAGTGGTAAAAACAAGTTTTAACTTTTGTTTAGGTGGTAAAATATAAGTTTTAATGTTTTAGGTGGTAAAAAATAATTTTTTGATTTTTTTAGGTGGTTAAAAAACTATTTGTCTTTAATAATAAGATATTAGTTGAGGGGAGGAAATATAAATTTGTCATAAACACTTGTGTCCTTTATTATTGTTTTTGCTATTTGTTTTACATTGTATTTGTAGGGTTTGTTCCTAATTGTTCGTGGCACGCGTTTGGTTGTAACAGAATGAATGAGATTTTAGAAGGTAGACAATTCTAGATGTGATTTTTATGGCCAATAAATATGTGGATGGTGCTCGTTGTCAAGGCTATGAAGGGGTGGATTTTAAGTTTGATTTTGAGAAGGCCTATGACTGGTTGAGTGGGGAATTCTTTGATGTTATTTTTGGAAGGAAAGGGTTTGGGGAGTGGTGGAGACATTGGATAAAAGGTTGCATGTCCATTGTCCACTACCTTCATTTTTGGTGAAGGGAGTTATTCCGCTCTTATAGGGGATTATATCAGGGTCCTTTATCACCATTTCTTCACCCTTGTGGTTGATGTCTTTGGGAGAATTTCAAAGATATTAAAAAGGTTAGATGGGTGGAAGAAGATGTATTTCTCTCTCGGAGGCATTATCTAATTCAATCATGTTTGTCAAGTATAGCCTCTTATCATCTCTCTTCGTTCAAAATTCCTTTTGTTATTACACATGAGATTGAGAGGCTTATTACGGACTTCTTATGGTCGGAGGGGAGGGGACTGTGGAGGTGGGAAAAGATCCTTTTGTGAAGTGGGAATAGGTGTGTCGTCCAAAGGAAGGAAGGTTGGGTTTGGGGAAGTGATCACATAAAATATTCTCTTGAAGGGAAGAGGTGGTGGTGGTTTCTTCAAGAATCTCAATCTCTTTGATACAATGATTTCTACCGTACTTATAGATGTGTCAAAATGGTTCAAGTGCTGACAGATGACTCATAAGTGTCCTTAGAAGTTCATTTCGCAAGTATCAACTTTGTTCCCTTATTTGACAAAGTTAAAAGCTATGGGTGGTCAACACGTTTGATTTTTGGAGGAAATTTGGATAGGAGAACAACGTATTCCCTCGTTTGGATTTGCTTATCTACTGTTCATAATGCAATGCTCCAGTATCAGCTTCGTGAATGCGTAGGATTTACATGCTAGGAGTCCACCCCATAACCGAGAGGTAGAGGAGCTAGCTTATATATGTGATCTCTTGCCTATTCATTTGTTAGAACGAGGTACTCTAGAGTTTGGATGAGAGATTTTTCGGGTGTCTTTTCATGAAAAACCTTCTTTACTTCTTACTAAGCCCCCTAGGTTTGCAGACTAAGGTGAAAATACCTCAAGTGCAGGCTTTCATGTGGACGCTAGGTTTTGGGAGGATGAATAAGAATGACATCCAGAATTGGACGCCAGAACTCTTCGAAATCTCACTAGACATATGCATGTAGCGCCCTATCAAACAAAGATTCTTTTTCACAGTTTTCGAACAGTAAAGTACTAAGGTGTCTTTTTATCTATGGAGCAGTTGCAAAATTAGACTTGGGTTTGTCCTTTGAATATTAACTCCGTATCATTGGTTGCATATTCAGAATGTGGGTTTAGGAAGTCTATGGAGAAAAAGAAGCTGTGGAAATGTAGAGTATATGGTTGATTTCAACGACATGGCTGTGTTTTATCCGGAGCTTTGTCTCGTGTGATTTGCTGTGCGACAAAGTAAATTTGTTTTCATCTTTCCGGGTAAAGATAGCAGGGTTTTAAAAGGATTATTCAATTGGGGATACTCAAAGGTCTTGGATTAATCAGTAATTTGTCTTTTCGTTACCCTCTTCATGGTTGAATAATATATTATTTTTCATAAAAAATAGGTATACAAAAGCCACTCGAAATAACCAGCATGCCACTCTTATCATTTTAAACCTTGCTGGGGTCTTTTCTTATTGAAATGGAAGCCGTTACAAGGGTAGCAAAGGGAATGCAAGGATGCTGTCAGAAGTTTGCTTCGTTCTGTGCACAAGAAGTTAGTGGAAGAGTAGAATAAATAGGGAAGGATTGTAAAGCCAACAGTATCTTTGTTTTGGTGACATTACTTTTGGGAGGCCATACACTGCTCGACCCGGGGGTATTTGTAATTGTTTACTTACTCTTGATTAAGGAATACAAATTCTGCCGTCTTCCCTAATCTTCCCCTTCCTTATTTCGGTTTTCCTTAATCTGCTGTAATTGTCACTAACCTTGGGATTAGTAACCTGATCCGTAACAGTTGGTAATGGGTAGGGGTAATGTTGTCTATTAGCTTTCTCTTCCACCCCTCCTTTAAGTGTGTGCCAAAATTAAATGTGTCAACTAATTTTGAAAGGAGGAAGTATGTTCATTTAATTTTTAATAGTAAAATATTGGACGCATTAAACTGTGTACAAGATGGTGCTTAGTTTTCTACCTTGAAGCAGTAGCGTCTACAAGATGGAATCACATTGGACGCATTAAACTGTTTCTTTGTTACTGGGCATAAAGGCTTAAATCTAACATCAGAACAATCACAATAGTGTAACATATGTTTTTTTTCATAGTAAAATGGTAATTAGTTTTCTACCTTAAAAGCAGTAGCAGTCTACAAGATGAGAACGTTTGAATTTATTTTTTCTGAACTACAGACCTGCAGCCACTTTTAAGGTTTATAGACTTAATTTTCTGTTTAATTATACATCACCACATGACTTGCTGAGAGGTAAAATATCTGCAGCTATTTCCATGATGTAAATACAACTTCAATTTTGAATTTGAAGATTTGTTTATATGTTTAGTTGTTAAATTACAGATTTTTAGTGCATGCAAAACTGTAAGAAATGAAAGGACAGCAAACTTCCCTTCTTTCTCCTCTTCCACTTTTCTTTTCCAATCATTGTAAATACATATGTAGACTGTTGTTTTATTCTAAACCATGTGTTGTATTTTTTCAGTTGCCTGAATTTACTGCCAATTTTCCAGCCCTTGGCAGACTGAAAATTGATGCTTCCTTCTTTAAGAAGTTCAATTATGGAAATGCCGCGGTATAGTTGATTCCCTTTACATGGCTATTGACTTCTATTTTCCTAGTTATTATTTTATTATAGTTTCTGAAAGTTTATTCCTATTCGGAGCTGCTAAAATTGCTTTATTTATTTTTGCTTCTGTTAGTCATCAAAGTCATTACAGTTTCAATAATTTGGAATTTTTCTAAATAATGTGGTTGCGTAGAAGTGGTACTCAACTATTGGTATAGACAATGTCATATCTTCAATGTGTTGTAGACTCTTCAATTATCCAGGCAGTTGATCTCTTGTATCCTGCTGATAGTATGTTTGTTAGTGGGAATTGCAGACTGGAATTGGGAATGGGAGATTAAAAATTTAAAATTGCGAATTTGTTTCGTGGGAGGTGGGTGGGCTGCGGGCATGGAAGTGGCATCAGATGAAATTCTTCTCCCATGTTTCTATTTACCTTTTGGGGCCCAGGAGTTCACTTTAACAATGTTTAAGAGCCCGGGGAACTTATTTTCACGCGCAGCTTTTCATTTCTTGTACACTTATTCTCTTGTGACTTCTTTCAGTTGACACTAAATGTGTGTGTATTGTATTTTTTTTTTTTTTTGACTAGTTTGTATGTGTTATTCCAGGTCAGATTAATTGCATCTGTTCCAGGATATCATTCTGGGTCTAATTTAAAGAAGTGGGGTCACATGAAGCTACGCTCTATCCTAGAGCAATGTACTTTCGATGATGAGTTTAAGAAATCGCCACTAATTTATCAGGTAATGGGAGAATAGCTACTCCCTCAGTTTTCCTAATTACTTGTAGCATCAAAAACCGGGGTAAAGGAGTTAGTGGAAAGTGGTTTCCCTTTTGACTTATTTCTGGTGGGACAGTGGAAGTTGTTTAGAGATATGGGTTTTTAAAAATAATATAAGTGAACCTATATGATGGGGTAAGGGAGGGTCAAAGAATTATTTTGTCACAAAAAGAATAAAGAAAAAGTAGGCAGATCAATGAAGAACACACCTAAACGGAAAGTAGGAAGATCAATGGGGAACATGTGGACTATTCAACTTACTTATTTGAAGCTTTAGTTCAATTATCATTTTTTGATTAGCAAGAGAAATCCAGGCGGACAACCTTAAACATCCTTTGGATGAATTACAAAACGCTTTGAGATTCGAAAAAAAGAAGATAAAAGAGCACAACAAAACCCTAGCTCAGACATTGGCCAAACCCCAGCCCCTGTTCAGAGTGAACTAGGTGAATAAACATTCAAACGGCGAACGGGTGACAAATAACTAAGCTAAAAAAAAGCTCTTCCATCTCACTAATACAACAGATCTCAAAAGTAAAGAATAACCTGAATCACACGAAACTAATAACGAAATAAAAAGAAAATTGACAATCATAATCTACAAAGTTTACATCAAGAGTAGTGAAGTTTTAGTTCATGCTGGTGCATTAAACTCCTTGGTTTAGGTATATGATGTCCTTAGTTTTAAAATCAATTACCCTCCTATGCTAAAATCATTTTCCCGTTTTTGGGAGAAGATGAAGTTGGTTTAGGTATGTGCTGTCCTTAGTTTTAAAAAGCGCGCTTTTTTGCGCTTAAAGCCCTGGAAGCTCGAAAGCTTACAGCAAAAGGCTTCTGCCAGCCGAAGCCTAGCGACATACAGGAAGCGCGCACTTCCTAGCGCTTGGGCACTTATGCGCTTCCTGTAGTCTGGGCACATGAGCACATCAAGCCCTTCTTTTTTATAAAAAATAAAGCAACATGAGATCTTCTTTTTTAATGTATCACGTGATTTCTTTTTTTAAAGAAGCGAATATTTTGGGAAGTTTTTGAAAACCCTAGTGCAATTTTCTCCCGTGGCCGGATTTCTCTAGCCTTCTCGTCGGATTTCTCCCGTGGGATTCTGAATTTTTCTTTCATCGTTTTGATTTTTTTTTTGTCATCTTTCCTTTCTTCTCCATATTTTCTGAACTTTTGAAATTTTATTATTTCTTTCCGGGTTTTTTTAAGCACGAAACGAAGTCGCTTAAGCGAAGCGATTCGCCTATGGAAAGTGTGCGCTTTCGCTTTGCGCTTAAGCTCCAGGGCCCTTTTGCGCTTCAGTGAGTTTTGCGCTTTTTAAAACTAAGGTGCTGTCGTCGTAAAATTGGAGAAGACGGGGTGATGTCAGTTGCCTACCTACAGTTTCTGGTTGCCATGTTGCATATTTGAAATTTGGCCTTTTAGACAATGGCTGCTTTAAAGTGTCTCCCGTTCTCATTAAAGTTGTGTTTTGGTTCAGTTTTCCTCTCTTGGTTCACTGGATGAGAAGTGGATGACTGAACTAAGAACTTCATTGTCATCTGGCTTGTCAGCTGATAAATCCGCCCTAGGTTTGGGCGAACCACGGATCATATGGCCTACCGTGGAAGATGTGAGATGGTCACTGGAGGTAACATTTAATTGCTTTACCTGTTTTGGGTGCTTGTTGCATAATTTGTATTACTCCCTCCGTCTCAATTTAATTGTCTCAATTTTCGACCATCTTAATTTAATTGTCAGACTTCCTTTTATGGCATGGACCAATAATATTTTATTACATATTTTTTCCCACCTGCTTCCTCTCTCATTCAATTAAATATAATAGATAAGCTAATATCTACTAATCGCTAAATACTCTGTGAAAAGCTTAATGCGACAATTAAATAGGGACGGGGGGTTTATGTTTTTCTCTCTTGCTGACAACTTGTTTTTGACAGCTTGTTATAATTTAACTGGTTTCGTGGTTCATGTAGACTACTAGTTTACTTTGTCTCTCCAACCAAGTAACAGTAGAAATAGGGAGAAATTCCCTAAATTATAGCGTAAGCTGATCTGTAAAGAGTAGTATTCTTCAAATTTTTGCCCTTCAAAAGTTTGTTCAATCTTAAAACAAATCAATTTCCATGTGATGCCGTTGTCTGGGGGATCTATTGTTAAGCATTGACATACAATTAGTGGTATTGTTTTGTATTTAGGTTAGGGTTTCTCCGGACAGTCCTTAAACTTTCAGGCACATCAGGGAACCACTTGATGTTTTTTGGGCCTCCCAATGTCATACACTTATATCTCATATCTATACCCTTTCCTTAAAGACGAATTACTAGTTCGTCTATATGACTTGGTTTCATTTTAGTGACTGATGATTTATTGATGGTTCTGTAAGAAAATTATGAGTGCAACATGCTTATAGAGGAGCTTGTCACGACTCACGTGTACATTTTTATGTCTGTTTCTTCTGAGTATTAAAAAAACCCTGTTACACTTCATAATTACTAGTAATGTTGTTGAATGCGTTAATTGGTGTGACAGAAAATGAGGAAAGTTCAGATGGTTGTTCCTAGCTAAAAATTTACGTGCTTTCTCATGAAGTCTTTAAAATAATACGTTGTACTTCTCTATTGAAGACTAGGTGAATGCCTGAAAAATATTTGTCGGGAATGATAGTCTGGTAAATTTAAGTCTCTACCTCCTTCCCCACCCCCCCACCCCCCGACTTTGCTACTTATTGCATCGTGATACCGGTTATTTTAGAGACTTGTAAACTGCTGCAGATCGGTTCTGAATGGTTTATAGGAAAAAAAATCTTTTATGAAGATGTTACAAAACTTACTTTTTTCTTTCTGGTTCAGGGTTATGCAGCTGGAAATGCTGTCCCAAGTCCAATTAAAAATGTGGAGAAGGAATTTCTGAAGAAATACTGGGCAAAGTGGAAGGCTACTCACACGGGTCGCTGGTACCATCTTTTATTTTATCCACTTGACTAAATTTGAAGCTGAAAATTTTACTTTATTTTTTTTTATTCTCTAATGAAATGACAGTCAGCTGTATTAGTTGTCAAAACTGTAGTAGTGACTGGTAGTTCTTACTCCCTAATACGGTTGGTAAGTCGCCAGTTTTTTTTTTGTTTTTTGTTTATCGGAAAGATGAAGGACCCTACTAAACCAACACCTCTTAGATCAGCCAACCTATCTACACAGGTCAACGCTTGAGAGAGGCGGGATGATAAGGGAATCCTACTCCCAAGCATAACGTAGTTAATGGGCCTCGAACTTCACCAACCAAGCTTGGTTAACATTACAGTACTAGGGATGGTTTTTTTTATACAACAACAACCAACCCTTAGTCCCAAAATGATTGGGGTCGGCCGTCGGTTAACATGAAATGTCAAAATAATTGGGACGGTTCTTTTATATAAAAAAGTTTTTATCTGTGTCATTTAGCTCATCAGACTTGAACTGGCATAGGGGAACTTTTGTCTATTCGTGCTGGGAGTCTCTCTTGAATAGTACCCTGTGTCTTAATCTCTTACTCATTATTGTTGCTGTGGATTAGGATTGGAAAACGGACATATTGACAAGTTCATCAACTACAGTAACCAATGGGATTGTTAAAATTACCTAGTTGTCCTTGCCAAACATTTTAGTTTCTTATTCCATGGGTATAATTTTAAGTTGAATTGCGACCCTATGATAGAAGATCTGTCTAATAGATTTGTGTGCTTTACATATTGGTTGTTTTGTTCTTCATATTCGTATGTTGAGCTCTGTGACTTACAGTTCTAGCTTGCTCAGCGGTTTGTCCTCCTGACTATGCCTTTTTTCTGATCTTTCTTCTCATTTAAAGCAATAGTTTGTCAGTTATCAACAATGTCAAAAGATCTTGAAAAATCAATTCAAATTTGTGGAGTTCCCATGTTACTCATCAGTCTGATGTAATTAATAGTGCCTCCTGTCAGATCCCCTCTCCATGACAGTAAAGGTATATCACAGTTGCTTGATCGATAGATGCACACACACTATAAATCTTGATGACACATTTAGTGTGACCAAGATTGTTCCAGAGTTTCCAAATTGAGAAGCGTTTTCTAGGTTCAGAAGGGAAAATGGATACAACTTTTGTTTCCTCCGCCCCTTTAGTTATTTTTTTAGGGTGCAAAACTAGGGGAAGAACCCCGACAAAAAATTAAACCCTCCGTCCCTTTAGTTGATTGCCTGTTTATTAGTTATGGTTGAAGACACTGATGTATTATGTTACTTGTTGACAGTGATGGTGACATTCTTTTGTTTACAGCCGTGCAATGCCCCATATAAAGACATTTGTTCGTTATAATGGTCAAAATATCGCGTAAGAACATTTATCCTTATCTCTGATTCTTTCTAGTTTTTTATTTCCTTTCAACAAATAATTGTACCATAGGGTTCTGACCAGTGCCACTCATTTCAAAACAGTTGGTTCTTGCTAACTTCATCAAACCTCAGCAAAGCTGCTTGGGGAGCCCTTCAAAAGAATAACTCTCAATTGATGATACGTTCCTATGAGGTATTTGCAATTTCGTCGAGTGCTAAGGAGATGTTTATTTAGACATAACATATGATACTGTAACCGATACATGATTACCTGCATAACTGCATTATGTATTTGGGCTTATACCCTGAATGAACTCTTAATGCAAGTTGTTGTCAATCTATTTTTCTGACCTTTTTTGAATCCCGTACCCAGTATTGATGGTTTGACATGATTAAGGAAAGGAGGGAGTATTTAAGTTTTGAAAAAACAACAAAAAAACAACACAGTTTAAAAGAAATATGAGGTCTGACTCATCATGGGCTTTACAAGATTATGTATCTCATTCTTGCTGTTTATTCTGTCTGAATGACCCAGCTTGGCGTGCTGTTTTTGCCTACTGTTGTCAAAAATGGTTTTGGATTTTCTTGCACGAAGGATAAGAGCTCTTGTAAGGTATTTTTTTTCTTTCTATTCTGTTCTCGGGAGCTCTTGCAACATTGTTATAGTTGTTGTATTTTCTGATTTTGGGATGTTAATGCAGGATACAAGTGGATCAATGGCAAATTCTAGAAATCGGAAGATTAAACTAGTTTCACTGACTTGGCCAGGAAGAGATGATCATGATGATTCTAACTCTGAAGTAGTTCCTCTTCCTGTGCCTTACGAACTTCCTCCAAAGTTGTACTCTTCTCAGGGTCAGTTTTTTTTTCTTTCCTTTGCCCTTTGCCCTTTGCCCTTTTCCCTTTTTCTGTTTCGTATTTTGTTTATTGTTGAAGGTCACTTCTCGTGTAATTTCTTATCAAGAATGATGGCTTGTGCCCTTGTGGCGTCTAGGCGACCGGACCTTTGTTAGTTTCAGGCATGTCTCTAAATTCCTTTTCAACTGCAGATGTTCCGTGGTCGTGGGAACGTCAATACAGACAAAAGGATGTCTATGGTCAAGTTTGGCCCAGGCAGGTGTAG

mRNA sequence

AATAAACCGGCCTGCTGCTGAAAGAAAGGCGCGGAAATAACGTAACTGAAGAATTTTCCTTGTAAATGCGATTAAATTATTAAAAGCTTGCCACTGCACCGGCAAAGCGGCAACTGAGTAATTTCAAGTTCTGAGGCTGTACTATTCTCTCTCTTAGGGTTCCATTATCTCTTCTCTCCATTCGCATGCTTTCGGAGCTTCTTCAATGGCGTCTTCTCAGATAGGAATTCTGGTTCCGTTGAACAAGAACCTGGAGAAGGAGAAATCGTTGCCGGAATTTCCGATTTCTGAAGGTGATAATGTGATAGGGCGTAATAATGTTCCGGTGACTGATAAAAGACTCAGTCGCAAGCACCTCATTTTGTCCGCTTCATCTGGTGGCTCTACTGATTTGCGTGTGGAAGGAATGAATCCTGTGGTGATTAATTCTGGTGGTCAGAGGAAGTTACTGAATTCTGGGGAAAAAGTTACATTTAGAAATGGTGATATCTTGGAGTTGATTCCAGGGAGCCACTATTTCAAGTATGAAATTTTAAGTAATCAGAGAAACCCCGGCGTTGTTAGCAAAAGGAGTGAAGATGTTAGTGAATGTACAAATGTTGGAGATGGCAGGAAGAGAGCACGAGAAGACTTGAATGCTGGAGCTTCTACAGAACAGTCAACGCTTCCACATGGAATTGGGAGAAAATTAAAAGAAAATATAGATAATGTAAGTGTTGAATCAAAAAAACAAAATCTTTCAGTCTCTGGGAATAGCGAGGAGGCGATACGTCATTTCCATGTATCCGATGATGGATTGCCTTTAACTTTTCGTCTTATGAAAGTACAAGGGTTACCAGAATGGGCGAACACCTCTTGTGTATCTATTGACAATGTGATTGAGGGGGATGTGCTTGTTGCTGTACTTTCAAATTACATGGTGGACATAGACTGGTTGTTATCTGCATGCCCAATGCTTAAAAAAGTTCCTCAGGTGCTAGTTGTTCATGGAGAAAGTGACGGTACCGCTGATTACATGAAGAGGAACAAACCCAGCAATTGGATTTTACACAAACCCCCATTACCCATCTCGTACGGGACACACCATTCGAAGGCTATGCTTCTTGTGTATCCTCGAGGAGTGAGGGTCATTGTGCATACAGCAAACTTGATTAATGTCGATTGGAATAATAAAAGTCAAGGCTTGTGGATGCAAGATTTCCCTTGGAAAGATCAGAATGAGGCAAGCAAGGGGTGTCCATTTGAAAGTGATCTTATTGATTATCTACAAGCTCTAAAGTTGCCTGAATTTACTGCCAATTTTCCAGCCCTTGGCAGACTGAAAATTGATGCTTCCTTCTTTAAGAAGTTCAATTATGGAAATGCCGCGGTCAGATTAATTGCATCTGTTCCAGGATATCATTCTGGGTCTAATTTAAAGAAGTGGGGTCACATGAAGCTACGCTCTATCCTAGAGCAATGTACTTTCGATGATGAGTTTAAGAAATCGCCACTAATTTATCAGTTTTCCTCTCTTGGTTCACTGGATGAGAAGTGGATGACTGAACTAAGAACTTCATTGTCATCTGGCTTGTCAGCTGATAAATCCGCCCTAGGTTTGGGCGAACCACGGATCATATGGCCTACCGTGGAAGATGTGAGATGGTCACTGGAGGGTTATGCAGCTGGAAATGCTGTCCCAAGTCCAATTAAAAATGTGGAGAAGGAATTTCTGAAGAAATACTGGGCAAAGTGGAAGGCTACTCACACGGGTCGCTGCCGTGCAATGCCCCATATAAAGACATTTGTTCGTTATAATGGTCAAAATATCGCTTGGTTCTTGCTAACTTCATCAAACCTCAGCAAAGCTGCTTGGGGAGCCCTTCAAAAGAATAACTCTCAATTGATGATACGTTCCTATGAGCTTGGCGTGCTGTTTTTGCCTACTGTTGTCAAAAATGGTTTTGGATTTTCTTGCACGAAGGATAAGAGCTCTTGTAAGGATACAAGTGGATCAATGGCAAATTCTAGAAATCGGAAGATTAAACTAGTTTCACTGACTTGGCCAGGAAGAGATGATCATGATGATTCTAACTCTGAAGTAGTTCCTCTTCCTGTGCCTTACGAACTTCCTCCAAAGTTGTACTCTTCTCAGGATGTTCCGTGGTCGTGGGAACGTCAATACAGACAAAAGGATGTCTATGGTCAAGTTTGGCCCAGGCAGGTGTAG

Coding sequence (CDS)

ATGGCGTCTTCTCAGATAGGAATTCTGGTTCCGTTGAACAAGAACCTGGAGAAGGAGAAATCGTTGCCGGAATTTCCGATTTCTGAAGGTGATAATGTGATAGGGCGTAATAATGTTCCGGTGACTGATAAAAGACTCAGTCGCAAGCACCTCATTTTGTCCGCTTCATCTGGTGGCTCTACTGATTTGCGTGTGGAAGGAATGAATCCTGTGGTGATTAATTCTGGTGGTCAGAGGAAGTTACTGAATTCTGGGGAAAAAGTTACATTTAGAAATGGTGATATCTTGGAGTTGATTCCAGGGAGCCACTATTTCAAGTATGAAATTTTAAGTAATCAGAGAAACCCCGGCGTTGTTAGCAAAAGGAGTGAAGATGTTAGTGAATGTACAAATGTTGGAGATGGCAGGAAGAGAGCACGAGAAGACTTGAATGCTGGAGCTTCTACAGAACAGTCAACGCTTCCACATGGAATTGGGAGAAAATTAAAAGAAAATATAGATAATGTAAGTGTTGAATCAAAAAAACAAAATCTTTCAGTCTCTGGGAATAGCGAGGAGGCGATACGTCATTTCCATGTATCCGATGATGGATTGCCTTTAACTTTTCGTCTTATGAAAGTACAAGGGTTACCAGAATGGGCGAACACCTCTTGTGTATCTATTGACAATGTGATTGAGGGGGATGTGCTTGTTGCTGTACTTTCAAATTACATGGTGGACATAGACTGGTTGTTATCTGCATGCCCAATGCTTAAAAAAGTTCCTCAGGTGCTAGTTGTTCATGGAGAAAGTGACGGTACCGCTGATTACATGAAGAGGAACAAACCCAGCAATTGGATTTTACACAAACCCCCATTACCCATCTCGTACGGGACACACCATTCGAAGGCTATGCTTCTTGTGTATCCTCGAGGAGTGAGGGTCATTGTGCATACAGCAAACTTGATTAATGTCGATTGGAATAATAAAAGTCAAGGCTTGTGGATGCAAGATTTCCCTTGGAAAGATCAGAATGAGGCAAGCAAGGGGTGTCCATTTGAAAGTGATCTTATTGATTATCTACAAGCTCTAAAGTTGCCTGAATTTACTGCCAATTTTCCAGCCCTTGGCAGACTGAAAATTGATGCTTCCTTCTTTAAGAAGTTCAATTATGGAAATGCCGCGGTCAGATTAATTGCATCTGTTCCAGGATATCATTCTGGGTCTAATTTAAAGAAGTGGGGTCACATGAAGCTACGCTCTATCCTAGAGCAATGTACTTTCGATGATGAGTTTAAGAAATCGCCACTAATTTATCAGTTTTCCTCTCTTGGTTCACTGGATGAGAAGTGGATGACTGAACTAAGAACTTCATTGTCATCTGGCTTGTCAGCTGATAAATCCGCCCTAGGTTTGGGCGAACCACGGATCATATGGCCTACCGTGGAAGATGTGAGATGGTCACTGGAGGGTTATGCAGCTGGAAATGCTGTCCCAAGTCCAATTAAAAATGTGGAGAAGGAATTTCTGAAGAAATACTGGGCAAAGTGGAAGGCTACTCACACGGGTCGCTGCCGTGCAATGCCCCATATAAAGACATTTGTTCGTTATAATGGTCAAAATATCGCTTGGTTCTTGCTAACTTCATCAAACCTCAGCAAAGCTGCTTGGGGAGCCCTTCAAAAGAATAACTCTCAATTGATGATACGTTCCTATGAGCTTGGCGTGCTGTTTTTGCCTACTGTTGTCAAAAATGGTTTTGGATTTTCTTGCACGAAGGATAAGAGCTCTTGTAAGGATACAAGTGGATCAATGGCAAATTCTAGAAATCGGAAGATTAAACTAGTTTCACTGACTTGGCCAGGAAGAGATGATCATGATGATTCTAACTCTGAAGTAGTTCCTCTTCCTGTGCCTTACGAACTTCCTCCAAAGTTGTACTCTTCTCAGGATGTTCCGTGGTCGTGGGAACGTCAATACAGACAAAAGGATGTCTATGGTCAAGTTTGGCCCAGGCAGGTGTAG

Protein sequence

MASSQIGILVPLNKNLEKEKSLPEFPISEGDNVIGRNNVPVTDKRLSRKHLILSASSGGSTDLRVEGMNPVVINSGGQRKLLNSGEKVTFRNGDILELIPGSHYFKYEILSNQRNPGVVSKRSEDVSECTNVGDGRKRAREDLNAGASTEQSTLPHGIGRKLKENIDNVSVESKKQNLSVSGNSEEAIRHFHVSDDGLPLTFRLMKVQGLPEWANTSCVSIDNVIEGDVLVAVLSNYMVDIDWLLSACPMLKKVPQVLVVHGESDGTADYMKRNKPSNWILHKPPLPISYGTHHSKAMLLVYPRGVRVIVHTANLINVDWNNKSQGLWMQDFPWKDQNEASKGCPFESDLIDYLQALKLPEFTANFPALGRLKIDASFFKKFNYGNAAVRLIASVPGYHSGSNLKKWGHMKLRSILEQCTFDDEFKKSPLIYQFSSLGSLDEKWMTELRTSLSSGLSADKSALGLGEPRIIWPTVEDVRWSLEGYAAGNAVPSPIKNVEKEFLKKYWAKWKATHTGRCRAMPHIKTFVRYNGQNIAWFLLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPTVVKNGFGFSCTKDKSSCKDTSGSMANSRNRKIKLVSLTWPGRDDHDDSNSEVVPLPVPYELPPKLYSSQDVPWSWERQYRQKDVYGQVWPRQV
Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Spo14921Spo14921gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Spo14921.1Spo14921.1-proteinpolypeptide


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Spo14921.1.utr5p.1Spo14921.1.utr5p.1five_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Spo14921.1.CDS.16Spo14921.1.CDS.16CDS
Spo14921.1.CDS.15Spo14921.1.CDS.15CDS
Spo14921.1.CDS.14Spo14921.1.CDS.14CDS
Spo14921.1.CDS.13Spo14921.1.CDS.13CDS
Spo14921.1.CDS.12Spo14921.1.CDS.12CDS
Spo14921.1.CDS.11Spo14921.1.CDS.11CDS
Spo14921.1.CDS.10Spo14921.1.CDS.10CDS
Spo14921.1.CDS.9Spo14921.1.CDS.9CDS
Spo14921.1.CDS.8Spo14921.1.CDS.8CDS
Spo14921.1.CDS.7Spo14921.1.CDS.7CDS
Spo14921.1.CDS.6Spo14921.1.CDS.6CDS
Spo14921.1.CDS.5Spo14921.1.CDS.5CDS
Spo14921.1.CDS.4Spo14921.1.CDS.4CDS
Spo14921.1.CDS.3Spo14921.1.CDS.3CDS
Spo14921.1.CDS.2Spo14921.1.CDS.2CDS
Spo14921.1.CDS.1Spo14921.1.CDS.1CDS


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
Spo14921.1.exon.16Spo14921.1.exon.16exon
Spo14921.1.exon.15Spo14921.1.exon.15exon
Spo14921.1.exon.14Spo14921.1.exon.14exon
Spo14921.1.exon.13Spo14921.1.exon.13exon
Spo14921.1.exon.12Spo14921.1.exon.12exon
Spo14921.1.exon.11Spo14921.1.exon.11exon
Spo14921.1.exon.10Spo14921.1.exon.10exon
Spo14921.1.exon.9Spo14921.1.exon.9exon
Spo14921.1.exon.8Spo14921.1.exon.8exon
Spo14921.1.exon.7Spo14921.1.exon.7exon
Spo14921.1.exon.6Spo14921.1.exon.6exon
Spo14921.1.exon.5Spo14921.1.exon.5exon
Spo14921.1.exon.4Spo14921.1.exon.4exon
Spo14921.1.exon.3Spo14921.1.exon.3exon
Spo14921.1.exon.2Spo14921.1.exon.2exon
Spo14921.1.exon.1Spo14921.1.exon.1exon


Homology
BLAST of Spo14921.1 vs. NCBI nr
Match: gi|902233940|gb|KNA23261.1| (hypothetical protein SOVF_026200 [Spinacia oleracea])

HSP 1 Score: 1350.9 bits (3495), Expect = 0.000e+0
Identity = 658/667 (98.65%), Postives = 661/667 (99.10%), Query Frame = 1

		  

Query: 1   MASSQIGILVPLNKNLEKEKSLPEFPISEGDNVIGRNNVPVTDKRLSRKHLILSASSGGS 60
           MASSQIGILVPLNK LEKEKSLPE PISEGDNVIGRNNVPVTDKRLSRKHLILSASSGGS
Sbjct: 1   MASSQIGILVPLNKILEKEKSLPELPISEGDNVIGRNNVPVTDKRLSRKHLILSASSGGS 60

Query: 61  TDLRVEGMNPVVINSGGQRKLLNSGEKVTFRNGDILELIPGSHYFKYEILSNQRNPGVVS 120
           TDLRVEGMNPVVINSGGQRKLLNSGEKVTFRNGDILELIPGSHYFKYEILSNQRNPG VS
Sbjct: 61  TDLRVEGMNPVVINSGGQRKLLNSGEKVTFRNGDILELIPGSHYFKYEILSNQRNPGAVS 120

Query: 121 KRSEDVSECTNVGDGRKRAREDLNAGASTEQSTLPHGIGRKLKENIDNVSVESKKQNLSV 180
            RSEDVSECTN+GDGRKRAREDLNAGASTEQSTLPHGIGRKLKENIDNVSVESKKQNLSV
Sbjct: 121 NRSEDVSECTNLGDGRKRAREDLNAGASTEQSTLPHGIGRKLKENIDNVSVESKKQNLSV 180

Query: 181 SGNSEEAIRHFHVSDDGLPLTFRLMKVQGLPEWANTSCVSIDNVIEGDVLVAVLSNYMVD 240
           SGNSEEAIRHFHVSDDGLPLTFRLMKV+GLPEWANTSCVSIDNVIEGDVLVAVLSNYMVD
Sbjct: 181 SGNSEEAIRHFHVSDDGLPLTFRLMKVRGLPEWANTSCVSIDNVIEGDVLVAVLSNYMVD 240

Query: 241 IDWLLSACPMLKKVPQVLVVHGESDGTADYMKRNKPSNWILHKPPLPISYGTHHSKAMLL 300
           IDWLLSACPMLKKVPQVLVVHGESDGTADYMKRNKPSNWILHKPPLPISYGTHHSKAMLL
Sbjct: 241 IDWLLSACPMLKKVPQVLVVHGESDGTADYMKRNKPSNWILHKPPLPISYGTHHSKAMLL 300

Query: 301 VYPRGVRVIVHTANLINVDWNNKSQGLWMQDFPWKDQNEASKGCPFESDLIDYLQALKLP 360
           VYPRGVRVIVHTANLIN+DWNNKSQGLWMQDFPWKDQNEASKGCPFESDLIDYLQALKLP
Sbjct: 301 VYPRGVRVIVHTANLINIDWNNKSQGLWMQDFPWKDQNEASKGCPFESDLIDYLQALKLP 360

Query: 361 EFTANFPALGRLKIDASFFKKFNYGNAAVRLIASVPGYHSGSNLKKWGHMKLRSILEQCT 420
           EFTANFPALGRLKIDASFFKKFNYGNAAVRLIASVPGYHSGSNLKKWGHMKLRSILEQCT
Sbjct: 361 EFTANFPALGRLKIDASFFKKFNYGNAAVRLIASVPGYHSGSNLKKWGHMKLRSILEQCT 420

Query: 421 FDDEFKKSPLIYQFSSLGSLDEKWMTELRTSLSSGLSADKSALGLGEPRIIWPTVEDVRW 480
           FDDEFKKSPLIYQFSSLGSLDEKWMTELRTSLSSGLSADKSALGLGEPRIIWPTVEDVRW
Sbjct: 421 FDDEFKKSPLIYQFSSLGSLDEKWMTELRTSLSSGLSADKSALGLGEPRIIWPTVEDVRW 480

Query: 481 SLEGYAAGNAVPSPIKNVEKEFLKKYWAKWKATHTGRCRAMPHIKTFVRYNGQNIAWFLL 540
           SLEGYAAGNAVPSPIKNVEKEFLKKYWAKWKATHTGRCRAMPHIKTFVRYNGQNIAWFLL
Sbjct: 481 SLEGYAAGNAVPSPIKNVEKEFLKKYWAKWKATHTGRCRAMPHIKTFVRYNGQNIAWFLL 540

Query: 541 TSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPTVVKNGFGFSCTKDKSSCKDTSGSMAN 600
           TSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPTVVKNGFGFSCTKDKSSCKDTSGSM N
Sbjct: 541 TSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPTVVKNGFGFSCTKDKSSCKDTSGSMVN 600

Query: 601 SRNRKIKLVSLTWPGRDDHDDSNSEVVPLPVPYELPPKLYSSQDVPWSWERQYRQKDVYG 660
           S NRKIKLVSLTWPGRDDHDDSNSEVVPLPVPYELPPKLYSSQDVPWSWERQYRQKDVYG
Sbjct: 601 SGNRKIKLVSLTWPGRDDHDDSNSEVVPLPVPYELPPKLYSSQDVPWSWERQYRQKDVYG 660

Query: 661 QVWPRQV 668
           QVWPRQV
Sbjct: 661 QVWPRQV 667

BLAST of Spo14921.1 vs. NCBI nr
Match: gi|731370342|ref|XP_010665690.1| (PREDICTED: tyrosyl-DNA phosphodiesterase 1 isoform X1 [Beta vulgaris subsp. vulgaris])

HSP 1 Score: 1137.9 bits (2942), Expect = 0.000e+0
Identity = 553/668 (82.78%), Postives = 602/668 (90.12%), Query Frame = 1

		  

Query: 1   MASSQIGILVPLNKNLEKEKSLPEFPISEGDNVIGRNNVPVTDKRLSRKHLILSASSGGS 60
           MASSQ+GILVPLNK+LEKEKSLPEFP+SEGDNVIGRNN+PVTDKRLSRKHLILSASS   
Sbjct: 52  MASSQVGILVPLNKSLEKEKSLPEFPVSEGDNVIGRNNIPVTDKRLSRKHLILSASSSDC 111

Query: 61  -TDLRVEGMNPVVINSGGQRKLLNSGEKVTFRNGDILELIPGSHYFKYEILSNQRNPGVV 120
              LRVEGMNPVVINSG QRK+LNSG+K + +NGDILELIPGSHYFKY   +NQR+  V 
Sbjct: 112 YAALRVEGMNPVVINSGDQRKILNSGDKASIKNGDILELIPGSHYFKYLTTNNQRDCDVF 171

Query: 121 SKRSEDVSECTNVGDGRKRAREDLNAGASTEQSTLPHGIGRKLKENIDNVSVESKKQNLS 180
           SK +E  S  TN    RKRARE+LN G+S E   + HG GRKLKE  +NV+ ES KQNLS
Sbjct: 172 SKVNEGGSVHTNFAVVRKRARENLNPGSSEEHMVVQHGSGRKLKETPNNVNFESNKQNLS 231

Query: 181 VSGNSEEAIRHFHVSDDGLPLTFRLMKVQGLPEWANTSCVSIDNVIEGDVLVAVLSNYMV 240
           +S + EEA+RHFHVSDD   LTFRLMKV+GLPEWANTSCVSID VIEGDVLVAVLSNYMV
Sbjct: 232 ISRSKEEALRHFHVSDDRSALTFRLMKVRGLPEWANTSCVSIDTVIEGDVLVAVLSNYMV 291

Query: 241 DIDWLLSACPMLKKVPQVLVVHGESDGTADYMKRNKPSNWILHKPPLPISYGTHHSKAML 300
           DIDWLLSACPMLKKVPQVLVVHGESDGT +YMKRNKPSNWILHKPPLPISYGTHHSKAML
Sbjct: 292 DIDWLLSACPMLKKVPQVLVVHGESDGTVEYMKRNKPSNWILHKPPLPISYGTHHSKAML 351

Query: 301 LVYPRGVRVIVHTANLINVDWNNKSQGLWMQDFPWKDQNEASKGCPFESDLIDYLQALKL 360
           L+YPRGVRVIVHTANLI+VDWNNKSQGLWMQDFPWKDQNEA+KGCPFE+DLIDYLQALKL
Sbjct: 352 LLYPRGVRVIVHTANLIHVDWNNKSQGLWMQDFPWKDQNEANKGCPFETDLIDYLQALKL 411

Query: 361 PEFTANFPALGRLKIDASFFKKFNYGNAAVRLIASVPGYHSGSNLKKWGHMKLRSILEQC 420
           P FTAN PALGR+ ID  FFKKFNYGNA+VRLIASVPGYH GSNLKKWGHMKLRSILEQC
Sbjct: 412 PVFTANLPALGRVTIDPFFFKKFNYGNASVRLIASVPGYHLGSNLKKWGHMKLRSILEQC 471

Query: 421 TFDDEFKKSPLIYQFSSLGSLDEKWMTELRTSLSSGLSADKSALGLGEPRIIWPTVEDVR 480
           TFD+EFK+SPLIYQFSSLGSLDEKWMTELRTS+SSG+ ADKS LGLGEP IIWP+VEDVR
Sbjct: 472 TFDEEFKRSPLIYQFSSLGSLDEKWMTELRTSMSSGVLADKSPLGLGEPLIIWPSVEDVR 531

Query: 481 WSLEGYAAGNAVPSPIKNVEKEFLKKYWAKWKATHTGRCRAMPHIKTFVRYNGQNIAWFL 540
            SLEGYAAGNA+PSP+KNVEK FL KYWAKWKATHTG CR+MPHIKTFVRYNGQN+AWFL
Sbjct: 532 CSLEGYAAGNAIPSPLKNVEKGFLNKYWAKWKATHTGHCRSMPHIKTFVRYNGQNLAWFL 591

Query: 541 LTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPTVVKNGFGFSCTKDKSSCKDTSGSMA 600
           LTS+NLSKAAWG LQKNNSQLMIRSYELGVLFLP+VVKNG GFSCT +KSS +DT GS +
Sbjct: 592 LTSANLSKAAWGTLQKNNSQLMIRSYELGVLFLPSVVKNGCGFSCTGNKSSLEDTRGSTS 651

Query: 601 NSRNRKIKLVSLTWPGRDDHDDSNSEVVPLPVPYELPPKLYSSQDVPWSWERQYRQKDVY 660
           N RNRKIKLV+LTW GRD+ DDS+SE+V LPVPYELPPKLYSS+DVPWSW+R+Y QKDVY
Sbjct: 652 NCRNRKIKLVTLTWQGRDNDDDSDSEIVCLPVPYELPPKLYSSEDVPWSWDRKYMQKDVY 711

Query: 661 GQVWPRQV 668
           GQVWPR+V
Sbjct: 712 GQVWPRRV 719

BLAST of Spo14921.1 vs. NCBI nr
Match: gi|870843494|gb|KMS96656.1| (hypothetical protein BVRB_8g201170 isoform B [Beta vulgaris subsp. vulgaris])

HSP 1 Score: 1134.0 bits (2932), Expect = 0.000e+0
Identity = 553/668 (82.78%), Postives = 603/668 (90.27%), Query Frame = 1

		  

Query: 1   MASSQIGILVPLNKNLEKEKSLPEFPISEGDNVIGRNNVPVTDKRLSRKHLILSASSGGS 60
           MASSQ+GILVPLNK+LEKEKSLPEFP+SEGDNVIGRNN+PVTDKRLSRKHLILSASS   
Sbjct: 1   MASSQVGILVPLNKSLEKEKSLPEFPVSEGDNVIGRNNIPVTDKRLSRKHLILSASSSDC 60

Query: 61  -TDLRVEGMNPVVINSGGQRKLLNSGEKVTFRNGDILELIPGSHYFKYEILSNQRNPGVV 120
              LRVEGMNPVVINSG QRK+LNSG+K + +NGDILELIPGSHYFKY   +NQR+  V 
Sbjct: 61  YAALRVEGMNPVVINSGDQRKILNSGDKASIKNGDILELIPGSHYFKYLTTNNQRDCDVF 120

Query: 121 SKRSEDVSECTNVGDGRKRAREDLNAGASTEQSTLPHGIGRKLKENIDNVSVESKKQNLS 180
           SK +E  S  TN    RKRARE+LN G+S E+  + HG GRKLKE  +NV+ ES KQNLS
Sbjct: 121 SKVNEGGSVHTNFAVVRKRARENLNPGSS-EEHMVQHGSGRKLKETPNNVNFESNKQNLS 180

Query: 181 VSGNSEEAIRHFHVSDDGLPLTFRLMKVQGLPEWANTSCVSIDNVIEGDVLVAVLSNYMV 240
           +S + EEA+RHFHVSDD   LTFRLMKV+GLPEWANTSCVSID VIEGDVLVAVLSNYMV
Sbjct: 181 ISRSKEEALRHFHVSDDRSALTFRLMKVRGLPEWANTSCVSIDTVIEGDVLVAVLSNYMV 240

Query: 241 DIDWLLSACPMLKKVPQVLVVHGESDGTADYMKRNKPSNWILHKPPLPISYGTHHSKAML 300
           DIDWLLSACPMLKKVPQVLVVHGESDGT +YMKRNKPSNWILHKPPLPISYGTHHSKAML
Sbjct: 241 DIDWLLSACPMLKKVPQVLVVHGESDGTVEYMKRNKPSNWILHKPPLPISYGTHHSKAML 300

Query: 301 LVYPRGVRVIVHTANLINVDWNNKSQGLWMQDFPWKDQNEASKGCPFESDLIDYLQALKL 360
           L+YPRGVRVIVHTANLI+VDWNNKSQGLWMQDFPWKDQNEA+KGCPFE+DLIDYLQALKL
Sbjct: 301 LLYPRGVRVIVHTANLIHVDWNNKSQGLWMQDFPWKDQNEANKGCPFETDLIDYLQALKL 360

Query: 361 PEFTANFPALGRLKIDASFFKKFNYGNAAVRLIASVPGYHSGSNLKKWGHMKLRSILEQC 420
           P FTAN PALGR+ ID  FFKKFNYGNA+VRLIASVPGYH GSNLKKWGHMKLRSILEQC
Sbjct: 361 PVFTANLPALGRVTIDPFFFKKFNYGNASVRLIASVPGYHLGSNLKKWGHMKLRSILEQC 420

Query: 421 TFDDEFKKSPLIYQFSSLGSLDEKWMTELRTSLSSGLSADKSALGLGEPRIIWPTVEDVR 480
           TFD+EFK+SPLIYQFSSLGSLDEKWMTELRTS+SSG+ ADKS LGLGEP IIWP+VEDVR
Sbjct: 421 TFDEEFKRSPLIYQFSSLGSLDEKWMTELRTSMSSGVLADKSPLGLGEPLIIWPSVEDVR 480

Query: 481 WSLEGYAAGNAVPSPIKNVEKEFLKKYWAKWKATHTGRCRAMPHIKTFVRYNGQNIAWFL 540
            SLEGYAAGNA+PSP+KNVEK FL KYWAKWKATHTG CR+MPHIKTFVRYNGQN+AWFL
Sbjct: 481 CSLEGYAAGNAIPSPLKNVEKGFLNKYWAKWKATHTGHCRSMPHIKTFVRYNGQNLAWFL 540

Query: 541 LTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPTVVKNGFGFSCTKDKSSCKDTSGSMA 600
           LTS+NLSKAAWG LQKNNSQLMIRSYELGVLFLP+VVKNG GFSCT +KSS +DT GS +
Sbjct: 541 LTSANLSKAAWGTLQKNNSQLMIRSYELGVLFLPSVVKNGCGFSCTGNKSSLEDTRGSTS 600

Query: 601 NSRNRKIKLVSLTWPGRDDHDDSNSEVVPLPVPYELPPKLYSSQDVPWSWERQYRQKDVY 660
           N RNRKIKLV+LTW GRD+ DDS+SE+V LPVPYELPPKLYSS+DVPWSW+R+Y QKDVY
Sbjct: 601 NCRNRKIKLVTLTWQGRDNDDDSDSEIVCLPVPYELPPKLYSSEDVPWSWDRKYMQKDVY 660

Query: 661 GQVWPRQV 668
           GQVWPR+V
Sbjct: 661 GQVWPRRV 667

BLAST of Spo14921.1 vs. NCBI nr
Match: gi|731370345|ref|XP_010665691.1| (PREDICTED: tyrosyl-DNA phosphodiesterase 1 isoform X2 [Beta vulgaris subsp. vulgaris])

HSP 1 Score: 1134.0 bits (2932), Expect = 0.000e+0
Identity = 553/668 (82.78%), Postives = 603/668 (90.27%), Query Frame = 1

		  

Query: 1   MASSQIGILVPLNKNLEKEKSLPEFPISEGDNVIGRNNVPVTDKRLSRKHLILSASSGGS 60
           MASSQ+GILVPLNK+LEKEKSLPEFP+SEGDNVIGRNN+PVTDKRLSRKHLILSASS   
Sbjct: 52  MASSQVGILVPLNKSLEKEKSLPEFPVSEGDNVIGRNNIPVTDKRLSRKHLILSASSSDC 111

Query: 61  -TDLRVEGMNPVVINSGGQRKLLNSGEKVTFRNGDILELIPGSHYFKYEILSNQRNPGVV 120
              LRVEGMNPVVINSG QRK+LNSG+K + +NGDILELIPGSHYFKY   +NQR+  V 
Sbjct: 112 YAALRVEGMNPVVINSGDQRKILNSGDKASIKNGDILELIPGSHYFKYLTTNNQRDCDVF 171

Query: 121 SKRSEDVSECTNVGDGRKRAREDLNAGASTEQSTLPHGIGRKLKENIDNVSVESKKQNLS 180
           SK +E  S  TN    RKRARE+LN G+S E+  + HG GRKLKE  +NV+ ES KQNLS
Sbjct: 172 SKVNEGGSVHTNFAVVRKRARENLNPGSS-EEHMVQHGSGRKLKETPNNVNFESNKQNLS 231

Query: 181 VSGNSEEAIRHFHVSDDGLPLTFRLMKVQGLPEWANTSCVSIDNVIEGDVLVAVLSNYMV 240
           +S + EEA+RHFHVSDD   LTFRLMKV+GLPEWANTSCVSID VIEGDVLVAVLSNYMV
Sbjct: 232 ISRSKEEALRHFHVSDDRSALTFRLMKVRGLPEWANTSCVSIDTVIEGDVLVAVLSNYMV 291

Query: 241 DIDWLLSACPMLKKVPQVLVVHGESDGTADYMKRNKPSNWILHKPPLPISYGTHHSKAML 300
           DIDWLLSACPMLKKVPQVLVVHGESDGT +YMKRNKPSNWILHKPPLPISYGTHHSKAML
Sbjct: 292 DIDWLLSACPMLKKVPQVLVVHGESDGTVEYMKRNKPSNWILHKPPLPISYGTHHSKAML 351

Query: 301 LVYPRGVRVIVHTANLINVDWNNKSQGLWMQDFPWKDQNEASKGCPFESDLIDYLQALKL 360
           L+YPRGVRVIVHTANLI+VDWNNKSQGLWMQDFPWKDQNEA+KGCPFE+DLIDYLQALKL
Sbjct: 352 LLYPRGVRVIVHTANLIHVDWNNKSQGLWMQDFPWKDQNEANKGCPFETDLIDYLQALKL 411

Query: 361 PEFTANFPALGRLKIDASFFKKFNYGNAAVRLIASVPGYHSGSNLKKWGHMKLRSILEQC 420
           P FTAN PALGR+ ID  FFKKFNYGNA+VRLIASVPGYH GSNLKKWGHMKLRSILEQC
Sbjct: 412 PVFTANLPALGRVTIDPFFFKKFNYGNASVRLIASVPGYHLGSNLKKWGHMKLRSILEQC 471

Query: 421 TFDDEFKKSPLIYQFSSLGSLDEKWMTELRTSLSSGLSADKSALGLGEPRIIWPTVEDVR 480
           TFD+EFK+SPLIYQFSSLGSLDEKWMTELRTS+SSG+ ADKS LGLGEP IIWP+VEDVR
Sbjct: 472 TFDEEFKRSPLIYQFSSLGSLDEKWMTELRTSMSSGVLADKSPLGLGEPLIIWPSVEDVR 531

Query: 481 WSLEGYAAGNAVPSPIKNVEKEFLKKYWAKWKATHTGRCRAMPHIKTFVRYNGQNIAWFL 540
            SLEGYAAGNA+PSP+KNVEK FL KYWAKWKATHTG CR+MPHIKTFVRYNGQN+AWFL
Sbjct: 532 CSLEGYAAGNAIPSPLKNVEKGFLNKYWAKWKATHTGHCRSMPHIKTFVRYNGQNLAWFL 591

Query: 541 LTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPTVVKNGFGFSCTKDKSSCKDTSGSMA 600
           LTS+NLSKAAWG LQKNNSQLMIRSYELGVLFLP+VVKNG GFSCT +KSS +DT GS +
Sbjct: 592 LTSANLSKAAWGTLQKNNSQLMIRSYELGVLFLPSVVKNGCGFSCTGNKSSLEDTRGSTS 651

Query: 601 NSRNRKIKLVSLTWPGRDDHDDSNSEVVPLPVPYELPPKLYSSQDVPWSWERQYRQKDVY 660
           N RNRKIKLV+LTW GRD+ DDS+SE+V LPVPYELPPKLYSS+DVPWSW+R+Y QKDVY
Sbjct: 652 NCRNRKIKLVTLTWQGRDNDDDSDSEIVCLPVPYELPPKLYSSEDVPWSWDRKYMQKDVY 711

Query: 661 GQVWPRQV 668
           GQVWPR+V
Sbjct: 712 GQVWPRRV 718

BLAST of Spo14921.1 vs. NCBI nr
Match: gi|870843493|gb|KMS96655.1| (hypothetical protein BVRB_8g201170 isoform A [Beta vulgaris subsp. vulgaris])

HSP 1 Score: 1134.0 bits (2932), Expect = 0.000e+0
Identity = 553/668 (82.78%), Postives = 603/668 (90.27%), Query Frame = 1

		  

Query: 1   MASSQIGILVPLNKNLEKEKSLPEFPISEGDNVIGRNNVPVTDKRLSRKHLILSASSGGS 60
           MASSQ+GILVPLNK+LEKEKSLPEFP+SEGDNVIGRNN+PVTDKRLSRKHLILSASS   
Sbjct: 1   MASSQVGILVPLNKSLEKEKSLPEFPVSEGDNVIGRNNIPVTDKRLSRKHLILSASSSDC 60

Query: 61  -TDLRVEGMNPVVINSGGQRKLLNSGEKVTFRNGDILELIPGSHYFKYEILSNQRNPGVV 120
              LRVEGMNPVVINSG QRK+LNSG+K + +NGDILELIPGSHYFKY   +NQR+  V 
Sbjct: 61  YAALRVEGMNPVVINSGDQRKILNSGDKASIKNGDILELIPGSHYFKYLTTNNQRDCDVF 120

Query: 121 SKRSEDVSECTNVGDGRKRAREDLNAGASTEQSTLPHGIGRKLKENIDNVSVESKKQNLS 180
           SK +E  S  TN    RKRARE+LN G+S E+  + HG GRKLKE  +NV+ ES KQNLS
Sbjct: 121 SKVNEGGSVHTNFAVVRKRARENLNPGSS-EEHMVQHGSGRKLKETPNNVNFESNKQNLS 180

Query: 181 VSGNSEEAIRHFHVSDDGLPLTFRLMKVQGLPEWANTSCVSIDNVIEGDVLVAVLSNYMV 240
           +S + EEA+RHFHVSDD   LTFRLMKV+GLPEWANTSCVSID VIEGDVLVAVLSNYMV
Sbjct: 181 ISRSKEEALRHFHVSDDRSALTFRLMKVRGLPEWANTSCVSIDTVIEGDVLVAVLSNYMV 240

Query: 241 DIDWLLSACPMLKKVPQVLVVHGESDGTADYMKRNKPSNWILHKPPLPISYGTHHSKAML 300
           DIDWLLSACPMLKKVPQVLVVHGESDGT +YMKRNKPSNWILHKPPLPISYGTHHSKAML
Sbjct: 241 DIDWLLSACPMLKKVPQVLVVHGESDGTVEYMKRNKPSNWILHKPPLPISYGTHHSKAML 300

Query: 301 LVYPRGVRVIVHTANLINVDWNNKSQGLWMQDFPWKDQNEASKGCPFESDLIDYLQALKL 360
           L+YPRGVRVIVHTANLI+VDWNNKSQGLWMQDFPWKDQNEA+KGCPFE+DLIDYLQALKL
Sbjct: 301 LLYPRGVRVIVHTANLIHVDWNNKSQGLWMQDFPWKDQNEANKGCPFETDLIDYLQALKL 360

Query: 361 PEFTANFPALGRLKIDASFFKKFNYGNAAVRLIASVPGYHSGSNLKKWGHMKLRSILEQC 420
           P FTAN PALGR+ ID  FFKKFNYGNA+VRLIASVPGYH GSNLKKWGHMKLRSILEQC
Sbjct: 361 PVFTANLPALGRVTIDPFFFKKFNYGNASVRLIASVPGYHLGSNLKKWGHMKLRSILEQC 420

Query: 421 TFDDEFKKSPLIYQFSSLGSLDEKWMTELRTSLSSGLSADKSALGLGEPRIIWPTVEDVR 480
           TFD+EFK+SPLIYQFSSLGSLDEKWMTELRTS+SSG+ ADKS LGLGEP IIWP+VEDVR
Sbjct: 421 TFDEEFKRSPLIYQFSSLGSLDEKWMTELRTSMSSGVLADKSPLGLGEPLIIWPSVEDVR 480

Query: 481 WSLEGYAAGNAVPSPIKNVEKEFLKKYWAKWKATHTGRCRAMPHIKTFVRYNGQNIAWFL 540
            SLEGYAAGNA+PSP+KNVEK FL KYWAKWKATHTG CR+MPHIKTFVRYNGQN+AWFL
Sbjct: 481 CSLEGYAAGNAIPSPLKNVEKGFLNKYWAKWKATHTGHCRSMPHIKTFVRYNGQNLAWFL 540

Query: 541 LTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPTVVKNGFGFSCTKDKSSCKDTSGSMA 600
           LTS+NLSKAAWG LQKNNSQLMIRSYELGVLFLP+VVKNG GFSCT +KSS +DT GS +
Sbjct: 541 LTSANLSKAAWGTLQKNNSQLMIRSYELGVLFLPSVVKNGCGFSCTGNKSSLEDTRGSTS 600

Query: 601 NSRNRKIKLVSLTWPGRDDHDDSNSEVVPLPVPYELPPKLYSSQDVPWSWERQYRQKDVY 660
           N RNRKIKLV+LTW GRD+ DDS+SE+V LPVPYELPPKLYSS+DVPWSW+R+Y QKDVY
Sbjct: 601 NCRNRKIKLVTLTWQGRDNDDDSDSEIVCLPVPYELPPKLYSSEDVPWSWDRKYMQKDVY 660

Query: 661 GQVWPRQV 668
           GQVWPR+V
Sbjct: 661 GQVWPRRV 667

BLAST of Spo14921.1 vs. UniProtKB/TrEMBL
Match: A0A0K9RUQ3_SPIOL (Uncharacterized protein OS=Spinacia oleracea GN=SOVF_026200 PE=4 SV=1)

HSP 1 Score: 1350.9 bits (3495), Expect = 0.000e+0
Identity = 658/667 (98.65%), Postives = 661/667 (99.10%), Query Frame = 1

		  

Query: 1   MASSQIGILVPLNKNLEKEKSLPEFPISEGDNVIGRNNVPVTDKRLSRKHLILSASSGGS 60
           MASSQIGILVPLNK LEKEKSLPE PISEGDNVIGRNNVPVTDKRLSRKHLILSASSGGS
Sbjct: 1   MASSQIGILVPLNKILEKEKSLPELPISEGDNVIGRNNVPVTDKRLSRKHLILSASSGGS 60

Query: 61  TDLRVEGMNPVVINSGGQRKLLNSGEKVTFRNGDILELIPGSHYFKYEILSNQRNPGVVS 120
           TDLRVEGMNPVVINSGGQRKLLNSGEKVTFRNGDILELIPGSHYFKYEILSNQRNPG VS
Sbjct: 61  TDLRVEGMNPVVINSGGQRKLLNSGEKVTFRNGDILELIPGSHYFKYEILSNQRNPGAVS 120

Query: 121 KRSEDVSECTNVGDGRKRAREDLNAGASTEQSTLPHGIGRKLKENIDNVSVESKKQNLSV 180
            RSEDVSECTN+GDGRKRAREDLNAGASTEQSTLPHGIGRKLKENIDNVSVESKKQNLSV
Sbjct: 121 NRSEDVSECTNLGDGRKRAREDLNAGASTEQSTLPHGIGRKLKENIDNVSVESKKQNLSV 180

Query: 181 SGNSEEAIRHFHVSDDGLPLTFRLMKVQGLPEWANTSCVSIDNVIEGDVLVAVLSNYMVD 240
           SGNSEEAIRHFHVSDDGLPLTFRLMKV+GLPEWANTSCVSIDNVIEGDVLVAVLSNYMVD
Sbjct: 181 SGNSEEAIRHFHVSDDGLPLTFRLMKVRGLPEWANTSCVSIDNVIEGDVLVAVLSNYMVD 240

Query: 241 IDWLLSACPMLKKVPQVLVVHGESDGTADYMKRNKPSNWILHKPPLPISYGTHHSKAMLL 300
           IDWLLSACPMLKKVPQVLVVHGESDGTADYMKRNKPSNWILHKPPLPISYGTHHSKAMLL
Sbjct: 241 IDWLLSACPMLKKVPQVLVVHGESDGTADYMKRNKPSNWILHKPPLPISYGTHHSKAMLL 300

Query: 301 VYPRGVRVIVHTANLINVDWNNKSQGLWMQDFPWKDQNEASKGCPFESDLIDYLQALKLP 360
           VYPRGVRVIVHTANLIN+DWNNKSQGLWMQDFPWKDQNEASKGCPFESDLIDYLQALKLP
Sbjct: 301 VYPRGVRVIVHTANLINIDWNNKSQGLWMQDFPWKDQNEASKGCPFESDLIDYLQALKLP 360

Query: 361 EFTANFPALGRLKIDASFFKKFNYGNAAVRLIASVPGYHSGSNLKKWGHMKLRSILEQCT 420
           EFTANFPALGRLKIDASFFKKFNYGNAAVRLIASVPGYHSGSNLKKWGHMKLRSILEQCT
Sbjct: 361 EFTANFPALGRLKIDASFFKKFNYGNAAVRLIASVPGYHSGSNLKKWGHMKLRSILEQCT 420

Query: 421 FDDEFKKSPLIYQFSSLGSLDEKWMTELRTSLSSGLSADKSALGLGEPRIIWPTVEDVRW 480
           FDDEFKKSPLIYQFSSLGSLDEKWMTELRTSLSSGLSADKSALGLGEPRIIWPTVEDVRW
Sbjct: 421 FDDEFKKSPLIYQFSSLGSLDEKWMTELRTSLSSGLSADKSALGLGEPRIIWPTVEDVRW 480

Query: 481 SLEGYAAGNAVPSPIKNVEKEFLKKYWAKWKATHTGRCRAMPHIKTFVRYNGQNIAWFLL 540
           SLEGYAAGNAVPSPIKNVEKEFLKKYWAKWKATHTGRCRAMPHIKTFVRYNGQNIAWFLL
Sbjct: 481 SLEGYAAGNAVPSPIKNVEKEFLKKYWAKWKATHTGRCRAMPHIKTFVRYNGQNIAWFLL 540

Query: 541 TSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPTVVKNGFGFSCTKDKSSCKDTSGSMAN 600
           TSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPTVVKNGFGFSCTKDKSSCKDTSGSM N
Sbjct: 541 TSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPTVVKNGFGFSCTKDKSSCKDTSGSMVN 600

Query: 601 SRNRKIKLVSLTWPGRDDHDDSNSEVVPLPVPYELPPKLYSSQDVPWSWERQYRQKDVYG 660
           S NRKIKLVSLTWPGRDDHDDSNSEVVPLPVPYELPPKLYSSQDVPWSWERQYRQKDVYG
Sbjct: 601 SGNRKIKLVSLTWPGRDDHDDSNSEVVPLPVPYELPPKLYSSQDVPWSWERQYRQKDVYG 660

Query: 661 QVWPRQV 668
           QVWPRQV
Sbjct: 661 QVWPRQV 667

BLAST of Spo14921.1 vs. UniProtKB/TrEMBL
Match: A0A0J8B6P0_BETVU (Uncharacterized protein OS=Beta vulgaris subsp. vulgaris GN=BVRB_8g201170 PE=4 SV=1)

HSP 1 Score: 1134.0 bits (2932), Expect = 0.000e+0
Identity = 553/668 (82.78%), Postives = 603/668 (90.27%), Query Frame = 1

		  

Query: 1   MASSQIGILVPLNKNLEKEKSLPEFPISEGDNVIGRNNVPVTDKRLSRKHLILSASSGGS 60
           MASSQ+GILVPLNK+LEKEKSLPEFP+SEGDNVIGRNN+PVTDKRLSRKHLILSASS   
Sbjct: 1   MASSQVGILVPLNKSLEKEKSLPEFPVSEGDNVIGRNNIPVTDKRLSRKHLILSASSSDC 60

Query: 61  -TDLRVEGMNPVVINSGGQRKLLNSGEKVTFRNGDILELIPGSHYFKYEILSNQRNPGVV 120
              LRVEGMNPVVINSG QRK+LNSG+K + +NGDILELIPGSHYFKY   +NQR+  V 
Sbjct: 61  YAALRVEGMNPVVINSGDQRKILNSGDKASIKNGDILELIPGSHYFKYLTTNNQRDCDVF 120

Query: 121 SKRSEDVSECTNVGDGRKRAREDLNAGASTEQSTLPHGIGRKLKENIDNVSVESKKQNLS 180
           SK +E  S  TN    RKRARE+LN G+S E+  + HG GRKLKE  +NV+ ES KQNLS
Sbjct: 121 SKVNEGGSVHTNFAVVRKRARENLNPGSS-EEHMVQHGSGRKLKETPNNVNFESNKQNLS 180

Query: 181 VSGNSEEAIRHFHVSDDGLPLTFRLMKVQGLPEWANTSCVSIDNVIEGDVLVAVLSNYMV 240
           +S + EEA+RHFHVSDD   LTFRLMKV+GLPEWANTSCVSID VIEGDVLVAVLSNYMV
Sbjct: 181 ISRSKEEALRHFHVSDDRSALTFRLMKVRGLPEWANTSCVSIDTVIEGDVLVAVLSNYMV 240

Query: 241 DIDWLLSACPMLKKVPQVLVVHGESDGTADYMKRNKPSNWILHKPPLPISYGTHHSKAML 300
           DIDWLLSACPMLKKVPQVLVVHGESDGT +YMKRNKPSNWILHKPPLPISYGTHHSKAML
Sbjct: 241 DIDWLLSACPMLKKVPQVLVVHGESDGTVEYMKRNKPSNWILHKPPLPISYGTHHSKAML 300

Query: 301 LVYPRGVRVIVHTANLINVDWNNKSQGLWMQDFPWKDQNEASKGCPFESDLIDYLQALKL 360
           L+YPRGVRVIVHTANLI+VDWNNKSQGLWMQDFPWKDQNEA+KGCPFE+DLIDYLQALKL
Sbjct: 301 LLYPRGVRVIVHTANLIHVDWNNKSQGLWMQDFPWKDQNEANKGCPFETDLIDYLQALKL 360

Query: 361 PEFTANFPALGRLKIDASFFKKFNYGNAAVRLIASVPGYHSGSNLKKWGHMKLRSILEQC 420
           P FTAN PALGR+ ID  FFKKFNYGNA+VRLIASVPGYH GSNLKKWGHMKLRSILEQC
Sbjct: 361 PVFTANLPALGRVTIDPFFFKKFNYGNASVRLIASVPGYHLGSNLKKWGHMKLRSILEQC 420

Query: 421 TFDDEFKKSPLIYQFSSLGSLDEKWMTELRTSLSSGLSADKSALGLGEPRIIWPTVEDVR 480
           TFD+EFK+SPLIYQFSSLGSLDEKWMTELRTS+SSG+ ADKS LGLGEP IIWP+VEDVR
Sbjct: 421 TFDEEFKRSPLIYQFSSLGSLDEKWMTELRTSMSSGVLADKSPLGLGEPLIIWPSVEDVR 480

Query: 481 WSLEGYAAGNAVPSPIKNVEKEFLKKYWAKWKATHTGRCRAMPHIKTFVRYNGQNIAWFL 540
            SLEGYAAGNA+PSP+KNVEK FL KYWAKWKATHTG CR+MPHIKTFVRYNGQN+AWFL
Sbjct: 481 CSLEGYAAGNAIPSPLKNVEKGFLNKYWAKWKATHTGHCRSMPHIKTFVRYNGQNLAWFL 540

Query: 541 LTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPTVVKNGFGFSCTKDKSSCKDTSGSMA 600
           LTS+NLSKAAWG LQKNNSQLMIRSYELGVLFLP+VVKNG GFSCT +KSS +DT GS +
Sbjct: 541 LTSANLSKAAWGTLQKNNSQLMIRSYELGVLFLPSVVKNGCGFSCTGNKSSLEDTRGSTS 600

Query: 601 NSRNRKIKLVSLTWPGRDDHDDSNSEVVPLPVPYELPPKLYSSQDVPWSWERQYRQKDVY 660
           N RNRKIKLV+LTW GRD+ DDS+SE+V LPVPYELPPKLYSS+DVPWSW+R+Y QKDVY
Sbjct: 601 NCRNRKIKLVTLTWQGRDNDDDSDSEIVCLPVPYELPPKLYSSEDVPWSWDRKYMQKDVY 660

Query: 661 GQVWPRQV 668
           GQVWPR+V
Sbjct: 661 GQVWPRRV 667

BLAST of Spo14921.1 vs. UniProtKB/TrEMBL
Match: A0A0J8E0L9_BETVU (Uncharacterized protein OS=Beta vulgaris subsp. vulgaris GN=BVRB_8g201170 PE=4 SV=1)

HSP 1 Score: 1134.0 bits (2932), Expect = 0.000e+0
Identity = 553/668 (82.78%), Postives = 603/668 (90.27%), Query Frame = 1

		  

Query: 1   MASSQIGILVPLNKNLEKEKSLPEFPISEGDNVIGRNNVPVTDKRLSRKHLILSASSGGS 60
           MASSQ+GILVPLNK+LEKEKSLPEFP+SEGDNVIGRNN+PVTDKRLSRKHLILSASS   
Sbjct: 1   MASSQVGILVPLNKSLEKEKSLPEFPVSEGDNVIGRNNIPVTDKRLSRKHLILSASSSDC 60

Query: 61  -TDLRVEGMNPVVINSGGQRKLLNSGEKVTFRNGDILELIPGSHYFKYEILSNQRNPGVV 120
              LRVEGMNPVVINSG QRK+LNSG+K + +NGDILELIPGSHYFKY   +NQR+  V 
Sbjct: 61  YAALRVEGMNPVVINSGDQRKILNSGDKASIKNGDILELIPGSHYFKYLTTNNQRDCDVF 120

Query: 121 SKRSEDVSECTNVGDGRKRAREDLNAGASTEQSTLPHGIGRKLKENIDNVSVESKKQNLS 180
           SK +E  S  TN    RKRARE+LN G+S E+  + HG GRKLKE  +NV+ ES KQNLS
Sbjct: 121 SKVNEGGSVHTNFAVVRKRARENLNPGSS-EEHMVQHGSGRKLKETPNNVNFESNKQNLS 180

Query: 181 VSGNSEEAIRHFHVSDDGLPLTFRLMKVQGLPEWANTSCVSIDNVIEGDVLVAVLSNYMV 240
           +S + EEA+RHFHVSDD   LTFRLMKV+GLPEWANTSCVSID VIEGDVLVAVLSNYMV
Sbjct: 181 ISRSKEEALRHFHVSDDRSALTFRLMKVRGLPEWANTSCVSIDTVIEGDVLVAVLSNYMV 240

Query: 241 DIDWLLSACPMLKKVPQVLVVHGESDGTADYMKRNKPSNWILHKPPLPISYGTHHSKAML 300
           DIDWLLSACPMLKKVPQVLVVHGESDGT +YMKRNKPSNWILHKPPLPISYGTHHSKAML
Sbjct: 241 DIDWLLSACPMLKKVPQVLVVHGESDGTVEYMKRNKPSNWILHKPPLPISYGTHHSKAML 300

Query: 301 LVYPRGVRVIVHTANLINVDWNNKSQGLWMQDFPWKDQNEASKGCPFESDLIDYLQALKL 360
           L+YPRGVRVIVHTANLI+VDWNNKSQGLWMQDFPWKDQNEA+KGCPFE+DLIDYLQALKL
Sbjct: 301 LLYPRGVRVIVHTANLIHVDWNNKSQGLWMQDFPWKDQNEANKGCPFETDLIDYLQALKL 360

Query: 361 PEFTANFPALGRLKIDASFFKKFNYGNAAVRLIASVPGYHSGSNLKKWGHMKLRSILEQC 420
           P FTAN PALGR+ ID  FFKKFNYGNA+VRLIASVPGYH GSNLKKWGHMKLRSILEQC
Sbjct: 361 PVFTANLPALGRVTIDPFFFKKFNYGNASVRLIASVPGYHLGSNLKKWGHMKLRSILEQC 420

Query: 421 TFDDEFKKSPLIYQFSSLGSLDEKWMTELRTSLSSGLSADKSALGLGEPRIIWPTVEDVR 480
           TFD+EFK+SPLIYQFSSLGSLDEKWMTELRTS+SSG+ ADKS LGLGEP IIWP+VEDVR
Sbjct: 421 TFDEEFKRSPLIYQFSSLGSLDEKWMTELRTSMSSGVLADKSPLGLGEPLIIWPSVEDVR 480

Query: 481 WSLEGYAAGNAVPSPIKNVEKEFLKKYWAKWKATHTGRCRAMPHIKTFVRYNGQNIAWFL 540
            SLEGYAAGNA+PSP+KNVEK FL KYWAKWKATHTG CR+MPHIKTFVRYNGQN+AWFL
Sbjct: 481 CSLEGYAAGNAIPSPLKNVEKGFLNKYWAKWKATHTGHCRSMPHIKTFVRYNGQNLAWFL 540

Query: 541 LTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPTVVKNGFGFSCTKDKSSCKDTSGSMA 600
           LTS+NLSKAAWG LQKNNSQLMIRSYELGVLFLP+VVKNG GFSCT +KSS +DT GS +
Sbjct: 541 LTSANLSKAAWGTLQKNNSQLMIRSYELGVLFLPSVVKNGCGFSCTGNKSSLEDTRGSTS 600

Query: 601 NSRNRKIKLVSLTWPGRDDHDDSNSEVVPLPVPYELPPKLYSSQDVPWSWERQYRQKDVY 660
           N RNRKIKLV+LTW GRD+ DDS+SE+V LPVPYELPPKLYSS+DVPWSW+R+Y QKDVY
Sbjct: 601 NCRNRKIKLVTLTWQGRDNDDDSDSEIVCLPVPYELPPKLYSSEDVPWSWDRKYMQKDVY 660

Query: 661 GQVWPRQV 668
           GQVWPR+V
Sbjct: 661 GQVWPRRV 667

BLAST of Spo14921.1 vs. UniProtKB/TrEMBL
Match: E0CVK9_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_10s0116g00450 PE=4 SV=1)

HSP 1 Score: 916.8 bits (2368), Expect = 1.600e-263
Identity = 449/671 (66.92%), Postives = 528/671 (78.69%), Query Frame = 1

		  

Query: 1   MASSQIGILVPLNKNLEKEKSLPEFPISEGDNVIGRNNVPVTDKRLSRKHLILSASSGGS 60
           M+ SQIG LVPLN+NLE++ S P+ PI  G NVIGRN++ V+DKRLSRKHL L AS  GS
Sbjct: 1   MSLSQIGFLVPLNRNLEEDTSTPKLPIPTGANVIGRNSISVSDKRLSRKHLTLIASGNGS 60

Query: 61  TDLRVEGMNPVVINSGGQRKLLNSGEKVTFRNGDILELIPGSHYFKYEILSNQRNPGVVS 120
            D  VEG NPVV+ SG QRK L +GEK    N DI+ELIPG ++FKY  ++ ++     +
Sbjct: 61  VDAVVEGTNPVVVASGNQRKKLRTGEKAVITNDDIIELIPGHYFFKYVTVAGEKCEKKGN 120

Query: 121 KRSEDVSECTNVGDGRKRARE--DLNAGASTEQSTLPHGIGRKLKENIDNVSVESKKQNL 180
                  E   V   RKR R+  +  A A   Q+ + + +  + +  +   S  S+    
Sbjct: 121 SMDAQNMESNEVSLSRKRMRQVSEDEAFARKLQAEMENDVLVQERSLVTGKSGYSQASTA 180

Query: 181 SVSGN--SEEAIRHFHVSDDGLPLTFRLMKVQGLPEWANTSCVSIDNVIEGDVLVAVLSN 240
           S+  +  + EAIRHF +  D LPLT+RL++V+ LP WANTS VSI +VI+GDVL+AVLSN
Sbjct: 181 SIPSSHMNSEAIRHFSIPKDNLPLTYRLLRVKDLPAWANTSSVSIRDVIQGDVLIAVLSN 240

Query: 241 YMVDIDWLLSACPMLKKVPQVLVVHGESDGTADYMKRNKPSNWILHKPPLPISYGTHHSK 300
           YMVDIDWLLS+CP L K+P VLV+HGE DGT D+MK+NKP NWILHKPPLPIS+GTHHSK
Sbjct: 241 YMVDIDWLLSSCPTLAKIPHVLVIHGEGDGTLDHMKKNKPPNWILHKPPLPISFGTHHSK 300

Query: 301 AMLLVYPRGVRVIVHTANLINVDWNNKSQGLWMQDFPWKDQNEASKGCPFESDLIDYLQA 360
           AMLLVYPRGVRVIVHTANLI VDWNNKSQGLWMQDFPWK Q E SKGC FE+DLIDYL  
Sbjct: 301 AMLLVYPRGVRVIVHTANLIYVDWNNKSQGLWMQDFPWKVQKELSKGCAFENDLIDYLSV 360

Query: 361 LKLPEFTANFPALGRLKIDASFFKKFNYGNAAVRLIASVPGYHSGSNLKKWGHMKLRSIL 420
           LK PEFTAN PALG   I++SFFKKF+Y NA VRLIASVPGYH+GSNLKKWGHMKL S+L
Sbjct: 361 LKWPEFTANLPALGSFNINSSFFKKFDYSNAVVRLIASVPGYHTGSNLKKWGHMKLCSVL 420

Query: 421 EQCTFDDEFKKSPLIYQFSSLGSLDEKWMTELRTSLSSGLSADKSALGLGEPRIIWPTVE 480
           ++C FD EF+KSPL YQFSSLGSLDEKWMTEL +S+SSG   DK+ LGLG+P IIWPTVE
Sbjct: 421 QECIFDKEFQKSPLAYQFSSLGSLDEKWMTELASSMSSGSCDDKTPLGLGKPLIIWPTVE 480

Query: 481 DVRWSLEGYAAGNAVPSPIKNVEKEFLKKYWAKWKATHTGRCRAMPHIKTFVRYNGQNIA 540
           DVR SLEGYAAGNA+PSP KNVEKEFLKKYWAKWKATHTGRCRAMPHIKT+ RYNGQN+A
Sbjct: 481 DVRCSLEGYAAGNAIPSPQKNVEKEFLKKYWAKWKATHTGRCRAMPHIKTYTRYNGQNLA 540

Query: 541 WFLLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPTVVKNGFGFSCTKDKSSCKDTSG 600
           WFLLTS+NLSKAAWGALQKNNSQLMIRSYELGVLFLP+ +  G GFSCT + S  K+  G
Sbjct: 541 WFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLFLPSPINRGQGFSCTDNGSPSKNKCG 600

Query: 601 SMANSRNRKIKLVSLTWPGRDDHDDSNSEVVPLPVPYELPPKLYSSQDVPWSWERQYRQK 660
              N+++++ KLV+LTW G +   DS+SEV+PLPVPYELPPK YSS+DVPWSW+R+Y +K
Sbjct: 601 LSENTKSQRTKLVTLTWEG-NRSSDSSSEVIPLPVPYELPPKQYSSEDVPWSWDRRYYKK 660

Query: 661 DVYGQVWPRQV 668
           DV GQVWPR V
Sbjct: 661 DVCGQVWPRHV 670

BLAST of Spo14921.1 vs. UniProtKB/TrEMBL
Match: A0A067JLC5_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_21271 PE=4 SV=1)

HSP 1 Score: 899.4 bits (2323), Expect = 2.600e-258
Identity = 434/668 (64.97%), Postives = 522/668 (78.14%), Query Frame = 1

		  

Query: 1   MASSQIGILVPLNKNLEKEKSLPEFPISEGDNVIGRNNVPVTDKRLSRKHLILSASSGGS 60
           M+ SQIG LVPL  NLE++ SLP+ P+S+G N IGR++V V DKRLSR HL ++AS  GS
Sbjct: 1   MSRSQIGYLVPLKVNLEEDTSLPKLPLSKGSNTIGRSHVSVPDKRLSRNHLTVTASIDGS 60

Query: 61  TDLRVEGMNPVVINSGGQRKLLNSGEKVTFRNGDILELIPGSHYFKYEILSNQRNPGVVS 120
            +L  EG NPVV+ SG  R+ LN GE+++  +GDI+ELIPG H+FKY   S+  +    S
Sbjct: 61  LNLTPEGTNPVVVKSGNLRRKLNPGEQLSIASGDIIELIPGHHFFKYVASSSHSSNSNSS 120

Query: 121 KRSEDVSECTNVGDGRKRAREDLNAGASTEQSTLPHGIGRKLKENIDNVSVESKKQNLSV 180
             +     C N  DG +  + D  +              +K++E  +  +    +   + 
Sbjct: 121 SPNTPKRHCNN--DGGEIIKSDPPS-------------RKKMREAPEVANRSKLETENNG 180

Query: 181 SGNSEEAIRHFHVSDDGLPLTFRLMKVQGLPEWANTSCVSIDNVIEGDVLVAVLSNYMVD 240
            GN EEAI  FHV DD LPL FRL+KVQGLP WANTSCVSI++V++GD+LVA+LSNYMVD
Sbjct: 181 QGNCEEAICKFHVPDDKLPLIFRLLKVQGLPAWANTSCVSINDVVQGDILVAILSNYMVD 240

Query: 241 IDWLLSACPMLKKVPQVLVVHGESDGTADYMKRNKPSNWILHKPPLPISYGTHHSKAMLL 300
           +DWL+SACP L K+P VLV+HGE D T ++MKR+KP+NWILHKPPLPIS+GTHHSKAMLL
Sbjct: 241 VDWLISACPTLAKIPHVLVIHGEGDATLEHMKRSKPANWILHKPPLPISFGTHHSKAMLL 300

Query: 301 VYPRGVRVIVHTANLINVDWNNKSQGLWMQDFPWKDQNEASKGCPFESDLIDYLQALKLP 360
           +YPRGVR+IVHTANLI VDWNNKSQGLWMQDFPWKD+   SKGC FE+DL+DYL ALK P
Sbjct: 301 IYPRGVRIIVHTANLIYVDWNNKSQGLWMQDFPWKDEKSQSKGCGFENDLVDYLHALKWP 360

Query: 361 EFTANFPALGRLKIDASFFKKFNYGNAAVRLIASVPGYHSGSNLKKWGHMKLRSILEQCT 420
           EF    PALG   ++ SF KKF+Y +AAVRLIASVPGYH+G+NLKKWGHMKLRS+L++C 
Sbjct: 361 EFPVKLPALGNFTMNPSFLKKFDYSSAAVRLIASVPGYHAGANLKKWGHMKLRSVLQECI 420

Query: 421 FDDEFKKSPLIYQFSSLGSLDEKWMTELRTSLSSGLSADKSALGLGEPRIIWPTVEDVRW 480
           F  EFK SPL+YQFSSLGSLDEKWMTEL TS+SSGLS DK+ LGLGE RIIWPTVEDVR 
Sbjct: 421 FGKEFKNSPLVYQFSSLGSLDEKWMTELATSMSSGLSEDKTPLGLGEARIIWPTVEDVRC 480

Query: 481 SLEGYAAGNAVPSPIKNVEKEFLKKYWAKWKATHTGRCRAMPHIKTFVRYNGQNIAWFLL 540
           SLEGYAAGNA+PSP+KNVE+EFLKKYW+KWKATHTGRCRAMPHIKTF RYNGQ +AWFLL
Sbjct: 481 SLEGYAAGNAIPSPLKNVEREFLKKYWSKWKATHTGRCRAMPHIKTFTRYNGQKLAWFLL 540

Query: 541 TSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPTVVK-NGFGFSCTKDKSSCKDTSGSMA 600
           TS+NLSKAAWGALQKNNSQLMIRSYELGVLFLP+  K + +GFSC  ++   KD  G +A
Sbjct: 541 TSANLSKAAWGALQKNNSQLMIRSYELGVLFLPSPYKIHAYGFSCVDNEVLSKDKCGVLA 600

Query: 601 NSRNRKIKLVSLTWPGRDDHDDSNSEVVPLPVPYELPPKLYSSQDVPWSWERQYRQKDVY 660
           +S   + KLV+L W G     DS+ EV+PLPVPYELPP+ YSS+D+PWSW+R+Y +KDVY
Sbjct: 601 DSEVLRTKLVTLAWQGT---KDSSCEVIPLPVPYELPPQPYSSEDIPWSWDRRYTKKDVY 650

Query: 661 GQVWPRQV 668
           GQVWPR V
Sbjct: 661 GQVWPRLV 650

BLAST of Spo14921.1 vs. ExPASy Swiss-Prot
Match: TYDP1_ARATH (Tyrosyl-DNA phosphodiesterase 1 OS=Arabidopsis thaliana GN=TDP1 PE=1 SV=1)

HSP 1 Score: 714.9 bits (1844), Expect = 8.100e-205
Identity = 336/482 (69.71%), Postives = 392/482 (81.33%), Query Frame = 1

		  

Query: 186 EAIRHFHVSDDGLPLTFRLMKVQGLPEWANTSCVSIDNVIEGDVLVAVLSNYMVDIDWLL 245
           EAIR F   ++ LP TFRL+ V  LP+WANTSCVSI++VIEGDV+ A+LSNYMVDIDWL+
Sbjct: 128 EAIRRFCPPNEKLPSTFRLLSVDALPDWANTSCVSINDVIEGDVVAAILSNYMVDIDWLM 187

Query: 246 SACPMLKKVPQVLVVHGESDGTADYMKRNKPSNWILHKPPLPISYGTHHSKAMLLVYPRG 305
           SACP L  +PQV+V+HGE DG  +Y++R KP+NWILHKP LPIS+GTHHSKA+ LVYPRG
Sbjct: 188 SACPKLANIPQVMVIHGEGDGRQEYIQRKKPANWILHKPRLPISFGTHHSKAIFLVYPRG 247

Query: 306 VRVIVHTANLINVDWNNKSQGLWMQDFPWKDQN-EASKGCPFESDLIDYLQALKLPEFTA 365
           VRV+VHTANLI+VDWNNKSQGLWMQDFPWKD + +  KGC FE DLIDYL  LK PEFTA
Sbjct: 248 VRVVVHTANLIHVDWNNKSQGLWMQDFPWKDDDKDPPKGCGFEGDLIDYLNVLKWPEFTA 307

Query: 366 NFPALGRLKIDASFFKKFNYGNAAVRLIASVPGYHSGSNLKKWGHMKLRSILEQCTFDDE 425
           N P  G +KI+A+FFKKF+Y +A VRLIASVPGYH+G NL KWGHMKLR+IL++C FD E
Sbjct: 308 NLPGRGNVKINAAFFKKFDYSDATVRLIASVPGYHTGFNLNKWGHMKLRTILQECIFDRE 367

Query: 426 FKKSPLIYQFSSLGSLDEKWMTELRTSLSSGLSADKSALGLGEPRIIWPTVEDVRWSLEG 485
           F++SPLIYQFSSLGSLDEKW+ E   SLSSG++ DK+ LG G+  IIWPTVEDVR SLEG
Sbjct: 368 FRRSPLIYQFSSLGSLDEKWLAEFGNSLSSGITEDKTPLGPGDSLIIWPTVEDVRCSLEG 427

Query: 486 YAAGNAVPSPIKNVEKEFLKKYWAKWKATHTGRCRAMPHIKTFVRYNGQNIAWFLLTSSN 545
           YAAGNA+PSP+KNVEK FLKKYWA+WKA H+ R RAMPHIKTF RYN Q IAWFLLTSSN
Sbjct: 428 YAAGNAIPSPLKNVEKPFLKKYWARWKADHSARGRAMPHIKTFTRYNDQKIAWFLLTSSN 487

Query: 546 LSKAAWGALQKNNSQLMIRSYELGVLFLPTVVK-NGFGFSCTKDKSSCKDTSGSMANSRN 605
           LSKAAWGALQKNNSQLMIRSYELGVLFLP+ +K  G  FSCT+   S         +   
Sbjct: 488 LSKAAWGALQKNNSQLMIRSYELGVLFLPSPIKTQGCVFSCTESNPSVMKAKQETKDEVE 547

Query: 606 RKIKLVSLTWPGRDDHDDSNSEVVPLPVPYELPPKLYSSQDVPWSWERQYRQKDVYGQVW 665
           ++ KLV++TW G    D    E++ LPVPY+LPPK YS +DVPWSW+R Y +KDVYGQVW
Sbjct: 548 KRSKLVTMTWQG----DRDLPEIISLPVPYQLPPKPYSPEDVPWSWDRGYSKKDVYGQVW 605

BLAST of Spo14921.1 vs. ExPASy Swiss-Prot
Match: TYDP1_MOUSE (Tyrosyl-DNA phosphodiesterase 1 OS=Mus musculus GN=Tdp1 PE=1 SV=2)

HSP 1 Score: 296.6 bits (758), Expect = 6.900e-79
Identity = 187/496 (37.70%), Postives = 269/496 (54.23%), Query Frame = 1

		  

Query: 195 DDGLPLTFRLMKVQGLPEWANTSCVSIDNVIE---GDVLVAVLSNYMVDIDWLLSACPML 254
           D G P  F L +V G+    N+  + I +++    G ++ +   NY  D+DWL+   P  
Sbjct: 160 DKGNPFQFYLTRVSGIKAKYNSKALHIKDILSPLFGTLVSSAQFNYCFDVDWLIKQYPPE 219

Query: 255 KKVPQVLVVHGES-DGTADYMKRNKP-SNWILHKPPLPISYGTHHSKAMLLVYPRGVRVI 314
            +   +L+VHG+  +  AD   + KP +N  L +  L I++GTHH+K MLL+Y  G+RV+
Sbjct: 220 FRKNPILLVHGDKREAKADLHAQAKPYANISLCQAKLDIAFGTHHTKMMLLLYEEGLRVV 279

Query: 315 VHTANLINVDWNNKSQGLWMQD-FPWKDQNEASKG---CPFESDLIDYLQALKLPEFTAN 374
           +HT+NLI  DW+ K+QG+W+   +P  DQ   + G     F++DL  YL A   P     
Sbjct: 280 IHTSNLIREDWHQKTQGIWLSPLYPRIDQGSHTAGESSTRFKADLTSYLTAYNAPP---- 339

Query: 375 FPALGRLKIDASFFKKFNYGNAAVRLIASVPGYHSGSNLKKWGHMKLRSILE-QCTFDDE 434
                 L+      ++ +     V LI S PG   GS+   WGH +LR +L+       +
Sbjct: 340 ------LQEWIDIIQEHDLSETNVYLIGSTPGRFQGSHRDNWGHFRLRKLLQAHAPSTPK 399

Query: 435 FKKSPLIYQFSSLGSL---DEKWM-TELRTSL----SSGLSADKSALGLGEPRIIWPTVE 494
            +  P++ QFSS+GSL   + KW+ +E + SL      G    KSA+ L    +I+P+VE
Sbjct: 400 GECWPIVGQFSSIGSLGPDESKWLCSEFKDSLLALREEGRPPGKSAVPL---HLIYPSVE 459

Query: 495 DVRWSLEGYAAGNAVPSPIKNVEKE-FLKKYWAKWKATHTGRCRAMPHIKTFVRYNGQ-- 554
           +VR SLEGY AG ++P  I+  EK+ +L  Y+ KW A  +GR  AMPHIKT++R +    
Sbjct: 460 NVRTSLEGYPAGGSLPYSIQTAEKQRWLHSYFHKWSAETSGRSNAMPHIKTYMRPSPDFS 519

Query: 555 NIAWFLLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPTVVKNGFGFSCTKDK----- 614
            +AWFL+TS+NLSKAAWGAL+KN +QLMIRSYELGVLFLP    + FG    K K     
Sbjct: 520 KLAWFLVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLP----SAFGLDTFKVKQKFFS 579

Query: 615 SSCKDTSGSMANSRNRKIKLVSLTWPGRDDHDDSNSEVVPLPVPYELPPKLYSSQDVPWS 664
           SSC+ T+                                  PVPY+LPP+LY S+D PW 
Sbjct: 580 SSCEPTAS--------------------------------FPVPYDLPPELYRSKDRPWI 606

BLAST of Spo14921.1 vs. ExPASy Swiss-Prot
Match: TYDP1_HUMAN (Tyrosyl-DNA phosphodiesterase 1 OS=Homo sapiens GN=TDP1 PE=1 SV=2)

HSP 1 Score: 290.8 bits (743), Expect = 3.800e-77
Identity = 197/551 (35.75%), Postives = 286/551 (51.91%), Query Frame = 1

		  

Query: 138 RAREDLNA-GASTEQSTLPHGIG--RKLKENIDNVSVESKKQNLSVSGNSEEAIRHFHVS 197
           +  +D++A    T Q T  HG     +LKE  D      + Q++            + + 
Sbjct: 111 KKEKDISAPNDGTAQRTENHGAPACHRLKEEEDEYETSGEGQDI------------WDML 170

Query: 198 DDGLPLTFRLMKVQGLPEWANTSCVSIDNVIE---GDVLVAVLSNYMVDIDWLLSACPML 257
           D G P  F L +V G+    N+  + I +++    G ++ +   NY  D+DWL+   P  
Sbjct: 171 DKGNPFQFYLTRVSGVKPKYNSGALHIKDILSPLFGTLVSSAQFNYCFDVDWLVKQYPPE 230

Query: 258 KKVPQVLVVHGES-DGTADYMKRNKP-SNWILHKPPLPISYGTHHSKAMLLVYPRGVRVI 317
            +   +L+VHG+  +  A    + KP  N  L +  L I++GTHH+K MLL+Y  G+RV+
Sbjct: 231 FRKKPILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKMMLLLYEEGLRVV 290

Query: 318 VHTANLINVDWNNKSQGLWMQDFPWK--DQNEASKGCP--FESDLIDYLQALKLPEFTAN 377
           +HT+NLI+ DW+ K+QG+W+     +  D    S   P  F++DLI YL A   P     
Sbjct: 291 IHTSNLIHADWHQKTQGIWLSPLYPRIADGTHKSGESPTHFKADLISYLMAYNAPS---- 350

Query: 378 FPALGRLKIDASFFKKFNYGNAAVRLIASVPGYHSGSNLKKWGHMKLRSILEQCTFDDEF 437
                 LK       K +     V LI S PG   GS    WGH +L+ +L+        
Sbjct: 351 ------LKEWIDVIHKHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLKKLLKDHASSMPN 410

Query: 438 KKS-PLIYQFSSLGSL---DEKWM-TELRTSL----SSGLSADKSALGLGEPRIIWPTVE 497
            +S P++ QFSS+GSL   + KW+ +E + S+        +  KS++ L    +I+P+VE
Sbjct: 411 AESWPVVGQFSSVGSLGADESKWLCSEFKESMLTLGKESKTPGKSSVPL---YLIYPSVE 470

Query: 498 DVRWSLEGYAAGNAVPSPIKNVEKE-FLKKYWAKWKATHTGRCRAMPHIKTFVRYNGQ-- 557
           +VR SLEGY AG ++P  I+  EK+ +L  Y+ KW A  +GR  AMPHIKT++R +    
Sbjct: 471 NVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSNAMPHIKTYMRPSPDFS 530

Query: 558 NIAWFLLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPTVVKNGFGFSCTKDKSSCKD 617
            IAWFL+TS+NLSKAAWGAL+KN +QLMIRSYELGVLFLP    + FG    K K   K 
Sbjct: 531 KIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLP----SAFGLDSFKVKQ--KF 590

Query: 618 TSGSMANSRNRKIKLVSLTWPGRDDHDDSNSEVVPLPVPYELPPKLYSSQDVPWSWERQY 664
            +G                         S   +   PVPY+LPP+LY S+D PW W   Y
Sbjct: 591 FAG-------------------------SQEPMATFPVPYDLPPELYGSKDRPWIWNIPY 605

BLAST of Spo14921.1 vs. ExPASy Swiss-Prot
Match: TYDP1_RAT (Tyrosyl-DNA phosphodiesterase 1 OS=Rattus norvegicus GN=Tdp1 PE=1 SV=1)

HSP 1 Score: 286.6 bits (732), Expect = 7.100e-76
Identity = 183/519 (35.26%), Postives = 274/519 (52.79%), Query Frame = 1

		  

Query: 167 DNVSVESKKQNLSVSGNSEEAIRHFHVSDDGLPLTFRLMKVQGLPEWANTSCVSIDNVIE 226
           D+   +   +    SG  ++    + + D   P  F L +V G+    N+  + I +++ 
Sbjct: 135 DSHRAQRADEEYETSGEGQDI---WDMLDKENPFQFYLTRVSGIKAKYNSKALHIKDILS 194

Query: 227 ---GDVLVAVLSNYMVDIDWLLSACPMLKKVPQVLVVHGES-DGTADYMKRNKP-SNWIL 286
              G ++ +   NY  D++WL+   P   +   +L+VHG+  +  AD   + KP +N  L
Sbjct: 195 PLFGTLVSSAQFNYCFDVNWLIKQYPPEFRKKPILLVHGDKREAKADLHAQAKPYANISL 254

Query: 287 HKPPLPISYGTHHSKAMLLVYPRGVRVIVHTANLINVDWNNKSQGLWMQD-FPWKDQNEA 346
            +  L I++GTHH+K MLL+Y  G+RV++HT+NLI  DW+ K+QG+W+   +P   Q   
Sbjct: 255 CQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTSNLIREDWHQKTQGIWLSPLYPRIYQGNH 314

Query: 347 SKG---CPFESDLIDYLQALKLPEFTANFPALGRLKIDASFFKKFNYGNAAVRLIASVPG 406
           + G     F++DL  YL A   P           L+      ++ +     V LI S PG
Sbjct: 315 TSGESSTHFKADLTSYLMAYNAPP----------LQEWIDIIQEHDLSETNVYLIGSTPG 374

Query: 407 YHSGSNLKKWGHMKLRSILE-QCTFDDEFKKSPLIYQFSSLGSL---DEKWM-TELRTSL 466
              GS+   WGH +LR +L+         +  P++ QFSS+GSL   + KW+ +E + SL
Sbjct: 375 RFQGSHKDNWGHFRLRKLLQAHAPSAPRGECWPVVGQFSSIGSLGPDESKWLCSEFKESL 434

Query: 467 ----SSGLSADKSALGLGEPRIIWPTVEDVRWSLEGYAAGNAVPSPIKNVEKE-FLKKYW 526
                 G +  +SA+ L    +I+P+VE+VR SLEGY AG ++P  I+  EK+ +L  Y+
Sbjct: 435 LAVREEGRTPGRSAVPL---HLIYPSVENVRTSLEGYPAGGSLPYGIQTAEKQRWLHPYF 494

Query: 527 AKWKATHTGRCRAMPHIKTFVRYNGQ--NIAWFLLTSSNLSKAAWGALQKNNSQLMIRSY 586
            KW A  +GR  AMPHIKT++R +     +AWFL+TS+NLSKAAWGAL+KN +QLMIRSY
Sbjct: 495 HKWSAETSGRSNAMPHIKTYMRPSPDFSKLAWFLVTSANLSKAAWGALEKNGAQLMIRSY 554

Query: 587 ELGVLFLPTVVKNGFGFSCTKDKSSCKDTSGSMANSRNRKIKLVSLTWPGRDDHDDSNSE 646
           ELGVLFLP    + FG    K K                                 S+  
Sbjct: 555 ELGVLFLP----SAFGLDTFKVKQKF---------------------------FSSSSEP 606

Query: 647 VVPLPVPYELPPKLYSSQDVPWSWERQY-RQKDVYGQVW 664
           +   PVPY+LPP+LY S+D PW W   Y +  D +G +W
Sbjct: 615 MASFPVPYDLPPELYGSKDRPWIWNIPYVKAPDTHGNMW 606

BLAST of Spo14921.1 vs. ExPASy Swiss-Prot
Match: TYDP1_DROME (Probable tyrosyl-DNA phosphodiesterase OS=Drosophila melanogaster GN=gkt PE=2 SV=1)

HSP 1 Score: 173.7 bits (439), Expect = 6.700e-42
Identity = 128/373 (34.32%), Postives = 191/373 (51.21%), Query Frame = 1

		  

Query: 227 GDVLVAVLSNYMVDIDWLLSA---CPMLKKVPQVLVVHGESDGTADYMKRNKPSNWILHK 286
           G++   V  N+MVDI WLL       +L K P +L+   ES       K  +    I  K
Sbjct: 181 GEIESTVQINFMVDIGWLLGHYYFAGILDK-PLLLLYGDESPELLSIGKFKQQVTAIRVK 240

Query: 287 PPLPISYGTHHSKAMLLVYPRG-VRVIVHTANLINVDWNNKSQGLWMQD----FPWKDQN 346
            P P  + T H+K M L Y  G +RV++ TANL   DW+N++QGLW+       P     
Sbjct: 241 MPTP--FATSHTKMMFLGYSDGSMRVVISTANLYEDDWHNRTQGLWISPKLPALPVDADT 300

Query: 347 EASKGCP-FESDLIDYLQALKLPEFTANFPALGRLKIDASFFKKFNYGNAAVRLIASVPG 406
            A +    F+ DL+ YL   K+ +     P + R++   S F   N     V  + SVPG
Sbjct: 301 GAGESLTGFKQDLMLYLVEYKISQLQ---PWIARIR--NSDFSAIN-----VFFLGSVPG 360

Query: 407 YHSGSNLK--KWGHMKLRSILEQ--CTFDDEFKKSPLIYQFSSLGSLDEKWMTELRTSLS 466
            H   +++   WGH +L S+L +     DD     P++ Q SS+GSL       ++    
Sbjct: 361 GHREGSVRGHPWGHARLASLLAKHAAPIDDRI---PVVCQSSSIGSLGANVQAWIQQDFV 420

Query: 467 SGLSADKSALG----LGEPRIIWPTVEDVRWSLEGYAAGNAVPSPIKNVEKE-FLKKYWA 526
           + L  D + +G    +   ++I+P+  +V  S +G   G  +P      +K+ +LK Y  
Sbjct: 421 NSLKKDSTPVGKLRQMPPFKMIYPSYGNVAGSHDGMLGGGCLPYGKNTNDKQPWLKDYLQ 480

Query: 527 KWKATHTGRCRAMPHIKTFVRYN--GQNIAWFLLTSSNLSKAAWGALQKNNS---QLMIR 577
           +WK++   R RAMPHIK++ R+N   Q++ WF+LTS+NLSKAAWG   KN++    L I 
Sbjct: 481 QWKSSDRFRSRAMPHIKSYTRFNLEDQSVYWFVLTSANLSKAAWGCFNKNSNIQPCLRIA 537

BLAST of Spo14921.1 vs. TAIR (Arabidopsis)
Match: AT5G15170.1 (tyrosyl-DNA phosphodiesterase-related)

HSP 1 Score: 714.9 bits (1844), Expect = 4.600e-206
Identity = 336/482 (69.71%), Postives = 392/482 (81.33%), Query Frame = 1

		  

Query: 186 EAIRHFHVSDDGLPLTFRLMKVQGLPEWANTSCVSIDNVIEGDVLVAVLSNYMVDIDWLL 245
           EAIR F   ++ LP TFRL+ V  LP+WANTSCVSI++VIEGDV+ A+LSNYMVDIDWL+
Sbjct: 128 EAIRRFCPPNEKLPSTFRLLSVDALPDWANTSCVSINDVIEGDVVAAILSNYMVDIDWLM 187

Query: 246 SACPMLKKVPQVLVVHGESDGTADYMKRNKPSNWILHKPPLPISYGTHHSKAMLLVYPRG 305
           SACP L  +PQV+V+HGE DG  +Y++R KP+NWILHKP LPIS+GTHHSKA+ LVYPRG
Sbjct: 188 SACPKLANIPQVMVIHGEGDGRQEYIQRKKPANWILHKPRLPISFGTHHSKAIFLVYPRG 247

Query: 306 VRVIVHTANLINVDWNNKSQGLWMQDFPWKDQN-EASKGCPFESDLIDYLQALKLPEFTA 365
           VRV+VHTANLI+VDWNNKSQGLWMQDFPWKD + +  KGC FE DLIDYL  LK PEFTA
Sbjct: 248 VRVVVHTANLIHVDWNNKSQGLWMQDFPWKDDDKDPPKGCGFEGDLIDYLNVLKWPEFTA 307

Query: 366 NFPALGRLKIDASFFKKFNYGNAAVRLIASVPGYHSGSNLKKWGHMKLRSILEQCTFDDE 425
           N P  G +KI+A+FFKKF+Y +A VRLIASVPGYH+G NL KWGHMKLR+IL++C FD E
Sbjct: 308 NLPGRGNVKINAAFFKKFDYSDATVRLIASVPGYHTGFNLNKWGHMKLRTILQECIFDRE 367

Query: 426 FKKSPLIYQFSSLGSLDEKWMTELRTSLSSGLSADKSALGLGEPRIIWPTVEDVRWSLEG 485
           F++SPLIYQFSSLGSLDEKW+ E   SLSSG++ DK+ LG G+  IIWPTVEDVR SLEG
Sbjct: 368 FRRSPLIYQFSSLGSLDEKWLAEFGNSLSSGITEDKTPLGPGDSLIIWPTVEDVRCSLEG 427

Query: 486 YAAGNAVPSPIKNVEKEFLKKYWAKWKATHTGRCRAMPHIKTFVRYNGQNIAWFLLTSSN 545
           YAAGNA+PSP+KNVEK FLKKYWA+WKA H+ R RAMPHIKTF RYN Q IAWFLLTSSN
Sbjct: 428 YAAGNAIPSPLKNVEKPFLKKYWARWKADHSARGRAMPHIKTFTRYNDQKIAWFLLTSSN 487

Query: 546 LSKAAWGALQKNNSQLMIRSYELGVLFLPTVVK-NGFGFSCTKDKSSCKDTSGSMANSRN 605
           LSKAAWGALQKNNSQLMIRSYELGVLFLP+ +K  G  FSCT+   S         +   
Sbjct: 488 LSKAAWGALQKNNSQLMIRSYELGVLFLPSPIKTQGCVFSCTESNPSVMKAKQETKDEVE 547

Query: 606 RKIKLVSLTWPGRDDHDDSNSEVVPLPVPYELPPKLYSSQDVPWSWERQYRQKDVYGQVW 665
           ++ KLV++TW G    D    E++ LPVPY+LPPK YS +DVPWSW+R Y +KDVYGQVW
Sbjct: 548 KRSKLVTMTWQG----DRDLPEIISLPVPYQLPPKPYSPEDVPWSWDRGYSKKDVYGQVW 605

BLAST of Spo14921.1 vs. TAIR (Arabidopsis)
Match: AT5G07400.1 (forkhead-associated domain-containing protein / FHA domain-containing protein)

HSP 1 Score: 65.9 bits (159), Expect = 1.100e-10
Identity = 54/195 (27.69%), Postives = 83/195 (42.56%), Query Frame = 1

		  

Query: 234 LSNYMVDIDWLLSACPMLKKVPQVLVVHGES-------DGTADYMKRNKPSNWILHKP-P 293
           L+ +  DI W L+ C   + +P  +  H          D        N P+  +++ P P
Sbjct: 402 LATFTSDILWFLTCCDTPRHLPVTIACHNAERCWSSNPDARTAVPLPNYPNVTMVYPPFP 461

Query: 294 LPISYGT---------HHSKAMLLVYPRGVRVIVHTANLINVDWNNKSQGLWMQDFPWK- 353
             I++G          HH K  +L     +RVI+ +ANL+   WN+ +  +W QDFP + 
Sbjct: 462 EEIAFGKDRTNRGIACHHPKLFILQRKDSIRVIITSANLVARQWNDVTNTVWWQDFPRRA 521

Query: 354 --DQNEASKGCPFESDLIDYLQALKLPEFTANFPAL-GRLKIDASF-------FKKFNYG 401
             D       C  E++       LK P+F A        L  D          F K+N+ 
Sbjct: 522 DPDLLSLFGHCQRETN-----HGLK-PDFCAQLAGFAASLLTDVPSQAHWILEFTKYNFE 581

The following BLAST results are available for this feature:
BLAST of Spo14921.1 vs. NCBI nr
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. NCBI nr)
Total hits: 5
Match NameE-valueIdentityDescription
gi|902233940|gb|KNA23261.1|0.0e+098.6hypothetical protein SOVF_0262... [more]
gi|731370342|ref|XP_010665690.1|0.0e+082.7PREDICTED: tyrosyl-DNA phospho... [more]
gi|870843494|gb|KMS96656.1|0.0e+082.7hypothetical protein BVRB_8g20... [more]
gi|731370345|ref|XP_010665691.1|0.0e+082.7PREDICTED: tyrosyl-DNA phospho... [more]
gi|870843493|gb|KMS96655.1|0.0e+082.7hypothetical protein BVRB_8g20... [more]
back to top
BLAST of Spo14921.1 vs. UniProtKB/TrEMBL
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. UniprotKB/TrEMBL)
Total hits: 5
Match NameE-valueIdentityDescription
A0A0K9RUQ3_SPIOL0.0e+098.6Uncharacterized protein OS=Spi... [more]
A0A0J8B6P0_BETVU0.0e+082.7Uncharacterized protein OS=Bet... [more]
A0A0J8E0L9_BETVU0.0e+082.7Uncharacterized protein OS=Bet... [more]
E0CVK9_VITVI1.6e-26366.9Putative uncharacterized prote... [more]
A0A067JLC5_JATCU2.6e-25864.9Uncharacterized protein OS=Jat... [more]
back to top
BLAST of Spo14921.1 vs. ExPASy Swiss-Prot
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. ExPASy SwissProt)
Total hits: 5
Match NameE-valueIdentityDescription
TYDP1_ARATH8.1e-20569.7Tyrosyl-DNA phosphodiesterase ... [more]
TYDP1_MOUSE6.9e-7937.7Tyrosyl-DNA phosphodiesterase ... [more]
TYDP1_HUMAN3.8e-7735.7Tyrosyl-DNA phosphodiesterase ... [more]
TYDP1_RAT7.1e-7635.2Tyrosyl-DNA phosphodiesterase ... [more]
TYDP1_DROME6.7e-4234.3Probable tyrosyl-DNA phosphodi... [more]
back to top
BLAST of Spo14921.1 vs. TAIR (Arabidopsis)
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. TAIR)
Total hits: 2
Match NameE-valueIdentityDescription
AT5G15170.14.6e-20669.7tyrosyl-DNA phosphodiesterase-... [more]
AT5G07400.11.1e-1027.6forkhead-associated domain-con... [more]
back to top
InterPro
Analysis Name: InterPro Annotations of S. oleracea
Date Performed: 2018-06-29
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000253Forkhead-associated (FHA) domainGENE3D2.60.200.20coord: 18..111
score: 4.9
IPR008984SMAD/FHA domainunknownSSF49879SMAD/FHA domaincoord: 19..106
score: 1.29
IPR010347Tyrosyl-DNA phosphodiesterase IPANTHERPTHR12415TYROSYL-DNA PHOSPHODIESTERASE 1coord: 1..667
score: 7.2E
IPR010347Tyrosyl-DNA phosphodiesterase IPFAMPF06087Tyr-DNA_phosphocoord: 202..641
score: 1.9
IPR027415Tyrosyl-DNA phosphodiesterase C-terminal domainGENE3D3.30.870.20coord: 389..665
score: 8.2
NoneNo IPR availableGENE3D3.30.870.10coord: 198..385
score: 1.3
NoneNo IPR availablePANTHERPTHR12415:SF0TYROSYL-DNA PHOSPHODIESTERASE 1coord: 1..667
score: 7.2E
NoneNo IPR availableunknownSSF56024Phospholipase D/nucleasecoord: 625..664
score: 3.53E-71coord: 389..582
score: 3.53E-71coord: 199..413
score: 1.05

GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006281 DNA repair
cellular_component GO:0005634 nucleus
molecular_function GO:0008081 phosphoric diester hydrolase activity
molecular_function GO:0005515 protein binding