Spo11971.1 (mRNA)

Overview
NameSpo11971.1
TypemRNA
OrganismSpinacia oleracea (Spinach)
DescriptionEndoglucanase (3.2.1.4)
Locationchr4 : 8661106 .. 8677152 (+)
Sequence length4452
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
NTGCACAGCTCAAACGTATGGGGAGGATCATACGAAGTAGCAGTACACAGAGACGAACACGAACACAACAACAACAACAACAACAACAACAAAAGCAGAGAAATGGAAGATTGGGATAAATCAAGCTTACTTTACCAAGAACAACTCCATAATCATCACAATCAAAATCACACTCATGAATTAGATCAAGTTCAACAAGGTTGGTTATTAATACCACAAGACAAAACCCGCAAAAACAGGAAGAACAAGTATGNACAACTCCATAATCATCACAATCAAAATCACACTCATGAATTAGATCAAGTTCAACAAGGTTGGTTATTAATACCACAAGACAAAACCCGCAAAAACAGGAAGAACAAGTATGTTGATTTTGGGTGCATTATTGTTAGCAAGAAGATACTCAAATGGGCATTTTGGTCTTTTCTAATTGCGTTCATTGTTATTGGGTTGCCGATTATTATTGCTAAGTCTTTGCCTAAGCATAAACATGGACCTCCTCCTGCTGATAATTACACTGTTGCCCTTAATATGGCTCTTCGGTTCTTCAATGCCCAGAAATGTAAGTTCTTTTTTCTTCTCTGTTTTTTTTAAAAAATATTATTTAATTTTAAAATGAAATTATTATTTAATATTTACTTCCTTTTTTGTGTGGGATTATATTTAATGTGAAAATCTTTGTGATTTTTGGTGCTATAAATAAGTTTCCCGTTGATATATTGATTAGGGCCGAATTACGTATGCTTCATATTTATAATGAAGCCCGAGCCCGGCCCAAAACATCGAATTACTTGAAATTTGCTTCCTTTTTTGTGTGGGATTATATTTAATGTGAAATTGTTTGTGATTTTTGGTGCTATAAATAAGTTTCCCGTTGATATTTGCGTATATCCTACATTGATTAGGGCCGAATTACGTATTTGTCATGTTTAGAATGATGCCCGAGCCCGGCCCAAAACATCGGAAGCCGCTTTGTTCTTTTTTATTCTCTGTTTTTCTAAATATAATTTTATTTTAAAATGAAATTATTTAAAATTTACTTCCTTTTTTGTGTGGGATTATATTTAATGTGAAATTGTTTGTGATTTTTGGTGCTATAAATAAGTTTCCCGTTGATATTTGCGTATATCCTACATTGATTAGGGCCGAATTACGTATTTGTCATGTTTAGAATGATGCCCGAGCCCGGCCCAAAACATCGGAAGCCGCTTTGTTCTTTTTTATTCTCTGTTTTTCTAAATATAATTTTATTTTAAAATGAAATTATTTAAAATTTACTTCCTTTTTTGTGTGGGATTATATTTAATGTGAAAATCTTTGTGATTTTTGGTGCTATAAATAAGTTTCCCGTTGAATCGTTGATATGTGCAATTAAGCCAATTATGTATGTGTATTGTATTTCCCATATTGATTAGGGCCAAATTACGTAGGTGTAATGTTTAGATTAAAGCCCGAGCCCGGCCCAAACCTCGAAAATCCGCTTTGGCCCGTTGATATATGCAACTATGAATGTGTATAACTGTATATCCCATATTGATTAGGGCCGAATTACGTATGTGTCATGTTTAAATTAAAGGCCCGAGCCCGGCCCAAAACCTCAAAACCCGTTTTAGCCCGTAAAGTTCGTACCAAATTTATGGTTATCATAATGAGTTTTTGACTTAAAGTCCTTCTCACCCCTATTTTTTATCACCTTTAGTGCGTGTATATCCGATATTGATTAGGAAATTAGTACTAGAACGTAATCATAAGCAATTACATATTAGTAAAATCTTAAAGAAAATGATATTTGAAAAATGTACATTAAAAATAATCCAACAACATAATACAATAACATTTGATTTTTATCTATTATTCCGTAGTAAGAAAGTTTGGTAAAAAATAATGTATGAATAGTGCAATCGTTAAACGGTTACTCCGTACAAATATTCAAGAACGGAGAGAGTAAGATTGAGTTTGGGAACCAAGTAAACTTGGAAGTGTGAGGAAAAATAAAACTATGAAAATCTGTAGAAAAGCAAATCTGAAGAAATATGTAGAAATCACAAATTTTAATTTGTTTGGCTAATTATTATGGAAAATAAGAAAAATTGAAATATCTCATATATTTCACTTCTTTAATTTTGTTTTCCTCATATTTCCACAGTTTTCCTATCACTTTTGAGGTAAAGTGTGAAAAATAAAACAGGCAAAATATAACTAGTTGATTTCCACAAATTTCCACACATTTCACTTTTCTCACTTTTCTATTTAAATACCAAACAAGCTAAAATTAAGAAATTCTCATATTTTCACAATTTTACACATTTTTCTCATTAAACTAGAGTTTCTAAACACAGGGTAAGTGAATACTACTTAACCCTAAACTATGCATATATTGTAAGGAAAAATATATAATTACAATTATGTGTATATCCTATAATGATTAGGAAATTTTGGTATAAATGGTAGTAAATCACATGATTTTGAATTTTTTTTTTTTTTTTGAAAATGTTGTGATTTGTTTGGGATAATCTTGATTCAGTGAATTCGAACGTTANTTTTTTTTTTTTGAAAATGTTGTGATTTGTTTGGGATAATCTTGATTCAGTGAATTCGAACGTTATTGACAGCTAAGACTGAACAATGATGCACTTGGCTCAAAGAAAGTTTTTTTCCTGAAACGGTGAATTATTTTCTGAAAAGCAACAGCTAGTCTACAAGAAAAGTCCCATCGAAAATTTGGAAAAGCCTATCTAAAGTCAAAAGTAATTTTTGGAATTTCACCCATTTACTATAGTAAGGTTGAAATTTCCTTACTCTTGCATTATTTCATGTCTTTGGTTAATTATTGAGACTATTGATAGAGATGCATGTGATGCGTGTTTCAAATTTCAATTTCAAAATACGATATTCACATTGAAAGATTGTGTACGGAGTATGAAATATTAGTAAGGGCATTTTTGATAATATAGGTTTATAATAAAGTTACATTATACTATAGGTGTATCATTCATGTGTAGTACATATATATTGGGAAATAATTCATGTGATGCATGTGGTGCGTGTTTCAAATTTCAAAATACGATATTCATATTGAAAGATTGTGTACGGAGTATGAAATATTAGTAAGGGCATTTTTTATAATTTAGTTTTATAAAAAGTTACATTATACATTTATACTATAGGTGTATCATGTATGGTACATATACAATGGGAAATAATTTTGTTACCACAAAGGATTGGTATAGGAACATTTATTATAAGACGCAATTTTTCGTGTTTTATTTACAAGGTGGTGTGCTAAAATCAATATAAGGTGTCACAAAATCAGTGGAGATGATTTCTTAATAAGATAACCAAAAGACCGTCTAACAGTTTACCACGGAGTACTATGTAACTGTTAGACAACATGTTATGTTACCAATTGATAAATTTTAATGAGCCGAGCCGAGTCGACCTGCACACTCTCCTCCAACGACGAAGGTCTATCGATGTAGGCCCGAGTTAATCCAACATGGACTCGACTTGGCTCGTGAAATTTATCACCAGTCTCCTACATATTCGTCATACTATTAGGTACTCCGTATGGTATATTAATATAACAGTTAAATAAGCGTAAAACGGTCTGATAGAAGGAAGATTAATTGAATAGTTAATTAGAGAAATAAAATACTCCCTCTGTCCCGGAATACTTGACCTGTTTTCCTTATCGGGTCGTCCCTTAATACTTGACCTGTTTCTAAAAATGGAAATATTCTAACAATATTATATTATTTCTCACTCCACCCCTATTAACCCACCTACCCCCTATTTCATACAAAAAAATAATTAAAAATTCAACCCCTACTCTCCCCCAACCCCACCTCTTAACCCACGTCCCTTAAACTCTGTGCCGGTCAAACCGGGTCGAGTATTCCGGGACGGAGGGAGTAGGCGTGTATCCCCTAATTTAAGGTCAATTAAAAATTATGGATTGAATCAATAAACATGCCTTGATATAAGGAGTTCAAAAAACGGGGACTTTTTTTGTGTGTCGTGCAAAAGGATGGCACTTCTTTTTGCTACCTTTTGACATTCGTCTCGTTAGTTGGTTTTTTGGTTGCGTATTTCCATAGGCAACATTCACTTTGAACTCCGTGTGGGGCCCTTACAAAATTGGTTACGTAATATTTCCCCTTTTTTTATTACTACCTTCGTTTTTTAATGTTTATAATTTACGCTTACTGTTTACACGGGCTTCAATGTAATATTTAACCATTTATATATATTCAATTCCGTATATGAAAAAATTACAAAAAAATGATATTATGAAAATACATAACGAGACCATTCCAACAAGATCTTACATAATAACATTTTAATGTATACAACAATGAGAATTTACGGTCAAAGTTTTCATACTTTTGACACATTTTAAGGCGTAAAGAACTTTCAGGAATGGAGGTAGTAGAACTAATAAAGAATTATAGCGTGTAAACGAGATTTGATCATATATTTGACTACCCTTATGATCTTATCTACAAGTCAATGCATTTATAGTATGATGGTAGAATTATTTTTTATTTTTTTGACCAGTAAGATGTGGACCCCTCTCGGATCAACCCGGTCCGAAATAGCAGGATAAACACTTTGAGAATGCGGGGTGGGAAGATTAGGGGATACCCATTTCAAAATGTCTCTAGTGATAATTGAAACCTCAACCTCAACCTCAAAGTTTAGAATGTTTTCTCCTTTCCACTCGGCAAAAGTTATGTTTGGGTTCAATTGTTTTTTTTAATGAAATTAACTTATTTGAACTTGTTTTTTTTTTCAATCTTATATTTTTAACTTATTTTTTGTTGAATTTTTCTAAACTTATTTTGTCTAAAAAAACTGTGGACTTATTTTTTCTAAATTAAATCGAAGTAAGTTAAAATAAGTTGAAACACTCAGATTCATTCATAGTATAATACTAGAATAGATCACGTTCATTTATAGCTTTGGCCATATTATAATTGGTTATTTTCTAGAGTAGGTGTTATTCTTCTCCTTCCATACGAAGATTTGACTATAAGACAACTGGCTTGGGAATGTTCTATAACTTCATTGTCATTAACTCATGATTAGGTGATTTTTTAAGAATAACAAACATTAAAAAATAATGTTGTCAACCCTTAAAAATTAAATCATTTATAAATTGCCAATAATTATACTCTTTAGTGTATTTAATCAAGTAATCCAAATCCTATAACTTATGTCTAAATGAAATGTTCACAACTCACATAGATGTTGCCTCATACGTGCAAAGTCACCCCTTTATAATTTCTGGATCCGCGACTGTATATGTGCGCTATTTTCTTTTATGAAATTATTTGAACATCTATTTTTTTGACTATAAAAAAATTATAAAATCTTGTGATTCATGCTAGTTTTCTTTTGCTTTACACTATCANAGCTCAAATCTTAACGAGCTTTAAACGGGTTTTAGTAAATGATCGAAAACCTATTCAATTCAAGGGTACTTATAGTGGCCATAAAGATTGACGAAATATGAAGAAGTAGCTAATTTGGAAGTCCAGCAAAATTCTTGTTGATTGTCGTGATTGAGAAGACTAAAAGAGGAAGCAGGAAATATGAATGTATAATTGACAAAATTCAAGCAAATATGCATTGACTATTGAGTTTCTAGATATATAATTCCTAAAATTTGTTTACTTTTTACCATATTAAATTTAGACTTTACCATTATACAAAATAATACATATTCTATATTTAAATTTGTATTATTCTATTAAAATATTTTACAAGCTATAACGGAGTTATAAACGGGCTACTAACGAGTTACAAAAGAGATATAAACGGGCTGTTCGTGAACAATAACGAGCCGAACGGGATGTTGTTCGAGTAAAGCTCGTTTAACAAACGAGCCTTAAATATTTGCTCGAGCTCGTTCAATTGCTTAATGAACGAGCTTTGACCGAGCTTTGACCGAGCTTGTTCGCGAGTTGTTCGCGAACGTCCTAGCTCATTTGCACCCCTACAACTCACATAGATGTTGCCTCATACGTGCAAAGTCACCCCTTTATAATTTCTGGATCCGCGACTGTATATGTGCGCTATTTTCTTTTATGAAATTATTTGAACATCTATTTTTTTGACTATAANCGACTGTATATGTGCGCTATTTTCTTTTATGAAATTATTTGAACATCTATTTTTTTGACTATAAAAAAATTATAAAATCTTGTGATTCATGCTAGTTTTCTTTTGCTTTACACTATCATTTTACTAAGTTGCCACCCTAACTTTTAAATCCTGACTCCGCCACTGATTTTGATGTTGCAGCGGGGAAATTGCCGAAGCATAACAACGTATCATGGAGAGGAAACTCGGGATTAAAAGACGGATCAAAGTTAACCGATGTAAAAGGAGGATTAGTTGGTGGATATTATGATTCAGGAGAAAACACTAAGTACCATTTTCCTATGGCATTTGCAATGACAATGCTAAGTTGGAGTGTGATTGAATACCCTCACAAGTACGAGGCCATTGGCGAGTATGATCATGTTCGCGAGCTCATCAAATGGGGGACCGATTACTTGCTCAAAACCTTCAANGCGAGTATGATCATGTTCGCGAGCTCATCAAATGGGGGACCGATTACTTGCTCAAAACCTTCAACAGTTCGGCCACTCAGATCGACAAAATCCACAGCCAGGTTGGTGAGCGCTTCCGAGTTAATAATTTCGTAAATAAATGATCATTTTATTACAAAAATGCCGTTAAATGCGTGCCCCGACCCCCAAGTACAATCATGCTTCATTCATGCCGGCCTCATGTATAGTCAACTTTGCATTTCCGTAATACATCAACACCTACGTAATATAAAGTCAAAGGATAATATTATACAATCAAGAATAAACAAAAATGCATTTTAGTAATGCATAGGCAGCCGTAAAATACACTGTCAATATATTTACAAAATTGCCATTTGATTGAGAAACTTTCTTAATCACTGAGATTAACTCGGAATTTCTCCAGGTTGGTGGTGCACAAAACGGATCCACTATCCCGAGTGACGTCACCTGTTGGGAGAGGCCTGAAGACATGGACTATGAGCGCCCCGTGCAGACAAGCTTTGCAGGGGCGGATCTAGGTGGAGAAATGGCCGCAGCCTTTGCTGCAGCCTCTATAGTTTTCCGTGACAACCAGCTTTACGCGAAGAAGCTCGTCAGAGGGGCAGAAACCGTTTTCCGGTTTGCTAGAGATACCGGTAAAAGAGCCCCGTATAGCAGAGGAAACCCATGGGCTGCACCGTATTATAACTCTACCGGGTATTATGATGAGTACATGTGGGGTGCTACTTGGTTGTATTATGCTACGGGGAATTCAACCTACGGTTTGCTGGCTACTAATCCCGGGATTCCTAAGAACGCGAAGGCGTTTAGGATGAACACGAACACGAGCTACTTGAGTTGGGATAACAAGTTGCCTGCTGCTATGTTGCTGTTGACGAGGTTTAGGATGTTTCTGAATCCTGGTTACCCGTATGAGCAGACTTTGGCGCAGTACCATAATGTTACGAAGCTTAATATGTGCTCGTATCTTAAGCAATTCCCGGTTTATAACCGGACTCGAGGTATGTTTTACTCGAGGTAGAAAGTGTACACACCTTCTACTTCCTATACTTTCCTGCCTGTTTGCTTTTTTTAGTAAACCCGATCATCTTGAGCCCGACCCGAAAATAGTGAGTTTGGGCAGATATGGTAAGCCCGTTTTCAGNTGGATATTATGATTCAGGAGAAAACACTAAGTACCATTTTCCTATGGCATTTGCAATGACAATGCTAAGTTGGAGTGTGATTGAATACCCTCACAAGTACGAGGCCATTGGCGAGTATGATCATGTTCGCGAGCTCATCAAATGGGGGACCGATTACTTGCTCAAAACCTTCAATAGTTCGGCCACTCAGATCGACAAAATCCACAGCCAGGTTGGTGAGCGCTTCCGAGTTAATAATTTCGTAAATAAATGATCATTTTATTACAAAAATGCCATTAAATGCGGGCCCGTCCCCCAAGTACAATCATGCTTCATTCATGCCGGCCTCATGTACAGTCAACTTTGCATTTTTCGTAATACATCAACACCTACGTAATATAAAGTCAACGGATAATATTATACAATCAAGAATAAACAAAAATGCATTTTAGTAATACATAGGCAGCCGTAAAATACACTCAATATATTTACAAAATTGCCATTTGATTGAGAAACTTTCTTAATCACTGAGATTAACTCGGAATTTCTCCAGGTTGGTGGTGCACAAAACGGATCCACTATCCCGAGCGACGTCACCTGTTGGGAGAGGCCCGAAGACATGGACTATGAACGCCCCGTGCAAACAAGCTTTGCAGGGGCTGATCTAGGTGGAGAAATGGCCGCGGCCTTTGCTGCAGCCTCCATCGTATTCCGTGACAACCAGCTTTACTCGAAGAAGCTCGTCAGAGGGGCAGAAACCGTGTTCCGGTTTGCTAGGGATACCGGTAAAAGAGCCCCGTATAGCAGAGGAAACCCATGGGCTGCACCGTATTATAACTCAACCGGGTATTACGATGAGTACATGTGGGGTGCTACTTGGTTGTATTATGCAACCGGGAATTCAACCTACGGTTTGCTGGCTACTAATCCCGGGATTCCTAAGAACGCGAAGGCGTTTAGGATGAACACGAACACGAGCTACTTGAGTTGGGATAACAAGTTGCCTGCTGCTATGTTGCTGTTGACGAGGTTTAGGATGTTTCTGAATCCTGGTTACCCGTATGAGCAGACTTTGGCGCAGTACCATAATGTTACGAAGCTTAATATGTGCTCGTATCTTAAGCAATTCCCGGTTTATAACCGGACTCGAGGTATGTTTTACTCGAGGTAGAAAGTGTACACACCTTCTACTTCCTATACTTTCCTGCCTGTTTGCTTTTTTTAGTAAACCCGATCATCTTGAGCCCGACCCGAAAATAGTGAGTTTGGGCAGATATGGTAAGCCCGTTTTCAGTCCCGGTATGGTTTTAAAATTATTGGGCGGGTTTTGGGCAATTCCAAATTGTAATTTTAGTTACAAAATTGGCCCGGTACTGAATGCCCGAAAAGTCTGTTATTATGACCCAAAACAGCAGGTTTTCGGCTGGAAAACGGGCCAAACTGGCCCTTGCCGACATAATGGGCAGATTGTTTTGTCCGTGGGCTGGCCCGAATCGGTCCGTCTAATTTCACTTATAAAATCGTTCCGGTCCAAATACCCAAAAGCCCGCTATTTGGCCCAACAATAGCGGGGTTTGGGCTGAAAAGTGCGAATTTTAGTCTGGCTGAAATAGTGGGCAGAATATTTTGGTCGTGGTGGGCCCGAACCCGGCCCGCATTTGATCGGATCTGCTTTTTAAGTATGTTTTCCCGGTGGATTATTAATGCTGATGTTGGACTGTAATTGATCTGCAGGAGGACTGATAGAGCTGAACAGCGGAGGGCCGAAGCCGTTGCAGTACGTAGTGAATGCTGCATTTTTGGCGAATCTGTTTGCAGATTACATGGAGTCTACTGGGGTGCCTGGATGGTATTGTGGCCCGAATTTTATGCGTCAATCTGATCTTCGTAACTTCGCTCAAACTCAGGTAATTAACATGTTAATAACTCCATTTCTCTTCACCATTTCCTTAGTTCTACATTGAACTTAGGCTTGATCCTCTTGAGCCCGGCCCGAAAATGATGCCTTTGGGCAGATTTTCAGGCCCACTTCCCGGGCCTGCCTGGTTTTTCTCGTGGGTTGTTTTTGATGACCCCAAACTTGAAATTTTGTTATAAAATTGGCCCGGCCAAAAAATCAGTTATGTTGACCCGAAATAGCGGGTTTATGCACAAAACCTTGGCCCGGCCCGAGAAAGTGGGCTGATTTTTTGGTCTCGACCAGTCCGGACCCCTCTTTTGATCATGTCTACATTGAACTAACCATTGTTTTCTGGTTGGCACAGGTTGAGTACATTTTGGGCAAGAACCCAATGCACATGAGCTATGTGGTGGGCTTCGGAAACAAGTACCCGAAACACGTGCATCACCGGGCTGCGTCGATACCGCGCAAAAACGGAGGGAAATGTAAAGGGGGTTACAAATGGAGGGACAGCAAAAAGCCCAACCCATACACAATTATTGGGGCAATGGTTGGTGGTCCGGACCGTTTTGACAACTTCAAAGATAACCGCAAACTAGACCGCTACACCGAGCCCACATTGGCTGGAAATGCCGGTCTAGTGGCGGCTCTTATCTCGTTGACGGCAACTCCCGGGGTTTCCGGAGTGGACCGGAACCTTATATTTTCGGGTGTTCCCCCGTTTTATACTCCGGCCCCACCTCCCCCCGCACCGTGGAGGCCCTAAGAGAATCGAAGGAAAAAAACAGAAATGTAAATTTTATTTCCTTTAGATTAATGTCCTCAACCAAACGGCCCCTTAATTCATTTTGAAACTCATTTTTTTGGCAGGGGTTTTAGCCATTTTTGCCTTCTAGAAAGGGGTTTTCCCCTTTGTTTATATACCTAGGGATGATTAATATGTTTGATCCATCATTGTTGAATGTTTTGCACATTGATTTTGTATTATCATCATGATTAATGGTAAATGGATGTAATTTTGGGAAGATGATACACATATAAGCCTTGAACTTGAATCGAGTCCAAAACCACTATCCAAATAGCTTAAAAAACGACAAAGTCAACTGCTCGATTAGTCTCTCGAAATATGTTAGTGAGACGTTTGCATGTAGACTAAAATGAAATATTCCTACATTATTTACTAAGGTATTACAACCCTCACCAACCTAAACGACTACTGTAGTCAAACCAGTTCATACAAACAGGTAACAATTAATCGTTAATTGCTAAACAGGGCCTAAAGCTTACCGAGTTGTCTGCTTGATTAGTCTCTCGAAATATGCTAGTGAGATGTTTGCATGTAGACTAAAATGAAATATTCCTACATTATTTACTAAGGTATTACAATTTACAACCCTCACCAACCTAAACGACTATAGTCAAACCAGTTCATACAAACAGGTAACAATTAACCGTTAATTTGCCAAACAAGGCCTAAACTTTACCGAGTTGTCTGCTAGATTAGTCTCTCGAAATATGCTAGTAAGACGTTTGCATGTAGACTAAAATGATATTCCTACGTTATCTACTAAGGTATCACAACCCTCACCAACCTAAACGACTATAGTCAAACCAGTTCATACAAACATGTAATAATTAACCGTTAATTGTCAAACAGGACCTAAAGTTTACCGAGTTGTAAGATTCTCTAGAAGATGTAAAGTAGTCAAACAAACAAGGGTAACATGTAACAGGTTGGAGAACTTTAATCAAAGCAAAGATATGATCAATCCAGATTTGCCCATATGCTGAATCTGGAGATTTACTTTACAGTTTTACACACATGAACTACAAATAGAATTAAAATAATCCAATTACATAAAATTAATATAAAGTCAAATGTAACATAATGAACATAGATCTCTGTCTAAGTTAGAGAAGGAACAATCTTTTCATTTTCATAAATTTTCGTTGTAAGAGTTCATAAAATCGTAGGAAGTGAAAGAAGGTGGTATTGGCCCGGGTTCTTTATGGGTTTAACGGGCTTAAAATGGGCCGTTTGGTTCATGGTTATTGCAACTTGGATCAGCCCGAGAACGGCCACATCGACCAACTCGGCCCGAACCTTGCCTACGCACAACTTTGATAATTATACTCTTGCTCTTCGTAAAGCACTCTTGTTCTTCAATGCCCAGAAATGTAAGTCTCTTTTGCCTTACTTTGAGGGGATCTGTCATTCCATTTTCTACTAATGCTACATCATTAGTGTAAATTTTGATTATGGGGTGTTTGATAATAGGGTTCTAGTGACTTTAAATCAGCTGATGTAGTCTAAACAACTAACATTTAACTAGACCCGATCATCTTGAGCCCCCCTGACCCGACCCGAAAAATGACGGGTTTGGGCAGATTTAAGACCCATTTTTCGAGCCCGGACTATTTTCAACTCAAAAACTGGAATTTTAGCTATAAAACCGACATGATTCGAAAACCCCACTATTTTCGCCCAAAATAGCGGGTTTGGCTCAAAAAATTAGCCCGAAGTTGGTGGATACTCCGATTCTGGAAAAAGTAGTAGTATGTACTAACATTGAACCAGACTTGATCATCTTGAGCCCAACCCACAAAATGACGGATTTTGGGTAGATTTTCTAGGCTCAATTTTTACTTTTTTTGGGTTTTGGGTAGATTTTTTAGGCTCAATTCAGCCTAAAAATGGCCCCAACAAGCTGCTATTTTTTGCCCGAAAAACGGGTTTGGGTACAAAAATTGCCCTGGAGTTCGGCCTGGTCCTAGATGATGGGATGAAGTTTGGTCACAGCCCGACCCAAACCCGACCCGTCCTCCTTTTGATCAAGTCTACATTAACCAAAAAAAATTTGCAGTAAAACATGAGTCTTTGATTGAATCTAACTGCCATGATCATCTTGTTGCAGCTGGGAAACTACCCAAGAACAATGGAATTCCATGGAGAGGAAACTCGGGCCTACGCGACGGGTCACAACTGAAAGATGTACAGGGAGATGGTTTAGTTGGTGGATACTATGATTCTGGGGAAAACACAAAGTTTCATTTCCCTTTGGCTTACTCCATGACAATGCTAAGTTGGAGCTTGATTGAATACCCTCACAAATATCGTGCCATTAACGAGTATAATCATGTACTCGAACTTATCAAATGGGGAACTGATTACTTGCTCTTGACATTCGATTCTAATGCCACAAAGATCAGCAAAATCTATAGTCAGGTTAGATTATGATCAAGAGTTACTATATTATGTCCGACAAAACTTTGTTTCTTCTGAACTTATCTGAACTTGATATAACTAGTGTTATTCTTATTTGAGCTTTATTTTTGTTCTAGTTGTGAATTGAACTTATCGGGCTTTACCCGAATAAAGTTTTCATGTTTGATGTTAGTGTGCAATTGTGTGAAGTTTCTGAACTAGAGATGGCTAATGAGGTGGTTTGGGTGGGTTTGGTGGGTCGTATGCCGGGGATGGTTGACAAGTACACGACCCACACAATGTAGGTCAGGGTGGGTCGGGTGTGGGTTGTGTCGGGCGAAAACGACAAAAATACATATCGGGTCATGGGTTATGTGGGTTAGGCGAATCGAGTGAGTTTGTTGGGTTTAGACGGGTTTTGAGGGTTAACGCGGGTCCGGTGGGTTTAGGCGGGTTTTGGGGGCTGACGTGGGCCATACAGGTTTATACGATTAATTCCTTAAAAAGTTACAAAATTCATATGAACTTAAATTAATTAATATTTACTCCGTATATTATTGACATACTCACATAGTCACATGTAGTCAATGTTACTCAGACTCTTTGTTTTACCTCAAGTACCCGTGTCGGATCCTCGACACTAGGACATGAGTATGGACACTCCAACACTTCATTTTGGTCTAAAATCAAGGAAACTTTCCACAATTTAGCCGTGTCGGACACTGAGAGATGTACCCAATAGTACCCAAGTCTGAGTAACATAGCATGTGGTTAATGTACATATAAGCATTGAAATCAAAATTTATTAGCATTTTAAAAATACAAAATAGTTTAAATTATGATGATTCGTCAGGTGAGTCACAAGGTTAGGTGTGCAGGATTAGTGTGTCGGGTGGGCTGTTTCATGTGTCAAGTGGGTTGGGTTATTGTGTTGGGGGACTGTTTTGTGGGTTGGGTGGGTCGGGCATGCCGGGTTGGGTCGGGTGGGGTGGGTTGAAATGAGTTATGTTGTAAAAAATTCACCCACGAGGCCACGACACATCAAAAAGAATATTTGGGTGTGTCATGGGTCATAAGTGGGTCCGACCCACAACAAATTGTGTCAGGGTGGGATGTGTTAAACCCGGCCCACCGGCCATTTCTAGTCTACTCTGAAAACCCATCAGAAGTTAATCTTCCTGAACATAACAAAGCCTTATTTTAAACAATCTGCCAATTCTTCAGTTATCAGTTATCACTTATCACTAGAATTTCTGTAACAACAACACTAGGTAATTGACTAATTGTCAATATTTTCATGTACAAGATCATACTACTCTTACAGTCTAATAATCTGCATGAACACTATAAATATATACTCAATTTCCAACCTAACATTATCCTTCAAATGTAAAGGTTGGAGGTGCCTATGAAAAATCAAACATACCAGATGACATCACCTGTTGGGAAAGACCAGAAGACATGGATTATCCCAGACCAGTACAAACAACATACGCAGGACCAGAACTCGCTGCAGAAATGACAGCAGCTTTATCATCAGCCTCAATACTATTCAAACAAAACAACCCTACTTACTCCATAAAGCTCATCAAAGCATCCGAATCCCTCTTCGCCTTCGCTAGGGACACTCGTAAAAGACGGCCCTACAGCCGAGGAAACCCGTGGGTCGACCCGTATTACAACTCGACGGGGTACTTTGACGAGTACTTGTGGGGTTCAACGTGGTTGTATTTAGCTACCGGGAATGTTAAGTATTTGGCATTAGCAACTAATCCGGGGATAACTAAGAATGCGATGGGTTTTAGATGGACGAGGAAGATGAGTGTGTTGAGTTGGGATAATAAGGTTCCTTCGGCTTTGATGTTGCTTACAAGGGTGAGGTTGTTTAAAGCTCCTGAATACCCTTATGAAGAGACATTGATTGGGCATCATAAGTTTATTGCTCTCTCTATGTGTTCTTATCTCAAACCCTTTCGTCTCTTTAAGTGGAGTAAAGGTACGGAATTTCGTTCTTTTCACCTGAAAAATCTCTTACATGAACTTAATCAAACTTCTTGTAGTTGTGAATTTGAGTTACCCTACCTGAACTTAACTATTCTGAAACTTATTTTTATTAGGGGGTGTTTGGTTATGATTTACGAGAGGTTTGGAAAAAAAGAGAGTTTTTTGAAGTGAATTAGAGTTTTGACCACTTCAAGAAGCTAATGTTTGATTATGAGAGGTTTGGAAGAGAGTTTTGGGATGAAAAACTCAATATAGGAGCTTTTTGTATTATAGGTGTATAGTGACAACTTTGTCCTAATATTGTCTAAAATTACAACTTTTACCGACCTACGTTACTTAAACAACTACCAGCTAATTTACCACACACTTTACACAAACAACTAATTCAACCAGTTGGTCAAACCAACTAATATTAACAGCTAACAGCTAGCCAAACCATCTAACATCTACCAACTAACACCTACTTGCCAAACAGGGCCTAAGTGAGAACAACAAAGCCGAAATATTCTACATTTTGTGATATAATCATGTTTCTTCATTTCCTCGAATTATTTTTTGCTGTTTACAATCTGATTCAGGATTGCTGCAGGAGGACTGATACAGTTTAATGATGGAGGGGAAGAATCTCTTCAATATGTAGCAAATGCTGCATTTTTAGCAAACCTATTTGCTGATTACCTGAATGCTACTGATAGTCCGGGATGGTTATGTGGCCCTAATTTCTTCCCTATTGCTACACTCCGGGAATTCGCCTCATCTCAGGTTTACTTCCGTTTCAGTTCCGATGTTATCATGTGTTGAACCTCCTTAACTAACTAAACTGACTGACAACTGTTTATGTTGACACAGATAGACTACATATTGGGTAAAAACCCCCTAGACCTAAGCTATGTGGTGGGGTACGGTGACAAGTACCCGAACCGTGTACATCACCGAGCAGCTTCAATCCCGTCAGACGGTCTGAAGTACTCGTGTAAAGGTGGATACAAATGGAGGGACAGCACAAAACCGAATCCCAACACCATTGTAGGAGCCATGGTTGGCGGACCTGACCGGTTCGATAATTTCCACGATGACCGTAGTGCGTCCGCGGGACCCACATTAGCCGGAAACGCCGGGCTCGTGGCAGCCCTTGTTTCACTGACTAGTGGTGGTGGTTATGGTGTTGATAAGAGTTTCTTGTTTTCAAACCTCCCACCACCACCAATACCATAA

mRNA sequence

NTGCACAGCTCAAACGTATGGGGAGGATCATACGAAGTAGCAGTACACAGAGACGAACACGAACACAACAACAACAACAACAACAACAACAAAAGCAGAGAAATGGAAGATTGGGATAAATCAAGCTTACTTTACCAAGAACAACTCCATAATCATCACAATCAAAATCACACTCATGAATTAGATCAAGTTCAACAAGGTTGGTTATTAATACCACAAGACAAAACCCGCAAAAACAGGAAGAACAAGTATGNACAACTCCATAATCATCACAATCAAAATCACACTCATGAATTAGATCAAGTTCAACAAGGTTGGTTATTAATACCACAAGACAAAACCCGCAAAAACAGGAAGAACAAGTATGTTGATTTTGGGTGCATTATTGTTAGCAAGAAGATACTCAAATGGGCATTTTGGTCTTTTCTAATTGCGTTCATTGTTATTGGGTTGCCGATTATTATTGCTAAGTCTTTGCCTAAGCATAAACATGGACCTCCTCCTGCTGATAATTACACTGTTGCCCTTAATATGGCTCTTCGGTTCTTCAATGCCCAGAAATCGGGGAAATTGCCGAAGCATAACAACGTATCATGGAGAGGAAACTCGGGATTAAAAGACGGATCAAAGTTAACCGATGTAAAAGGAGGATTAGTTGGTGGATATTATGATTCAGGAGAAAACACTAAGTACCATTTTCCTATGGCATTTGCAATGACAATGCTAAGTTGGAGTGTGATTGAATACCCTCACAAGTACGAGGCCATTGGCGAGTATGATCATGTTCGCGAGCTCATCAAATGGGGGACCGATTACTTGCTCAAAACCTTCAANGCGAGTATGATCATGTTCGCGAGCTCATCAAATGGGGGACCGATTACTTGCTCAAAACCTTCAACAGTTCGGCCACTCAGATCGACAAAATCCACAGCCAGGTTGGTTGGTGGTGCACAAAACGGATCCACTATCCCGAGTGACGTCACCTGTTGGGAGAGGCCTGAAGACATGGACTATGAGCGCCCCGTGCAGACAAGCTTTGCAGGGGCGGATCTAGGTGGAGAAATGGCCGCAGCCTTTGCTGCAGCCTCTATAGTTTTCCGTGACAACCAGCTTTACGCGAAGAAGCTCGTCAGAGGGGCAGAAACCGTTTTCCGGTTTGCTAGAGATACCGGTAAAAGAGCCCCGTATAGCAGAGGAAACCCATGGGCTGCACCGTATTATAACTCTACCGGGTATTATGATGAGTACATGTGGGGTGCTACTTGGTTGTATTATGCTACGGGGAATTCAACCTACGGTTTGCTGGCTACTAATCCCGGGATTCCTAAGAACGCGAAGGCGTTTAGGATGAACACGAACACGAGCTACTTGAGTTGGGATAACAAGTTGCCTGCTGCTATGTTGCTGTTGACGAGGTTTAGGATGTTTCTGAATCCTGGTTACCCGTATGAGCAGACTTTGGCGCAGTACCATAATGTTACGAAGCTTAATATGTGCTCGTATCTTAAGCAATTCCCGGTTTATAACCGGACTCGAGGAGAAAACACTAAGTACCATTTTCCTATGGCATTTGCAATGACAATGCTAAGTTGGAGTGTGATTGAATACCCTCACAAGTACGAGGCCATTGGCGAGTATGATCATGTTCGCGAGCTCATCAAATGGGGGACCGATTACTTGCTCAAAACCTTCAATAGTTCGGCCACTCAGATCGACAAAATCCACAGCCAGGTTGGTGGTGCACAAAACGGATCCACTATCCCGAGCGACGTCACCTGTTGGGAGAGGCCCGAAGACATGGACTATGAACGCCCCGTGCAAACAAGCTTTGCAGGGGCTGATCTAGGTGGAGAAATGGCCGCGGCCTTTGCTGCAGCCTCCATCGTATTCCGTGACAACCAGCTTTACTCGAAGAAGCTCGTCAGAGGGGCAGAAACCGTGTTCCGGTTTGCTAGGGATACCGGTAAAAGAGCCCCGTATAGCAGAGGAAACCCATGGGCTGCACCGTATTATAACTCAACCGGGTATTACGATGAGTACATGTGGGGTGCTACTTGGTTGTATTATGCAACCGGGAATTCAACCTACGGTTTGCTGGCTACTAATCCCGGGATTCCTAAGAACGCGAAGGCGTTTAGGATGAACACGAACACGAGCTACTTGAGTTGGGATAACAAGTTGCCTGCTGCTATGTTGCTGTTGACGAGGTTTAGGATGTTTCTGAATCCTGGTTACCCGTATGAGCAGACTTTGGCGCAGTACCATAATGTTACGAAGCTTAATATGTGCTCGTATCTTAAGCAATTCCCGGTTTATAACCGGACTCGAGGAGGACTGATAGAGCTGAACAGCGGAGGGCCGAAGCCGTTGCAGTACGTAGTGAATGCTGCATTTTTGGCGAATCTGTTTGCAGATTACATGGAGTCTACTGGGGTGCCTGGATGGTATTGTGGCCCGAATTTTATGCGTCAATCTGATCTTCGTAACTTCGCTCAAACTCAGGTTGAGTACATTTTGGGCAAGAACCCAATGCACATGAGCTATGTGGTGGGCTTCGGAAACAAGTACCCGAAACACGTGCATCACCGGGCTGCGTCGATACCGCGCAAAAACGGAGGGAAATGTAAAGGGGGTTACAAATGGAGGGACAGCAAAAAGCCCAACCCATACACAATTATTGGGGCAATGGTTGGTGGTCCGGACCGTTTTGACAACTTCAAAGATAACCGCAAACTAGACCGCTACACCGAGCCCACATTGGCTGGAAATGCCGGTCTAGTGGCGGCTCTTATCTCGTTGACGGCAACTCCCGGGGTTTCCGGAGTGGACCGGAACCTTATATTTTCGGGTGTTCCCCCGTTTTATACTCCGGCCCCACCTCCCCCCGCACCCCCGAGAACGGCCACATCGACCAACTCGGCCCGAACCTTGCCTACGCACAACTTTGATAATTATACTCTTGCTCTTCGTAAAGCACTCTTGTTCTTCAATGCCCAGAAATCTGGGAAACTACCCAAGAACAATGGAATTCCATGGAGAGGAAACTCGGGCCTACGCGACGGGTCACAACTGAAAGATGTACAGGGAGATGGTTTAGTTGGTGGATACTATGATTCTGGGGAAAACACAAAGTTTCATTTCCCTTTGGCTTACTCCATGACAATGCTAAGTTGGAGCTTGATTGAATACCCTCACAAATATCGTGCCATTAACGAGTATAATCATGTACTCGAACTTATCAAATGGGGAACTGATTACTTGCTCTTGACATTCGATTCTAATGCCACAAAGATCAGCAAAATCTATAGTCAGGTTGGAGGTGCCTATGAAAAATCAAACATACCAGATGACATCACCTGTTGGGAAAGACCAGAAGACATGGATTATCCCAGACCAGTACAAACAACATACGCAGGACCAGAACTCGCTGCAGAAATGACAGCAGCTTTATCATCAGCCTCAATACTATTCAAACAAAACAACCCTACTTACTCCATAAAGCTCATCAAAGCATCCGAATCCCTCTTCGCCTTCGCTAGGGACACTCGTAAAAGACGGCCCTACAGCCGAGGAAACCCGTGGGTCGACCCGTATTACAACTCGACGGGGTACTTTGACGAGTACTTGTGGGGTTCAACGTGGTTGTATTTAGCTACCGGGAATGTTAAGTATTTGGCATTAGCAACTAATCCGGGGATAACTAAGAATGCGATGGGTTTTAGATGGACGAGGAAGATGAGTGTGTTGAGTTGGGATAATAAGGTTCCTTCGGCTTTGATGTTGCTTACAAGGGTGAGGTTGTTTAAAGCTCCTGAATACCCTTATGAAGAGACATTGATTGGGCATCATAAGTTTATTGCTCTCTCTATGTGTTCTTATCTCAAACCCTTTCGTCTCTTTAAGTGGAGTAAAGGAGGACTGATACAGTTTAATGATGGAGGGGAAGAATCTCTTCAATATGTAGCAAATGCTGCATTTTTAGCAAACCTATTTGCTGATTACCTGAATGCTACTGATAGTCCGGGATGGTTATGTGGCCCTAATTTCTTCCCTATTGCTACACTCCGGGAATTCGCCTCATCTCAGATAGACTACATATTGGGTAAAAACCCCCTAGACCTAAGCTATGTGGTGGGGTACGGTGACAAGTACCCGAACCGTGTACATCACCGAGCAGCTTCAATCCCGTCAGACGGTCTGAAGTACTCGTGTAAAGGTGGATACAAATGGAGGGACAGCACAAAACCGAATCCCAACACCATTGTAGGAGCCATGGTTGGCGGACCTGACCGGTTCGATAATTTCCACGATGACCGTAGTGCGTCCGCGGGACCCACATTAGCCGGAAACGCCGGGCTCGTGGCAGCCCTTGTTTCACTGACTAGTGGTGGTGGTTATGGTGTTGATAAGAGTTTCTTGTTTTCAAACCTCCCACCACCACCAATACCATAA

Coding sequence (CDS)

ATGGAAGATTGGGATAAATCAAGCTTACTTTACCAAGAACAACTCCATAATCATCACAATCAAAATCACACTCATGAATTAGATCAAGTTCAACAAGGTTGGTTATTAATACCACAAGACAAAACCCGCAAAAACAGGAAGAACAAGTATGNACAACTCCATAATCATCACAATCAAAATCACACTCATGAATTAGATCAAGTTCAACAAGGTTGGTTATTAATACCACAAGACAAAACCCGCAAAAACAGGAAGAACAAGTATGTTGATTTTGGGTGCATTATTGTTAGCAAGAAGATACTCAAATGGGCATTTTGGTCTTTTCTAATTGCGTTCATTGTTATTGGGTTGCCGATTATTATTGCTAAGTCTTTGCCTAAGCATAAACATGGACCTCCTCCTGCTGATAATTACACTGTTGCCCTTAATATGGCTCTTCGGTTCTTCAATGCCCAGAAATCGGGGAAATTGCCGAAGCATAACAACGTATCATGGAGAGGAAACTCGGGATTAAAAGACGGATCAAAGTTAACCGATGTAAAAGGAGGATTAGTTGGTGGATATTATGATTCAGGAGAAAACACTAAGTACCATTTTCCTATGGCATTTGCAATGACAATGCTAAGTTGGAGTGTGATTGAATACCCTCACAAGTACGAGGCCATTGGCGAGTATGATCATGTTCGCGAGCTCATCAAATGGGGGACCGATTACTTGCTCAAAACCTTCAANGCGAGTATGATCATGTTCGCGAGCTCATCAAATGGGGGACCGATTACTTGCTCAAAACCTTCAACAGTTCGGCCACTCAGATCGACAAAATCCACAGCCAGGTTGGTTGGTGGTGCACAAAACGGATCCACTATCCCGAGTGACGTCACCTGTTGGGAGAGGCCTGAAGACATGGACTATGAGCGCCCCGTGCAGACAAGCTTTGCAGGGGCGGATCTAGGTGGAGAAATGGCCGCAGCCTTTGCTGCAGCCTCTATAGTTTTCCGTGACAACCAGCTTTACGCGAAGAAGCTCGTCAGAGGGGCAGAAACCGTTTTCCGGTTTGCTAGAGATACCGGTAAAAGAGCCCCGTATAGCAGAGGAAACCCATGGGCTGCACCGTATTATAACTCTACCGGGTATTATGATGAGTACATGTGGGGTGCTACTTGGTTGTATTATGCTACGGGGAATTCAACCTACGGTTTGCTGGCTACTAATCCCGGGATTCCTAAGAACGCGAAGGCGTTTAGGATGAACACGAACACGAGCTACTTGAGTTGGGATAACAAGTTGCCTGCTGCTATGTTGCTGTTGACGAGGTTTAGGATGTTTCTGAATCCTGGTTACCCGTATGAGCAGACTTTGGCGCAGTACCATAATGTTACGAAGCTTAATATGTGCTCGTATCTTAAGCAATTCCCGGTTTATAACCGGACTCGAGGAGAAAACACTAAGTACCATTTTCCTATGGCATTTGCAATGACAATGCTAAGTTGGAGTGTGATTGAATACCCTCACAAGTACGAGGCCATTGGCGAGTATGATCATGTTCGCGAGCTCATCAAATGGGGGACCGATTACTTGCTCAAAACCTTCAATAGTTCGGCCACTCAGATCGACAAAATCCACAGCCAGGTTGGTGGTGCACAAAACGGATCCACTATCCCGAGCGACGTCACCTGTTGGGAGAGGCCCGAAGACATGGACTATGAACGCCCCGTGCAAACAAGCTTTGCAGGGGCTGATCTAGGTGGAGAAATGGCCGCGGCCTTTGCTGCAGCCTCCATCGTATTCCGTGACAACCAGCTTTACTCGAAGAAGCTCGTCAGAGGGGCAGAAACCGTGTTCCGGTTTGCTAGGGATACCGGTAAAAGAGCCCCGTATAGCAGAGGAAACCCATGGGCTGCACCGTATTATAACTCAACCGGGTATTACGATGAGTACATGTGGGGTGCTACTTGGTTGTATTATGCAACCGGGAATTCAACCTACGGTTTGCTGGCTACTAATCCCGGGATTCCTAAGAACGCGAAGGCGTTTAGGATGAACACGAACACGAGCTACTTGAGTTGGGATAACAAGTTGCCTGCTGCTATGTTGCTGTTGACGAGGTTTAGGATGTTTCTGAATCCTGGTTACCCGTATGAGCAGACTTTGGCGCAGTACCATAATGTTACGAAGCTTAATATGTGCTCGTATCTTAAGCAATTCCCGGTTTATAACCGGACTCGAGGAGGACTGATAGAGCTGAACAGCGGAGGGCCGAAGCCGTTGCAGTACGTAGTGAATGCTGCATTTTTGGCGAATCTGTTTGCAGATTACATGGAGTCTACTGGGGTGCCTGGATGGTATTGTGGCCCGAATTTTATGCGTCAATCTGATCTTCGTAACTTCGCTCAAACTCAGGTTGAGTACATTTTGGGCAAGAACCCAATGCACATGAGCTATGTGGTGGGCTTCGGAAACAAGTACCCGAAACACGTGCATCACCGGGCTGCGTCGATACCGCGCAAAAACGGAGGGAAATGTAAAGGGGGTTACAAATGGAGGGACAGCAAAAAGCCCAACCCATACACAATTATTGGGGCAATGGTTGGTGGTCCGGACCGTTTTGACAACTTCAAAGATAACCGCAAACTAGACCGCTACACCGAGCCCACATTGGCTGGAAATGCCGGTCTAGTGGCGGCTCTTATCTCGTTGACGGCAACTCCCGGGGTTTCCGGAGTGGACCGGAACCTTATATTTTCGGGTGTTCCCCCGTTTTATACTCCGGCCCCACCTCCCCCCGCACCCCCGAGAACGGCCACATCGACCAACTCGGCCCGAACCTTGCCTACGCACAACTTTGATAATTATACTCTTGCTCTTCGTAAAGCACTCTTGTTCTTCAATGCCCAGAAATCTGGGAAACTACCCAAGAACAATGGAATTCCATGGAGAGGAAACTCGGGCCTACGCGACGGGTCACAACTGAAAGATGTACAGGGAGATGGTTTAGTTGGTGGATACTATGATTCTGGGGAAAACACAAAGTTTCATTTCCCTTTGGCTTACTCCATGACAATGCTAAGTTGGAGCTTGATTGAATACCCTCACAAATATCGTGCCATTAACGAGTATAATCATGTACTCGAACTTATCAAATGGGGAACTGATTACTTGCTCTTGACATTCGATTCTAATGCCACAAAGATCAGCAAAATCTATAGTCAGGTTGGAGGTGCCTATGAAAAATCAAACATACCAGATGACATCACCTGTTGGGAAAGACCAGAAGACATGGATTATCCCAGACCAGTACAAACAACATACGCAGGACCAGAACTCGCTGCAGAAATGACAGCAGCTTTATCATCAGCCTCAATACTATTCAAACAAAACAACCCTACTTACTCCATAAAGCTCATCAAAGCATCCGAATCCCTCTTCGCCTTCGCTAGGGACACTCGTAAAAGACGGCCCTACAGCCGAGGAAACCCGTGGGTCGACCCGTATTACAACTCGACGGGGTACTTTGACGAGTACTTGTGGGGTTCAACGTGGTTGTATTTAGCTACCGGGAATGTTAAGTATTTGGCATTAGCAACTAATCCGGGGATAACTAAGAATGCGATGGGTTTTAGATGGACGAGGAAGATGAGTGTGTTGAGTTGGGATAATAAGGTTCCTTCGGCTTTGATGTTGCTTACAAGGGTGAGGTTGTTTAAAGCTCCTGAATACCCTTATGAAGAGACATTGATTGGGCATCATAAGTTTATTGCTCTCTCTATGTGTTCTTATCTCAAACCCTTTCGTCTCTTTAAGTGGAGTAAAGGAGGACTGATACAGTTTAATGATGGAGGGGAAGAATCTCTTCAATATGTAGCAAATGCTGCATTTTTAGCAAACCTATTTGCTGATTACCTGAATGCTACTGATAGTCCGGGATGGTTATGTGGCCCTAATTTCTTCCCTATTGCTACACTCCGGGAATTCGCCTCATCTCAGATAGACTACATATTGGGTAAAAACCCCCTAGACCTAAGCTATGTGGTGGGGTACGGTGACAAGTACCCGAACCGTGTACATCACCGAGCAGCTTCAATCCCGTCAGACGGTCTGAAGTACTCGTGTAAAGGTGGATACAAATGGAGGGACAGCACAAAACCGAATCCCAACACCATTGTAGGAGCCATGGTTGGCGGACCTGACCGGTTCGATAATTTCCACGATGACCGTAGTGCGTCCGCGGGACCCACATTAGCCGGAAACGCCGGGCTCGTGGCAGCCCTTGTTTCACTGACTAGTGGTGGTGGTTATGGTGTTGATAAGAGTTTCTTGTTTTCAAACCTCCCACCACCACCAATACCATAA

Protein sequence

MEDWDKSSLLYQEQLHNHHNQNHTHELDQVQQGWLLIPQDKTRKNRKNKYXQLHNHHNQNHTHELDQVQQGWLLIPQDKTRKNRKNKYVDFGCIIVSKKILKWAFWSFLIAFIVIGLPIIIAKSLPKHKHGPPPADNYTVALNMALRFFNAQKSGKLPKHNNVSWRGNSGLKDGSKLTDVKGGLVGGYYDSGENTKYHFPMAFAMTMLSWSVIEYPHKYEAIGEYDHVRELIKWGTDYLLKTFXASMIMFASSSNGGPITCSKPSTVRPLRSTKSTARLVGGAQNGSTIPSDVTCWERPEDMDYERPVQTSFAGADLGGEMAAAFAAASIVFRDNQLYAKKLVRGAETVFRFARDTGKRAPYSRGNPWAAPYYNSTGYYDEYMWGATWLYYATGNSTYGLLATNPGIPKNAKAFRMNTNTSYLSWDNKLPAAMLLLTRFRMFLNPGYPYEQTLAQYHNVTKLNMCSYLKQFPVYNRTRGENTKYHFPMAFAMTMLSWSVIEYPHKYEAIGEYDHVRELIKWGTDYLLKTFNSSATQIDKIHSQVGGAQNGSTIPSDVTCWERPEDMDYERPVQTSFAGADLGGEMAAAFAAASIVFRDNQLYSKKLVRGAETVFRFARDTGKRAPYSRGNPWAAPYYNSTGYYDEYMWGATWLYYATGNSTYGLLATNPGIPKNAKAFRMNTNTSYLSWDNKLPAAMLLLTRFRMFLNPGYPYEQTLAQYHNVTKLNMCSYLKQFPVYNRTRGGLIELNSGGPKPLQYVVNAAFLANLFADYMESTGVPGWYCGPNFMRQSDLRNFAQTQVEYILGKNPMHMSYVVGFGNKYPKHVHHRAASIPRKNGGKCKGGYKWRDSKKPNPYTIIGAMVGGPDRFDNFKDNRKLDRYTEPTLAGNAGLVAALISLTATPGVSGVDRNLIFSGVPPFYTPAPPPPAPPRTATSTNSARTLPTHNFDNYTLALRKALLFFNAQKSGKLPKNNGIPWRGNSGLRDGSQLKDVQGDGLVGGYYDSGENTKFHFPLAYSMTMLSWSLIEYPHKYRAINEYNHVLELIKWGTDYLLLTFDSNATKISKIYSQVGGAYEKSNIPDDITCWERPEDMDYPRPVQTTYAGPELAAEMTAALSSASILFKQNNPTYSIKLIKASESLFAFARDTRKRRPYSRGNPWVDPYYNSTGYFDEYLWGSTWLYLATGNVKYLALATNPGITKNAMGFRWTRKMSVLSWDNKVPSALMLLTRVRLFKAPEYPYEETLIGHHKFIALSMCSYLKPFRLFKWSKGGLIQFNDGGEESLQYVANAAFLANLFADYLNATDSPGWLCGPNFFPIATLREFASSQIDYILGKNPLDLSYVVGYGDKYPNRVHHRAASIPSDGLKYSCKGGYKWRDSTKPNPNTIVGAMVGGPDRFDNFHDDRSASAGPTLAGNAGLVAALVSLTSGGGYGVDKSFLFSNLPPPPIP
Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Spo11971Spo11971gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Spo11971.1Spo11971.1-proteinpolypeptide


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
Spo11971.1.exon.1Spo11971.1.exon.1exon
Spo11971.1.exon.2Spo11971.1.exon.2exon
Spo11971.1.exon.3Spo11971.1.exon.3exon
Spo11971.1.exon.4Spo11971.1.exon.4exon
Spo11971.1.exon.5Spo11971.1.exon.5exon
Spo11971.1.exon.6Spo11971.1.exon.6exon
Spo11971.1.exon.7Spo11971.1.exon.7exon
Spo11971.1.exon.8Spo11971.1.exon.8exon
Spo11971.1.exon.9Spo11971.1.exon.9exon
Spo11971.1.exon.10Spo11971.1.exon.10exon
Spo11971.1.exon.11Spo11971.1.exon.11exon
Spo11971.1.exon.12Spo11971.1.exon.12exon


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Spo11971.1.utr5p.1Spo11971.1.utr5p.1five_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Spo11971.1.CDS.1Spo11971.1.CDS.1CDS
Spo11971.1.CDS.2Spo11971.1.CDS.2CDS
Spo11971.1.CDS.3Spo11971.1.CDS.3CDS
Spo11971.1.CDS.4Spo11971.1.CDS.4CDS
Spo11971.1.CDS.5Spo11971.1.CDS.5CDS
Spo11971.1.CDS.6Spo11971.1.CDS.6CDS
Spo11971.1.CDS.7Spo11971.1.CDS.7CDS
Spo11971.1.CDS.8Spo11971.1.CDS.8CDS
Spo11971.1.CDS.9Spo11971.1.CDS.9CDS
Spo11971.1.CDS.10Spo11971.1.CDS.10CDS
Spo11971.1.CDS.11Spo11971.1.CDS.11CDS
Spo11971.1.CDS.12Spo11971.1.CDS.12CDS


Homology
BLAST of Spo11971.1 vs. NCBI nr
Match: gi|731326101|ref|XP_010673848.1| (PREDICTED: endoglucanase 12-like [Beta vulgaris subsp. vulgaris])

HSP 1 Score: 896.3 bits (2315), Expect = 6.800e-257
Identity = 423/513 (82.46%), Postives = 463/513 (90.25%), Query Frame = 1

		  

Query: 936  STNSARTLPTHNFDNYTLALRKALLFFNAQKSGKLPKNNGIPWRGNSGLRDGSQLKDVQG 995
            STNSA+ +PTHNFD+Y+ AL KALLFFNAQKSGKLPKNN IPWRGNSGL DGSQLKDV+G
Sbjct: 30   STNSAQNIPTHNFDHYSTALHKALLFFNAQKSGKLPKNNEIPWRGNSGLLDGSQLKDVKG 89

Query: 996  DGLVGGYYDSGENTKFHFPLAYSMTMLSWSLIEYPHKYRAINEYNHVLELIKWGTDYLLL 1055
             GLVGGYYDSGENTKFHFPLA+SMTMLSWS+IEYPHKYRAINEYNH  ELIKWGTDYLLL
Sbjct: 90   -GLVGGYYDSGENTKFHFPLAFSMTMLSWSMIEYPHKYRAINEYNHTRELIKWGTDYLLL 149

Query: 1056 TFDSNATKISKIYSQVGGAYEKSNIPDDITCWERPEDMDYPRPVQTTYAGPELAAEMTAA 1115
            TFDSNA+KI+KIYSQVGGAYE SN+PDDI+CW+RPEDMDYPRPVQT YAGPELA EM AA
Sbjct: 150  TFDSNASKINKIYSQVGGAYENSNLPDDISCWQRPEDMDYPRPVQTAYAGPELAGEMAAA 209

Query: 1116 LSSASILFKQNNPTYSIKLIKASESLFAFARDTRKRRPYSRGNPWVDPYYNSTGYFDEYL 1175
            L+SASI++  +NP YS KLIK +E+LF FARD  KRRPYSRGNPW++PYYNSTGYFDEYL
Sbjct: 210  LASASIVY-NDNPAYSKKLIKGAEALFTFARDPSKRRPYSRGNPWIEPYYNSTGYFDEYL 269

Query: 1176 WGSTWLYLATGNVKYLALATNPGITKNAMGFRWTRKMSVLSWDNKVPSALMLLTRVRLFK 1235
            W +TWLY ATGN+KYL LATNPGITKNAM  +WT+K S+LSWDNKVPS+LMLLTR+R+F 
Sbjct: 270  WSATWLYFATGNLKYLKLATNPGITKNAMTSKWTKKTSILSWDNKVPSSLMLLTRIRMFH 329

Query: 1236 APEYPYEETLIGHHKFIALSMCSYLKPFRLFKWSKGGLIQFNDGGEESLQYVANAAFLAN 1295
            AP YPYE+ L  HHK IALSMCSYLKPF LF WSKGGLIQ NDGG +SLQYVANAAFLAN
Sbjct: 330  APGYPYEQALAEHHKLIALSMCSYLKPFHLFSWSKGGLIQLNDGGPQSLQYVANAAFLAN 389

Query: 1296 LFADYLNATDSPGWLCGPNFFPIATLREFASSQIDYILGKNPLDLSYVVGYGDKYPNRVH 1355
            LFADYLNA+D+ GW CGPNFF IATLR FASSQI+YILGKNPL LSYVVGYG+K+PN VH
Sbjct: 390  LFADYLNASDTSGWFCGPNFFSIATLRNFASSQIEYILGKNPLKLSYVVGYGEKFPNHVH 449

Query: 1356 HRAASIPSDGLKYSCKGGYKWRDSTKPNPNTIVGAMVGGPDRFDNFHDDRS--ASAGPTL 1415
            HRAASIPSDGLKYSCKGGYKW+DSTKPNPNTI GAMVGGPD+FD+FHDDRS   S+GPTL
Sbjct: 450  HRAASIPSDGLKYSCKGGYKWKDSTKPNPNTIEGAMVGGPDQFDHFHDDRSDRGSSGPTL 509

Query: 1416 AGNAGLVAALVSLTSGGGYGVDKSFLFSNLPPP 1447
            A NAGLVAALVSLT  GGYGVDK+FLFSNL PP
Sbjct: 510  AANAGLVAALVSLTGSGGYGVDKNFLFSNLSPP 540

BLAST of Spo11971.1 vs. NCBI nr
Match: gi|870863518|gb|KMT14682.1| (hypothetical protein BVRB_4g074500 [Beta vulgaris subsp. vulgaris])

HSP 1 Score: 845.9 bits (2184), Expect = 1.100e-241
Identity = 402/456 (88.16%), Postives = 425/456 (93.20%), Query Frame = 1

		  

Query: 479 GENTKYHFPMAFAMTMLSWSVIEYPHKYEAIGEYDHVRELIKWGTDYLLKTFNSSATQID 538
           GENTK+HFPMA+AMTMLSWSVIEYPHKYEAI EYDHVRELIKWGTDYLL TFNSSA  ID
Sbjct: 106 GENTKFHFPMAYAMTMLSWSVIEYPHKYEAINEYDHVRELIKWGTDYLLLTFNSSANLID 165

Query: 539 KIHSQVGGAQNGSTIPSDVTCWERPEDMDYERPVQTSFAGADLGGEMAAAFAAASIVFRD 598
            +HSQVGGAQNGST+PSD+TCWERPE MDYERPVQTSFAGADLGGEMAAA AAASIVFRD
Sbjct: 166 HVHSQVGGAQNGSTVPSDMTCWERPEKMDYERPVQTSFAGADLGGEMAAALAAASIVFRD 225

Query: 599 NQLYSKKLVRGAETVFRFARDTGKRAPYSRGNPWAAPYYNSTGYYDEYMWGATWLYYATG 658
           N+ Y+KKL++GAETVFRFARD GKRAPYSRGN + +PYYNSTGYYDEYMWGATWLYYATG
Sbjct: 226 NRAYAKKLIKGAETVFRFARDEGKRAPYSRGNLYISPYYNSTGYYDEYMWGATWLYYATG 285

Query: 659 NSTYGLLATNPGIPKNAKAFRMNTNTSYLSWDNKLPAAMLLLTRFRMFLNPGYPYEQTLA 718
           NSTYG LATNPGIPKNA+AFRMNTNTSYLSWDNKLPA+MLLLTRFRMFLNPGYPYEQTL 
Sbjct: 286 NSTYGSLATNPGIPKNARAFRMNTNTSYLSWDNKLPASMLLLTRFRMFLNPGYPYEQTLM 345

Query: 719 QYHNVTKLNMCSYLKQFPVYNRTRGGLIELNSGGPKPLQYVVNAAFLANLFADYMESTGV 778
           QYHNVT+LNMCSY+KQF VYN T+GGLIELN+GGPKPLQYV NAAFLANLFADYM+STGV
Sbjct: 346 QYHNVTQLNMCSYMKQFHVYNWTQGGLIELNNGGPKPLQYVANAAFLANLFADYMDSTGV 405

Query: 779 PGWYCGPNFMRQSDLRNFAQTQVEYILGKNPMHMSYVVGFGNKYPKHVHHRAASIPRKNG 838
           PGWYCGP F+RQSDLR FA +QV+YILGKNP+HMSYVVGFGNKYPKHVHHRAASIP KNG
Sbjct: 406 PGWYCGPYFLRQSDLRRFATSQVDYILGKNPLHMSYVVGFGNKYPKHVHHRAASIPHKNG 465

Query: 839 GK--CKGGYKWRDSKKPNPYTIIGAMVGGPDRFDNFKDNRKLDRYTEPTLAGNAGLVAAL 898
            K  CKGGYKWRDSKKPNP TI+GAMVGGPDRFD FKDNRKLDRYTEPTLAGNAGLVAAL
Sbjct: 466 VKYSCKGGYKWRDSKKPNPNTIVGAMVGGPDRFDRFKDNRKLDRYTEPTLAGNAGLVAAL 525

Query: 899 ISLTATPGVSGVDRNLIFSGVPPFYTPAPPPPAPPR 933
           ISLT T G SGVDRN IFSGVPP YTPAPPPPAP R
Sbjct: 526 ISLTTTAG-SGVDRNYIFSGVPPLYTPAPPPPAPWR 560

BLAST of Spo11971.1 vs. NCBI nr
Match: gi|731326097|ref|XP_010673846.1| (PREDICTED: endoglucanase 12-like [Beta vulgaris subsp. vulgaris])

HSP 1 Score: 845.9 bits (2184), Expect = 1.100e-241
Identity = 402/456 (88.16%), Postives = 425/456 (93.20%), Query Frame = 1

		  

Query: 479 GENTKYHFPMAFAMTMLSWSVIEYPHKYEAIGEYDHVRELIKWGTDYLLKTFNSSATQID 538
           GENTK+HFPMA+AMTMLSWSVIEYPHKYEAI EYDHVRELIKWGTDYLL TFNSSA  ID
Sbjct: 179 GENTKFHFPMAYAMTMLSWSVIEYPHKYEAINEYDHVRELIKWGTDYLLLTFNSSANLID 238

Query: 539 KIHSQVGGAQNGSTIPSDVTCWERPEDMDYERPVQTSFAGADLGGEMAAAFAAASIVFRD 598
            +HSQVGGAQNGST+PSD+TCWERPE MDYERPVQTSFAGADLGGEMAAA AAASIVFRD
Sbjct: 239 HVHSQVGGAQNGSTVPSDMTCWERPEKMDYERPVQTSFAGADLGGEMAAALAAASIVFRD 298

Query: 599 NQLYSKKLVRGAETVFRFARDTGKRAPYSRGNPWAAPYYNSTGYYDEYMWGATWLYYATG 658
           N+ Y+KKL++GAETVFRFARD GKRAPYSRGN + +PYYNSTGYYDEYMWGATWLYYATG
Sbjct: 299 NRAYAKKLIKGAETVFRFARDEGKRAPYSRGNLYISPYYNSTGYYDEYMWGATWLYYATG 358

Query: 659 NSTYGLLATNPGIPKNAKAFRMNTNTSYLSWDNKLPAAMLLLTRFRMFLNPGYPYEQTLA 718
           NSTYG LATNPGIPKNA+AFRMNTNTSYLSWDNKLPA+MLLLTRFRMFLNPGYPYEQTL 
Sbjct: 359 NSTYGSLATNPGIPKNARAFRMNTNTSYLSWDNKLPASMLLLTRFRMFLNPGYPYEQTLM 418

Query: 719 QYHNVTKLNMCSYLKQFPVYNRTRGGLIELNSGGPKPLQYVVNAAFLANLFADYMESTGV 778
           QYHNVT+LNMCSY+KQF VYN T+GGLIELN+GGPKPLQYV NAAFLANLFADYM+STGV
Sbjct: 419 QYHNVTQLNMCSYMKQFHVYNWTQGGLIELNNGGPKPLQYVANAAFLANLFADYMDSTGV 478

Query: 779 PGWYCGPNFMRQSDLRNFAQTQVEYILGKNPMHMSYVVGFGNKYPKHVHHRAASIPRKNG 838
           PGWYCGP F+RQSDLR FA +QV+YILGKNP+HMSYVVGFGNKYPKHVHHRAASIP KNG
Sbjct: 479 PGWYCGPYFLRQSDLRRFATSQVDYILGKNPLHMSYVVGFGNKYPKHVHHRAASIPHKNG 538

Query: 839 GK--CKGGYKWRDSKKPNPYTIIGAMVGGPDRFDNFKDNRKLDRYTEPTLAGNAGLVAAL 898
            K  CKGGYKWRDSKKPNP TI+GAMVGGPDRFD FKDNRKLDRYTEPTLAGNAGLVAAL
Sbjct: 539 VKYSCKGGYKWRDSKKPNPNTIVGAMVGGPDRFDRFKDNRKLDRYTEPTLAGNAGLVAAL 598

Query: 899 ISLTATPGVSGVDRNLIFSGVPPFYTPAPPPPAPPR 933
           ISLT T G SGVDRN IFSGVPP YTPAPPPPAP R
Sbjct: 599 ISLTTTAG-SGVDRNYIFSGVPPLYTPAPPPPAPWR 633

BLAST of Spo11971.1 vs. NCBI nr
Match: gi|902209662|gb|KNA16253.1| (hypothetical protein SOVF_090810 [Spinacia oleracea])

HSP 1 Score: 742.3 bits (1915), Expect = 1.600e-210
Identity = 356/357 (99.72%), Postives = 356/357 (99.72%), Query Frame = 1

		  

Query: 1093 MDYPRPVQTTYAGPELAAEMTAALSSASILFKQNNPTYSIKLIKASESLFAFARDTRKRR 1152
            MDYPRPVQTTYAGPELAAEMTAALSSASILFKQNNPTYS KLIKASESLFAFARDTRKRR
Sbjct: 1    MDYPRPVQTTYAGPELAAEMTAALSSASILFKQNNPTYSKKLIKASESLFAFARDTRKRR 60

Query: 1153 PYSRGNPWVDPYYNSTGYFDEYLWGSTWLYLATGNVKYLALATNPGITKNAMGFRWTRKM 1212
            PYSRGNPWVDPYYNSTGYFDEYLWGSTWLYLATGNVKYLALATNPGITKNAMGFRWTRKM
Sbjct: 61   PYSRGNPWVDPYYNSTGYFDEYLWGSTWLYLATGNVKYLALATNPGITKNAMGFRWTRKM 120

Query: 1213 SVLSWDNKVPSALMLLTRVRLFKAPEYPYEETLIGHHKFIALSMCSYLKPFRLFKWSKGG 1272
            SVLSWDNKVPSALMLLTRVRLFKAPEYPYEETLIGHHKFIALSMCSYLKPFRLFKWSKGG
Sbjct: 121  SVLSWDNKVPSALMLLTRVRLFKAPEYPYEETLIGHHKFIALSMCSYLKPFRLFKWSKGG 180

Query: 1273 LIQFNDGGEESLQYVANAAFLANLFADYLNATDSPGWLCGPNFFPIATLREFASSQIDYI 1332
            LIQFNDGGEESLQYVANAAFLANLFADYLNATDSPGWLCGPNFFPIATLREFASSQIDYI
Sbjct: 181  LIQFNDGGEESLQYVANAAFLANLFADYLNATDSPGWLCGPNFFPIATLREFASSQIDYI 240

Query: 1333 LGKNPLDLSYVVGYGDKYPNRVHHRAASIPSDGLKYSCKGGYKWRDSTKPNPNTIVGAMV 1392
            LGKNPLDLSYVVGYGDKYPNRVHHRAASIPSDGLKYSCKGGYKWRDSTKPNPNTIVGAMV
Sbjct: 241  LGKNPLDLSYVVGYGDKYPNRVHHRAASIPSDGLKYSCKGGYKWRDSTKPNPNTIVGAMV 300

Query: 1393 GGPDRFDNFHDDRSASAGPTLAGNAGLVAALVSLTSGGGYGVDKSFLFSNLPPPPIP 1450
            GGPDRFDNFHDDRSASAGPTLAGNAGLVAALVSLTSGGGYGVDKSFLFSNLPPPPIP
Sbjct: 301  GGPDRFDNFHDDRSASAGPTLAGNAGLVAALVSLTSGGGYGVDKSFLFSNLPPPPIP 357

BLAST of Spo11971.1 vs. NCBI nr
Match: gi|702252418|ref|XP_010065082.1| (PREDICTED: endoglucanase 12 [Eucalyptus grandis])

HSP 1 Score: 710.7 bits (1833), Expect = 5.300e-201
Identity = 336/512 (65.62%), Postives = 406/512 (79.30%), Query Frame = 1

		  

Query: 941  RTLPTHNF-----DNYTLALRKALLFFNAQKSGKLPKNNGIPWRGNSGLRDGSQLKDVQG 1000
            +T+P H       DNYT+AL KALLFFNAQKSGKLPKNN IPWRGNSGL DG+   DV+G
Sbjct: 99   KTVPRHKARPPPPDNYTVALHKALLFFNAQKSGKLPKNNPIPWRGNSGLTDGNGTTDVKG 158

Query: 1001 DGLVGGYYDSGENTKFHFPLAYSMTMLSWSLIEYPHKYRAINEYNHVLELIKWGTDYLLL 1060
             GLVGGYYD+G+NTKFHFP+A++M+MLSWS+IEY HKY A+ EYNH  +LIKWGTDYLLL
Sbjct: 159  -GLVGGYYDAGDNTKFHFPMAFAMSMLSWSVIEYSHKYEAVGEYNHARDLIKWGTDYLLL 218

Query: 1061 TFDSNATKISKIYSQVGGAYEKSNIPDDITCWERPEDMDYPRPVQTTYAGPELAAEMTAA 1120
            TF+S+A+KI KIYSQVGG+   S IPDD  CW RPEDMDYPRPVQT  +GP+LA EM AA
Sbjct: 219  TFNSSASKIDKIYSQVGGSQNGSKIPDDHNCWTRPEDMDYPRPVQTANSGPDLAGEMAAA 278

Query: 1121 LSSASILFKQNNPTYSIKLIKASESLFAFARDTRKRRPYSRGNPWVDPYYNSTGYFDEYL 1180
            L+SASI+F+ ++  YS KL+K + ++FAFARD+ +R PYSRGNP++DPYYNSTGYFDEY+
Sbjct: 279  LASASIVFR-DDAAYSKKLVKGAATVFAFARDSGRRTPYSRGNPYIDPYYNSTGYFDEYM 338

Query: 1181 WGSTWLYLATGNVKYLALATNPGITKNAMGFRWTRKMSVLSWDNKVPSALMLLTRVRLFK 1240
            WG+ WLY ATGN  Y++LATNPGI KN+  F   R +SVLSWDNK+P+A++LLTR+R+F 
Sbjct: 339  WGAAWLYYATGNKSYISLATNPGIPKNSKAFYMVRDLSVLSWDNKLPAAMLLLTRLRVFL 398

Query: 1241 APEYPYEETLIGHHKFIALSMCSYLKPFRLFKWSKGGLIQFNDGGEESLQYVANAAFLAN 1300
             P YPYE+ L  +     L+MCSYL  F +F W+KGG+I+ N G  + LQY+ANAAFLA+
Sbjct: 399  NPGYPYEDMLSMYQNVTGLTMCSYLHQFNVFNWTKGGMIELNHGNPQPLQYMANAAFLAS 458

Query: 1301 LFADYLNATDSPGWLCGPNFFPIATLREFASSQIDYILGKNPLDLSYVVGYGDKYPNRVH 1360
            L+ DY+NAT  PGW CGPNF    TLR FA+SQIDYILGKNP+ +SYVVG+G KYP  VH
Sbjct: 459  LYVDYMNATGVPGWNCGPNFITSDTLRSFATSQIDYILGKNPMKMSYVVGFGAKYPKHVH 518

Query: 1361 HRAASIPSDGLKYSCKGGYKWRDSTKPNPNTIVGAMVGGPDRFDNFHDDRS--ASAGPTL 1420
            HR ASIP+D  KYSC GG+KWRD+   NPNTI GAMVGGPDRFD F D R+  +   PTL
Sbjct: 519  HRGASIPNDHKKYSCTGGWKWRDTRNNNPNTINGAMVGGPDRFDKFKDVRTNYSYTEPTL 578

Query: 1421 AGNAGLVAALVSLTSGGGYGVDKSFLFSNLPP 1446
            AGNAGLVAALVSLT  GG  +DK+ +FS +PP
Sbjct: 579  AGNAGLVAALVSLTESGGRAIDKNSIFSAVPP 608

BLAST of Spo11971.1 vs. UniProtKB/TrEMBL
Match: A0A0J8CRT0_BETVU (Endoglucanase OS=Beta vulgaris subsp. vulgaris GN=BVRB_4g074510 PE=3 SV=1)

HSP 1 Score: 896.3 bits (2315), Expect = 4.700e-257
Identity = 423/513 (82.46%), Postives = 463/513 (90.25%), Query Frame = 1

		  

Query: 936  STNSARTLPTHNFDNYTLALRKALLFFNAQKSGKLPKNNGIPWRGNSGLRDGSQLKDVQG 995
            STNSA+ +PTHNFD+Y+ AL KALLFFNAQKSGKLPKNN IPWRGNSGL DGSQLKDV+G
Sbjct: 30   STNSAQNIPTHNFDHYSTALHKALLFFNAQKSGKLPKNNEIPWRGNSGLLDGSQLKDVKG 89

Query: 996  DGLVGGYYDSGENTKFHFPLAYSMTMLSWSLIEYPHKYRAINEYNHVLELIKWGTDYLLL 1055
             GLVGGYYDSGENTKFHFPLA+SMTMLSWS+IEYPHKYRAINEYNH  ELIKWGTDYLLL
Sbjct: 90   -GLVGGYYDSGENTKFHFPLAFSMTMLSWSMIEYPHKYRAINEYNHTRELIKWGTDYLLL 149

Query: 1056 TFDSNATKISKIYSQVGGAYEKSNIPDDITCWERPEDMDYPRPVQTTYAGPELAAEMTAA 1115
            TFDSNA+KI+KIYSQVGGAYE SN+PDDI+CW+RPEDMDYPRPVQT YAGPELA EM AA
Sbjct: 150  TFDSNASKINKIYSQVGGAYENSNLPDDISCWQRPEDMDYPRPVQTAYAGPELAGEMAAA 209

Query: 1116 LSSASILFKQNNPTYSIKLIKASESLFAFARDTRKRRPYSRGNPWVDPYYNSTGYFDEYL 1175
            L+SASI++  +NP YS KLIK +E+LF FARD  KRRPYSRGNPW++PYYNSTGYFDEYL
Sbjct: 210  LASASIVY-NDNPAYSKKLIKGAEALFTFARDPSKRRPYSRGNPWIEPYYNSTGYFDEYL 269

Query: 1176 WGSTWLYLATGNVKYLALATNPGITKNAMGFRWTRKMSVLSWDNKVPSALMLLTRVRLFK 1235
            W +TWLY ATGN+KYL LATNPGITKNAM  +WT+K S+LSWDNKVPS+LMLLTR+R+F 
Sbjct: 270  WSATWLYFATGNLKYLKLATNPGITKNAMTSKWTKKTSILSWDNKVPSSLMLLTRIRMFH 329

Query: 1236 APEYPYEETLIGHHKFIALSMCSYLKPFRLFKWSKGGLIQFNDGGEESLQYVANAAFLAN 1295
            AP YPYE+ L  HHK IALSMCSYLKPF LF WSKGGLIQ NDGG +SLQYVANAAFLAN
Sbjct: 330  APGYPYEQALAEHHKLIALSMCSYLKPFHLFSWSKGGLIQLNDGGPQSLQYVANAAFLAN 389

Query: 1296 LFADYLNATDSPGWLCGPNFFPIATLREFASSQIDYILGKNPLDLSYVVGYGDKYPNRVH 1355
            LFADYLNA+D+ GW CGPNFF IATLR FASSQI+YILGKNPL LSYVVGYG+K+PN VH
Sbjct: 390  LFADYLNASDTSGWFCGPNFFSIATLRNFASSQIEYILGKNPLKLSYVVGYGEKFPNHVH 449

Query: 1356 HRAASIPSDGLKYSCKGGYKWRDSTKPNPNTIVGAMVGGPDRFDNFHDDRS--ASAGPTL 1415
            HRAASIPSDGLKYSCKGGYKW+DSTKPNPNTI GAMVGGPD+FD+FHDDRS   S+GPTL
Sbjct: 450  HRAASIPSDGLKYSCKGGYKWKDSTKPNPNTIEGAMVGGPDQFDHFHDDRSDRGSSGPTL 509

Query: 1416 AGNAGLVAALVSLTSGGGYGVDKSFLFSNLPPP 1447
            A NAGLVAALVSLT  GGYGVDK+FLFSNL PP
Sbjct: 510  AANAGLVAALVSLTGSGGYGVDKNFLFSNLSPP 540

BLAST of Spo11971.1 vs. UniProtKB/TrEMBL
Match: A0A0J8CMD5_BETVU (Endoglucanase OS=Beta vulgaris subsp. vulgaris GN=BVRB_4g074500 PE=3 SV=1)

HSP 1 Score: 845.9 bits (2184), Expect = 7.300e-242
Identity = 402/456 (88.16%), Postives = 425/456 (93.20%), Query Frame = 1

		  

Query: 479 GENTKYHFPMAFAMTMLSWSVIEYPHKYEAIGEYDHVRELIKWGTDYLLKTFNSSATQID 538
           GENTK+HFPMA+AMTMLSWSVIEYPHKYEAI EYDHVRELIKWGTDYLL TFNSSA  ID
Sbjct: 106 GENTKFHFPMAYAMTMLSWSVIEYPHKYEAINEYDHVRELIKWGTDYLLLTFNSSANLID 165

Query: 539 KIHSQVGGAQNGSTIPSDVTCWERPEDMDYERPVQTSFAGADLGGEMAAAFAAASIVFRD 598
            +HSQVGGAQNGST+PSD+TCWERPE MDYERPVQTSFAGADLGGEMAAA AAASIVFRD
Sbjct: 166 HVHSQVGGAQNGSTVPSDMTCWERPEKMDYERPVQTSFAGADLGGEMAAALAAASIVFRD 225

Query: 599 NQLYSKKLVRGAETVFRFARDTGKRAPYSRGNPWAAPYYNSTGYYDEYMWGATWLYYATG 658
           N+ Y+KKL++GAETVFRFARD GKRAPYSRGN + +PYYNSTGYYDEYMWGATWLYYATG
Sbjct: 226 NRAYAKKLIKGAETVFRFARDEGKRAPYSRGNLYISPYYNSTGYYDEYMWGATWLYYATG 285

Query: 659 NSTYGLLATNPGIPKNAKAFRMNTNTSYLSWDNKLPAAMLLLTRFRMFLNPGYPYEQTLA 718
           NSTYG LATNPGIPKNA+AFRMNTNTSYLSWDNKLPA+MLLLTRFRMFLNPGYPYEQTL 
Sbjct: 286 NSTYGSLATNPGIPKNARAFRMNTNTSYLSWDNKLPASMLLLTRFRMFLNPGYPYEQTLM 345

Query: 719 QYHNVTKLNMCSYLKQFPVYNRTRGGLIELNSGGPKPLQYVVNAAFLANLFADYMESTGV 778
           QYHNVT+LNMCSY+KQF VYN T+GGLIELN+GGPKPLQYV NAAFLANLFADYM+STGV
Sbjct: 346 QYHNVTQLNMCSYMKQFHVYNWTQGGLIELNNGGPKPLQYVANAAFLANLFADYMDSTGV 405

Query: 779 PGWYCGPNFMRQSDLRNFAQTQVEYILGKNPMHMSYVVGFGNKYPKHVHHRAASIPRKNG 838
           PGWYCGP F+RQSDLR FA +QV+YILGKNP+HMSYVVGFGNKYPKHVHHRAASIP KNG
Sbjct: 406 PGWYCGPYFLRQSDLRRFATSQVDYILGKNPLHMSYVVGFGNKYPKHVHHRAASIPHKNG 465

Query: 839 GK--CKGGYKWRDSKKPNPYTIIGAMVGGPDRFDNFKDNRKLDRYTEPTLAGNAGLVAAL 898
            K  CKGGYKWRDSKKPNP TI+GAMVGGPDRFD FKDNRKLDRYTEPTLAGNAGLVAAL
Sbjct: 466 VKYSCKGGYKWRDSKKPNPNTIVGAMVGGPDRFDRFKDNRKLDRYTEPTLAGNAGLVAAL 525

Query: 899 ISLTATPGVSGVDRNLIFSGVPPFYTPAPPPPAPPR 933
           ISLT T G SGVDRN IFSGVPP YTPAPPPPAP R
Sbjct: 526 ISLTTTAG-SGVDRNYIFSGVPPLYTPAPPPPAPWR 560

BLAST of Spo11971.1 vs. UniProtKB/TrEMBL
Match: A0A0K9R9R4_SPIOL (Endoglucanase OS=Spinacia oleracea GN=SOVF_090810 PE=3 SV=1)

HSP 1 Score: 742.3 bits (1915), Expect = 1.100e-210
Identity = 356/357 (99.72%), Postives = 356/357 (99.72%), Query Frame = 1

		  

Query: 1093 MDYPRPVQTTYAGPELAAEMTAALSSASILFKQNNPTYSIKLIKASESLFAFARDTRKRR 1152
            MDYPRPVQTTYAGPELAAEMTAALSSASILFKQNNPTYS KLIKASESLFAFARDTRKRR
Sbjct: 1    MDYPRPVQTTYAGPELAAEMTAALSSASILFKQNNPTYSKKLIKASESLFAFARDTRKRR 60

Query: 1153 PYSRGNPWVDPYYNSTGYFDEYLWGSTWLYLATGNVKYLALATNPGITKNAMGFRWTRKM 1212
            PYSRGNPWVDPYYNSTGYFDEYLWGSTWLYLATGNVKYLALATNPGITKNAMGFRWTRKM
Sbjct: 61   PYSRGNPWVDPYYNSTGYFDEYLWGSTWLYLATGNVKYLALATNPGITKNAMGFRWTRKM 120

Query: 1213 SVLSWDNKVPSALMLLTRVRLFKAPEYPYEETLIGHHKFIALSMCSYLKPFRLFKWSKGG 1272
            SVLSWDNKVPSALMLLTRVRLFKAPEYPYEETLIGHHKFIALSMCSYLKPFRLFKWSKGG
Sbjct: 121  SVLSWDNKVPSALMLLTRVRLFKAPEYPYEETLIGHHKFIALSMCSYLKPFRLFKWSKGG 180

Query: 1273 LIQFNDGGEESLQYVANAAFLANLFADYLNATDSPGWLCGPNFFPIATLREFASSQIDYI 1332
            LIQFNDGGEESLQYVANAAFLANLFADYLNATDSPGWLCGPNFFPIATLREFASSQIDYI
Sbjct: 181  LIQFNDGGEESLQYVANAAFLANLFADYLNATDSPGWLCGPNFFPIATLREFASSQIDYI 240

Query: 1333 LGKNPLDLSYVVGYGDKYPNRVHHRAASIPSDGLKYSCKGGYKWRDSTKPNPNTIVGAMV 1392
            LGKNPLDLSYVVGYGDKYPNRVHHRAASIPSDGLKYSCKGGYKWRDSTKPNPNTIVGAMV
Sbjct: 241  LGKNPLDLSYVVGYGDKYPNRVHHRAASIPSDGLKYSCKGGYKWRDSTKPNPNTIVGAMV 300

Query: 1393 GGPDRFDNFHDDRSASAGPTLAGNAGLVAALVSLTSGGGYGVDKSFLFSNLPPPPIP 1450
            GGPDRFDNFHDDRSASAGPTLAGNAGLVAALVSLTSGGGYGVDKSFLFSNLPPPPIP
Sbjct: 301  GGPDRFDNFHDDRSASAGPTLAGNAGLVAALVSLTSGGGYGVDKSFLFSNLPPPPIP 357

BLAST of Spo11971.1 vs. UniProtKB/TrEMBL
Match: A0A059DC05_EUCGR (Endoglucanase OS=Eucalyptus grandis GN=EUGRSUZ_A00393 PE=3 SV=1)

HSP 1 Score: 710.7 bits (1833), Expect = 3.700e-201
Identity = 336/512 (65.62%), Postives = 406/512 (79.30%), Query Frame = 1

		  

Query: 941  RTLPTHNF-----DNYTLALRKALLFFNAQKSGKLPKNNGIPWRGNSGLRDGSQLKDVQG 1000
            +T+P H       DNYT+AL KALLFFNAQKSGKLPKNN IPWRGNSGL DG+   DV+G
Sbjct: 99   KTVPRHKARPPPPDNYTVALHKALLFFNAQKSGKLPKNNPIPWRGNSGLTDGNGTTDVKG 158

Query: 1001 DGLVGGYYDSGENTKFHFPLAYSMTMLSWSLIEYPHKYRAINEYNHVLELIKWGTDYLLL 1060
             GLVGGYYD+G+NTKFHFP+A++M+MLSWS+IEY HKY A+ EYNH  +LIKWGTDYLLL
Sbjct: 159  -GLVGGYYDAGDNTKFHFPMAFAMSMLSWSVIEYSHKYEAVGEYNHARDLIKWGTDYLLL 218

Query: 1061 TFDSNATKISKIYSQVGGAYEKSNIPDDITCWERPEDMDYPRPVQTTYAGPELAAEMTAA 1120
            TF+S+A+KI KIYSQVGG+   S IPDD  CW RPEDMDYPRPVQT  +GP+LA EM AA
Sbjct: 219  TFNSSASKIDKIYSQVGGSQNGSKIPDDHNCWTRPEDMDYPRPVQTANSGPDLAGEMAAA 278

Query: 1121 LSSASILFKQNNPTYSIKLIKASESLFAFARDTRKRRPYSRGNPWVDPYYNSTGYFDEYL 1180
            L+SASI+F+ ++  YS KL+K + ++FAFARD+ +R PYSRGNP++DPYYNSTGYFDEY+
Sbjct: 279  LASASIVFR-DDAAYSKKLVKGAATVFAFARDSGRRTPYSRGNPYIDPYYNSTGYFDEYM 338

Query: 1181 WGSTWLYLATGNVKYLALATNPGITKNAMGFRWTRKMSVLSWDNKVPSALMLLTRVRLFK 1240
            WG+ WLY ATGN  Y++LATNPGI KN+  F   R +SVLSWDNK+P+A++LLTR+R+F 
Sbjct: 339  WGAAWLYYATGNKSYISLATNPGIPKNSKAFYMVRDLSVLSWDNKLPAAMLLLTRLRVFL 398

Query: 1241 APEYPYEETLIGHHKFIALSMCSYLKPFRLFKWSKGGLIQFNDGGEESLQYVANAAFLAN 1300
             P YPYE+ L  +     L+MCSYL  F +F W+KGG+I+ N G  + LQY+ANAAFLA+
Sbjct: 399  NPGYPYEDMLSMYQNVTGLTMCSYLHQFNVFNWTKGGMIELNHGNPQPLQYMANAAFLAS 458

Query: 1301 LFADYLNATDSPGWLCGPNFFPIATLREFASSQIDYILGKNPLDLSYVVGYGDKYPNRVH 1360
            L+ DY+NAT  PGW CGPNF    TLR FA+SQIDYILGKNP+ +SYVVG+G KYP  VH
Sbjct: 459  LYVDYMNATGVPGWNCGPNFITSDTLRSFATSQIDYILGKNPMKMSYVVGFGAKYPKHVH 518

Query: 1361 HRAASIPSDGLKYSCKGGYKWRDSTKPNPNTIVGAMVGGPDRFDNFHDDRS--ASAGPTL 1420
            HR ASIP+D  KYSC GG+KWRD+   NPNTI GAMVGGPDRFD F D R+  +   PTL
Sbjct: 519  HRGASIPNDHKKYSCTGGWKWRDTRNNNPNTINGAMVGGPDRFDKFKDVRTNYSYTEPTL 578

Query: 1421 AGNAGLVAALVSLTSGGGYGVDKSFLFSNLPP 1446
            AGNAGLVAALVSLT  GG  +DK+ +FS +PP
Sbjct: 579  AGNAGLVAALVSLTESGGRAIDKNSIFSAVPP 608

BLAST of Spo11971.1 vs. UniProtKB/TrEMBL
Match: V4UHI7_9ROSI (Endoglucanase OS=Citrus clementina GN=CICLE_v10014583mg PE=3 SV=1)

HSP 1 Score: 705.7 bits (1820), Expect = 1.200e-199
Identity = 334/522 (63.98%), Postives = 415/522 (79.50%), Query Frame = 1

		  

Query: 940  ARTLPTHNF-----DNYTLALRKALLFFNAQKSGKLPKNNGIPWRGNSGLRDGSQLKDVQ 999
            A++LP H       D+YT AL KALLFFNAQKSGKLPKNNGIPWRGNSGL DG+   DV+
Sbjct: 113  AKSLPKHKSAPPSPDDYTHALHKALLFFNAQKSGKLPKNNGIPWRGNSGLSDGNSTTDVK 172

Query: 1000 GDGLVGGYYDSGENTKFHFPLAYSMTMLSWSLIEYPHKYRAINEYNHVLELIKWGTDYLL 1059
            G GLVGGYYD+G+NTKFHFP++++MTMLSWSLIEY HKY++I EY+H+ +LIKWGTDYLL
Sbjct: 173  G-GLVGGYYDAGDNTKFHFPMSFAMTMLSWSLIEYNHKYQSIGEYDHMRDLIKWGTDYLL 232

Query: 1060 LTFDSNATKISKIYSQVGGAYEKSNIPDDITCWERPEDMDYPRPVQTTYAGPELAAEMTA 1119
            LTF+S+ATKI KIY+QVGG+   ++ P+D  CW+RPEDMDYPRPVQT  AGP+LA EM A
Sbjct: 233  LTFNSSATKIDKIYAQVGGSQNGTSEPNDHNCWQRPEDMDYPRPVQTINAGPDLAGEMAA 292

Query: 1120 ALSSASILFKQNNPTYSIKLIKASESLFAFARDTRKRRPYSRGNPWVDPYYNSTGYFDEY 1179
            AL++ASI+FK N   YS KL+K ++++F FAR+  KRRPY RGNP+++PYYNS+GYFDEY
Sbjct: 293  ALAAASIVFKDNT-AYSRKLVKGAKTVFDFAREGGKRRPYCRGNPFIEPYYNSSGYFDEY 352

Query: 1180 LWGSTWLYLATGNVKYLALATNPGITKNAMGFRWTRKMSVLSWDNKVPSALMLLTRVRLF 1239
            +WG+ WLY ATGNV YL+LATN G+ KN+  F    + SVLSWDNK+P+A++LLTR R+F
Sbjct: 353  MWGAAWLYYATGNVSYLSLATNSGLPKNSKAFYRIPEKSVLSWDNKLPAAMLLLTRFRIF 412

Query: 1240 KAPEYPYEETLIGHHKFIALSMCSYLKPFRLFKWSKGGLIQFNDGGEESLQYVANAAFLA 1299
             +P YPYE+ L  +H   +L+MCSYL+ F +F W++GG++Q N G  + LQYVANAAFLA
Sbjct: 413  LSPGYPYEDMLRMYHNTTSLTMCSYLEQFHVFNWTRGGMVQLNQGKPQPLQYVANAAFLA 472

Query: 1300 NLFADYLNATDSPGWLCGPNFFPIATLREFASSQIDYILGKNPLDLSYVVGYGDKYPNRV 1359
            +L+ DYLNA+ +PG+ CGPNF  +A LR F++SQIDYILGKNP  +SYVVGYG K+P  V
Sbjct: 473  SLYVDYLNASGAPGFTCGPNFITLAKLRSFSTSQIDYILGKNPTKMSYVVGYGKKFPRHV 532

Query: 1360 HHRAASIPSDGLKYSCKGGYKWRDSTKPNPNTIVGAMVGGPDRFDNFHDDRSAS--AGPT 1419
            HHR ASIPSDG KYSCKGG+KW ++ KPNP+ I GAMV GPDRFD FHD R+      PT
Sbjct: 533  HHRGASIPSDGKKYSCKGGWKWSNNPKPNPHNITGAMVAGPDRFDKFHDVRTNHNYTEPT 592

Query: 1420 LAGNAGLVAALVSLTSGGGYGVDKSFLFSNLPP-----PPIP 1450
            LAGNAGLVAAL SLT+  G G+DK+ +FS +PP     PP P
Sbjct: 593  LAGNAGLVAALASLTTSAGIGIDKNTMFSAIPPLYPQSPPPP 632

BLAST of Spo11971.1 vs. ExPASy Swiss-Prot
Match: GUN12_ORYSJ (Endoglucanase 12 OS=Oryza sativa subsp. japonica GN=GLU3 PE=2 SV=2)

HSP 1 Score: 618.6 bits (1594), Expect = 1.700e-175
Identity = 304/512 (59.38%), Postives = 375/512 (73.24%), Query Frame = 1

		  

Query: 949  DNYTLALRKALLFFNAQKSGKLPKNNGIPWRGNSGLRDGSQLKDVQGDGLVGGYYDSGEN 1008
            D YT AL KALLFFNAQKSG+LPKNNGI WRGNSGL DGS L DV+G GLVGGYYD+G+N
Sbjct: 109  DQYTDALHKALLFFNAQKSGRLPKNNGIKWRGNSGLSDGSDLTDVKG-GLVGGYYDAGDN 168

Query: 1009 TKFHFPLAYSMTMLSWSLIEYPHKYRAINEYNHVLELIKWGTDYLLLTFDSNATKISKIY 1068
             KFHFPLA+SMTMLSWS+IEY  KY+A+ EY+HV ELIKWGTDYLLLTF+S+A+ I K+Y
Sbjct: 169  IKFHFPLAFSMTMLSWSVIEYSAKYKAVGEYDHVRELIKWGTDYLLLTFNSSASTIDKVY 228

Query: 1069 SQVGGAYEKSNIPDDITCWERPEDMDYPRPVQTTYAGPELAAEMTAALSSASILFKQNNP 1128
            SQVG A      PDD  CW RPEDM YPRPVQT  + P+L  EM AAL++ASI+F+ +N 
Sbjct: 229  SQVGIAKINGTQPDDHYCWNRPEDMAYPRPVQTAGSAPDLGGEMAAALAAASIVFR-DNA 288

Query: 1129 TYSIKLIKASESLFAFARDTRKRRPYSRGNPWVDPYYNSTGYFDEYLWGSTWLYLATGNV 1188
             YS KL+  + +++ FAR + +R PYSRGN +++ YYNST Y+DEY+W + W+Y ATGN 
Sbjct: 289  AYSKKLVNGAAAVYKFARSSGRRTPYSRGNQYIEYYYNSTSYWDEYMWSAAWMYYATGNN 348

Query: 1189 KYLALATNPGITKNAMGFRWTRKMSVLSWDNKVPSALMLLTRVRLFKAPEYPYEETLIGH 1248
             Y+  AT+P + KNA  F      SV SWDNK+P A +LL+R+R+F  P YPYEE+LIG+
Sbjct: 349  TYITFATDPRLPKNAKAFYSILDFSVFSWDNKLPGAELLLSRLRMFLNPGYPYEESLIGY 408

Query: 1249 HKFIALSMCSYLKPFRLFKWSKGGLIQFNDGGEESLQYVANAAFLANLFADYLNATDSPG 1308
            H   +++MC+Y   F  F ++KGGL QFN G  + LQY    +FLA L+ADY+ + + PG
Sbjct: 409  HNTTSMNMCTYFPRFGAFNFTKGGLAQFNHGKGQPLQYTVANSFLAALYADYMESVNVPG 468

Query: 1309 WLCGPNFFPIATLREFASSQIDYILGKNPLDLSYVVGYGDKYPNRVHHRAASIPSDGLKY 1368
            W CGP F  +  LR FA SQ++YILG NP  +SYVVGYG KYP R+HHR AS P +G+KY
Sbjct: 469  WYCGPYFMTVDDLRSFARSQVNYILGDNPKKMSYVVGYGKKYPRRLHHRGASTPHNGIKY 528

Query: 1369 SCKGGYKWRDSTKPNPNTIVGAMVGGPDRFDNFHDDR--SASAGPTLAGNAGLVAALVSL 1428
            SC GGYKWRD+   +PN +VGAMVGGPD+ D F D R   A   PTL GNAGLVAALV+L
Sbjct: 529  SCTGGYKWRDTKGADPNVLVGAMVGGPDKNDQFKDARLTYAQNEPTLVGNAGLVAALVAL 588

Query: 1429 T-SGGGYG---VDKSFLFSNLPP-----PPIP 1450
            T SG G G   VDK+ +FS +PP     PP P
Sbjct: 589  TNSGRGAGVTAVDKNTMFSAVPPMFPATPPPP 618

BLAST of Spo11971.1 vs. ExPASy Swiss-Prot
Match: GUN7_ARATH (Endoglucanase 7 OS=Arabidopsis thaliana GN=KOR2 PE=2 SV=1)

HSP 1 Score: 589.7 bits (1519), Expect = 8.500e-167
Identity = 296/566 (52.30%), Postives = 387/566 (68.37%), Query Frame = 1

		  

Query: 891  GLVAALISLTATPGVSGVDRNLIFSGVPPFYTPAPPPPAPPRTATSTNSARTLPTHNFDN 950
            G +A L  + A P        +I     P +  APPPP                    DN
Sbjct: 85   GSIAVLFLVVALP--------IIIVKSLPRHKSAPPPP--------------------DN 144

Query: 951  YTLALRKALLFFNAQKSGKLPKNNGIPWRGNSGLRDGSQLKDVQGDGLVGGYYDSGENTK 1010
            YTLAL KAL FF+AQKSGKLPK N + WRG+SG +DG  L DV G GLVGGYYD G N K
Sbjct: 145  YTLALHKALQFFDAQKSGKLPKKNKVSWRGDSGTKDG--LPDVVG-GLVGGYYDGGSNVK 204

Query: 1011 FHFPLAYSMTMLSWSLIEYPHKYRAINEYNHVLELIKWGTDYLLLTFDSNATKISKIYSQ 1070
            FHFP+A+SMTMLSWSLIEY HKY+AI+EY+H+ +++KWGTDYLLLTF+++AT++  IY+Q
Sbjct: 205  FHFPMAFSMTMLSWSLIEYSHKYKAIDEYDHMRDVLKWGTDYLLLTFNNSATRLDHIYTQ 264

Query: 1071 VGGAYEKSNIPDDITCWERPEDMDYPRPVQTTYAGPELAAEMTAALSSASILFKQNNPTY 1130
            VGG    S  PDDI CW++PEDM Y RPV ++ +  +L AE++AAL++ASI+F  + P Y
Sbjct: 265  VGGGLRDSESPDDIYCWQKPEDMSYDRPVLSSTSAADLGAEVSAALAAASIVFT-DKPDY 324

Query: 1131 SIKLIKASESLFAFARDTRKRRPYSRGNPWVDPYYNSTGYFDEYLWGSTWLYLATGNVKY 1190
            + KL K +E+L+ F R   +R+ YS G P    +YNST  FDE++W   WLY ATGN  Y
Sbjct: 325  AKKLKKGAETLYPFFRSKSRRKRYSDGQPTAQAFYNSTSMFDEFMWAGAWLYYATGNKTY 384

Query: 1191 LALATNPGITKNAMGFRWTRKMSVLSWDNKVPSALMLLTRVRLFKAPEYPYEETLIGHHK 1250
            +  AT P + + A  F    ++ V SW+NK+P A++L+TR RLF  P +PYE  L  +H 
Sbjct: 385  IQFATTPSVPQTAKAFANRPELMVPSWNNKLPGAMLLMTRYRLFLNPGFPYENMLNRYHN 444

Query: 1251 FIALSMCSYLKPFRLFKWSKGGLIQFNDGGEESLQYVANAAFLANLFADYLNATDSPGWL 1310
               ++MC+YLK + +F  + GGL+Q N G    L+YVA+A+FLA+LFADYLN+T  PGW 
Sbjct: 445  ATGITMCAYLKQYNVFNRTSGGLMQLNLGKPRPLEYVAHASFLASLFADYLNSTGVPGWY 504

Query: 1311 CGPNFFPIATLREFASSQIDYILGKNPLDLSYVVGYGDKYPNRVHHRAASIPSDGLKYSC 1370
            CGP F     L++FA SQIDYILG NPL +SYVVG+G K+P RVHHR A+IP+D  + SC
Sbjct: 505  CGPTFVENHVLKDFAQSQIDYILGDNPLKMSYVVGFGKKFPRRVHHRGATIPNDKKRRSC 564

Query: 1371 KGGYKWRDSTKPNPNTIVGAMVGGPDRFDNFHDDRS--ASAGPTLAGNAGLVAALVSLTS 1430
            + G K+RD+  PNPN I GAMVGGP++FD FHD R+   ++ PTL+GNAGLVAALVSLTS
Sbjct: 565  REGLKYRDTKNPNPNNITGAMVGGPNKFDEFHDLRNNYNASEPTLSGNAGLVAALVSLTS 618

Query: 1431 GGGYGVDKSFLFSNLPP-----PPIP 1450
             GG  +DK+ +F+++PP     PP P
Sbjct: 625  SGGQQIDKNTMFNSVPPLYSPTPPPP 618

BLAST of Spo11971.1 vs. ExPASy Swiss-Prot
Match: GUN9_ORYSJ (Endoglucanase 9 OS=Oryza sativa subsp. japonica GN=GLU1 PE=2 SV=1)

HSP 1 Score: 574.7 bits (1480), Expect = 2.800e-162
Identity = 297/545 (54.50%), Postives = 369/545 (67.71%), Query Frame = 1

		  

Query: 913  IFSGVPPFYTPAPPPPAPPRTATSTNSARTLPTHNFDNYTLALRKALLFFNAQKSGKLPK 972
            I   +P  + P PPP                     D++T+ALRKAL+FFNAQKSGKLPK
Sbjct: 95   IAKAIPRHHRPPPPP---------------------DDFTVALRKALMFFNAQKSGKLPK 154

Query: 973  NNGIPWRGNSGLRDGSQLKDVQGDGLVGGYYDSGENTKFHFPLAYSMTMLSWSLIEYPHK 1032
            NN + WRGNS ++DG     V G  LVGGYYD+G+  KF+FP A+SMT+LSWS+IEY  K
Sbjct: 155  NNNVHWRGNSCMKDGLSDPAV-GRSLVGGYYDAGDAVKFNFPAAFSMTLLSWSVIEYSAK 214

Query: 1033 YRAINEYNHVLELIKWGTDYLLLTFDSNATKISKIYSQVGGAYEK--SNIPDDITCWERP 1092
            Y A+ E  H+ + IKWG DY L TF+S A  I ++  QVG       S  P+D  CW RP
Sbjct: 215  YEAVGELGHIRDTIKWGADYFLKTFNSTADTIDRVVMQVGSGATSPGSTQPNDHYCWMRP 274

Query: 1093 EDMDYPRPVQTTYAGPELAAEMTAALSSASILFKQNNPTYSIKLIKASESLFAFARDTRK 1152
            ED+DYPRPV   +A  +LAAEM A+L++ASI+FK N   YS KL+  + +LF FAR  R 
Sbjct: 275  EDIDYPRPVVECHACSDLAAEMAASLAAASIVFKDNK-AYSQKLVHGATTLFKFARQNRG 334

Query: 1153 RRPYSRGNPWVDPYYNSTGYFDEYLWGSTWLYLATGNVKYLALATNPGITKNAMGFRWTR 1212
            R  YS G      +YNST Y+DE++WG +W+YLATGN  YL LAT+P + K+A  +    
Sbjct: 335  R--YSAGGSDAAKFYNSTSYWDEFVWGGSWMYLATGNSSYLQLATHPKLAKHAGAYWGGP 394

Query: 1213 KMSVLSWDNKVPSALMLLTRVRLFKAPEYPYEETLIGHHKFIALSMCSYLKPFRLFKWSK 1272
               V SWDNK+  A +LL+R+RLF +P YPYEE L   H   ++ MCSYL  F+ F  +K
Sbjct: 395  DYGVFSWDNKLTGAQVLLSRLRLFLSPGYPYEEILRTFHNQTSIIMCSYLPIFKSFNRTK 454

Query: 1273 GGLIQFNDGGEESLQYVANAAFLANLFADYLNATDSPGWLCGPNFFPIATLREFASSQID 1332
            GGLIQ N G  + LQYV NAAFLA+L+ DYL A D+PGW CGP+F+PI TLR FA +QI+
Sbjct: 455  GGLIQLNHGRPQPLQYVVNAAFLASLYGDYLEAADTPGWYCGPHFYPIETLRNFARTQIE 514

Query: 1333 YILGKNPLDLSYVVGYGDKYPNRVHHRAASIPSDGLKYSCKGGYKWRDSTKPNPNTIVGA 1392
            YILGKNPL +SYVVGYG++YP RVHHR ASIP +G+ Y CKGG+KWR++ KPNPN IVGA
Sbjct: 515  YILGKNPLKMSYVVGYGNRYPKRVHHRGASIPKNGVHYGCKGGWKWRETKKPNPNIIVGA 574

Query: 1393 MVGGPDRFDNFHDDRS--ASAGPTLAGNAGLVAALVSLTSGGGYGVDKSFLFSNLPP--- 1450
            MV GPDR D F D R        TLAGNAGLVAALV+L SG G+GVDK+ +FS +PP   
Sbjct: 575  MVAGPDRHDGFKDVRKNYNYTEATLAGNAGLVAALVAL-SGEGHGVDKNTMFSAVPPMFP 613

BLAST of Spo11971.1 vs. ExPASy Swiss-Prot
Match: GUN25_ARATH (Endoglucanase 25 OS=Arabidopsis thaliana GN=KOR PE=1 SV=1)

HSP 1 Score: 559.3 bits (1440), Expect = 1.200e-157
Identity = 277/457 (60.61%), Postives = 337/457 (73.74%), Query Frame = 1

		  

Query: 479 GENTKYHFPMAFAMTMLSWSVIEYPHKYEAIGEYDHVRELIKWGTDYLLKTFNSSATQID 538
           G+  K++FPMA+AMTMLSWSVIEY  KYEA GE  HV+ELIKWGTDY LKTFNS+A  ID
Sbjct: 164 GDAIKFNFPMAYAMTMLSWSVIEYSAKYEAAGELTHVKELIKWGTDYFLKTFNSTADSID 223

Query: 539 KIHSQVGGAQ--NGSTIPSDVTCWERPEDMDYERPVQTSFAG-ADLGGEMAAAFAAASIV 598
            + SQVG     +G+T P+D  CW RPEDMDY+RPV T   G +DL  EMAAA A+ASIV
Sbjct: 224 DLVSQVGSGNTDDGNTDPNDHYCWMRPEDMDYKRPVTTCNGGCSDLAAEMAAALASASIV 283

Query: 599 FRDNQLYSKKLVRGAETVFRFARDTGKRAPYSRGNPWAAPYYNSTGYYDEYMWGATWLYY 658
           F+DN+ YSKKLV GA+ V++F R   +R  YS G   ++ +YNS+ Y+DE++WG  W+YY
Sbjct: 284 FKDNKEYSKKLVHGAKVVYQFGRT--RRGRYSAGTAESSKFYNSSMYWDEFIWGGAWMYY 343

Query: 659 ATGNSTYGLLATNPGIPKNAKAFRMNTNTSYLSWDNKLPAAMLLLTRFRMFLNPGYPYEQ 718
           ATGN TY  L T P + K+A AF         SWDNKL  A LLL+R R+FL+PGYPYE+
Sbjct: 344 ATGNVTYLNLITQPTMAKHAGAFWGGPYYGVFSWDNKLAGAQLLLSRLRLFLSPGYPYEE 403

Query: 719 TLAQYHNVTKLNMCSYLKQFPVYNRTRGGLIELNSGGPKPLQYVVNAAFLANLFADYMES 778
            L  +HN T + MCSYL  F  +NRT GGLIELN G P+PLQY VNAAFLA L++DY+++
Sbjct: 404 ILRTFHNQTSIVMCSYLPIFNKFNRTNGGLIELNHGAPQPLQYSVNAAFLATLYSDYLDA 463

Query: 779 TGVPGWYCGPNFMRQSDLRNFAQTQVEYILGKNPMHMSYVVGFGNKYPKHVHHRAASIPR 838
              PGWYCGPNF   S LR+FA++Q++YILGKNP  MSYVVGFG KYP+HVHHR ASIP+
Sbjct: 464 ADTPGWYCGPNFYSTSVLRDFARSQIDYILGKNPRKMSYVVGFGTKYPRHVHHRGASIPK 523

Query: 839 -KNGGKCKGGYKWRDSKKPNPYTIIGAMVGGPDRFDNFKDNRKLDRYTEPTLAGNAGLVA 898
            K    CKGG+KWRDSKKPNP TI GAMV GPD+ D ++D R    YTEPTLAGNAGLVA
Sbjct: 524 NKVKYNCKGGWKWRDSKKPNPNTIEGAMVAGPDKRDGYRDVRMNYNYTEPTLAGNAGLVA 583

Query: 899 ALISLTATPGVSG-VDRNLIFSGVPPFYTPAPPPPAP 931
           AL++L+     +G +D+N IFS VPP +   PPPPAP
Sbjct: 584 ALVALSGEEEATGKIDKNTIFSAVPPLFPTPPPPPAP 618

BLAST of Spo11971.1 vs. ExPASy Swiss-Prot
Match: GUN10_ORYSJ (Endoglucanase 10 OS=Oryza sativa subsp. japonica GN=GLU2 PE=2 SV=1)

HSP 1 Score: 550.8 bits (1418), Expect = 4.400e-155
Identity = 285/535 (53.27%), Postives = 371/535 (69.35%), Query Frame = 1

		  

Query: 926  PPPAPPRTATSTNSARTLPTHNFDNYTLALRKALLFFNAQKSGKLPKNNGIPWRGNSGLR 985
            PPP PP                 D YT AL KAL+FFNAQ+SG LPK+NG+ WRGNS ++
Sbjct: 102  PPPPPP-----------------DQYTQALHKALMFFNAQRSGPLPKHNGVSWRGNSCMK 161

Query: 986  DGSQLKDVQGDGLVGGYYDSGENTKFHFPLAYSMTMLSWSLIEYPHKYRAINEYNHVLEL 1045
            DG     V+   LVGG+YD+G+  KF++P+A+SMTMLSWS+IEY  KY AI E +HV EL
Sbjct: 162  DGLSDSTVR-KSLVGGFYDAGDAIKFNYPMAWSMTMLSWSVIEYKAKYEAIGELDHVKEL 221

Query: 1046 IKWGTDYLLLTFDSNATKISKIYSQVG-GAYEKSNI-PDDITCWERPEDMDYPRPVQTTY 1105
            IKWGTDYLL TF+S+A  I +I +QVG G   K    P+D  CW RPED+DYPRPV   +
Sbjct: 222  IKWGTDYLLKTFNSSADTIDRIVAQVGVGDTSKGGAQPNDHYCWMRPEDIDYPRPVTECH 281

Query: 1106 AGPELAAEMTAALSSASILFKQNNPTYSIKLIKASESLFAFARDTRKRRPYSRGNPWVDP 1165
            +  +LA+EM AAL++ASI+FK ++ TYS KL++ +++L+ F R  R R  YS        
Sbjct: 282  SCSDLASEMAAALAAASIVFK-DSKTYSDKLVRGAKALYKFGRLQRGR--YSPNGSDQAI 341

Query: 1166 YYNSTGYFDEYLWGSTWLYLATGNVKYLALATNPGITKNAMGFRW--TRKMSVLSWDNKV 1225
            +YNST Y+DE++WG  W+Y ATGN  YL++AT PG+ K+A G  W  +    V +WD+K+
Sbjct: 342  FYNSTSYWDEFVWGGAWMYFATGNNTYLSVATAPGMAKHA-GAYWLDSPNYGVFTWDDKL 401

Query: 1226 PSALMLLTRVRLFKAPEYPYEETLIGHHKFIALSMCSYLKPFRLFKWSKGGLIQFNDGGE 1285
            P A +LL+R+RLF +P YPYEE L   H      MCSYL  +  F ++KGG+IQ N G  
Sbjct: 402  PGAQVLLSRLRLFLSPGYPYEEILRTFHNQTDNVMCSYLPMYNSFNFTKGGMIQLNHGRP 461

Query: 1286 ESLQYVANAAFLANLFADYLNATDSPGWLCGPNFFPIATLREFASSQIDYILGKNPLDLS 1345
            + LQYV NAAFLA+L++DYL+A D+PGW CGP F+    LR+FA SQ+DY+LGKNPL +S
Sbjct: 462  QPLQYVVNAAFLASLYSDYLDAADTPGWYCGPTFYTTEVLRKFARSQLDYVLGKNPLKMS 521

Query: 1346 YVVGYGDKYPNRVHHRAASIPSDGLKYSCKGGYKWRDSTKPNPNTIVGAMVGGPDRFDNF 1405
            YVVG+G+KYP R HHR ASIP +G+KY CKGG+KWR++ KPNPN ++GA+V GPDR D F
Sbjct: 522  YVVGFGNKYPKRAHHRGASIPHNGVKYGCKGGFKWRETKKPNPNILIGALVAGPDRHDGF 581

Query: 1406 HDDRS--ASAGPTLAGNAGLVAALVSLTS-GGGYGVDKSFLFSNLPP----PPIP 1450
             D R+      PTLA NAGLVAAL+SLT+     G+DK+ +FS +PP    PP P
Sbjct: 582  KDVRTNYNYTEPTLAANAGLVAALISLTNIHVKSGIDKNTIFSAVPPMFPTPPPP 614

BLAST of Spo11971.1 vs. TAIR (Arabidopsis)
Match: AT1G65610.1 (Six-hairpin glycosidases superfamily protein)

HSP 1 Score: 589.7 bits (1519), Expect = 4.800e-168
Identity = 296/566 (52.30%), Postives = 387/566 (68.37%), Query Frame = 1

		  

Query: 891  GLVAALISLTATPGVSGVDRNLIFSGVPPFYTPAPPPPAPPRTATSTNSARTLPTHNFDN 950
            G +A L  + A P        +I     P +  APPPP                    DN
Sbjct: 85   GSIAVLFLVVALP--------IIIVKSLPRHKSAPPPP--------------------DN 144

Query: 951  YTLALRKALLFFNAQKSGKLPKNNGIPWRGNSGLRDGSQLKDVQGDGLVGGYYDSGENTK 1010
            YTLAL KAL FF+AQKSGKLPK N + WRG+SG +DG  L DV G GLVGGYYD G N K
Sbjct: 145  YTLALHKALQFFDAQKSGKLPKKNKVSWRGDSGTKDG--LPDVVG-GLVGGYYDGGSNVK 204

Query: 1011 FHFPLAYSMTMLSWSLIEYPHKYRAINEYNHVLELIKWGTDYLLLTFDSNATKISKIYSQ 1070
            FHFP+A+SMTMLSWSLIEY HKY+AI+EY+H+ +++KWGTDYLLLTF+++AT++  IY+Q
Sbjct: 205  FHFPMAFSMTMLSWSLIEYSHKYKAIDEYDHMRDVLKWGTDYLLLTFNNSATRLDHIYTQ 264

Query: 1071 VGGAYEKSNIPDDITCWERPEDMDYPRPVQTTYAGPELAAEMTAALSSASILFKQNNPTY 1130
            VGG    S  PDDI CW++PEDM Y RPV ++ +  +L AE++AAL++ASI+F  + P Y
Sbjct: 265  VGGGLRDSESPDDIYCWQKPEDMSYDRPVLSSTSAADLGAEVSAALAAASIVFT-DKPDY 324

Query: 1131 SIKLIKASESLFAFARDTRKRRPYSRGNPWVDPYYNSTGYFDEYLWGSTWLYLATGNVKY 1190
            + KL K +E+L+ F R   +R+ YS G P    +YNST  FDE++W   WLY ATGN  Y
Sbjct: 325  AKKLKKGAETLYPFFRSKSRRKRYSDGQPTAQAFYNSTSMFDEFMWAGAWLYYATGNKTY 384

Query: 1191 LALATNPGITKNAMGFRWTRKMSVLSWDNKVPSALMLLTRVRLFKAPEYPYEETLIGHHK 1250
            +  AT P + + A  F    ++ V SW+NK+P A++L+TR RLF  P +PYE  L  +H 
Sbjct: 385  IQFATTPSVPQTAKAFANRPELMVPSWNNKLPGAMLLMTRYRLFLNPGFPYENMLNRYHN 444

Query: 1251 FIALSMCSYLKPFRLFKWSKGGLIQFNDGGEESLQYVANAAFLANLFADYLNATDSPGWL 1310
               ++MC+YLK + +F  + GGL+Q N G    L+YVA+A+FLA+LFADYLN+T  PGW 
Sbjct: 445  ATGITMCAYLKQYNVFNRTSGGLMQLNLGKPRPLEYVAHASFLASLFADYLNSTGVPGWY 504

Query: 1311 CGPNFFPIATLREFASSQIDYILGKNPLDLSYVVGYGDKYPNRVHHRAASIPSDGLKYSC 1370
            CGP F     L++FA SQIDYILG NPL +SYVVG+G K+P RVHHR A+IP+D  + SC
Sbjct: 505  CGPTFVENHVLKDFAQSQIDYILGDNPLKMSYVVGFGKKFPRRVHHRGATIPNDKKRRSC 564

Query: 1371 KGGYKWRDSTKPNPNTIVGAMVGGPDRFDNFHDDRS--ASAGPTLAGNAGLVAALVSLTS 1430
            + G K+RD+  PNPN I GAMVGGP++FD FHD R+   ++ PTL+GNAGLVAALVSLTS
Sbjct: 565  REGLKYRDTKNPNPNNITGAMVGGPNKFDEFHDLRNNYNASEPTLSGNAGLVAALVSLTS 618

Query: 1431 GGGYGVDKSFLFSNLPP-----PPIP 1450
             GG  +DK+ +F+++PP     PP P
Sbjct: 625  SGGQQIDKNTMFNSVPPLYSPTPPPP 618

BLAST of Spo11971.1 vs. TAIR (Arabidopsis)
Match: AT5G49720.1 (glycosyl hydrolase 9A1)

HSP 1 Score: 559.3 bits (1440), Expect = 6.900e-159
Identity = 277/457 (60.61%), Postives = 337/457 (73.74%), Query Frame = 1

		  

Query: 479 GENTKYHFPMAFAMTMLSWSVIEYPHKYEAIGEYDHVRELIKWGTDYLLKTFNSSATQID 538
           G+  K++FPMA+AMTMLSWSVIEY  KYEA GE  HV+ELIKWGTDY LKTFNS+A  ID
Sbjct: 164 GDAIKFNFPMAYAMTMLSWSVIEYSAKYEAAGELTHVKELIKWGTDYFLKTFNSTADSID 223

Query: 539 KIHSQVGGAQ--NGSTIPSDVTCWERPEDMDYERPVQTSFAG-ADLGGEMAAAFAAASIV 598
            + SQVG     +G+T P+D  CW RPEDMDY+RPV T   G +DL  EMAAA A+ASIV
Sbjct: 224 DLVSQVGSGNTDDGNTDPNDHYCWMRPEDMDYKRPVTTCNGGCSDLAAEMAAALASASIV 283

Query: 599 FRDNQLYSKKLVRGAETVFRFARDTGKRAPYSRGNPWAAPYYNSTGYYDEYMWGATWLYY 658
           F+DN+ YSKKLV GA+ V++F R   +R  YS G   ++ +YNS+ Y+DE++WG  W+YY
Sbjct: 284 FKDNKEYSKKLVHGAKVVYQFGRT--RRGRYSAGTAESSKFYNSSMYWDEFIWGGAWMYY 343

Query: 659 ATGNSTYGLLATNPGIPKNAKAFRMNTNTSYLSWDNKLPAAMLLLTRFRMFLNPGYPYEQ 718
           ATGN TY  L T P + K+A AF         SWDNKL  A LLL+R R+FL+PGYPYE+
Sbjct: 344 ATGNVTYLNLITQPTMAKHAGAFWGGPYYGVFSWDNKLAGAQLLLSRLRLFLSPGYPYEE 403

Query: 719 TLAQYHNVTKLNMCSYLKQFPVYNRTRGGLIELNSGGPKPLQYVVNAAFLANLFADYMES 778
            L  +HN T + MCSYL  F  +NRT GGLIELN G P+PLQY VNAAFLA L++DY+++
Sbjct: 404 ILRTFHNQTSIVMCSYLPIFNKFNRTNGGLIELNHGAPQPLQYSVNAAFLATLYSDYLDA 463

Query: 779 TGVPGWYCGPNFMRQSDLRNFAQTQVEYILGKNPMHMSYVVGFGNKYPKHVHHRAASIPR 838
              PGWYCGPNF   S LR+FA++Q++YILGKNP  MSYVVGFG KYP+HVHHR ASIP+
Sbjct: 464 ADTPGWYCGPNFYSTSVLRDFARSQIDYILGKNPRKMSYVVGFGTKYPRHVHHRGASIPK 523

Query: 839 -KNGGKCKGGYKWRDSKKPNPYTIIGAMVGGPDRFDNFKDNRKLDRYTEPTLAGNAGLVA 898
            K    CKGG+KWRDSKKPNP TI GAMV GPD+ D ++D R    YTEPTLAGNAGLVA
Sbjct: 524 NKVKYNCKGGWKWRDSKKPNPNTIEGAMVAGPDKRDGYRDVRMNYNYTEPTLAGNAGLVA 583

Query: 899 ALISLTATPGVSG-VDRNLIFSGVPPFYTPAPPPPAP 931
           AL++L+     +G +D+N IFS VPP +   PPPPAP
Sbjct: 584 ALVALSGEEEATGKIDKNTIFSAVPPLFPTPPPPPAP 618

BLAST of Spo11971.1 vs. TAIR (Arabidopsis)
Match: AT4G24260.1 (glycosyl hydrolase 9A3)

HSP 1 Score: 530.8 bits (1366), Expect = 2.600e-150
Identity = 260/455 (57.14%), Postives = 324/455 (71.21%), Query Frame = 1

		  

Query: 479 GENTKYHFPMAFAMTMLSWSVIEYPHKYEAIGEYDHVRELIKWGTDYLLKTFNSSATQID 538
           G++ K++FPM++AMTMLSWSVIEY  KY+A GE +HV+ELIKWGTDY LKTFNSSA  I 
Sbjct: 165 GDSIKFNFPMSYAMTMLSWSVIEYSAKYQAAGELEHVKELIKWGTDYFLKTFNSSADNIY 224

Query: 539 KIHSQVGGAQNG--STIPSDVTCWERPEDMDYERPVQTSFAG-ADLGGEMAAAFAAASIV 598
            +  QVG   +G  S + +D  CW RPED+ Y+R V   ++  +DL  EMAAA A+ASIV
Sbjct: 225 VMVEQVGSGVSGRGSELHNDHYCWMRPEDIHYKRTVSQCYSSCSDLAAEMAAALASASIV 284

Query: 599 FRDNQLYSKKLVRGAETVFRFARDTGKRAPYSRGNPWAAPYYNSTGYYDEYMWGATWLYY 658
           F+DN+LYSK LV GA+T++RFA  T  R  YS+    ++ +YNS+ + DE +WG  WLYY
Sbjct: 285 FKDNRLYSKNLVHGAKTLYRFA--TTSRNRYSQNGKESSKFYNSSMFEDELLWGGAWLYY 344

Query: 659 ATGNSTYGLLATNPGIPKNAKAFRMNTNTSYLSWDNKLPAAMLLLTRFRMFLNPGYPYEQ 718
           ATGN TY    T+  + + A AF  +      SWDNKLP A LLLTR R+FL+PGYPYE 
Sbjct: 345 ATGNVTYLERVTSHHMAEKAGAFGNSPYYGVFSWDNKLPGAQLLLTRMRLFLSPGYPYED 404

Query: 719 TLAQYHNVTKLNMCSYLKQFPVYNRTRGGLIELNSGGPKPLQYVVNAAFLANLFADYMES 778
            L+++HN T   MCSYL  +  +NRT GGLI+LN G P+PLQYV NAAFLA LF+DY+E+
Sbjct: 405 MLSEFHNQTGRVMCSYLPYYKKFNRTNGGLIQLNHGAPQPLQYVANAAFLAALFSDYLEA 464

Query: 779 TGVPGWYCGPNFMRQSDLRNFAQTQVEYILGKNPMHMSYVVGFGNKYPKHVHHRAASIPR 838
              PGWYCGPNF     LRNF+++Q++YILGKNP  MSYVVG+G +YPK VHHR ASIP+
Sbjct: 465 ADTPGWYCGPNFYTTEFLRNFSRSQIDYILGKNPRKMSYVVGYGQRYPKQVHHRGASIPK 524

Query: 839 KNGGKCKGGYKWRDSKKPNPYTIIGAMVGGPDRFDNFKDNRKLDRYTEPTLAGNAGLVAA 898
                C GG+KW+ SKK NP  I GAMV GPD+ D F D R    YTEPTLAGNAGLVAA
Sbjct: 525 NMKETCTGGFKWKKSKKNNPNAINGAMVAGPDKHDGFHDIRTNYNYTEPTLAGNAGLVAA 584

Query: 899 LISLTATPGVSGVDRNLIFSGVPPFYTPAPPPPAP 931
           L++L+    V G+D+N +FS VPP     PPPPAP
Sbjct: 585 LVALSGEKAVGGIDKNTMFSAVPPLVMATPPPPAP 617

BLAST of Spo11971.1 vs. TAIR (Arabidopsis)
Match: AT1G75680.1 (glycosyl hydrolase 9B7)

HSP 1 Score: 339.3 bits (869), Expect = 1.100e-92
Identity = 209/485 (43.09%), Postives = 279/485 (57.53%), Query Frame = 1

		  

Query: 951  YTLALRKALLFFNAQKSGKLPKNNGIPWRGNSGLRDGSQLKDVQGDGLVGGYYDSGENTK 1010
            Y  AL+ AL FF+ QKSGKL +NN IPWRG+SGL+DGS+        L  G YD+G++ K
Sbjct: 58   YADALKLALQFFDIQKSGKL-ENNKIPWRGDSGLKDGSE----DNLDLSKGLYDAGDHIK 117

Query: 1011 FHFPLAYSMTMLSWSLIEYPHKYRAINEYNHVLELIKWGTDYLLLTFDSNATKISKIYSQ 1070
            F FP+A++ T+LSWS++EY  +  A+N+ +   + ++W TDYL+    S+    + +Y Q
Sbjct: 118  FGFPMAFTATVLSWSILEYGDQMNAVNQLDPAKDSLRWITDYLIKAHPSD----NVLYIQ 177

Query: 1071 VGGAYEKSNIPDDITCWERPEDMDYPRP---VQTTYAGPELAAEMTAALSSASILFKQNN 1130
            VG          D  CWERPEDM   RP   +     G E+AAE  AA++SAS++FK ++
Sbjct: 178  VGDPKV------DHPCWERPEDMKEKRPLTKIDVDTPGTEVAAETAAAMASASLVFKDSD 237

Query: 1131 PTYSIKLIKASESLFAFARDTRKRRPYSRGNPWVDPYYNSTGYFDEYLWGSTWLYLATGN 1190
            PTYS  L+K ++ LF FA DT KR  YS   P V  +YNSTGY DE LW ++WLY AT +
Sbjct: 238  PTYSATLLKHAKQLFNFA-DT-KRGSYSVNIPEVQKFYNSTGYGDELLWAASWLYHATED 297

Query: 1191 VKYLALATNPGITKNAMGFRWTRKMSVLSWDNKVPSALMLLTRVRLFKAPEYPYEETLIG 1250
              YL   +N G    + G       +  SWDNK+    +LL+R+  FK  +    + L  
Sbjct: 298  KTYLDYVSNHGKEFASFG-----NPTWFSWDNKLAGTQVLLSRLLFFKK-DLSGSKGLGN 357

Query: 1251 HHKFIALSMCSYL--KPFRLFKWSKGGLIQFNDGGEESLQYVANAAFLANLFADYLNATD 1310
            +       MC  L   P      + GGLI  ++    S+Q   ++AFLA+LF+DY+  + 
Sbjct: 358  YRNTAKAVMCGLLPKSPTSTASRTNGGLIWVSEWN--SMQQSVSSAFLASLFSDYMLTSR 417

Query: 1311 SPGWLCGPNFFPIATLREFASSQIDYILGKNPLDLSYVVGYGDKYPNRVHHRAASIPSDG 1370
                 C    F    LR+FA SQ DY+LGKNPL  S+VVGYGDKYP  VHHR ASIP+D 
Sbjct: 418  IHKISCDGKIFKATELRDFAKSQADYMLGKNPLGTSFVVGYGDKYPQFVHHRGASIPADA 477

Query: 1371 LKYSCKGGYKWRDSTKPNPNTIVGAMVGGPDRFDNFHDDRS--ASAGPTLAGNAGLVAAL 1429
                C  G+KW +STKPNPN   GA+VGGP   + F D R       PT   NA LV  L
Sbjct: 478  TT-GCLDGFKWFNSTKPNPNIAYGALVGGPFFNETFTDSRENPMQNEPTTYNNALLVGLL 516

BLAST of Spo11971.1 vs. TAIR (Arabidopsis)
Match: AT1G19940.1 (glycosyl hydrolase 9B5)

HSP 1 Score: 323.6 bits (828), Expect = 6.400e-88
Identity = 199/492 (40.45%), Postives = 283/492 (57.52%), Query Frame = 1

		  

Query: 947  NFDNYTLALRKALLFFNAQKSGKLPKNNGIPWRGNSGLRDGSQLKDVQGDGLVGGYYDSG 1006
            N  NY  AL+ A+ FF+ QKSGKL +NN I WRG+SGL+DGS+       GL    YD+G
Sbjct: 45   NVKNYANALKIAMQFFDIQKSGKL-ENNEISWRGDSGLKDGSEASIDLSKGL----YDAG 104

Query: 1007 ENTKFHFPLAYSMTMLSWSLIEYPHKYRAINEYNHVLELIKWGTDYLLLTFDSNATKISK 1066
            ++ KF FP+A++ T+LSWS++EY  +  ++N  +H  + +KW TD+L+    S     + 
Sbjct: 105  DHMKFGFPMAFTATVLSWSILEYGDQMASLNLLDHAKDSLKWTTDFLINAHPSP----NV 164

Query: 1067 IYSQVGGAYEKSNIPDDITCWERPEDMDYPRP---VQTTYAGPELAAEMTAALSSASILF 1126
            +Y QVG          D  CW+RPE M   R    + T   G E+AAE  AA+++AS++F
Sbjct: 165  LYIQVGDPVT------DHKCWDRPETMTRKRTLTKIDTKTPGTEVAAETAAAMAAASLVF 224

Query: 1127 KQNNPTYSIKLIKASESLFAFARDTRKRRPYSRGNPWVDPYYNSTGYFDEYLWGSTWLYL 1186
            K+++  YS  L+K ++ LF FA + R    YS   P V  YYNSTGY DE LW ++WLY 
Sbjct: 225  KESDTKYSSTLLKHAKQLFDFADNNRGS--YSVNIPEVQSYYNSTGYGDELLWAASWLYH 284

Query: 1187 ATGNVKYLALATNPGITKNAMGFRWTRKMSVLSWDNKVPSALMLLTRVRLFKAPEYPYEE 1246
            AT +  YL       +++N   F      S  SWDNK+P   +LL+R+  FK       +
Sbjct: 285  ATEDQTYLDF-----VSENGEEFGNFGSPSWFSWDNKLPGTHILLSRLTFFK-KGLSGSK 344

Query: 1247 TLIGHHKFIALSMCSYL--KPFRLFKWSKGGLIQFNDGGEESLQYVANAAFLANLFADYL 1306
             L G  +     MC  +   P      + GGLI  ++    +LQ+  ++AFLA L++DY+
Sbjct: 345  GLQGFKETAEAVMCGLIPSSPTATSSRTDGGLIWVSEW--NALQHPVSSAFLATLYSDYM 404

Query: 1307 NATDSPGWLCGPNFFPIATLREFASSQIDYILGKNPLDLSYVVGYGDKYPNRVHHRAASI 1366
              +      C    F  + LR+FA SQ DY+LGKNP  +SY+VGYG+KYP  VHHR ASI
Sbjct: 405  LTSGVKELSCSDQSFKPSDLRKFARSQADYMLGKNPEKMSYLVGYGEKYPEFVHHRGASI 464

Query: 1367 PSDGLKYSCKGGYKWRDSTKPNPNTIVGAMVGGPDRFDNFHDDRSASA--GPTLAGNA-- 1426
            P+D     CK G+KW +S +PNPN   GA+VGGP   D F D R+ S    P+   +A  
Sbjct: 465  PADATT-GCKDGFKWLNSDEPNPNVAYGALVGGPFLNDTFIDARNNSMQNEPSTYNSALV 510

Query: 1427 -GLVAALVSLTS 1429
             GL+++LV+ +S
Sbjct: 525  VGLLSSLVTTSS 510

The following BLAST results are available for this feature:
BLAST of Spo11971.1 vs. NCBI nr
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. NCBI nr)
Total hits: 5
Match NameE-valueIdentityDescription
gi|731326101|ref|XP_010673848.1|6.8e-25782.4PREDICTED: endoglucanase 12-li... [more]
gi|870863518|gb|KMT14682.1|1.1e-24188.1hypothetical protein BVRB_4g07... [more]
gi|731326097|ref|XP_010673846.1|1.1e-24188.1PREDICTED: endoglucanase 12-li... [more]
gi|902209662|gb|KNA16253.1|1.6e-21099.7hypothetical protein SOVF_0908... [more]
gi|702252418|ref|XP_010065082.1|5.3e-20165.6PREDICTED: endoglucanase 12 [E... [more]
back to top
BLAST of Spo11971.1 vs. UniProtKB/TrEMBL
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. UniprotKB/TrEMBL)
Total hits: 5
Match NameE-valueIdentityDescription
A0A0J8CRT0_BETVU4.7e-25782.4Endoglucanase OS=Beta vulgaris... [more]
A0A0J8CMD5_BETVU7.3e-24288.1Endoglucanase OS=Beta vulgaris... [more]
A0A0K9R9R4_SPIOL1.1e-21099.7Endoglucanase OS=Spinacia oler... [more]
A0A059DC05_EUCGR3.7e-20165.6Endoglucanase OS=Eucalyptus gr... [more]
V4UHI7_9ROSI1.2e-19963.9Endoglucanase OS=Citrus clemen... [more]
back to top
BLAST of Spo11971.1 vs. ExPASy Swiss-Prot
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. ExPASy SwissProt)
Total hits: 5
Match NameE-valueIdentityDescription
GUN12_ORYSJ1.7e-17559.3Endoglucanase 12 OS=Oryza sati... [more]
GUN7_ARATH8.5e-16752.3Endoglucanase 7 OS=Arabidopsis... [more]
GUN9_ORYSJ2.8e-16254.5Endoglucanase 9 OS=Oryza sativ... [more]
GUN25_ARATH1.2e-15760.6Endoglucanase 25 OS=Arabidopsi... [more]
GUN10_ORYSJ4.4e-15553.2Endoglucanase 10 OS=Oryza sati... [more]
back to top
BLAST of Spo11971.1 vs. TAIR (Arabidopsis)
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. TAIR)
Total hits: 5
Match NameE-valueIdentityDescription
AT1G65610.14.8e-16852.3Six-hairpin glycosidases super... [more]
AT5G49720.16.9e-15960.6glycosyl hydrolase 9A1[more]
AT4G24260.12.6e-15057.1glycosyl hydrolase 9A3[more]
AT1G75680.11.1e-9243.0glycosyl hydrolase 9B7[more]
AT1G19940.16.4e-8840.4glycosyl hydrolase 9B5[more]
back to top
InterPro
Analysis Name: InterPro Annotations of S. oleracea
Date Performed: 2018-06-29
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001701Glycoside hydrolase family 9PFAMPF00759Glyco_hydro_9coord: 951..1423
score: 5.9E-106coord: 138..461
score: 2.9E-68coord: 479..896
score: 2.2
IPR008928Six-hairpin glycosidase-likeunknownSSF48208Six-hairpin glycosidasescoord: 949..1432
score: 6.38E-111coord: 135..258
score: 5.95E-72coord: 292..477
score: 5.95E-72coord: 479..902
score: 4.68E
IPR012341Six-hairpin glycosidaseGENE3D1.50.10.10coord: 479..899
score: 6.4E-115coord: 136..250
score: 5.5E-84coord: 949..1427
score: 5.6E-129coord: 283..474
score: 5.5
IPR018221Glycoside hydrolase family 9, His active sitePROSITEPS00592GLYCOSYL_HYDROL_F9_1coord: 813..829
score: -coord: 1341..1357
scor
NoneNo IPR availablePANTHERPTHR22298ENDO-1,4-BETA-GLUCANASEcoord: 292..300
score: 1.3E-268coord: 565..933
score: 1.3E-268coord: 86..258
score: 1.3E
NoneNo IPR availablePANTHERPTHR22298:SF36ENDOGLUCANASE 21-RELATEDcoord: 565..933
score: 1.3E-268coord: 292..300
score: 1.3E-268coord: 86..258
score: 1.3E

GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0005975 carbohydrate metabolic process
biological_process GO:0030245 cellulose catabolic process
biological_process GO:0005982 starch metabolic process
biological_process GO:0005985 sucrose metabolic process
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003824 catalytic activity
molecular_function GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds
molecular_function GO:0008810 cellulase activity