Spo28038 (gene)

Overview
NameSpo28038
Typegene
OrganismSpinacia oleracea (Spinach)
DescriptionGATA transcription factor 9
Locationchr3 : 85133551 .. 85156124 (-)
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCGCTTCTGGAAAAAAGGTTCTTGCAATGTCTGGCAATGGTGAAAATGGGGATGATCCAATATCAAATACCCATTTTTCATTTGGTCAATTGTTAGACTTGGGTGTTATTAATCAATATGGGCAGTTGCTTGGTGAATATAGCAGATTGGATGAAGAACTCTGGCTTTCGAAACCCGAGCCTGATGATGTAGTGGATAATCTGGATAGAAGCGATGATGGGGTTAATCATTTTGCAACTTCTGTAAGTTTAATTAACTTCTGTTCAATTTGTGCTCAGTTAATTCTACCGATTATATAATCAATACCAAAAAAAGTTTGTTTTGTTTTGTTAAATTTCTGAATAAGTTCTGCTGTTTACCTGTTGATAATTTTATGAACATATGTGTAGTTTGATGTTGGTTGCGTTAAGTGTATGATTCTAATGATTGATGGGAGCTTCAAAATCAATAACTTTTACGGAGTATTTGTTAATAGTGCAATTTTTTTTGAGGAAAACATTAGGCTGGCATTTTATTGCAAATCAAGTTCCACCAAAGTTATCGTACTTTTGGGGGAATAAATCCATAGTTCCATACAAAACAAGTTCAGCAATACCCTTTGCCACAAGGTAAGCAACAGTATTCTCGGCCCTTTTAACATGAGAAACAACATAAAAAGTAAAACATTGTGATAGTAAAATAGTAAGCCTCTTTGTTCCAGCCACAATCAGATTCCCTTCATGGTTTCGAATCACAACACCAAGCCCCACCATCACGCCCTCAACCACATGAGCATCAACATTGATTTTCATTATGGAGTATTTGTTAATAGTGCAAATTAAATGGCATAATCATAAATAAATTTAATTTCAAGTGAGACCAAATTATTCAATTGAGCTCAAATTTAATTCATTTAATTCACTTTTTTCCTTATCTATCTATATATATATAATATATTAAAAGGCGTTGTGAAAAATGTATATGTGCCACATAGTACTCTCCCCTTTACGCCACATCATCCACTCACCAAGTGTTATGCCATGTCATAGACAACCAAAACTAATGCAATGAGATTCAAACACAAGACCTCATGTGTAGAAGATAAGTTTCATTACCATCTTAACCAACCACCAATTGTGGTTTATTAATTCACATTAATTTATAAATAAATGTTAACATAAGTCTTAACACAACTAAATTTTTACATGACATTTTATTTTCTTTGGACTATTTTGAAAAAAGTAATGACAATTATAGCATTAAAACATGAAAAACATGTTGACGTAAATTATATATATATATATATATATNCACATCATCCACTCACCAAGTGTTATGCCATGTCATAGACAACCAAAACTAATGCAATGAGATTCAAACACAAGACCTCATGTGTAGAAGATAAGTTTCATTACCATCTTAACCAACCACCAATTGTGGTTTATTAATTCACATTAATTTATAAATAAATGTTAACATAAGTCTTAACACAACTAAATTTTTACATGACATTTTATTTTCTTTGGACTATTTTGAAAAAAGTAATGACAATTATAGCATTAAAACATGAAAAACATNTATATATATATATATATATATATATATATATATATATATATATATATATATATATATAATTTGGTGTTTAATAATTTGAAGCACTTAGTTCCACTAGAAAACAAATTATACAAATGGTAAAATGATTTAATTGAAAAATTACTTAGCATGATAAATGACGTAGTGTGTCTTTGTAATTGACGAACCTCATGGATGTTTTCACTTTATGGAATACATGTATTTTGGTCAAATATTGAGCTCAAGAAAAATCTAAAATCTTAAGTATTGAATGTAGCAACCGGGGCATCGCCCGGGCCACACACTAGTTCTTTTTTATTTCCATCTCATTTTTTTTTTCCTTACCTATTTAGTTACAAAAATGTTATGTGATTTCGACTATTCACAAATAACAAAAATAACAGTAATTGATTTCAATTTTCCACGATGGATCATGTGTTAAACGTTGAAAATATTTCACTTTCGGAATAATGTGTTGAAAGGGATCTCCATATTGATCAATATGCTTGCCTAATAACATTGCTTAGCCTAGCATCCAAAGTTGGATTACAGTTGGGATAAAGATTTCGCGGAATCTAGTGAAAGAATAAACGAAAAATATTCTTGGGGTGAAAAACATTTGTGGATCAACGTTGTTAGTTATATGTATGTTGTTGCTAGATAGGGATAAGGTTGCGCGGTTTTGAAGTTTCTTAGACTGTAGCGTTAGTGCAATCCGTACTTATTGTTGTAAGAGGGGACTAGAGATCATTTAGATGACCAACTAGTTATGTTGGAAAGATGCAAAACCTTCTCTGCTTCGTTTGTCCCCAGTGGAGTCGTAAGATTTGAGATGTGAATTGTTCCATATCAGACTGCAGTTAGTGCAATCAATGTACTGCATGCAGATAAGGTGACTTTCGTGGCAGATTTACAGTTTTTTGAATAAGAGGAAATGGAGAGAAAGAGAGGGAGGGAAAACAAGTTCCATTGTTTGGATAAGGAAATGGAGGGAGTGAGAGGAATCCATTTTCCCTTCTTCTTTAATAAACAATATCCTTCCGAAATTGGAGAGAATTAGAGGGAAAATGGAGCTTCACCTTCTGCCTCCTTTCCTCCTCCTCCCTTTCGTTTCCTGTCTATCTTGCTATTCATGTTGGTATTTCCCTCAGTTGCAGTCATAGAGTTTAAGGGTTTGGCTATATATTTGTGGATGACGAGATGCAGCAAAAGGATCTTTTAGTAGGGTGAATGAGACTATGAGAGGATAAGGAGGTGGAAGGACACAGGTAGCTAAACACCTCCAGTCCTCCACAGAGTAAATGCTCAATTCCTTTACCATCCACCATAACCAAGGGAAACCCAAACTTCGTATGCTGTTTTATTTATGCATGAAAAAGAAAAAAATTCATAGGGTCTTCCTCTTGGAGATTCTATGTCTCATTCTAGGAACTGTAAGACTGAGTTAACCCCTCTTTTTTAATGTTCCATTGTGTTTCAGATATGTTGTACCCGTATCATTTTGAGTTCCATTGATCTGATTATGTATTTAGAATGCCGTTGCTACTTCTTAAATCCTGCCATCTCATGTCCTGTATGGAGAATTTGTATTTGGCTAGCTTTTTGAATCAGAGCCATCTGTGGTTGAGTGATATGAACTTTTACGCATAAGTCACTAAAGGATAAAACTTGTAGTCAATGTATTCAGTGAGACAGCTTTTTTGTTTTACATTTTCCAAAAAAGATCAAGCCGGAAAGGATGATCAGTTCTTTGTCCATACAGAACATCTTTAGCAACAGAATGGGGTTAACTTCACAAAAACATAGTTAGCTTTTCTGATGATATTCAATCTTGTAACTTCTTTCTACTTCTCATCTGTCTGTTGTGAACTTGTAAAGACTGTCTGCTCTTTTCAATCACCAGTTTCAGCTTCCTGCTATGTTAAAATTGACTGGGAATACTTTTCTTTTTCTTTTATGATATGAGTATTTTCTTGTTGTATGCTCATATGAACATGTCTCTTATCTGGACTACTATTTGGTTTTTTGCCTCCTTTCCTGAAGAGCATGCCTTTTTCTCCTCTCTTTTCTTCCTCCCCATTTTATTTTGTTTTACGTTCAAGTCAAAGCTCTCTGACTAGTATTATAAGAGCTGCAATTTAGTAAAGTGTTCTGCGGTACTTCTTTTAGATTGGTTTGACATTTTCACTGTATTAGTTATGCCGTTTGAACGCCTTGAGACCACTTACTACAAATTCAGTTCCATGCCCAATTTAAGATCTTCTGATTCCAGTCTAACTAGTGTATAAATAGATTACCAAAACAAATAAAAAGAAAGAAAAGGTAACGGAAATAAGAATTGATTAGGTACGACTTGGCAAAGTTAAATTGACCTAGGATATTGTGCTGGTTAGAAACTTTCGTTCTTGTGTTTCATCTGTAGATTCCTACTTGGTCATATTCTTGCTTCTAGAATCAATTCATCAGTTGGTCCCCTCTTCTCTATCCGACCCTCTTTTCTCTTTTGTATCTATGCTACTGCTTTCAACCTACTTAGGTTTACTGGTTTACTGTTGGATATTTTCTCTATTGATGATTTTTCCAGGCATAACCTCAACTAAGCTTGATTGCAAGGATAAGACTGATGCATCTACATCTTGTCGTAATTCGTGTATTAATGCTTAATGAGTTTACGCATTTATGAACATGTAGAGACACATCATTCCTTGAAATGTACTGAGTTTGTTATACTAATATTTGATAGGCTAAACATTTATCTGAAGATGTTTGTAACATAAATTGAATGTACTCAGTTTTGCACTCATAACAAATCATAAAAATGATGACATTTGGCCATTTTTTGTTAATTCAAAGTTCAAAGTTTATGTTGTGATACTAGCTGTCATTTGAAATAAAAAATAAAATTAGTTTTGTGATATGTTCCTCTTTCTTTCCATCTCATAATGAAAACTTCTTTGATGTGGTGCAGCCTTCAGAATTTGTAGACAAGACTGTCTGTGATGGAGGTATGGCAACTGTGAAAGAAAAGTTGACCTCCTTGAAGGATTGTTCTCAGAATCTGTCAAGCCCAATTTCAGTTTTCCAAACGAAGAGCTCATACTCAGATGACATAACTACAATATGTTCTCAAAGCAAATGCCTTCCACCCACTAAACACAAATTTTCAGGAAGCCTTCAGTTTTTGGGCACCACTATGTCTACTAGAGAGCAACGGTGTGGGCTTCAATCTGAATCAGAGTACTCTATGGAGTCCTTATCAGATGAAAATGAAGAGTCTATTCCATCCAAATCGGTACGGACTAGGAAAATTAGTGATTTACTTGACTTGGATTATGACCCCGCAATCAAGAAATGCATGCATTGCGAAACAACAGATACCCCGCAGTGGAGAACAGGGCCCTTAGGACGAGGAACCCTCTGTAATGCGTGTGGAGTTCAGTATAAACAGGGTAGGCTTTTTAGCGTCTACCGTCCTTTAAGGAGTCCGGCATTTGATCCTTCTTTACACTCCAATTTGCCCAAGAAAGTCCACGAGATGAGAAAAGGTAGCTCAAATGAAGGGGATCCAAATCCTCCTACTCCTAAACCCAAAAAAAAGAAAGTTGAGGAAATGAGGATGAGTAACAAAAGTGAGAGGGATCCAAGCCCGACTCCTAAACCCAAGAAAGTTGTGCCTACTCCTGAACTCAAGAAAGTTGAGGAGATGAGGATGGGTAACTATAGTGAGATGGCTCGGAACCTTTCTTTTGAACAAAGAAAAGCCGAGGAGATGAAGAAGGGTAACAATAGTGACATGGCCCGGGACCTTTCTTTTGAACAAAGAAAAGCCGAGGAGATGATCTTGGGGAACGAGAATGAAAGATATCAGAACCGTGCCCATAGAGCAAAGAAATTTGAGGAGATGAGAGAGAAGGCTAGCAATGTAGTTCAATCAGATTCTAGAAAGAGGAACCGGGAGTTTGATTCTTATCCGACTGTAAAACGCAAACTAGCGAGAAAAGTGGCCAAGAAGTGCCTACATTGTGAATCAACAGAGACCCCGCAATGGAGGGAAGGGCCCATGGGACGAAAGACACTTTGCAATGCATGTGGCGTCCAATACAAAGCTGGTAGACTATTTCCTGAGTACCGTCCTTTAAAAAGTCCTACTTATAATCCGTCTTTGCACTCCCATTTACCCAAGAAGGTCCTAAAGATGAGAATGAGCAGTTCTGAGAATTGTGCAACTTCTGGCAACCATGAAAAGCCAGCTTGAGTTTAACTTCTTAAGCAATTTTCTCGCAGAATGTATATGGTACTGTTCGCCTCTGCTAGTTCCCCGCAACTCACAGCTTCTTGTGCTCAGTATCCCCCTTTCTCTGTTTTTATGTACACATTACAATATGATAGCTGGATAGGAATGCCTGTTAGCCGCTAAAAATAGCTCAAGTTTGCGGGTTGAGCTGGCTAATTACAAGGGAAGAAATAGAGTAAAGAGTCATTGAATACTCTGAACCACATGTTATCTTTTTGGATTTTCATAGCTGGTGTCCCTTTTAGAATCTGTCTCCTAGTTATTTCATTCTATGTATAGTGTAATCTTAAAGCAGGTAGTATTGATTTCTTTGGTACAATATGATTATGTGAAGCAGTTAATTTTGACATCAGTAAATTATCAAAGGAGAATGTGTTCATTATGCTTTATCATGAGATCCAATTCCGAAATCAAGGAACATTAGTGTACGGGATACTCCCGTAGCCCAATTTAATACAAGAATAGCATTTTCTAGAATTATGGGCCCAAAAGACAAAGTCCTAAGGCCCAAATCAATGACCTAATAAATCCATCAAACACCTATTAATACCTATTTATGGGGGAAGGTTAAGAGGATCTCCTCCACTATCTCTCATCTCTCCCTACTTTATCTCTCATATGATACTTTTTTTTTTTTTTTTTTTTNTTTAAGATCTTCTGATTCCAGTCTAACTAGTGTATAAATAGATTACCAAAACAAATAAAAAGAAAGAAAAGGTAACGGAAATAAGAATTGATTAGGTACGACTTGGCAAAGTTAAATTGACCTAGGATATTGTGCTGGTTAGAAACTTTCGTTCTTGTGTTTCATCTGTAGATTCCTACTTGGTCATATTCTTGCTTCTAGAATCAATTCATCAGTTGGTCCCCTCTTCTCTATCCGACCCTCTTTTCTCTTTTGTATCTATGCTACTGCTTTCAACCTACTTAGGTTTACTGGTTTACTGTTGGATATTTTCTCTATTGATGATTTTTCCAGGCATAACCTCAACTAAGCTTGATTGCAAGGATAAGACTGATGCATCTACATCTTGTCGTAATTCGTGTATTAATGCTTAATGAGTTTACGCATTTATGAACATGTAGAGACACATCATTCCTTGAAATGTACTGAGTTTGTTATACTAATATTTGATAGGCTAAACATTTATCTGAAGATGTTTGTAACATAAATTGAATGTACTCAGTTTTGCACTCATAACAAATCATAAAAATGATGACATTTGGCCATTTTTTGTTAATTCAAAGTTCAAAGTTTATGTTGTGATACTAGCTGTCATTTGAAATAAAAAATAAAATTAGTTTTGTGATATGTTCCTCTTTCTTTCCATCTCATAATGAAAACTTCTTTGATGTGGTGCAGCCTTCAGAATTTGTAGACAAGACTGTCTGTGATGGAGGTATGGCAACTGTGAAAGAAAAGTTGACCTCCTTGAAGGATTGTTCTCAGAATCTGTCAAGCCCAATTTCAGTTTTCCAAACGAAGAGCTCATACTCAGATGACATAACTACAATATGTTCTCAAAGCAAATGCCTTCCACCCACTAAACACAAATTTTCAGGAAGCCTTCAGTTTTTGGGCACCACTATGTCTACTAGAGAGCAACGGTGTGGGCTTCAATCTGAATCAGAGTACTCTATGGAGTCCTTATCAGATGAAAATGAAGAGTCTATTCCATCCAAATCGGTACGGACTAGGAAAATTAGTGATTTACTTGACTTGGATTATGACCCCGCAATCAAGAAATGCATGCATTGCGAAACAACAGATACCCCGCAGTGGAGAACAGGGCCCTTAGGACGAGGAACCCTCTGTAATGCGTGTGGAGTTCAGTATAAACAGGGTAGGCTTTTTAGCGTCTACCGTCCTTTAAGGAGTCCGGCATTTGATCCTTCTTTACACTCCAATTTGCCCAAGAAAGTCCACGAGATGAGAAAAGGTAGCTCAAATGAAGGGGATCCAAATCCTCCTACTCCTAAACCCAAAAAAAAGAAAGTTGAGGAAATGAGGATGAGTAACAAAAGTGAGAGGGATCCAAGCCCGACTCCTAAACCCAAGAAAGTTGTGCCTACTCCTGAACTCAAGAAAGTTGAGGAGATGAGGATGGGTAACTATAGTGAGATGGCTCGGAACCTTTCTTTTGAACAAAGAAAAGCCGAGGAGATGAAGAAGGGTAACAATAGTGACATGGCCCGGGACCTTTCTTTTGAACAAAGAAAAGCCGAGGAGATGATCTTGGGGAACGAGAATGAAAGATATCAGAACCGTGCCCATAGAGCAAAGAAATTTGAGGAGATGAGAGAGAAGGCTAGCAATGTAGTTCAATCAGATTCTAGAAAGAGGAACCGGGAGTTTGATTCTTATCCGACTGTAAAACGCAAACTAGCGAGAAAAGTGGCCAAGAAGTGCCTACATTGTGAATCAACAGAGACCCCGCAATGGAGGGAAGGGCCCATGGGACGAAAGACACTTTGCAATGCATGTGGCGTCCAATACAAAGCTGGTAGACTATTTCCTGAGTACCGTCCTTTAAAAAGTCCTACTTATAATCCGTCTTTGCACTCCCATTTACCCAAGAAGGTCCTAAAGATGAGAATGAGCAGTTCTGAGAATTGTGCAACTTCTGGCAACCATGAAAAGCCAGCTTGAGTTTAACTTCTTAAGCAATTTTCTCGCAGAATGTATATGGTACTGTTCGCCTCTGCTAGTTCCCCGCAACTCACAGCTTCTTGTGCTCAGTATCCCCCTTTCTCTGTTTTTATGTACACATTACAATATGATAGCTGGATAGGAATGCCTGTTAGCCGCTAAAAATAGCTCAAGTTTGCGGGTTGAGCTGGCTAATTACAAGGGAAGAAATAGAGTAAAGAGTCATTGAATACTCTGAACCACATGTTATCTTTTTGGATTTTCATAGCTGGTGTCCCTTTTAGAATCTGTCTCCTAGTTATTTCATTCTATGTATAGTGTAATCTTAAAGCAGGTAGTATTGATTTCTTTGGTACAATATGATTATGTGAAGCAGTTAATTTTGACATCAGTAAATTATCAAAGGAGAATGTGTTCATTATGCTTTATCATGAGATCCAATTCCGAAATCAAGGAACATTAGTGTACGGGATACTCCCGTAGCCCAATTTAATACAAGAATAGCATTTTCTAGAATTATGGGCCCAAAAGACAAAGTCCTAAGGCCCAAATCAATGACCTAATAAATCCATCAAACACCTATTAATACCTATTTATGGGGGAAGGTTAAGAGGATCTCCTCCACTATCTCTCATCTCTCCCTACTTTATCTCTCATATGATACTTTTTTTTTTTTTTTTTTTTACTTTTTAATTTTTTTTTTTTTGAGAGGTAAGAAGAAAGTACATTAGGAGGCCCTCCGCACGTGGGTAACTCCTAAGTAATCGTTTGAGATAATAGAAGACAACATAGGGCTAATCCCCGGATTACATGCATAGTACTCGGAATCAGATGACCAACGTTTGGCAATCCAATCTGCCGTCTGATTAGCTTCTCTGTAGATGTGCGAGATCTCCCAAAAGTCAAAGTGTTGAGAAAGAAGCTTTTTGATATCCTTGATGATGGTGTCAAGTTTCCATGGTGTTGCCCATATTCCCTTCACGGTGTTTATGACTAATAGATTGTCTCCTTCTATGCATAAATTCTTGATTTATAGATGAATTGCTTCTTGAATGCCTTTATGGAGGGCGAGCGCTTCAGTCATGTAGACTTGAGAGTTGCCGATGTTGGAAGTCGCGGCACTTAAGTGTTGGCCCATATGGTCTCTGATGATGAAGCCTGCTGCAGCAGAGGAGTTTTTGCGTGATGCATCAAAGTTGAGTTTGAAATGGTTTGGAGGTGGTGGATGCCAACGGACAGGAATGTTGGTGGAAGTGGATGGTGAGGAACCTGAAAGGAGTCCTGTTTCAAGGGCAGAGTCCAGTTTGTTTATGGAAATCCATTCTGAGTGAGCTGAAATGGCTACAAAGTAGATTTTGACTGGGTTAAACGGTCTATTTTTGAAGACGCAGTTATTTCTTTCTTTCCAGATTGCCCAAATCATAAAAACAACCTTAATCTGGGTTTCAGAAAATTTTCGCAGAGAGCTTAGGAGCTCCATAATAGGTTGAAACTGAAAGGAGAAGTCAAGCCACGTACCTTTTTACGAGTGGATAGTTCCAGGCGTTTTTGACAATTGGACACTGAAGAAAGAGATGATCGATTGTTTCGACGTAAGTGTCGCATAGAGGGCAAATATCTGTGTTTAAGATGTGACGGGAGAAAAGAATGTGCCGGACTGCGATGCTTTTATGACCAATTTCCCATAGAAAAATTTTGATTTTTGGTGGAATATCTAATTTCCAAAGCTAGTTGAATTCCCATTTACTAGAGTTCTGTGGTATTTTATGGGCGAGCCAAGTTGCTGATTTGATAGAGAAGTCCCCCGATGTGGTGGGTCCCCAAATTGGCATGTCATCTGTTGTGATTAGTGGGAGGGGGAGGCCTTTGATTTTCAAAACTATGTGCTCAGGTAGGAGAGTCGCTAAAAGAGGGAGATTCTATTGTTGACCCGACAGGATGAACTTTGAAACCCGAAGGTCTTTATCAATAGATTCTCGATTTGAGTTTGTAAGGGACAACAAATTTTCTTGAGAAATCCAATTATCCAACCAGAAAAGGACAGAGGTCCCATTCCCGATTTTCCAACGGATTCCTTTACGGAGAATTTCACGTTGACAGAGAATCTGTTTCCAGATCTGCGAGTCTTTTTGTTTTGGTTTGCATGGAAGAAAGGTCGTGTTATTAAGATACTTTTTAGTCATAATAGAAACCCAAAGGTTGGAGGGGTTTGTCATTATTTTCCATCCTAATTTTGCCAAAAATGCTCTGTTGAGGGGTCTAAGTTTGCGTAGACCCAGGCCTCCCTGGGATTTCGGTTTGCAGATAGTGTCCCATGAAATGAGGGGCGTTGAGTTGGATTTGTCTGTTTGCTTCCAAAAGAAGTTACTCTGGGTAGCGTCTAGATGGTTGCAAATTTTTTTCGGAAGGAGGAAAGAGGAGCATACGTAGGCTGGGAGAGCTTCAAGATTGCTCTGAACTAGTGTTAATCTTCCTGCTTTAGAAAGGTAGTTTGCATGTAACATTTGAATTCTATTTTGATTTTTGTGCATTATGTTGTTATAGTCGGTTTTTGAAGGTTTGTAGCTGGAAAAATACGCACCTAGGTATCTACCTAGTGACATCGAGCTTTGCAAGTTGAAACTTCCTGCAATATTTAGTCTCCTAGTGTTGCAAATTTGTTTAGAAAAGACAATAGCGGACTTGTGGAAGTTCACTAATTGGCCCGAGAGGGTAAAAAATTCTGTAATAATATTCTTAAGATTGCTACATGAAGTGTTTGTTGCTTTACTGAAAAGAAGACTATTGTCTGCAAACATGAGGCAAGGAACTTTCGGGGTATGAGGAGCAATTTTAAAACCAATGCCAGAAGCAGTATTGAGTCCTGCTCTGGTTAGCATAATGATAAAGACGTCCATACAAATGAGAAAGAGGTAAGGTGAAAGCAGGTCACCTTGACGAAGACCTCTAGTAGGCTTAAAATGTTCTGTAGTTGAACCGTTGATTTTTAATGAGTAGGAAACCGTGGTCACACAACATTTGACCCATTTTATCCATTTGGTTGGGAAACCAAAGGCTCCTAGGGTTGCCCAAAGAAAGTCCCATTCTACCCGATCGTAGGCTTTTTCCATGTCTAATTTGAGGGCGCACCAACCTACCTTCGATTTAGATTTTTGAAAGACATTAAGAATTTCTTGAACAATTAGGAGGTTGTCATGAATTGATCTTTCTGGCGTAAACGCATTTTGGTAGGGGCTTATATGGAGTTGAAGAATTGGAGTGAGTCTTGAAACTAACAGTTTTGCAATAGCCTTGTACATTGTGTTGCACAAGCTAATTGGTCTGAAGTGATTTGGATTTTCCGGGAGATTTTTTTTCGGAATAAGGGCAATGATGGTATGATTGAGGGCCGATGGGATTGATCCATGGTGGAAAAAGCTTTTACGCTGTCGTGACTTCTTTTGCGATAATGTTCCAATATGCTTGAAATTTTTTTGGACCGTATCCATCTGAGCCCGGATATTTGAGTGGGTTCATTTCATAAAAGGCTGCTTTGATTTCTTCATCTGAAAAAGGGGCGATGATGTGGTTTTGTTCATCTCGGGAGAGGACTTTTGGTAGGAAACATAGGTCGATATTCTTGTTGCTTGGGCTATTGGAGGTAAACCGTTTCTTGAAATATTGAACCAGGATCGATTGAACTGTAGCTGGATTATCTTCCCATTGTTGTAGGTCATTTTTTAAACGGTGAATAAGATTCCTAGCAGGTTGCGATTTAGTTCGTTGGTGAAAGAAATGTGAATTGCGATCTCCTTTTGTTAACCATTCTTTACGCGCGAATTTTTTCCAGCTATTCTGGTCGTATAAGAGAAGTTTCTCTCGTTGCTTGATCATTCTAAGCAAGTGGTCATGTAGAATTCTGTTAAAGGGTTGATTAAGAAGTTGGGCTTGGAGACTTTGAATTTTGTCCAAATTCAGAGATAATTTATTGTTCATATGACCGTATTTCGATTGAGCCCAATCTTTGAGTTTTATTTTGATTGCTCGTAGTTTTTGACCTATTCTAAAAAACCTGGTTCCCGGGATATTCGTTTGCCATTCTTTGCGAACAATATCATAGGTATCCCTGTCTAGGGCCTATAGATTCTGAAATCTATATAATCTATTTTTGGTGTTGGTAGATGTATTGGTATCTAAAAAAATGGGGGCATGATCTGAGACTGTAAAGTTTCCATAGCTTAGACGGGTGTCAGGGAAGATGATGAGCGACATTAGCTTAGGTTTTTGGTTCATTTTATTAGCATTTTATTATTATTTTTGCTTAGTTTTATCGTTTTTAGTTCGTTTGTCGCATTTTTGCATTAATCGTAATATTTTCTCGTCTTGCGTAGATTTTAGTCGTTTTTGCTTAAAAAGTGCCTTTTGTGCGCATTCTCATTGCGTAAGGGTCGTTTCGAGTATTGACATGTTAATATTGTGTCATTATGCTTGACGATGTTTAAAATGATTTTACGATTTGTAAAAATGCGATACGAATTATTAATACGATTTTCGATTTTTAATAACGTGCTTTGCGATTTGCTAACTTGCTATACGATTCTTTGCATGAGCAGCGACGAAGCCTTGTGTGTGTGCATACCAACAGCAGCAATACAAGCAACACACACGAGCCTCGCAAGTGCAGCCACACGAGCCAAGGCACAGCAGCCCCAGCAGCTGCACTCGTGCACACACAGCAACAGCAAGTACAGTACAGCAGTAGCGCGCATTCAGCTTCGTTGCATCCATTTCCAAGGCTCTTTGCAGCAGCTCTCTTGCTCACCTTTAAGCCATCACACAGCAGCCTAGCTTGCACTCTTCATTCAGCACGTACAAATAGCCACAGCTGCTGTACACGAGCAAGCAACCAGCAGCCAAGTGCAGCCTCGTGCACGAGTATGGCAGCAACCTTGCTTGCCTTGTGTGTGCGGTTCATGAGCCCTTGCAGCAGCCTTGTGGGATTATTTTCATGAGCTTTGAAGCTCCTTAACAGCAGCCCTACACAGCCACAGCAAGGCAAAGCATTGCAGCAAGAACACGCAGCAAGAGCGCAGCAGAATACAAGCAGCCACAGTACGCGCAGAACCTGAAATGAGCAGTCTGTGCGCTCGAATCCTCTTTGTGCGATCGAATGCTTCTGTGCGATCGAGTTTCAGGACCTGTGTGGTCAAACAAACTGCAGTTTAGAGATCAGAATGTTCCATTTTTTGCTGTGTGACCGAATGCTTGTCCCTGTTCGACCGCACAGAATTTGTGCGACCACACAACACCCTTTGTGCGACGGAATTGGAATGCGGGGCTGCTATTATAAAAGCTGAAAATCGCAGCATTGTAATTAGGCTTTCATTTTGTTAATTACGTAATTTTAGATTTTAGGCTTAGCTTAGAATTCTCTCTCTAGAATTAGTTTAGGGCTTTGATTAGAGATTAGGATTAAGGATTCTTAGCCTCAAATTCTTGATTCTAAATTCAATTTAATCATCAAATTTTCAGATTTCTTTTCTATTTCTTTTAAACTTTGTTCTTCATTTTGTTCTTCAAGTTTGATGCTAATTCTTCAATTCAAGGTATAATTCTAATTCCTCTTTTGATTTGTTCTTTAATTCTTTAATTGCTTCACTAATTTCGTTTATGATTTGATTTCTTGCTTGAATTCTAGGATTAGTTTGTTATTTTGAATTTCATTGATTTTTCCCCAATTTTTGTGGTTTTAATTTTCTGAATTTTATTGTTTCAATTAGCTTCATTAGTTCAATTAGTTTCATGATTAGGATTAATATTAGTTTAATGTTGATGTTAATCATGCTAATTGAGTAGTTTAGTCTAGAGGTTAGGGATGAAGCCTTGGCATTGAATTTGGGGAACTTTAGGGAAAATTCATGATTGATGTTGTGATTAAATGTGATTTAGTGATAGGATTTCCATGACTAGTTGAGTAGTAGTCGCGAATGCTATTTGATTAGAATTGATTGGAATGTATGATTTAGGTTTGGCAAGACATTGTGAGCCTATTTTGTGCAAGTGAATAATAGTTCCTATTATATGCAAGCTTATCTAGTTTAGGATTATGCGAAAGCTTTCCATAGGCTAGGTTGCTTCGCGTGCTCCATCCGAGAGGTGGCGAGCATGTCATTAGTTTTCATTCCCCTATTTACTATGTCGTTGCATATTCATGAATGTCTTGACCTTGTTATACCTTTGATTCGTTTCCCATTGCCGATTCCTGAATCCTCTAGAATTTCTTTATTTGTTTGCATTTCTAGCTTAGATTAGATTAGTTCAAAAACTCAAATTCTATTTCGAACTAGCTTGTGAACGCATAGATTGAATAGTCAAACATATCCATTCCTATGGGACGATCCCTATACTTGCCGCTATAGTCAATATAGCCGGTAGTTTAGGTGTTTTATAAATTTTGTTTGATGAGGTGAATTTCTATTCAACGACCGAAAAACACCTTATCAAAATGGCGCCGTTGCCGGGGAATGGCATCATTGTTTGATTTTTCTTTCTAGTTCTCTAGTCTAGTTCATTTTATCTTTTTTCTTTTAGTTTTCAAATAAAAATCAAATCAAAAATGTCAAGAGTGTTCTATATTCCTCAAGTTCTTCATGAACCATTTTACATGTATTATAATAGAGTTCATGCGGAGCTTGAACATCGTCGCATCCATGGAACTTCTAATGGTTTAAAGGGATGGCATGTTTGGGATTGCATATACTATGGAGTGAATATGGATACATATGTGATTCTTCAATCCAATTGTAATGGAGAGCTTCATCTCATGCATTGGAAAGATGCCATGTCTTTCTTTACGTGGTTTGCTAGAGTTCAATATTTTGACTCTCGTGCTTATTCTCGTAATTTTTCGCATCCATGCATGAATGTCAATGATTTCCCTCATGTTTCTAATGATAATGCTCATGAGCTTGATGACAACCATTCTCCCCATGTTGATTCTCAAAGCTCCTTGCATTGTGAATTTTCAAAATCTTGTCCGTGTCACAATTATATTGATGATGATGATTATGATTATGATTATGATTCATGTGATGCTACGGATGATAATCAATCTAGTGATGAGAATGATGTTGAACTTAGAGGTTGTTCCTCTTTCTCCCACCCCCCCCCCCCCCCCCCCCNNNNNNNCTCACCCCATTGATGATTGCCCATTGCATGTTGAGAATGATTTGATTCATTTGGGTTGTGATTGTTCTATTGAGGATGTCATTGAGTGTGAAGCACCTAATTTGGTGGTTTCGGATGATTCCGAGAATTGTATTGAAGTTGATGATGTGAATTTGGTTGCAAAATCCCTTGTTGAGATTTCTTGTGTTCCTTGTGATGTGAGTGAAGTTATTTGTAAAACTTTTGTTCCCACTCCCTTCCCTCATACCCCTTTCCGAATCAAGAAACTCGAGATTTATCCTTCCTTTCCCATCTCTCTTCCTTATTGCCTCCGAACCCCACCACTTGAGGATTTTCCACCTCCTATCCATTCCCCATTGCCTTGTTTGAACAATAATGAGGAAATTGGCTCTCTTGCCACCCAAAAAGAGTTCGTCGAGGAGCATGAGGCTTCTCTTTTCCCTTTTTTCTCCTTCTCCTTTTGATTCTTCTTCTTATATTCTTATTTTTGCTTTTGATTTTTCTTATGATTTTTCTTATTTTTGGTTCCCTACCCCCCGTAGGCATAGAGTTGCATATTACTTTGTGCTTGCATCTCATTTGACTCTTGCCTGCCTGTTACACTTTGATATGTTTTTCATTGCATATGACAAACTATTGCGATCATTGTCGGGGTACTTACTAGAGGTAAGAGGCGGCAAGCTAATGACGTAAAAAGAGCGCTTCTCGGGAGGCAACCCGTGGGTGTTGGCCATTTGGCCCATGTATACTAATTTTTCTTGAATTTTGAAGTTTTTGTGCCTCAAGCTATCTTGATGTGAAAATCTTTGCATCCATCTTTGCCATGTCCGGCTATTCTTCTTGGTGAGTTCGTTCGTTATTACGGTTCTTTGCGGGACTTGCTCCCACGACATTTCCGGTAATATCCTTCTAACTCACATGCATTCTTTGGGGTCGGGCATCATACATGCGGGGCATTGGTTGGATGGGTAGTTGTGCCCCTCCTAGTTTCGTTTTAGGCCTTGAGGACATGGCCTAATTCGAGCTTGGGGGGAGATTCAATTGTTTGCTACTTTGATCTCCATGTGATGGATTGCTTGATTTTGTGAATATTTTGGATTTTGTGCATATTTCTTGATTAGTTTCATTGATTTTGTGAATATTTTCTTTATTTTCATTTTCTTTTTCATTTCTTTTCTTTTCCTTTCCTTTTATTTTTCATTTCATTTCTTTCTTTTTCCTTTTCGATCTTTGTCAAGGCAAGCTTTTCTAGCTTTTTAGGCATTATTCTTGATTTGGGCAAATAAAATTGGAACTTGTAGGTTAGAATGCTTTTATTCCTAATTGGTAGATGGTTGCACGGTTTTCAATGTCTTGGGTCGAATCTAAGATTTAGGTGTTAAGAAAGTTGGTTGAACTTTGGGTAGCATGCGCCTTGAACCCTTGGCTTGTGGTAAGATGGTGTCGATCTCTCTAGTGACTTGAAACCGGAGAGAGTGGTATTTGCTCCCATGTAAGAATCATGTTCATTAGCCCAAACACATTGTTTACACATATTGAGTGGCATGGATCCTTAGCATTGACTTTCTTATTCCCCGAGTTTGACTATTTCCTTCCCGAGACAGCGACCGAACTACTTTAGGGGACGATAGAGGCATTTCTTTTGGTGACTATTTTTCTTTTTAGTTTAGGACTTTATTTTCTATTTTTCATATATTTTTGTTTGCATTTACCCCCTTGCCAGCCATTTGAGCCTTGCCCCTTCATTTCTTATGAGCACATTACAAGCCTAATAGCCTTTGTTTTATTTTCACCCTTAGAAGTGATAGTGAAGCTTTGTGTTATGATGCTTGAGGTTTTGAATGATTATGCAAAGTTGAGGAATGATTGTTATTGGTATGAATGGCAAAGTTTGGGAAGTATGTTGTATATAGTGAATAAATAAGCGAAAGTAAAGTTTTGAAAAAGCCAAAAATAGAGAAAAACAAGGCATGAGAAAAGTTTCATTTGAAACACAAAAAAAAGAGAAAATCAAATCAAGAAAAGTTCAAAAAGAGTTTTAGTTTAGTGTTGTTGTTGAATAAGCTTGGGGGTACCCCAAATAAGTGGTATGAGTTTATTTTTGGTTCGATTTACTTGAGTTGAGTTCTATCATTGCGCTTCACTTTAGGTATGAGCACCAAATTACCAAATTTGTTCCACACCTTACCCCTAGCCTACGTTTCACCCTAAAAAGTCCTCTTGACCCTTTTGAGGTGAGTCATTGTCGGTGGAGGGAGGATTTGGTCAAGTTTATGGAATGGGATTGTCATGCTAAGATGTCGGGTAGCCTAATCTTGTTTTCCGGCGCCTTCGCGGTGTCTCGAGACGCTATTCTCAAGAATGGATAGGATCTAGAGATTTGACATGGTATGCTCGGTTGAAGGGGAATTGGCTAGTGTTTGCCCTTAGTTCTCTTTGCTTGGCACTTGAATCTTTCACTCGATTTAGGACATTGGTTAGCGTGCATTTGTTTATTTAGTTAGTAATATTAGCATTCTCTTGTGCTCGTTCCTTAATAAAGGCCCTTATCTTGAGTTTTGTAGTTTCGTTTTTTATTTTCTTTTCCTTTCTTTTCTTTTCCTTTTCTTTTTTTTCTTTTTCTTGGTTTGTTTTTCATTTAGGTCTTGCCTTGATTCAATTTTTGGAAGAGTTATGGGTGTTTCTCTTTGGAAACCCTTGCAAGATCCCACTCGCCCTCATGAGCGATTGTGGGGTTCAAAGAGCTTGTTGCATGCGCTTAACTGCAACCGTGATTCCTACGAAAGTGAGTTAGTGTGCATTCTTGTAGATCTTATTTTCGTAATTTTGATTTTTAATAAATAAGCTCATTCTCCTCCCCGTTTCTCATTCTAGCTTTGCTTGAGGACAAGCAAAGGGTTAGCTTGGGGGAGTTTGATGAGCGACATTTATGTCGCTCTTAGCTTAGGATTTTGGTTCATTTTATTATTATTTTTGCTTAGTTTTATCGTTTTTAGTTCGTTTATAGCATTTTTGCATTAATCGTAATATTTTCTCGTCTTGCGTAGATTTTAGTCGTTTTTGCTTAAAACGTGCCTTTTGTGCGCATTCTCATTGCGTAAGGGTCGTTTCGAGTATTGACGTGTTAATATTGGGTCATTATGCTTGACGATGTTTAAAATGATTTTACGATTTGCAAAAATGCGATACGAATTATTAATACGATTTTTGATTTTTAATAACGTGCTTTGCGATTTGCTAACTTGCTATACGAATCTTTGCATGAGCAGCAACGAAGCCTTGTGTGTGTGCATACCAACAGCAGCAATACAAGCAACACACACGAGCCTCGCAAGTGCAGCCACACGAGCCAAGGCACAACAGCCCCAGCAGCTGCACTCGTGCACACACAACAACATCAAGTACAGTACAACAGTAGCGCGCATTCAGCTTCGTTGCATCCATTTCCAAGGCTCTTTGCAGCAGCTCTCTTACTCACCACAGCTGCTGTACACGAGCAAGCAACCAGCAGCCAAGTGCAGCCTCGTGCACGAGTATGGCAGCAACCTTGCTTGCCTTGTGTGTGCGGTTCATGAGCCCTTGCAGCAGCCCTGTGGGATTATTTTCATGAGCTTTGAAGCTCCTTAACAGCATCCCTACACAGCCACAGCAAGGCAAAGCAGTGCAGCAATAACACGCAGCAAGAGCGCAGCAGAATACAAGCAGCCACAGTACGCGCAGAACCTGAAATGAGCAGTATGTGTGCTCGAATCCTCTTTGTGCGATCGAATGCTTCTGTGCGATCGAGTTTCAGGACCTGTGTGGTCGAACAAACTGCAGTTTAGAGATCAGAATGTTCCATTTTTGCTGTGCGACCGAATGCTTGTCCCTGTTCGACCGCACAGAATTTGTGCGACCACACAACACCCTTTGTGCGACCGAATTGGAATGCAGGGCTGCTATTATAAAAGCCGAAAATCGCAGCATTGTAATTAGGCTTTCATTTTGTTAATTACGTAATTTTAGATTTTAGGCTTAGCTTAGAATTCTCTCTCTAGAATTAGTTTAGGGCTTTGATTAGAGATTAGGATTAAGGATTCTTAGCCTCAAATTCTTGATTCTAAATTCAATTTAATCATCAAATTTTCAGATTTCTTTTCTATTTCTTTTGAGCTATTTTCTTCATTTTGTTCTTCAAGTTTGATGCTAATTCTTCAATTCAAGGTATAATTCTAATTCTTCTTTTGATTTGTTCTTTAATTCTTTAATTGCTTCACTAATTTCGTTTATGCTTTGATTTCATGCTTGAATTCTAGGATTAGTTTGTTATTTTGAATTTCATTGATTTTTCCCCAATTTTGTGGTTTTAATTTTCTGAATTTTATTGTTTAAATTAGCATCATTAGTTCAATTAGTTTCATGATTAGGATTAATATTAGTTTAATGTTGATGTTAATCATGCTAATTGAGTAGTTTAGTCTAGAGGTTAGGGATGAAGCCTTGTCATTGAATTTGGGGAACTTTAGGGAAAATTCATGATTGATGTTGTGATTAAATGTGATTTAGTGATAGGATTTCCATTCATGATTGAGTTTGGCAAGACATTGTGAGCCTATTTTGTGCAAGTGAATGATAGTTCCTATTATATGCAAGCTTATCTAGTTTAGGATTATGTGAAAGCTTTCCATAGGCTAGGTTGCTTCGCGTGCTCCATCCGAGAGGGGGCGATCACGTCATTAGTTTTCATTCCCCTATTTGCTATGTCGTTGCATATTCATGAATGTCTTGATCTTGTTATACCTTTGATTCGATTCCCATTATCGATTCCCGAATCCTCTAGAATTTCTTTATTTGTTTGCATTTCTAGCTTAGATTAGATTAGTTCAAAAACTCAAATTCTATTTCGAACTAGTTTGTGAACGCATAGATTGAATAGTCAAACATATCCATTCCTATGGGACAATCCCTATACTTGCCGCTATAGTCAATATAGCCGGTAGTTTAGGTGTTTTATAAATTTTGTTTGATGAGGTTAATTTCTATTCAACGACGGAAAAAACCTTATCAGAAGACCGTGTTAAAACTGTTGTGAGCAACGACTCTATCTAGACGTTGTAGAAGTAAGTCTTGTCGATTAGCTTTCCATGAAAATGAGGGTTGAGTGCTAGCTACATCATAAGCGTTAAGGTACATTAGTAGGGATGGTAACCGCTGGCATTGGGTGTCCGTTATTGGGTTTCCACCGAGTTTATCGTCAGGGGTTCGTAATTCATTAAAATCTCCAATTATTAACCATGGCTGGGATAAATTGTCATGGAAAGTACGGAGGTACTCCCAGAAAAAATCTTTATCATTTTCCTGGGCCGGGCAGTAAACTCCCGTTACTACAAACGTTTTGTTATTTGGTTTGTACAGGATTGACAGGTGAGCACACCTTTCTGAAACTGAGTGGTTTAAGACTGAGATGTTATCGTTGTTCCAACAAAACCATAGTCCACCACTATGGTTGGTGGGGTCAATGTTTAAGTGTTGGTGGAAATTTAGCGTTCTTAAAAGTGTTTTGCTGTTTGAATTATTGGTTCTTGTTTCTTGAATACAAAGAATATCCGGTCGGAAGTTATTTTTCATGTGAATGAGTTCCGCAAGGGCTTGACCTCGCTTCCCTCCTTGAATATTCCAAGAAAGTATTTTCATAATAAACACAAAAGTGGTAAGTATTCTTTCAAGATCGAAAGTGACAAGCTACAATGAGCAGTTCGTGCGGAGTTTAAACTCCGTTTCCTTTGAACAGGTAATAGACCCAATTTTGTTAGGATTGGAAAAAAACCCGCAGTCACCGGAGTTCTAGTTGGAATATGACTAAAGCAGTTCTTGTGTGACGTATTTAGAGTAGAGAAAGTAGTTGTAAAAGGAGTGTCCCGAATACTGGAAATAAAAACTAATTTCGAAAGAGAGTTTGGGGGGGGCGGATATATAAATCCGGTTCCGATAAGGAGTGAGCTCCTTAGGCAGTTCGAGTAGTAGTAATAAACTACAACATATCAAGTAGGGAGTTAAGATCCCTAGTATACATACAAAAGATAAAACCAGGGCGGATATATGAATCCGATTCGGATAAGGAGTAAGATCGTGTTGTAGAAAATAGCAAAAATTAAACGGATTTTTCATGAAATGCCCCTGAGGTTTATCTTAATGCACCAAATACCCTTAAACTTTCCAGAATGCACCAAATACCCCCGAGGTTTAGTATAAATACACAAAATACCCAAAATGACTATTTTCCGTTAACTCCGTTAACTTATCCGTTAACTTTAACTTTAATTTAATTTAATTTTTATTTTTTCTCTTTCCCTTTTAATTTCTTATTTTTTATTTTTTGTTTTTTTCTCTCTCCCCTTCCTTCTTCCCTGTCTTCCATGGCAGCCCTTCTCTTCATCACCACCACCACACACACCACCCTGCTAACCCCTTCTCTTCCTCACCACCACCCCACACCGCCACCGACACCAGCCTGCTCCACGGCAGATAAGCACATCGATTTTCCTCTCTCCTCAAAATCCAGATCCCAAAAAAATTGATCCCCAAATCAGCAATCTAGTCGCAATTATCGTAACGGAGGGGGAAGTGGGGGGTGGGCAACGACGCCGGTGAGGAAGGGGGAGTGGGGTTTGGTGGCGGCGGTAGTGGGGAGCAATGAGGAAGGTGGGGGAAGTGCAGGGAAGGGAAACACGGGTTTGGTGGTGGCGGCGGCAGTGGGGAGCAATGAGGAAGATGGGGGAAGTGCGGGGGAGGGAAACGTCATCGGTGAGGAAGGGGGAGTGGGGTTTGGTGGTGGCGGCATGGCGGCGGTGGGGTTTGGTGGTGGTGGCGTTCTCTCTCCTCTAATTCGTTGGTGCTGCGAGGATGAAGGTCATGTGGTTGCTTAA

mRNA sequence

ATGGCCGCTTCTGGAAAAAAGGTTCTTGCAATGTCTGGCAATGGTGAAAATGGGGATGATCCAATATCAAATACCCATTTTTCATTTGGTCAATTGTTAGACTTGGGTGTTATTAATCAATATGGGCAGTTGCTTGGTGAATATAGCAGATTGGATGAAGAACTCTGGCTTTCGAAACCCGAGCCTGATGATGTAGTGGATAATCTGGATAGAAGCGATGATGGGCCTTCAGAATTTGTAGACAAGACTGTCTGTGATGGAGGTATGGCAACTGTGAAAGAAAAGTTGACCTCCTTGAAGGATTGTTCTCAGAATCTGTCAAGCCCAATTTCAGTTTTCCAAACGAAGAGCTCATACTCAGATGACATAACTACAATATGTTCTCAAAGCAAATGCCTTCCACCCACTAAACACAAATTTTCAGGAAGCCTTCAGTTTTTGGGCACCACTATGTCTACTAGAGAGCAACGGTGTGGGCTTCAATCTGAATCAGAGTACTCTATGGAGTCCTTATCAGATGAAAATGAAGAGTCTATTCCATCCAAATCGGTACGGACTAGGAAAATTAGTGATTTACTTGACTTGGATTATGACCCCGCAATCAAGAAATGCATGCATTGCGAAACAACAGATACCCCGCAGTGGAGAACAGGGCCCTTAGGACGAGGAACCCTCTGTAATGCGTGTGGAGTTCAGTATAAACAGGGTAGGCTTTTTAGCGTCTACCGTCCTTTAAGGAGTCCGGCATTTGATCCTTCTTTACACTCCAATTTGCCCAAGAAAGTCCACGAGATGAGAAAAGGTAGCTCAAATGAAGGGGATCCAAATCCTCCTACTCCTAAACCCAAAAAAAAGAAAGTTGAGGAAATGAGGATGAGTAACAAAAGTGAGAGGGATCCAAGCCCGACTCCTAAACCCAAGAAAGTTGTGCCTACTCCTGAACTCAAGAAAGTTGAGGAGATGAGGATGGGTAACTATAGTGAGATGGCTCGGAACCTTTCTTTTGAACAAAGAAAAGCCGAGGAGATGAAGAAGGGTAACAATAGTGACATGGCCCGGGACCTTTCTTTTGAACAAAGAAAAGCCGAGGAGATGATCTTGGGGAACGAGAATGAAAGATATCAGAACCGTGCCCATAGAGCAAAGAAATTTGAGGAGATGAGAGAGAAGGCTAGCAATGTAGTTCAATCAGATTCTAGAAAGAGGAACCGGGAGTTTGATTCTTATCCGACTCCTTCAGAATTTGTAGACAAGACTGTCTGTGATGGAGGTATGGCAACTGTGAAAGAAAAGTTGACCTCCTTGAAGGATTGTTCTCAGAATCTGTCAAGCCCAATTTCAGTTTTCCAAACGAAGAGCTCATACTCAGATGACATAACTACAATATGTTCTCAAAGCAAATGCCTTCCACCCACTAAACACAAATTTTCAGGAAGCCTTCAGTTTTTGGGCACCACTATGTCTACTAGAGAGCAACGGTGTGGGCTTCAATCTGAATCAGAGTACTCTATGGAGTCCTTATCAGATGAAAATGAAGAGTCTATTCCATCCAAATCGGTACGGACTAGGAAAATTAGTGATTTACTTGACTTGGATTATGACCCCGCAATCAAGAAATGCATGCATTGCGAAACAACAGATACCCCGCAGTGGAGAACAGGGCCCTTAGGACGAGGAACCCTCTGTAATGCGTGTGGAGTTCAGTATAAACAGGGTAGGCTTTTTAGCGTCTACCGTCCTTTAAGGAGTCCGGCATTTGATCCTTCTTTACACTCCAATTTGCCCAAGAAAGTCCACGAGATGAGAAAAGGTAGCTCAAATGAAGGGGATCCAAATCCTCCTACTCCTAAACCCAAAAAAAAGAAAGTTGAGGAAATGAGGATGAGTAACAAAAGTGAGAGGGATCCAAGCCCGACTCCTAAACCCAAGAAAGTTGTGCCTACTCCTGAACTCAAGAAAGTTGAGGAGATGAGGATGGGTAACTATAGTGAGATGGCTCGGAACCTTTCTTTTGAACAAAGAAAAGCCGAGGAGATGAAGAAGGGTAACAATAGTGACATGGCCCGGGACCTTTCTTTTGAACAAAGAAAAGCCGAGGAGATGATCTTGGGGAACGAGAATGAAAGATATCAGAACCGTGCCCATAGAGCAAAGAAATTTGAGGAGATGAGAGAGAAGGCTAGCAATGTAGTTCAATCAGATTCTAGAAAGAGGAACCGGGAGTTTGATTCTTATCCGACTCAATCTAGTCGCAATTATCGTAACGGAGGGGGAAGTGGGGGGTGGGCAACGACGCCGGTGAGGAAGGGGGAGTGGGGTTTGGTGGCGGCGGTAGTGGGGAGCAATGAGGAAGGTGGGGGAAGTGCAGGGAAGGGAAACACGGGTTTGGTGGTGGCGGCGGCAGTGGGGAGCAATGAGGAAGATGGGGGAAGTGCGGGGGAGGGAAACGTCATCGGTGAGGAAGGGGGAGTGGGGTTTGGTGGTGGCGGCATGGCGGCGGTGGGGTTTGGTGGTGGTGGCGTTCTCTCTCCTCTAATTCGTTGGTGCTGCGAGGATGAAGGTCATGTGGTTGCTTAA

Coding sequence (CDS)

ATGGCCGCTTCTGGAAAAAAGGTTCTTGCAATGTCTGGCAATGGTGAAAATGGGGATGATCCAATATCAAATACCCATTTTTCATTTGGTCAATTGTTAGACTTGGGTGTTATTAATCAATATGGGCAGTTGCTTGGTGAATATAGCAGATTGGATGAAGAACTCTGGCTTTCGAAACCCGAGCCTGATGATGTAGTGGATAATCTGGATAGAAGCGATGATGGGCCTTCAGAATTTGTAGACAAGACTGTCTGTGATGGAGGTATGGCAACTGTGAAAGAAAAGTTGACCTCCTTGAAGGATTGTTCTCAGAATCTGTCAAGCCCAATTTCAGTTTTCCAAACGAAGAGCTCATACTCAGATGACATAACTACAATATGTTCTCAAAGCAAATGCCTTCCACCCACTAAACACAAATTTTCAGGAAGCCTTCAGTTTTTGGGCACCACTATGTCTACTAGAGAGCAACGGTGTGGGCTTCAATCTGAATCAGAGTACTCTATGGAGTCCTTATCAGATGAAAATGAAGAGTCTATTCCATCCAAATCGGTACGGACTAGGAAAATTAGTGATTTACTTGACTTGGATTATGACCCCGCAATCAAGAAATGCATGCATTGCGAAACAACAGATACCCCGCAGTGGAGAACAGGGCCCTTAGGACGAGGAACCCTCTGTAATGCGTGTGGAGTTCAGTATAAACAGGGTAGGCTTTTTAGCGTCTACCGTCCTTTAAGGAGTCCGGCATTTGATCCTTCTTTACACTCCAATTTGCCCAAGAAAGTCCACGAGATGAGAAAAGGTAGCTCAAATGAAGGGGATCCAAATCCTCCTACTCCTAAACCCAAAAAAAAGAAAGTTGAGGAAATGAGGATGAGTAACAAAAGTGAGAGGGATCCAAGCCCGACTCCTAAACCCAAGAAAGTTGTGCCTACTCCTGAACTCAAGAAAGTTGAGGAGATGAGGATGGGTAACTATAGTGAGATGGCTCGGAACCTTTCTTTTGAACAAAGAAAAGCCGAGGAGATGAAGAAGGGTAACAATAGTGACATGGCCCGGGACCTTTCTTTTGAACAAAGAAAAGCCGAGGAGATGATCTTGGGGAACGAGAATGAAAGATATCAGAACCGTGCCCATAGAGCAAAGAAATTTGAGGAGATGAGAGAGAAGGCTAGCAATGTAGTTCAATCAGATTCTAGAAAGAGGAACCGGGAGTTTGATTCTTATCCGACTCCTTCAGAATTTGTAGACAAGACTGTCTGTGATGGAGGTATGGCAACTGTGAAAGAAAAGTTGACCTCCTTGAAGGATTGTTCTCAGAATCTGTCAAGCCCAATTTCAGTTTTCCAAACGAAGAGCTCATACTCAGATGACATAACTACAATATGTTCTCAAAGCAAATGCCTTCCACCCACTAAACACAAATTTTCAGGAAGCCTTCAGTTTTTGGGCACCACTATGTCTACTAGAGAGCAACGGTGTGGGCTTCAATCTGAATCAGAGTACTCTATGGAGTCCTTATCAGATGAAAATGAAGAGTCTATTCCATCCAAATCGGTACGGACTAGGAAAATTAGTGATTTACTTGACTTGGATTATGACCCCGCAATCAAGAAATGCATGCATTGCGAAACAACAGATACCCCGCAGTGGAGAACAGGGCCCTTAGGACGAGGAACCCTCTGTAATGCGTGTGGAGTTCAGTATAAACAGGGTAGGCTTTTTAGCGTCTACCGTCCTTTAAGGAGTCCGGCATTTGATCCTTCTTTACACTCCAATTTGCCCAAGAAAGTCCACGAGATGAGAAAAGGTAGCTCAAATGAAGGGGATCCAAATCCTCCTACTCCTAAACCCAAAAAAAAGAAAGTTGAGGAAATGAGGATGAGTAACAAAAGTGAGAGGGATCCAAGCCCGACTCCTAAACCCAAGAAAGTTGTGCCTACTCCTGAACTCAAGAAAGTTGAGGAGATGAGGATGGGTAACTATAGTGAGATGGCTCGGAACCTTTCTTTTGAACAAAGAAAAGCCGAGGAGATGAAGAAGGGTAACAATAGTGACATGGCCCGGGACCTTTCTTTTGAACAAAGAAAAGCCGAGGAGATGATCTTGGGGAACGAGAATGAAAGATATCAGAACCGTGCCCATAGAGCAAAGAAATTTGAGGAGATGAGAGAGAAGGCTAGCAATGTAGTTCAATCAGATTCTAGAAAGAGGAACCGGGAGTTTGATTCTTATCCGACTCAATCTAGTCGCAATTATCGTAACGGAGGGGGAAGTGGGGGGTGGGCAACGACGCCGGTGAGGAAGGGGGAGTGGGGTTTGGTGGCGGCGGTAGTGGGGAGCAATGAGGAAGGTGGGGGAAGTGCAGGGAAGGGAAACACGGGTTTGGTGGTGGCGGCGGCAGTGGGGAGCAATGAGGAAGATGGGGGAAGTGCGGGGGAGGGAAACGTCATCGGTGAGGAAGGGGGAGTGGGGTTTGGTGGTGGCGGCATGGCGGCGGTGGGGTTTGGTGGTGGTGGCGTTCTCTCTCCTCTAATTCGTTGGTGCTGCGAGGATGAAGGTCATGTGGTTGCTTAA

Protein sequence

MAASGKKVLAMSGNGENGDDPISNTHFSFGQLLDLGVINQYGQLLGEYSRLDEELWLSKPEPDDVVDNLDRSDDGPSEFVDKTVCDGGMATVKEKLTSLKDCSQNLSSPISVFQTKSSYSDDITTICSQSKCLPPTKHKFSGSLQFLGTTMSTREQRCGLQSESEYSMESLSDENEESIPSKSVRTRKISDLLDLDYDPAIKKCMHCETTDTPQWRTGPLGRGTLCNACGVQYKQGRLFSVYRPLRSPAFDPSLHSNLPKKVHEMRKGSSNEGDPNPPTPKPKKKKVEEMRMSNKSERDPSPTPKPKKVVPTPELKKVEEMRMGNYSEMARNLSFEQRKAEEMKKGNNSDMARDLSFEQRKAEEMILGNENERYQNRAHRAKKFEEMREKASNVVQSDSRKRNREFDSYPTPSEFVDKTVCDGGMATVKEKLTSLKDCSQNLSSPISVFQTKSSYSDDITTICSQSKCLPPTKHKFSGSLQFLGTTMSTREQRCGLQSESEYSMESLSDENEESIPSKSVRTRKISDLLDLDYDPAIKKCMHCETTDTPQWRTGPLGRGTLCNACGVQYKQGRLFSVYRPLRSPAFDPSLHSNLPKKVHEMRKGSSNEGDPNPPTPKPKKKKVEEMRMSNKSERDPSPTPKPKKVVPTPELKKVEEMRMGNYSEMARNLSFEQRKAEEMKKGNNSDMARDLSFEQRKAEEMILGNENERYQNRAHRAKKFEEMREKASNVVQSDSRKRNREFDSYPTQSSRNYRNGGGSGGWATTPVRKGEWGLVAAVVGSNEEGGGSAGKGNTGLVVAAAVGSNEEDGGSAGEGNVIGEEGGVGFGGGGMAAVGFGGGGVLSPLIRWCCEDEGHVVA
Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Spo28038.1Spo28038.1mRNA


Homology
BLAST of Spo28038.1 vs. NCBI nr
Match: gi|902234036|gb|KNA23303.1| (hypothetical protein SOVF_026070 [Spinacia oleracea])

HSP 1 Score: 800.4 bits (2066), Expect = 3.000e-228
Identity = 409/418 (97.85%), Postives = 411/418 (98.33%), Query Frame = 1

		  

Query: 1   MAASGKKVLAMSGNGENGDDPISNTHFSFGQLLDLGVINQYGQLLGEYSRLDEELWLSKP 60
           MAASGKKVLAMSGNGENGDDPISNTHFSFGQLLDLGVINQYGQLLGEYSRLDEELWLS+P
Sbjct: 1   MAASGKKVLAMSGNGENGDDPISNTHFSFGQLLDLGVINQYGQLLGEYSRLDEELWLSEP 60

Query: 61  EPDDVVDNLDRSDDG-------PSEFVDKTVCDGGMATVKEKLTSLKDCSQNLSSPISVF 120
           EPDDVVDNLDRSDDG       PSEFVDKTVCDGGMATVKEKLTSLKDCSQNLSSPISVF
Sbjct: 61  EPDDVVDNLDRSDDGVNHFATSPSEFVDKTVCDGGMATVKEKLTSLKDCSQNLSSPISVF 120

Query: 121 QTKSSYSDDITTICSQSKCLPPTKHKFSGSLQFLGTTMSTREQRCGLQSESEYSMESLSD 180
           QTKSSYSDDITTICSQSKCLPPTKHKFSGSLQFLGTTMSTREQRCGLQSESEYSMESLSD
Sbjct: 121 QTKSSYSDDITTICSQSKCLPPTKHKFSGSLQFLGTTMSTREQRCGLQSESEYSMESLSD 180

Query: 181 ENEESIPSKSVRTRKISDLLDLDYDPAIKKCMHCETTDTPQWRTGPLGRGTLCNACGVQY 240
           ENEESIPSKSVRTRKISDLLDLDYDPAIKKCMHCETTDTPQWRTGPLGRGTLCNACGVQY
Sbjct: 181 ENEESIPSKSVRTRKISDLLDLDYDPAIKKCMHCETTDTPQWRTGPLGRGTLCNACGVQY 240

Query: 241 KQGRLFSVYRPLRSPAFDPSLHSNLPKKVHEMRKGSSNEGDPNPPTPKPKKKKVEEMRMS 300
           KQGRLFSVYRPLRSPAFDPSLHSNLPKKVHEMRKGSSNEGDPNPPTPKPKKKKVEEMRMS
Sbjct: 241 KQGRLFSVYRPLRSPAFDPSLHSNLPKKVHEMRKGSSNEGDPNPPTPKPKKKKVEEMRMS 300

Query: 301 NKSERDPSPTPKPKKVVPTPELKKVEEMRMGNYSEMARNLSFEQRKAEEMKKGNNSDMAR 360
           NKSERDPSPTPKPKKVVPTPELKKVEEMRMGNYSEMARNLSFEQRKAEEMKKGNNSDMAR
Sbjct: 301 NKSERDPSPTPKPKKVVPTPELKKVEEMRMGNYSEMARNLSFEQRKAEEMKKGNNSDMAR 360

Query: 361 DLSFEQRKAEEMILGNENERYQNRAHRAKKFEEMREKASNVVQSDSRKRNREFDSYPT 412
           DLSFEQRKA+EMILGNENERYQNRAHRAKKFEEMREKASNVVQSDSRKRNREFDSYPT
Sbjct: 361 DLSFEQRKADEMILGNENERYQNRAHRAKKFEEMREKASNVVQSDSRKRNREFDSYPT 418

BLAST of Spo28038.1 vs. NCBI nr
Match: gi|731340716|ref|XP_010681549.1| (PREDICTED: uncharacterized protein LOC104896501 [Beta vulgaris subsp. vulgaris])

HSP 1 Score: 340.9 bits (873), Expect = 6.500e-90
Identity = 215/435 (49.43%), Postives = 265/435 (60.92%), Query Frame = 1

		  

Query: 1   MAASGKKVLAMSGNGENG---------DDPISNTHFSFGQLLDLGVINQYGQLLGEYSRL 60
           M AS  K  A+ G+             D+P        GQ L+LG++NQYGQL+GEYSRL
Sbjct: 1   MTASENKGFAIQGDFSGSSNLAHEIRDDNPRRINQILIGQGLNLGIVNQYGQLMGEYSRL 60

Query: 61  DEELWLSKPEPDDVVDNLDRSDD--------------GPSEFVDKTVCDGGMATVKEKLT 120
           D+ELWLSKPE ++ V+  +   +                SE+VD  VCD G++ VKEK +
Sbjct: 61  DDELWLSKPEDEEAVEKCNEGVELASVPFYENIVGQGQASEYVDDPVCDVGVS-VKEKAS 120

Query: 121 SLKDCSQNL---SSPISVFQTKSSYSDDITTICSQSKCLPPTKHKFSGSLQFLGTTMSTR 180
           SLK  SQ     SSPISV Q   S S+  TT+ S  K   PTK   SG  QF GTT S R
Sbjct: 121 SLKHYSQKHYKNSSPISVLQKGWSCSNVSTTVRSHRKRRQPTKRNSSGGFQFPGTTSSAR 180

Query: 181 EQRCGLQSESEYSMES-LSDENEESIPSKSVRTRKISDLLDLDYDPAIKKCMHCETTDTP 240
           E     QSESE SMES LSDE +E IP+K VRTRKIS+L++ D +PA KKCMHC TT+TP
Sbjct: 181 EHSWAFQSESENSMESSLSDETDEYIPAKQVRTRKISELINFDSNPAAKKCMHCGTTETP 240

Query: 241 QWRTGPLGRGTLCNACGVQYKQGRLFSVYRPLRSPAFDPSLHSNLPKKVHEMRKGSSNEG 300
           QWRTGP+GR T+CNACGVQYK GRL+  YRPL+SP FDPSLHSNLPKK            
Sbjct: 241 QWRTGPMGRKTVCNACGVQYKHGRLYPDYRPLKSPTFDPSLHSNLPKK------------ 300

Query: 301 DPNPPTPKPKKKKVEEMRMSNKSERDPSPTPKPKKVVPTPELKKVEEMRMGNYSEMARNL 360
                        V+ MRM N SER P PTPKP         KKV+ ++ GN   + +NL
Sbjct: 301 -------------VQMMRMGNVSERVPHPTPKP---------KKVQWIKTGNNIPIGQNL 360

Query: 361 SFEQRKAEEMKKGNNSDMARDLSFEQRKAEEMILGNENERYQNRAHRAKKFEEMREKASN 409
           + E +K  E + G NS+M ++L+ E    + M  GN+NER QNRAHRAKKFEE REKAS+
Sbjct: 361 TSELKKL-ETRMGKNSEMGQNLTSEPENFDGMCFGNKNERDQNRAHRAKKFEERREKASD 398

BLAST of Spo28038.1 vs. NCBI nr
Match: gi|1021047816|gb|KZN05596.1| (hypothetical protein DCAR_006433 [Daucus carota subsp. sativus])

HSP 1 Score: 129.4 bits (324), Expect = 3.000e-26
Identity = 92/229 (40.17%), Postives = 112/229 (48.91%), Query Frame = 1

		  

Query: 77  SEFVDKTVCDGGMATVKEKLTSLKDCSQN---LSSPISVFQTKSSYSDDITTICSQSKCL 136
           S FV+ +   GGM   KE +    + SQN    SSP+SV ++ SS S      CS  K +
Sbjct: 88  STFVEDSFSGGGMTLNKENVPLNNNASQNQFRTSSPVSVLESSSSSS------CSDGKIV 147

Query: 137 P-----------------PTKHKFSGSLQFLGTTMSTREQRCGL------QSESEYSMES 196
           P                 P   +   ++QF+  T S      GL       SESE  +ES
Sbjct: 148 PLSPSHRGPQRARSKRPRPANFRPRSTMQFVTPTSSVTPDNSGLVNVPMISSESENFVES 207

Query: 197 LS-------DENEE------SIPSKSVRTRKISDLLDLDYDPAIKKCMHCETTDTPQWRT 256
           L         EN++      S PS S      S L       +++KC+HCE T TPQWR 
Sbjct: 208 LRIKIPKPVSENKKKQKQKPSFPSSSAEMNCSSPL---QQQASVRKCLHCEITKTPQWRA 267

Query: 257 GPLGRGTLCNACGVQYKQGRLFSVYRPLRSPAFDPSLHSNLPKKVHEMR 267
           GPLG  TLCNACGV+YK GRLF  YRP  SP F PSLHSN  KKV EMR
Sbjct: 268 GPLGPKTLCNACGVRYKSGRLFPEYRPAASPTFVPSLHSNSHKKVVEMR 307

BLAST of Spo28038.1 vs. NCBI nr
Match: gi|747074652|ref|XP_011084325.1| (PREDICTED: GATA transcription factor 8 [Sesamum indicum])

HSP 1 Score: 121.7 bits (304), Expect = 6.200e-24
Identity = 93/242 (38.43%), Postives = 116/242 (47.93%), Query Frame = 1

		  

Query: 413 SEFVDKTVCDGGMATVKEKLTSLKDCSQN---LSSPISVFQTKSSYSDDITTICSQSKCL 472
           S F++ +    GM   KE L    + S N    SSPISV ++ SS S   T+  S++K +
Sbjct: 90  STFMEDSFSGQGMTVGKENLCPEMEPSDNQFQTSSPISVLESSSSSS---TSSSSEAKAM 149

Query: 473 P-----------------PTKHKFSGSLQFLGTTMSTREQ-----RCGLQSESEYSMESL 532
           P                 P       ++Q +  T S  E        GL SESE   ES 
Sbjct: 150 PLSPSHRGPQRARSKRPRPATFNPRAAIQLISPTSSFIETPQPFVAPGLSSESENFAEST 209

Query: 533 SDENE------ESIPSKSVRTRKISDLLDLDYDP---AIKKCMHCETTDTPQWRTGPLGR 592
           + +        E    K ++     D  D   +P   A++KCMHCE T TPQWR GP+G 
Sbjct: 210 TMKKSPKPAGLEPKKKKKIKIASPVDAEDAHQNPPAQAVRKCMHCEITKTPQWRAGPMGP 269

Query: 593 GTLCNACGVQYKQGRLFSVYRPLRSPAFDPSLHSNLPKKVHEMR-KGSSNEGDPNPPTPK 620
            TLCNACGV+YK GRLF  YRP  SP F PSLHSN  KKV EMR K         P T K
Sbjct: 270 KTLCNACGVRYKSGRLFPEYRPAASPTFVPSLHSNSHKKVLEMRGKVGPGTATTTPTTTK 328

BLAST of Spo28038.1 vs. NCBI nr
Match: gi|590565104|ref|XP_007009854.1| (Plant-specific GATA-type zinc finger transcription factor family protein isoform 2 [Theobroma cacao])

HSP 1 Score: 121.3 bits (303), Expect = 8.100e-24
Identity = 91/237 (38.40%), Postives = 117/237 (49.37%), Query Frame = 1

		  

Query: 413 SEFVDKTVCDGGMATVKEKLTSL-KDCSQN---LSSPISVFQTKSSYSDDITTICSQSKC 472
           S FVD + C   +   KE+ +S+ KD SQ+    SSP+SV ++ SS S + T   S    
Sbjct: 91  SNFVDDSQCGASLTIKKEESSSITKDSSQHQFQTSSPVSVLESSSSCSGEKTLPRSPETA 150

Query: 473 LP------------PTKHKFSGSLQFLGTTMSTREQRCG-------LQSESE-YSMESLS 532
            P            PT      ++Q +  T S  E           + S+SE Y+   L 
Sbjct: 151 APGRRGRARSKRPRPTTFNPRPAIQLISPTSSVNENDIPQPFVVPKVPSDSENYAESRLL 210

Query: 533 DENEESIPSKSVRTRKISDLLDLDYDPA----------IKKCMHCETTDTPQWRTGPLGR 592
            +    +  +  + +KI   L L   PA          ++KCMHCE T TPQWR GP+G 
Sbjct: 211 IKIPRQVNPEHKKKKKIK--LSLPTAPADNNQNSSGQAVRKCMHCEITKTPQWRAGPMGP 270

Query: 593 GTLCNACGVQYKQGRLFSVYRPLRSPAFDPSLHSNLPKKVHEMRKGSSNEGDPNPPT 616
            TLCNACGV+YK GRLF  YRP  SP F PSLHSN  KKV EMR    N+G   P T
Sbjct: 271 KTLCNACGVRYKSGRLFPEYRPAASPTFVPSLHSNSHKKVIEMR----NKGGAAPTT 321

BLAST of Spo28038.1 vs. UniProtKB/TrEMBL
Match: A0A0K9RV05_SPIOL (Uncharacterized protein OS=Spinacia oleracea GN=SOVF_026070 PE=4 SV=1)

HSP 1 Score: 800.4 bits (2066), Expect = 2.100e-228
Identity = 409/418 (97.85%), Postives = 411/418 (98.33%), Query Frame = 1

		  

Query: 1   MAASGKKVLAMSGNGENGDDPISNTHFSFGQLLDLGVINQYGQLLGEYSRLDEELWLSKP 60
           MAASGKKVLAMSGNGENGDDPISNTHFSFGQLLDLGVINQYGQLLGEYSRLDEELWLS+P
Sbjct: 1   MAASGKKVLAMSGNGENGDDPISNTHFSFGQLLDLGVINQYGQLLGEYSRLDEELWLSEP 60

Query: 61  EPDDVVDNLDRSDDG-------PSEFVDKTVCDGGMATVKEKLTSLKDCSQNLSSPISVF 120
           EPDDVVDNLDRSDDG       PSEFVDKTVCDGGMATVKEKLTSLKDCSQNLSSPISVF
Sbjct: 61  EPDDVVDNLDRSDDGVNHFATSPSEFVDKTVCDGGMATVKEKLTSLKDCSQNLSSPISVF 120

Query: 121 QTKSSYSDDITTICSQSKCLPPTKHKFSGSLQFLGTTMSTREQRCGLQSESEYSMESLSD 180
           QTKSSYSDDITTICSQSKCLPPTKHKFSGSLQFLGTTMSTREQRCGLQSESEYSMESLSD
Sbjct: 121 QTKSSYSDDITTICSQSKCLPPTKHKFSGSLQFLGTTMSTREQRCGLQSESEYSMESLSD 180

Query: 181 ENEESIPSKSVRTRKISDLLDLDYDPAIKKCMHCETTDTPQWRTGPLGRGTLCNACGVQY 240
           ENEESIPSKSVRTRKISDLLDLDYDPAIKKCMHCETTDTPQWRTGPLGRGTLCNACGVQY
Sbjct: 181 ENEESIPSKSVRTRKISDLLDLDYDPAIKKCMHCETTDTPQWRTGPLGRGTLCNACGVQY 240

Query: 241 KQGRLFSVYRPLRSPAFDPSLHSNLPKKVHEMRKGSSNEGDPNPPTPKPKKKKVEEMRMS 300
           KQGRLFSVYRPLRSPAFDPSLHSNLPKKVHEMRKGSSNEGDPNPPTPKPKKKKVEEMRMS
Sbjct: 241 KQGRLFSVYRPLRSPAFDPSLHSNLPKKVHEMRKGSSNEGDPNPPTPKPKKKKVEEMRMS 300

Query: 301 NKSERDPSPTPKPKKVVPTPELKKVEEMRMGNYSEMARNLSFEQRKAEEMKKGNNSDMAR 360
           NKSERDPSPTPKPKKVVPTPELKKVEEMRMGNYSEMARNLSFEQRKAEEMKKGNNSDMAR
Sbjct: 301 NKSERDPSPTPKPKKVVPTPELKKVEEMRMGNYSEMARNLSFEQRKAEEMKKGNNSDMAR 360

Query: 361 DLSFEQRKAEEMILGNENERYQNRAHRAKKFEEMREKASNVVQSDSRKRNREFDSYPT 412
           DLSFEQRKA+EMILGNENERYQNRAHRAKKFEEMREKASNVVQSDSRKRNREFDSYPT
Sbjct: 361 DLSFEQRKADEMILGNENERYQNRAHRAKKFEEMREKASNVVQSDSRKRNREFDSYPT 418

BLAST of Spo28038.1 vs. UniProtKB/TrEMBL
Match: A0A0J8EYH1_BETVU (Uncharacterized protein OS=Beta vulgaris subsp. vulgaris GN=BVRB_6g141930 PE=4 SV=1)

HSP 1 Score: 340.9 bits (873), Expect = 4.500e-90
Identity = 215/435 (49.43%), Postives = 265/435 (60.92%), Query Frame = 1

		  

Query: 1   MAASGKKVLAMSGNGENG---------DDPISNTHFSFGQLLDLGVINQYGQLLGEYSRL 60
           M AS  K  A+ G+             D+P        GQ L+LG++NQYGQL+GEYSRL
Sbjct: 1   MTASENKGFAIQGDFSGSSNLAHEIRDDNPRRINQILIGQGLNLGIVNQYGQLMGEYSRL 60

Query: 61  DEELWLSKPEPDDVVDNLDRSDD--------------GPSEFVDKTVCDGGMATVKEKLT 120
           D+ELWLSKPE ++ V+  +   +                SE+VD  VCD G++ VKEK +
Sbjct: 61  DDELWLSKPEDEEAVEKCNEGVELASVPFYENIVGQGQASEYVDDPVCDVGVS-VKEKAS 120

Query: 121 SLKDCSQNL---SSPISVFQTKSSYSDDITTICSQSKCLPPTKHKFSGSLQFLGTTMSTR 180
           SLK  SQ     SSPISV Q   S S+  TT+ S  K   PTK   SG  QF GTT S R
Sbjct: 121 SLKHYSQKHYKNSSPISVLQKGWSCSNVSTTVRSHRKRRQPTKRNSSGGFQFPGTTSSAR 180

Query: 181 EQRCGLQSESEYSMES-LSDENEESIPSKSVRTRKISDLLDLDYDPAIKKCMHCETTDTP 240
           E     QSESE SMES LSDE +E IP+K VRTRKIS+L++ D +PA KKCMHC TT+TP
Sbjct: 181 EHSWAFQSESENSMESSLSDETDEYIPAKQVRTRKISELINFDSNPAAKKCMHCGTTETP 240

Query: 241 QWRTGPLGRGTLCNACGVQYKQGRLFSVYRPLRSPAFDPSLHSNLPKKVHEMRKGSSNEG 300
           QWRTGP+GR T+CNACGVQYK GRL+  YRPL+SP FDPSLHSNLPKK            
Sbjct: 241 QWRTGPMGRKTVCNACGVQYKHGRLYPDYRPLKSPTFDPSLHSNLPKK------------ 300

Query: 301 DPNPPTPKPKKKKVEEMRMSNKSERDPSPTPKPKKVVPTPELKKVEEMRMGNYSEMARNL 360
                        V+ MRM N SER P PTPKP         KKV+ ++ GN   + +NL
Sbjct: 301 -------------VQMMRMGNVSERVPHPTPKP---------KKVQWIKTGNNIPIGQNL 360

Query: 361 SFEQRKAEEMKKGNNSDMARDLSFEQRKAEEMILGNENERYQNRAHRAKKFEEMREKASN 409
           + E +K  E + G NS+M ++L+ E    + M  GN+NER QNRAHRAKKFEE REKAS+
Sbjct: 361 TSELKKL-ETRMGKNSEMGQNLTSEPENFDGMCFGNKNERDQNRAHRAKKFEERREKASD 398

BLAST of Spo28038.1 vs. UniProtKB/TrEMBL
Match: A0A0D3HED9_9ORYZ (Uncharacterized protein OS=Oryza barthii PE=4 SV=1)

HSP 1 Score: 129.8 bits (325), Expect = 1.600e-26
Identity = 134/463 (28.94%), Postives = 193/463 (41.68%), Query Frame = 1

		  

Query: 203 KCMHCETTDTPQWRTGPLGRGTLCNACGVQYKQGRLFSVYRPLRSPAFDPSLHSNLPKKV 262
           +C HC TT+TPQWR GP G  TLCNACG++Y+ G+L   YRPLRSP F P LHSN   +V
Sbjct: 66  QCRHCGTTETPQWRHGPEGHRTLCNACGMRYRSGKLVPEYRPLRSPTFSPELHSNRHHRV 125

Query: 263 HEMRKGSSNEGDPNPPTPKPK---KKKVEEMRMSNKSERDPSPT------PKPKKVVPTP 322
            ++R+    +     P    +   + K EE  ++  S +D   T      P P+ V   P
Sbjct: 126 LQLRRRPGPQSAAPSPAAVARCGGEAKEEEEELAWVSNKDAFATVETTMAPSPRVVETPP 185

Query: 323 ELK----KVEEMRMGNYSEMARNLSFEQRKAEEMKKGNN-----SDMARD---------- 382
           E       +       +S+  R +    R+ +E     N     + MAR           
Sbjct: 186 EHDHRPPNIPTTSPEPHSDRPRRVVQLPRRLQEPSASANLAHAVAAMARAGRECAHCGTT 245

Query: 383 ------LSFEQRKAEEMILGNENERYQ-NRAHRAKKFEEMREKASNVVQSDSRKRNREFD 442
                 L  + R+      GN+    Q N    ++  +E ++K+ +   S  RKR+    
Sbjct: 246 KTPAWRLGPDSRRKLCNACGNKYRSGQLNSTTFSQNSQEQKKKSKSSACSRERKRSAVAA 305

Query: 443 SYPTPSEFVDKTVC------DGGMATVKEKLTSLKDCSQNLSSPISVFQTKS------SY 502
           +        D          DGG         +L D +          + +       S 
Sbjct: 306 TVVVGGGLRDDAAAIADEHLDGGDLQALLDDVALDDVAARGGGDAGEAKEEEEELEWLSN 365

Query: 503 SDDITTICSQSKCLPPTKHKF-----SGSLQFLGTTMSTREQRCGLQSESEYSMESLSDE 562
            D   T+ + S  LP  + K          +  G+T +       L  E +   E+    
Sbjct: 366 KDAFPTVETMSPALPENRTKVPVPPAGWQCRHCGSTETP------LWRERDGPAEAEHVR 425

Query: 563 NEESIPSKSVRT--RKISDLL------DLDYDPAIKKCMHCETTDTPQWRTGPLGRGTLC 606
            EE+ P+ +  T  R+I DLL      +       ++C HC TT TP W +GP  RG LC
Sbjct: 426 KEETPPNITPATKHRRIVDLLRCSTALNTAATAVERRCTHCGTTKTPAWLSGPDSRGKLC 485

BLAST of Spo28038.1 vs. UniProtKB/TrEMBL
Match: A0A166DRT1_DAUCA (Uncharacterized protein OS=Daucus carota subsp. sativus GN=DCAR_006433 PE=4 SV=1)

HSP 1 Score: 129.4 bits (324), Expect = 2.100e-26
Identity = 92/229 (40.17%), Postives = 112/229 (48.91%), Query Frame = 1

		  

Query: 77  SEFVDKTVCDGGMATVKEKLTSLKDCSQN---LSSPISVFQTKSSYSDDITTICSQSKCL 136
           S FV+ +   GGM   KE +    + SQN    SSP+SV ++ SS S      CS  K +
Sbjct: 88  STFVEDSFSGGGMTLNKENVPLNNNASQNQFRTSSPVSVLESSSSSS------CSDGKIV 147

Query: 137 P-----------------PTKHKFSGSLQFLGTTMSTREQRCGL------QSESEYSMES 196
           P                 P   +   ++QF+  T S      GL       SESE  +ES
Sbjct: 148 PLSPSHRGPQRARSKRPRPANFRPRSTMQFVTPTSSVTPDNSGLVNVPMISSESENFVES 207

Query: 197 LS-------DENEE------SIPSKSVRTRKISDLLDLDYDPAIKKCMHCETTDTPQWRT 256
           L         EN++      S PS S      S L       +++KC+HCE T TPQWR 
Sbjct: 208 LRIKIPKPVSENKKKQKQKPSFPSSSAEMNCSSPL---QQQASVRKCLHCEITKTPQWRA 267

Query: 257 GPLGRGTLCNACGVQYKQGRLFSVYRPLRSPAFDPSLHSNLPKKVHEMR 267
           GPLG  TLCNACGV+YK GRLF  YRP  SP F PSLHSN  KKV EMR
Sbjct: 268 GPLGPKTLCNACGVRYKSGRLFPEYRPAASPTFVPSLHSNSHKKVVEMR 307

BLAST of Spo28038.1 vs. UniProtKB/TrEMBL
Match: A0A061FP83_THECC (Plant-specific GATA-type zinc finger transcription factor family protein isoform 1 OS=Theobroma cacao GN=TCM_043158 PE=4 SV=1)

HSP 1 Score: 121.3 bits (303), Expect = 5.700e-24
Identity = 91/237 (38.40%), Postives = 117/237 (49.37%), Query Frame = 1

		  

Query: 413 SEFVDKTVCDGGMATVKEKLTSL-KDCSQN---LSSPISVFQTKSSYSDDITTICSQSKC 472
           S FVD + C   +   KE+ +S+ KD SQ+    SSP+SV ++ SS S + T   S    
Sbjct: 164 SNFVDDSQCGASLTIKKEESSSITKDSSQHQFQTSSPVSVLESSSSCSGEKTLPRSPETA 223

Query: 473 LP------------PTKHKFSGSLQFLGTTMSTREQRCG-------LQSESE-YSMESLS 532
            P            PT      ++Q +  T S  E           + S+SE Y+   L 
Sbjct: 224 APGRRGRARSKRPRPTTFNPRPAIQLISPTSSVNENDIPQPFVVPKVPSDSENYAESRLL 283

Query: 533 DENEESIPSKSVRTRKISDLLDLDYDPA----------IKKCMHCETTDTPQWRTGPLGR 592
            +    +  +  + +KI   L L   PA          ++KCMHCE T TPQWR GP+G 
Sbjct: 284 IKIPRQVNPEHKKKKKIK--LSLPTAPADNNQNSSGQAVRKCMHCEITKTPQWRAGPMGP 343

Query: 593 GTLCNACGVQYKQGRLFSVYRPLRSPAFDPSLHSNLPKKVHEMRKGSSNEGDPNPPT 616
            TLCNACGV+YK GRLF  YRP  SP F PSLHSN  KKV EMR    N+G   P T
Sbjct: 344 KTLCNACGVRYKSGRLFPEYRPAASPTFVPSLHSNSHKKVIEMR----NKGGAAPTT 394

BLAST of Spo28038.1 vs. ExPASy Swiss-Prot
Match: GATA8_ARATH (GATA transcription factor 8 OS=Arabidopsis thaliana GN=GATA8 PE=2 SV=1)

HSP 1 Score: 108.2 bits (269), Expect = 4.500e-22
Identity = 80/220 (36.36%), Postives = 107/220 (48.64%), Query Frame = 1

		  

Query: 412 PSEFVDKTVCDGGMATVKEKLTSLKDCSQ-NLSSPISVFQTKSSYSDDITTICSQSKCLP 471
           P  FV++T+ +    +      S    SQ   SSP+SV ++ SS S    T    +  + 
Sbjct: 86  PPSFVEETLVEKKEDSFSTNTDSSSSHSQFRSSSPVSVLESSSSSSQTTNT----TSLVL 145

Query: 472 PTKHKFSGSLQFLGTTMS---TREQRCGLQSES--EYSMESLSDENEESIPSKSVRTRKI 531
           P KH    + +           ++  CG  S        + LSD N+  I  K  +  KI
Sbjct: 146 PGKHGRPRTKRPRPPVQDKDRVKDNVCGGDSRLIIRIPKQFLSDHNK-MINKKKKKKAKI 205

Query: 532 -----SDLLDLDYDP-----------AIKKCMHCETTDTPQWRTGPLGRGTLCNACGVQY 591
                S  +DL+ +             ++KCMHCE T TPQWR GP+G  TLCNACGV+Y
Sbjct: 206 TSSSSSSGIDLEVNGNNVDSYSSEQYPLRKCMHCEVTKTPQWRLGPMGPKTLCNACGVRY 265

Query: 592 KQGRLFSVYRPLRSPAFDPSLHSNLPKKVHEMRKGSSNEG 610
           K GRLF  YRP  SP F P+LHSN  KKV EMR    ++G
Sbjct: 266 KSGRLFPEYRPAASPTFTPALHSNSHKKVAEMRNKRCSDG 300

BLAST of Spo28038.1 vs. ExPASy Swiss-Prot
Match: GAT11_ARATH (GATA transcription factor 11 OS=Arabidopsis thaliana GN=GATA11 PE=2 SV=1)

HSP 1 Score: 101.3 bits (251), Expect = 5.400e-20
Identity = 46/90 (51.11%), Postives = 60/90 (66.67%), Query Frame = 1

		  

Query: 186 TRKISDLLDL-DYDPAIKKCMHCETTDTPQWRTGPLGRGTLCNACGVQYKQGRLFSVYRP 245
           TR +S  L+  + D  ++KC HCETT TPQWR GP G  TLCNACGV+++ GRL   YRP
Sbjct: 203 TRTVSSTLEASNSDGIVRKCTHCETTKTPQWREGPSGPKTLCNACGVRFRSGRLVPEYRP 262

Query: 246 LRSPAFDPSLHSNLPKKVHEMRKGSSNEGD 275
             SP F P++HSN  +K+ EMR+    + D
Sbjct: 263 ASSPTFIPAVHSNSHRKIIEMRRKDDEQFD 292

BLAST of Spo28038.1 vs. ExPASy Swiss-Prot
Match: GAT13_ARATH (Putative GATA transcription factor 13 OS=Arabidopsis thaliana GN=GATA13 PE=3 SV=2)

HSP 1 Score: 96.7 bits (239), Expect = 1.300e-18
Identity = 52/110 (47.27%), Postives = 68/110 (61.82%), Query Frame = 1

		  

Query: 165 EYSMESLSDENEESIPSKSVRTRKISDLLDLDYDPAIKKCMHCETTDTPQWRTGPLGRGT 224
           E S +   DE+E S  S +++ RK +    L       KC HCETT TPQWR GP GR T
Sbjct: 162 EMSQQFAPDESESSEIS-ALKKRKKNKSRRL-------KCTHCETTTTPQWREGPNGRKT 221

Query: 225 LCNACGVQYKQGRLFSVYRPLRSPAFDPSLHSNLPKKVHEMRKGSSNEGD 275
           LCNACG++++ GRL   YRP  SP F P++HSNL KK+  MR   +++ D
Sbjct: 222 LCNACGIRFRSGRLVLEYRPAASPTFIPTVHSNLHKKIIYMRMKDNDQFD 263

BLAST of Spo28038.1 vs. ExPASy Swiss-Prot
Match: GATA9_ARATH (GATA transcription factor 9 OS=Arabidopsis thaliana GN=GATA9 PE=2 SV=1)

HSP 1 Score: 96.7 bits (239), Expect = 1.300e-18
Identity = 48/102 (47.06%), Postives = 63/102 (61.76%), Query Frame = 1

		  

Query: 170 SLSDENEESIPSKSVRTRKISDLLDLDYDPAI----KKCMHCETTDTPQWRTGPLGRGTL 229
           SL+D +E +   K  R ++     D+D D       ++C+HC T  TPQWRTGP+G  TL
Sbjct: 161 SLADSDETNPKKKQRRVKEQDFAGDMDVDCGESGGGRRCLHCATEKTPQWRTGPMGPKTL 220

Query: 230 CNACGVQYKQGRLFSVYRPLRSPAFDPSLHSNLPKKVHEMRK 268
           CNACGV+YK GRL   YRP  SP F  + HSN  +KV E+R+
Sbjct: 221 CNACGVRYKSGRLVPEYRPASSPTFVMARHSNSHRKVMELRR 262

BLAST of Spo28038.1 vs. ExPASy Swiss-Prot
Match: GAT10_ARATH (GATA transcription factor 10 OS=Arabidopsis thaliana GN=GATA10 PE=2 SV=1)

HSP 1 Score: 96.3 bits (238), Expect = 1.800e-18
Identity = 42/70 (60.00%), Postives = 49/70 (70.00%), Query Frame = 1

		  

Query: 198 DPAIKKCMHCETTDTPQWRTGPLGRGTLCNACGVQYKQGRLFSVYRPLRSPAFDPSLHSN 257
           D  ++ C HCET  TPQWR GP G  TLCNACGV++K GRL   YRP  SP F PS+HSN
Sbjct: 214 DGIVRICTHCETITTPQWRQGPSGPKTLCNACGVRFKSGRLVPEYRPASSPTFIPSVHSN 273

Query: 258 LPKKVHEMRK 268
             +K+ EMRK
Sbjct: 274 SHRKIIEMRK 283

BLAST of Spo28038.1 vs. TAIR (Arabidopsis)
Match: AT3G54810.2 (Plant-specific GATA-type zinc finger transcription factor family protein)

HSP 1 Score: 108.2 bits (269), Expect = 2.500e-23
Identity = 80/220 (36.36%), Postives = 107/220 (48.64%), Query Frame = 1

		  

Query: 412 PSEFVDKTVCDGGMATVKEKLTSLKDCSQ-NLSSPISVFQTKSSYSDDITTICSQSKCLP 471
           P  FV++T+ +    +      S    SQ   SSP+SV ++ SS S    T    +  + 
Sbjct: 86  PPSFVEETLVEKKEDSFSTNTDSSSSHSQFRSSSPVSVLESSSSSSQTTNT----TSLVL 145

Query: 472 PTKHKFSGSLQFLGTTMS---TREQRCGLQSES--EYSMESLSDENEESIPSKSVRTRKI 531
           P KH    + +           ++  CG  S        + LSD N+  I  K  +  KI
Sbjct: 146 PGKHGRPRTKRPRPPVQDKDRVKDNVCGGDSRLIIRIPKQFLSDHNK-MINKKKKKKAKI 205

Query: 532 -----SDLLDLDYDP-----------AIKKCMHCETTDTPQWRTGPLGRGTLCNACGVQY 591
                S  +DL+ +             ++KCMHCE T TPQWR GP+G  TLCNACGV+Y
Sbjct: 206 TSSSSSSGIDLEVNGNNVDSYSSEQYPLRKCMHCEVTKTPQWRLGPMGPKTLCNACGVRY 265

Query: 592 KQGRLFSVYRPLRSPAFDPSLHSNLPKKVHEMRKGSSNEG 610
           K GRLF  YRP  SP F P+LHSN  KKV EMR    ++G
Sbjct: 266 KSGRLFPEYRPAASPTFTPALHSNSHKKVAEMRNKRCSDG 300

BLAST of Spo28038.1 vs. TAIR (Arabidopsis)
Match: AT1G08010.1 (GATA transcription factor 11)

HSP 1 Score: 101.3 bits (251), Expect = 3.100e-21
Identity = 46/90 (51.11%), Postives = 60/90 (66.67%), Query Frame = 1

		  

Query: 186 TRKISDLLDL-DYDPAIKKCMHCETTDTPQWRTGPLGRGTLCNACGVQYKQGRLFSVYRP 245
           TR +S  L+  + D  ++KC HCETT TPQWR GP G  TLCNACGV+++ GRL   YRP
Sbjct: 203 TRTVSSTLEASNSDGIVRKCTHCETTKTPQWREGPSGPKTLCNACGVRFRSGRLVPEYRP 262

Query: 246 LRSPAFDPSLHSNLPKKVHEMRKGSSNEGD 275
             SP F P++HSN  +K+ EMR+    + D
Sbjct: 263 ASSPTFIPAVHSNSHRKIIEMRRKDDEQFD 292

BLAST of Spo28038.1 vs. TAIR (Arabidopsis)
Match: AT2G28340.1 (GATA transcription factor 13)

HSP 1 Score: 96.7 bits (239), Expect = 7.600e-20
Identity = 52/110 (47.27%), Postives = 68/110 (61.82%), Query Frame = 1

		  

Query: 165 EYSMESLSDENEESIPSKSVRTRKISDLLDLDYDPAIKKCMHCETTDTPQWRTGPLGRGT 224
           E S +   DE+E S  S +++ RK +    L       KC HCETT TPQWR GP GR T
Sbjct: 162 EMSQQFAPDESESSEIS-ALKKRKKNKSRRL-------KCTHCETTTTPQWREGPNGRKT 221

Query: 225 LCNACGVQYKQGRLFSVYRPLRSPAFDPSLHSNLPKKVHEMRKGSSNEGD 275
           LCNACG++++ GRL   YRP  SP F P++HSNL KK+  MR   +++ D
Sbjct: 222 LCNACGIRFRSGRLVLEYRPAASPTFIPTVHSNLHKKIIYMRMKDNDQFD 263

BLAST of Spo28038.1 vs. TAIR (Arabidopsis)
Match: AT4G32890.1 (GATA transcription factor 9)

HSP 1 Score: 96.7 bits (239), Expect = 7.600e-20
Identity = 48/102 (47.06%), Postives = 63/102 (61.76%), Query Frame = 1

		  

Query: 170 SLSDENEESIPSKSVRTRKISDLLDLDYDPAI----KKCMHCETTDTPQWRTGPLGRGTL 229
           SL+D +E +   K  R ++     D+D D       ++C+HC T  TPQWRTGP+G  TL
Sbjct: 161 SLADSDETNPKKKQRRVKEQDFAGDMDVDCGESGGGRRCLHCATEKTPQWRTGPMGPKTL 220

Query: 230 CNACGVQYKQGRLFSVYRPLRSPAFDPSLHSNLPKKVHEMRK 268
           CNACGV+YK GRL   YRP  SP F  + HSN  +KV E+R+
Sbjct: 221 CNACGVRYKSGRLVPEYRPASSPTFVMARHSNSHRKVMELRR 262

BLAST of Spo28038.1 vs. TAIR (Arabidopsis)
Match: AT1G08000.1 (GATA transcription factor 10)

HSP 1 Score: 96.3 bits (238), Expect = 9.900e-20
Identity = 42/70 (60.00%), Postives = 49/70 (70.00%), Query Frame = 1

		  

Query: 198 DPAIKKCMHCETTDTPQWRTGPLGRGTLCNACGVQYKQGRLFSVYRPLRSPAFDPSLHSN 257
           D  ++ C HCET  TPQWR GP G  TLCNACGV++K GRL   YRP  SP F PS+HSN
Sbjct: 214 DGIVRICTHCETITTPQWRQGPSGPKTLCNACGVRFKSGRLVPEYRPASSPTFIPSVHSN 273

Query: 258 LPKKVHEMRK 268
             +K+ EMRK
Sbjct: 274 SHRKIIEMRK 283

The following BLAST results are available for this feature:
BLAST of Spo28038.1 vs. NCBI nr
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. NCBI nr)
Total hits: 5
Match NameE-valueIdentityDescription
gi|902234036|gb|KNA23303.1|3.0e-22897.8hypothetical protein SOVF_0260... [more]
gi|731340716|ref|XP_010681549.1|6.5e-9049.4PREDICTED: uncharacterized pro... [more]
gi|1021047816|gb|KZN05596.1|3.0e-2640.1hypothetical protein DCAR_0064... [more]
gi|747074652|ref|XP_011084325.1|6.2e-2438.4PREDICTED: GATA transcription ... [more]
gi|590565104|ref|XP_007009854.1|8.1e-2438.4Plant-specific GATA-type zinc ... [more]
back to top
BLAST of Spo28038.1 vs. UniProtKB/TrEMBL
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. UniprotKB/TrEMBL)
Total hits: 5
Match NameE-valueIdentityDescription
A0A0K9RV05_SPIOL2.1e-22897.8Uncharacterized protein OS=Spi... [more]
A0A0J8EYH1_BETVU4.5e-9049.4Uncharacterized protein OS=Bet... [more]
A0A0D3HED9_9ORYZ1.6e-2628.9Uncharacterized protein OS=Ory... [more]
A0A166DRT1_DAUCA2.1e-2640.1Uncharacterized protein OS=Dau... [more]
A0A061FP83_THECC5.7e-2438.4Plant-specific GATA-type zinc ... [more]
back to top
BLAST of Spo28038.1 vs. ExPASy Swiss-Prot
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. ExPASy SwissProt)
Total hits: 5
Match NameE-valueIdentityDescription
GATA8_ARATH4.5e-2236.3GATA transcription factor 8 OS... [more]
GAT11_ARATH5.4e-2051.1GATA transcription factor 11 O... [more]
GAT13_ARATH1.3e-1847.2Putative GATA transcription fa... [more]
GATA9_ARATH1.3e-1847.0GATA transcription factor 9 OS... [more]
GAT10_ARATH1.8e-1860.0GATA transcription factor 10 O... [more]
back to top
BLAST of Spo28038.1 vs. TAIR (Arabidopsis)
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. TAIR)
Total hits: 5
Match NameE-valueIdentityDescription
AT3G54810.22.5e-2336.3Plant-specific GATA-type zinc ... [more]
AT1G08010.13.1e-2151.1GATA transcription factor 11[more]
AT2G28340.17.6e-2047.2GATA transcription factor 13[more]
AT4G32890.17.6e-2047.0GATA transcription factor 9[more]
AT1G08000.19.9e-2060.0GATA transcription factor 10[more]
back to top
InterPro
Analysis Name: InterPro Annotations of S. oleracea
Date Performed: 2018-06-29
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000679Zinc finger, GATA-typePFAMPF00320GATAcoord: 204..237
score: 7.9E-14coord: 540..573
score: 7.9
IPR000679Zinc finger, GATA-typeSMARTSM00401GATA_3coord: 198..252
score: 2.7E-14coord: 534..588
score: 2.7
IPR000679Zinc finger, GATA-typePROSITEPS00344GATA_ZN_FINGER_1coord: 540..565
score: -coord: 204..229
scor
IPR000679Zinc finger, GATA-typePROFILEPS50114GATA_ZN_FINGER_2coord: 202..234
score: 11.464coord: 538..570
score: 11
IPR013088Zinc finger, NHR/GATA-typeGENE3D3.30.50.10coord: 203..236
score: 3.1E-13coord: 539..572
score: 3.1
NoneNo IPR availablePANTHERPTHR10071TRANSCRIPTION FACTOR GATA GATA BINDING FACTORcoord: 522..603
score: 6.3E-42coord: 202..234
score: 6.3
NoneNo IPR availablePANTHERPTHR10071:SF200SUBFAMILY NOT NAMEDcoord: 522..603
score: 6.3E-42coord: 202..234
score: 6.3
NoneNo IPR availableunknownSSF57716Glucocorticoid receptor-like (DNA-binding domain)coord: 538..592
score: 1.43E-12coord: 202..256
score: 1.43

GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
biological_process GO:0006468 protein phosphorylation
biological_process GO:0009069 serine family amino acid metabolic process
cellular_component GO:0005667 transcription factor complex
molecular_function GO:0003700 DNA-binding transcription factor activity
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0005524 ATP binding
molecular_function GO:0005509 calcium ion binding
molecular_function GO:0004674 protein serine/threonine kinase activity
RNA-Seq Expression
   



Co-expression
Gener valueExpression
Spo157000.70Barchart | Table