Spo04041.1 (mRNA)

Overview
NameSpo04041.1
TypemRNA
OrganismSpinacia oleracea (Spinach)
DescriptionPoly(A) polymerase
LocationSpoScf_01559 : 73544 .. 83229 (+)
Sequence length2761
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTAGCTCGGGAGCGGCCAATCAACATAACGGGCAACGACTGGGCATAACCGAGCCAATATCGTTTGGAGGACCAATGGAATATGATGTGACCAAAACCCACGAACTTGAAAAGGTCAATTCAATATTATTTTTTTGTTTAGTTTGTTGCTTTTGTTGATCATCATTTTGTGGATTGCTCGAAGCCAGACTTAAGTGAGAATCGTGGTTTTACAGTTTTTAGAAAATGTGGGTTTGTATGAGCATCAGGAAGAAGCCGTGAGAAGGGAGGAAGTGCTTGGGAGATTAGACCAGGTTACGACGCTTCTTATTGTATTTTGTATGTTTGTCTTGTTTTTTTGAATTGCAATGCTGTTTTGAGCAGAGATTAACAACATCTTGGGTGTTTCATTGTTGGGTTTTGCATAGATTGTAAAGCTGTGGGTGAAAACAATTAGCCGAGCCAAGGGTTTGAATGAGCAATTGGTACAGGAAGCAAATGCAAAGATTTTCACGTTTGGGTCGTATCGCCTTGGGGTATGAAACCATATTTAGTTTTTTTAGTTTTAGTTTTTGGAGTTAGTTCTTCCTGATCTTTCTCATGAAGAATTCCTACTCTTTTGAATGAAATTCATCATATCTGCTTCCATTGGAGAATTTAGTTATGTTGAACGCCTAAAGGGGCCTCCATTTATCCTTGCTTAAATGTTACTGACTCTTGAATATATGGAAGGCCTTAACTGGGAAATATTGAAATAACTCGTTCTTTTGTTATGAAGCTACCATCCAATCGTATCAGTAGACTGTTAAGATGATAATTATAAGAGCTGACAAACGTGTTGTGTTGGGTCGTGTTCGTGTTCATGTTGAATATAAACGGGTTAAGAAACCTGAACACTAACCCGACTTTTTAAAGTAAACGTGTTGTGTCGTATTGACCTGTTTTATTAAACGTGTCATAAACCTTTGACACTAACCCGCTTTTTTCATGTCTGGTTCATGTCATGTCCGACTTTGTCAGCTCTAATAATTATGTTACTGGAAAGCCAATTAATGAGTTGCAAGCGTCACAAGGGTATTGTAGTTAGCTTCCTAGTTTGTTCAATGTATGATATTTAATTTTGTGTTTCTAGATCATGATCGAGTTTCTAAGTCAGGTGAGCAGTGCTGTATGCTTCTTCTGTGGAGTCTTCTAAGTTTTAACTGCAATGCGCTCAAGTTGCGATTCTCTAATAGGCTTGTCCTGCGGTTCAGGTGGATGACAAGTAGTAGAGTTCGTTATGGTTTGTCCCCATTTGTGCAAATTGCATTAACTATACTCTTTTTATAATCCAAGTATCATGAGGTGTAGATGATAATATAGCAAATAGAAAAATTGACCAACATTCAGTGAAAAATAAGTACTACTTATGACAAATGAAGTAGTAATGGTCGCTGAATTAATATCGTTGAGACAACCTTTCTGTGTAAGCTTGAAATGGAGGGAAAAGACTAGGTCAGTAACTCGGGTTTGATTCAAATTATAAATGTAGTCGTAACTGTTAACAAGCTCTGTTGTGCTACAGATGACTTAGGTGAACTGGTATTTTTTCCAGCAATATTTTCTTTTAAGCTTGCAATTTTGGCTTCTGAATGTTTTAAGATGGCTAAGTAGATTGTTTAGGCTCTTTTTTAGCGTGCACAGAGTATGCATGTGCTGATTGCTAAAGAGTGAAAACAAACAGCCACCTCTAATTGCAAGTCTCTATTGCCACTGAGGTACCCCAGATGACTTATTCGGAAGGCAAGATTTTATAAGTTTATCTTCAGGTCTGAGTGTGTGTACTTTGGCATTGAGTAGGATCAACACTAAGACTTGTTCGAAAGGAGAAGACAAAAGATACCCATTTTGCCGGATATGTGTATGCCGTCTTACCAATAGTGGGTTTTGGTCGCATCTCTTTCTTCATTGTGAGATGGCAAGTAATTTTTGAAGGAGTGTGAAGGAGGTTATGTGATATGTGTTGGGACTACTGGGTTTTCTCTTGTTATATTTGGGATCTGCTGCAGGTTCTGTTTGTAGGCTTTGGAAGGAAAAAGGAGAAGATGTTGTGGCATAATGTTTTTATGGCTTTATTGTGATGTCTGTGGTTAGAATAGAATGGGAGAATCTTCTCGGTAACTTCTGTAACTATTGATTTGCTGTGGGATGGAATAAATTTAGCTTCTTTTTGGGCTATTCTAAGGCAGTGGGGAAATTCAAAGAAGCCTGGCTATTCTGCAATCTGTTTTGTTTGGCTGTTTAGAGGACAACTTGTCCTCTGTTCTTGTACCTTCTGCTGCCTGCTCATACATCGGAAATTTTCTTCATCAAAAAAACTAAAACGAGATTTTGTGAAAGAGACAAGGTTGTGTTTAGAAAAGGGAAACACATGACAGTTTAAACAAACTGGGCCAGTCTGTTATGTCTAACTGTATAAAGGACAACTTGTCTTCTGTACCTGACTACTTGTACTTTGTGCTGTCTGTTCATATATTTATATAAGTTCTTTATCTAAAGAAATAAAAAAATTGGTGAATCATTTGACCTGTGGAATTAGGCTAAAGTCGTGTATAGAAAAGGAAATCACATGGTTTTCTACTGTATAAACAACCTTGTAATCTTTTCTTGTTCGAAGTAATCTACTATATCTTTCTTCCCAAACTCCAATAGCACCTTCTTGTCTATGCTCCAAGTTGGATGAAATCAAAGATAGTATTCTTGGAATGTCCTAAGAAGTATAATTAGTGTTTGGTAGCTTGTGCTTTTTCTTGGAATTTACATATCAATTATAATTATATATATTTTTCAAGCATGTTTTTAGTGTAAAAGTGTCAACTTTGCTGAAATTTCATAAAATTTTATTGTCATTGTCGTTAGGTGCATGGTCCTGGAGCTGATATAGATACCCTGTGTGTCGGCCCTAGACACGCGACACGAGAGGTAAGATGTAACGTTTATCTGCTTGTATATCTTGTTTCCTTTTTGAGTGCCCTTTCACTTGTTATACTTTTGCTGGATGTAGGATGATTTCTTTGGAGAACTCCACAGGATGCTTTCTGAGATGCCAGAAGTTACAGAGTTGCACCCTGTGCCTGATGCTTTTGTGCCAGTGATGAGATTTAAGTTTAATGGGGTGTCCATAGATCTTCTTTATGCAAAATTGTCGCTCTGGGTGATTCCTGAAGTGAGTAGAATCCCTATATAAATGCTGTTATCTCATTTTCATTTTATATTCACGAACTTGGTTTCTTTTTGGATTTTGTGTTTCTGGGGAGTAATCGGATAACAATGTGTTCTTGCTCATTGCATACTCCATGCGCTTGAGGCTCACTGAAGAACTGCAAGTTTGATACTAGTCAATGTCATGTACTCCTAGTAAAATGTGTCTTAGCTGGAAGGCTAGTATATGTTAATATGTGATAACATATGTAGTATAGAAGGTGAAGTAGCTGACCCCAAATTTTTGCTGGATGCATACATTATCACCGTTGCTTCAAAAGTCACACTTCTAGCCGAAGAAGACTTATACATCTGATGTAACTATGATTATGGCCTTAAAAGTATCTTCTTTTGGAGTCAAAGTTCTTCATTGTTTTGTGACGGCCTCGCAGGGTTGGGTCTTTACAGTTAAAAGTTCCTGAAATACCTATTCCCAACTTATTCCAAACTTTTATTTGTCAGTTTGAGGAAGGTTTTAGCAATTTTAACAGTTTACAGTGCGAGAATAGAACTAATCTAATATTATATTTTTTTCCTTTCATCTTATTGCTGAAGTCTCTCAACAATCAGGAATTTTCCTGTTCAATGTTGAAATATATGCTTTTCTTATTTGATTCGTATGTGCTTTGTATCTTATAATGTAGGACTTGGATGTCTCCCAAGATTCAATTTTGCAAAATGCAGACGAGCAGACTGTTCGCAGTCTTAATGGTTGTAGAGTTACAGATCAAATTTTGCGCTTGGTGCCAAATATACAGGTCTGCTATTGCTTGTAAAGTCTGATTTACAATGTCTAAGTTGTTTTTTTTCAAATGGGTGCTCATGATGGTGAAGGAGCCAAATATTCTTGTGACGTTTTGGAGTTGAGTAACCTCTGCAAAGTTGTGTTTGTTTTAAAAATAACTATATTTGCTAAACTTTAACCTCGTGGTGCAGAATTTCCGTACTACACTTAGATGCATGAGATTTTGGGCAAAGCGCCGTGGTGTCTATTCAAATGTAAGTATGAGTTATTTCCTCGTTTTATTTAAAATATATCAAATTAACGTTTTAATAGAGGTTCCTATAACTGATTTTCATTACTATCGGTAGTTGCTCCCCTTTTTATGGGATCTGTGGGTATTGACCGTCACGACTGTTTCATTGTCAGGTGGCTGGGTTTCTTGGTGGTATTAATTGGGCACTACTAGTCGCTCGTATTTGTCAGTTGTACCCCAATGCATTACCCAATATGTTAGTCTCTCGCTTCTTTAGGGTTTTTACACAATGGAGGTGGCCCAACCCTGTCATGCTCTGTGATATTGAAGAGGGATCACTTGGTCTGCAAGTTTGGGATCCGAGGAGGAATCCCAAGGATAGATACCATTTGATGCCTATAATAACTCCTGCGTATCCATCCATGAACTCTAGTTATAATGTCTCATCAAGTACTCTGCGAATTATGACAGAAGAGTTCCAGAGGGGACATGAAATTTGTGAGGTATTGTTTCCTGAATAATGTTTATGCCAGGTTTGCTCGTGTATGACTACCTCTTTCTATAGTGTGATGTATCTTTTTCCCTCTTGCCCTTTTAGAATGACCTCGGGGAGGGGAGGATTGTGTATACTGTTGCACTCCACTAGGAGGATCTGCTTTGTGTTTTATGTTAGTATTATTCTTCTATATATTTTGATCAATTATTTGTGATGTAGTGCATATGACAACTTACTACCGGGTTATGAATTCTGCATTTGACTGGTCAATAACTTACTTCGGTTAGTCCATACTTTTTTGTGTATTGGTGACTTCCTATAGCAGAGGTAACAAGTAGCCAAAAATGGTAGTGTTATATGATGGCACGAAAGACAACGAGAATACACAAGAGGATGGATTATGTGGACTAATTGAAGGTGTTGCAATGAACAGAGAATAGAACATTGAGTAAATAGACAAAAAAAGGAAAATGAGAGGAAGGAGGGGAAAGCGCAAATCATTAAAGTCGTGTGTTGTGCATGTGACACTTTCTTTACTTGCTATAACAGGTTACCTTTGTCAAAGGACCACTCTCATAAAATACATCAAAAAGTATGGACCAACCTTAGTTAATTAGTAGTTTGGATTGTTTAAAGAAGATTGTCCATAGTTGGATTGTTCATATCAACTTTTTTCTTTATCGTTGATCATTTATATAACTAGTCCTACATGCAAATGTGTGCTTAAATGTATGCAGTATGCACTTCTTTACTTCGTAATTTTTTTTAAAGATTTAAATATTAAGTAGACAGGTACATTTTAGATGTTTTACGTAGACACCAAAGTGCGCAAGAGTTCTCTTTTATAATAGAGATGATTCAGGTTTTGTTTTGGAGCTCTGTAATAGTTCACTATTACCATTATTATGGAAAAGATTAAGACGTTAGCGTGCTCCGTTTAGTTCGTAGTAATGAAAGGCGGCAATATGAATTAGTTTATATGTAAATTTGTAGGGAAACTCTTGTGTCTGTGCCCATGGTAATGCTTATTCACCCCAAAGTATAGGGTAATTGGGCGGTAATGGATGTTTATGAAGAAAACAACAACTTTTGGGATGCGGTTGATTAGGGTGTTACTTCCCTTGCAATATACACTTAAATACATTCTCATGGCTGTCACTTATCAACAACTACCAAACGAGGTCATAATGAAACTTATGAAGAAAAACACAAGTTTGTTGATGGGGTTGCTTTTCCATGTACATTAGGATGATTTTCATGGTGGCACTCGGGTATGCTTATGTGTTCTGTATCAGTTCCCTTATGGTTTTCAGTCTTATCACTGCTTTTCAGTATATAATTCAGTCTGTCACTTTGGTTCGAGCTTGAGATATTCCGTTTGGCACATGAATTGAGTTTGACTTTAATCGATTTATTGTAGAGTAGATCATTAGTAGCCTCATGAATTGTCTCCTTAGGAGTTAGGAGATTCCTGAGTATTGGTGGAAAGACTTGTAAAATGTTGGAGGAATTTCTTTAGGGTATTGAATTATTTAGTCTGCTGATAAGTCTGACATCATAGTCATAGAGGGAAATCTCTTCTGCTTCCTAGTGCTCCTTGGGTACCTTGAAACCCTGGGTCCATTTTAATCATGGGGACTTTAGATGGCATAGGTAGCAAATCAACCAGCAGAAACCTCTCTGGAAGTAGGAGCTTTGACTCCTGAAACCTATGGGCTATAGATTAGCAGAAAATCTCAAAAAAGAAACTGCCTTATTGCATTTCAGGCCAGTTTTGATCAAATGGCAGAAGTTGAGTCTCATACCCTGGACATTTGCTTTTGGGGAATTTGACTTTTTTTCTCTTTCAAATGCCAACGAATAAGACATCTAGTTTTCTAATAAGCCTTCAAGGTCAGTGTAGGGTCCATATTAGTGGCCATGAATTTAAATGACTTTGTATCCTGTTCAAGTGTATCTAGAAGGTTGCGATTTATTGATTCACGCCTGTAAGGCATTTCTTATTTTGGATTGGGCAGGGGTTGAGTGTGTTTCCCATGAAATTTCTTTTCTTTGGTCTTGTATCTTCTCATTGTACTATGCTTTTATTGTCTCTCTACTTTTGTCTTTGTCTTTTCCAACTCATTGTGTGTGTGTGTGTTTTTGTGTGTGTTGGGGGGGGGGGGGNGGGGGGGGGAGGTTGTGTTTCGTGTTGTACTTTTATTGGAGGGTCAGTTTATTTGGTTCGTTCCCCCGTTTCTATGAATCAATATTGTGGCTTATGTTATGATGTTGATTGCTACTTACTGCTTTTATAGTTTTATGCTGATGTTTTAATTTGATTCTTGGCATGTTTATACTTTATAGTAGCTGATTGTTGACTTCTAATTTCTAAACATGGGTATGTCTCAGCTGTTCTGGTCTTCCCTTGTTTTAGCATTTGAGTTTTGGCTATACTGCTACTCTGATGACATGCTGGACTTCATATATTTATTTGTTCTTCCTGTGTATTCTTCCTTCTATAGTTCTGTTTGGTGACTGCATGCACTTTCAGGCAATGGAATCTAGCAATGGTGACTGGGATATGCTTTTTGAGCCGTATCCTTTCTTCGAAGCGTATAAAAACTATTTGCAGATAGACATCAACGCTGAAAATGCTGATGACTATCAGAAATGGAAAGGTTGGGTAGAGTCTCGTCTTCGTACACTTACGTTAAAGGTAACAAAGCAGTTATTGCATGTCCAGATTGAAGTTTTAAGCTCTGTATCCTCTGTGGAATCCATTCCTGAACTTTGTTAATGCATGTCCAGATTGAGAGGCATACGTATAATATGCTTCAGTGCCATCCTCACCCAGGTGAATTCACAGATAATTCCAAACCTTTTCATAGCTGTTATTTCATGGGTTTGCAACGTAAAAAAGAGGTTCCAGCTAGTGAAGGTGAACAGTTTGATATAAGGATGACTGTTGACGATTTTAGGCAAACTGTCAATATGTATACCATGTGGAAACCTGGAATGGAGATCCGCGTAAGCCATGTAAAGCGAAGAAGTATACCTGCTTTTGTATTTCCTGGGGGTATTAGACCTCGTCCTGCAAAATCTTCAGGAGATAAGCGATCTTTAGAGATGAAGTCTGGTTTTGATAAGAAGCTAGATGATGGGAAGAAGAGAAAAAAGGAGAATGGTGGCGATAGCACGACTAGTGTTGGAGTGGGATTAACCAAGAACTCAGCTTCTTTGCCTTGTTTAAGTGAAGGAGGTCATGCAGGTAGTCCCATCAGCACCGCAACTTCTTTAGTAAAGGTGGACCATCTAGAAATGGATGGAGTTATTGAGCCGAAGACTGAGAAAATAGATACAGAAATGTTGCATCCTGCAACCAATGGTGCACTTAATGGATCTGTCCAACAAAGCCCGCCTTTGAGGACTGCATCAGCTCCTATTGCGTCATCTAACTCCGAAGAAGCAGAGACGTTGGCAATTGAGAAAATGATGGCAGGTCCATATGGCACCCATCAAGCATTACCTGAGCTTGATGAGTTAGAGGTTGACATTGAAAATGCTGATCAAGTTGGAGAAGTTGCTGAGCTGTCGCTGGAGTCTGAAGCATTGGAAATTGGAATTCCTGGTGCTACTGCTGCCCATTCCAGTTCATTTGCTATTGCGAGTTTGGAGGAGCTTGAGGTCTTATTCTATTCTTAGATCTGTCTCTTTTTAACTATCACTTAAATTACCAATAAATGTGTCACACCTATGTTTTGGAGGCCTTGAAAACGACCATGCGTAAAGCACAAAATGTGGTCTACCCAAATGGGTAGATTACCGACCTCTGGTAGACTGATAATCAGTGTATAATTTTGGGTAATCTCCTATTCAATGGGTATATGTAGCAATCTCCCGCATTTGTTTTGACATGACCTCTCTGCCAGCTTGTAGAGTGTGTTTGTATGATTAGCAATAACTTGGTTGGGTTAGGAGGGGAGGGAAGAAAATAAGGGGGAGTCTGTTTTGCTTTTCTTTTGATAAACAGTATTTTCTGAAGTTGGAAGTATTCTTCTTTTCCTTCTTTTACTTCATCTATCGTGTTATCAAGTTCCCCTTCCGTCAGATATGTGGTATCCAAATACACAAAAAATGTTAAAACCCTAGTTGAGATTGTACCTACTTTTGGTTTGGGGGTTATTTGCTAACTGCATGATCTTTGATCATTTCTGCATGTTCCTTTTATGCAGCCTGCTGAACTAATGGCACCATCTGTTCAAATAAATCCGGCTCCTGTAAAGCCACTTATTAGGTAATTCTCTTTGTATTATTATTATTAGTATGTGGAAGTGCTAATTTCAGTTACCAAAGTTCAATATGGTATATGTTCTGTGCATTTGGCAGGCTCAGCTTCACGTCCTTGGGTAAAGCTACAAGTCACGGCTCGTGAAGTAGCAAGTTAAAAGATTTCAACTTAAGCCAGCTGCAGTGCTTAGGGGTTGATGTTCCGTAAGAATACCTCTCCATCAGAATTTAGAACTATACATTCTGGTGTGAAGGGGTTTGTATAAAATTGTCAATAAATTTTGAATTGAATCAATTAGTGCCTTGTCATCCATGAGATTGTAGGAGGTGCACAGTAGTTTGCGCTACCAACATTTACGAGAACAGTATATAAGGAATTTTTTTTAACGCAACAACGAGAGATTTTATTATATTATTTTAAAAGGCAGGCATTACATCAACATCGTGATTGACCCAAAAACTTGCTACCCGATGAAACCACAATATAAGCAAACCCACCAAAAGTTTCAAAAAACTGTACAACCAAAACAACCAAAAGACCTCAACTGGCAAACAAACTATCATGGCAAACATCCAAATTGAGGGAACTACACCCAAACCAACTGACTCTAGCCTGTTCCTGCACTCGCACTGCTTCCCACTCCCACACTGCTAGCTCTAACTGGGATTGTTGCTGTAACTGTTGCTGGAATAGTGCCAAAGAAAATCGAAGTAGAAACCAC

mRNA sequence

ATGGCTAGCTCGGGAGCGGCCAATCAACATAACGGGCAACGACTGGGCATAACCGAGCCAATATCGTTTGGAGGACCAATGGAATATGATGTGACCAAAACCCACGAACTTGAAAAGTTTTTAGAAAATGTGGGTTTGTATGAGCATCAGGAAGAAGCCGTGAGAAGGGAGGAAGTGCTTGGGAGATTAGACCAGATTGTAAAGCTGTGGGTGAAAACAATTAGCCGAGCCAAGGGTTTGAATGAGCAATTGGTACAGGAAGCAAATGCAAAGATTTTCACGTTTGGGTCGTATCGCCTTGGGGTGCATGGTCCTGGAGCTGATATAGATACCCTGTGTGTCGGCCCTAGACACGCGACACGAGAGGATGATTTCTTTGGAGAACTCCACAGGATGCTTTCTGAGATGCCAGAAGTTACAGAGTTGCACCCTGTGCCTGATGCTTTTGTGCCAGTGATGAGATTTAAGTTTAATGGGGTGTCCATAGATCTTCTTTATGCAAAATTGTCGCTCTGGGTGATTCCTGAAGACTTGGATGTCTCCCAAGATTCAATTTTGCAAAATGCAGACGAGCAGACTGTTCGCAGTCTTAATGGTTGTAGAGTTACAGATCAAATTTTGCGCTTGGTGCCAAATATACAGAATTTCCGTACTACACTTAGATGCATGAGATTTTGGGCAAAGCGCCGTGGTGTCTATTCAAATGTGGCTGGGTTTCTTGGTGGTATTAATTGGGCACTACTAGTCGCTCGTATTTGTCAGTTGTACCCCAATGCATTACCCAATATGTTAGTCTCTCGCTTCTTTAGGGTTTTTACACAATGGAGGTGGCCCAACCCTGTCATGCTCTGTGATATTGAAGAGGGATCACTTGGTCTGCAAGTTTGGGATCCGAGGAGGAATCCCAAGGATAGATACCATTTGATGCCTATAATAACTCCTGCGTATCCATCCATGAACTCTAGTTATAATGTCTCATCAAGTACTCTGCGAATTATGACAGAAGAGTTCCAGAGGGGACATGAAATTTGTGAGGCAATGGAATCTAGCAATGGTGACTGGGATATGCTTTTTGAGCCGTATCCTTTCTTCGAAGCGTATAAAAACTATTTGCAGATAGACATCAACGCTGAAAATGCTGATGACTATCAGAAATGGAAAGGTTGGGTAGAGTCTCGTCTTCGTACACTTACGTTAAAGATTGAGAGGCATACGTATAATATGCTTCAGTGCCATCCTCACCCAGGTGAATTCACAGATAATTCCAAACCTTTTCATAGCTGTTATTTCATGGGTTTGCAACGTAAAAAAGAGGTTCCAGCTAGTGAAGGTGAACAGTTTGATATAAGGATGACTGTTGACGATTTTAGGCAAACTGTCAATATGTATACCATGTGGAAACCTGGAATGGAGATCCGCGTAAGCCATGTAAAGCGAAGAAGTATACCTGCTTTTGTATTTCCTGGGGGTATTAGACCTCGTCCTGCAAAATCTTCAGGAGATAAGCGATCTTTAGAGATGAAGTCTGGTTTTGATAAGAAGCTAGATGATGGGAAGAAGAGAAAAAAGGAGAATGGTGGCGATAGCACGACTAGTGTTGGAGTGGGATTAACCAAGAACTCAGCTTCTTTGCCTTGTTTAAGTGAAGGAGGTCATGCAGGTAGTCCCATCAGCACCGCAACTTCTTTAGTAAAGGTGGACCATCTAGAAATGGATGGAGTTATTGAGCCGAAGACTGAGAAAATAGATACAGAAATGTTGCATCCTGCAACCAATGGTGCACTTAATGGATCTGTCCAACAAAGCCCGCCTTTGAGGACTGCATCAGCTCCTATTGCGTCATCTAACTCCGAAGAAGCAGAGACGTTGGCAATTGAGAAAATGATGGCAGGTCCATATGGCACCCATCAAGCATTACCTGAGCTTGATGAGTTAGAGGTTGACATTGAAAATGCTGATCAAGTTGGAGAAGTTGCTGAGCTGTCGCTGGAGTCTGAAGCATTGGAAATTGGAATTCCTGGTGCTACTGCTGCCCATTCCAGTTCATTTGCTATTGCGAGTTTGGAGGAGCTTGAGCCTGCTGAACTAATGGCACCATCTGTTCAAATAAATCCGGCTCCTGTAAAGCCACTTATTAGGCTCAGCTTCACGTCCTTGGGTAAAGCTACAAGTCACGGCTCGTGAAGTAGCAAGTTAAAAGATTTCAACTTAAGCCAGCTGCAGTGCTTAGGGGTTGATGTTCCGTAAGAATACCTCTCCATCAGAATTTAGAACTATACATTCTGGTGTGAAGGGGTTTGTATAAAATTGTCAATAAATTTTGAATTGAATCAATTAGTGCCTTGTCATCCATGAGATTGTAGGAGGTGCACAGTAGTTTGCGCTACCAACATTTACGAGAACAGTATATAAGGAATTTTTTTTAACGCAACAACGAGAGATTTTATTATATTATTTTAAAAGGCAGGCATTACATCAACATCGTGATTGACCCAAAAACTTGCTACCCGATGAAACCACAATATAAGCAAACCCACCAAAAGTTTCAAAAAACTGTACAACCAAAACAACCAAAAGACCTCAACTGGCAAACAAACTATCATGGCAAACATCCAAATTGAGGGAACTACACCCAAACCAACTGACTCTAGCCTGTTCCTGCACTCGCACTGCTTCCCACTCCCACACTGCTAGCTCTAACTGGGATTGTTGCTGTAACTGTTGCTGGAATAGTGCCAAAGAAAATCGAAGTAGAAACCAC

Coding sequence (CDS)

ATGGCTAGCTCGGGAGCGGCCAATCAACATAACGGGCAACGACTGGGCATAACCGAGCCAATATCGTTTGGAGGACCAATGGAATATGATGTGACCAAAACCCACGAACTTGAAAAGTTTTTAGAAAATGTGGGTTTGTATGAGCATCAGGAAGAAGCCGTGAGAAGGGAGGAAGTGCTTGGGAGATTAGACCAGATTGTAAAGCTGTGGGTGAAAACAATTAGCCGAGCCAAGGGTTTGAATGAGCAATTGGTACAGGAAGCAAATGCAAAGATTTTCACGTTTGGGTCGTATCGCCTTGGGGTGCATGGTCCTGGAGCTGATATAGATACCCTGTGTGTCGGCCCTAGACACGCGACACGAGAGGATGATTTCTTTGGAGAACTCCACAGGATGCTTTCTGAGATGCCAGAAGTTACAGAGTTGCACCCTGTGCCTGATGCTTTTGTGCCAGTGATGAGATTTAAGTTTAATGGGGTGTCCATAGATCTTCTTTATGCAAAATTGTCGCTCTGGGTGATTCCTGAAGACTTGGATGTCTCCCAAGATTCAATTTTGCAAAATGCAGACGAGCAGACTGTTCGCAGTCTTAATGGTTGTAGAGTTACAGATCAAATTTTGCGCTTGGTGCCAAATATACAGAATTTCCGTACTACACTTAGATGCATGAGATTTTGGGCAAAGCGCCGTGGTGTCTATTCAAATGTGGCTGGGTTTCTTGGTGGTATTAATTGGGCACTACTAGTCGCTCGTATTTGTCAGTTGTACCCCAATGCATTACCCAATATGTTAGTCTCTCGCTTCTTTAGGGTTTTTACACAATGGAGGTGGCCCAACCCTGTCATGCTCTGTGATATTGAAGAGGGATCACTTGGTCTGCAAGTTTGGGATCCGAGGAGGAATCCCAAGGATAGATACCATTTGATGCCTATAATAACTCCTGCGTATCCATCCATGAACTCTAGTTATAATGTCTCATCAAGTACTCTGCGAATTATGACAGAAGAGTTCCAGAGGGGACATGAAATTTGTGAGGCAATGGAATCTAGCAATGGTGACTGGGATATGCTTTTTGAGCCGTATCCTTTCTTCGAAGCGTATAAAAACTATTTGCAGATAGACATCAACGCTGAAAATGCTGATGACTATCAGAAATGGAAAGGTTGGGTAGAGTCTCGTCTTCGTACACTTACGTTAAAGATTGAGAGGCATACGTATAATATGCTTCAGTGCCATCCTCACCCAGGTGAATTCACAGATAATTCCAAACCTTTTCATAGCTGTTATTTCATGGGTTTGCAACGTAAAAAAGAGGTTCCAGCTAGTGAAGGTGAACAGTTTGATATAAGGATGACTGTTGACGATTTTAGGCAAACTGTCAATATGTATACCATGTGGAAACCTGGAATGGAGATCCGCGTAAGCCATGTAAAGCGAAGAAGTATACCTGCTTTTGTATTTCCTGGGGGTATTAGACCTCGTCCTGCAAAATCTTCAGGAGATAAGCGATCTTTAGAGATGAAGTCTGGTTTTGATAAGAAGCTAGATGATGGGAAGAAGAGAAAAAAGGAGAATGGTGGCGATAGCACGACTAGTGTTGGAGTGGGATTAACCAAGAACTCAGCTTCTTTGCCTTGTTTAAGTGAAGGAGGTCATGCAGGTAGTCCCATCAGCACCGCAACTTCTTTAGTAAAGGTGGACCATCTAGAAATGGATGGAGTTATTGAGCCGAAGACTGAGAAAATAGATACAGAAATGTTGCATCCTGCAACCAATGGTGCACTTAATGGATCTGTCCAACAAAGCCCGCCTTTGAGGACTGCATCAGCTCCTATTGCGTCATCTAACTCCGAAGAAGCAGAGACGTTGGCAATTGAGAAAATGATGGCAGGTCCATATGGCACCCATCAAGCATTACCTGAGCTTGATGAGTTAGAGGTTGACATTGAAAATGCTGATCAAGTTGGAGAAGTTGCTGAGCTGTCGCTGGAGTCTGAAGCATTGGAAATTGGAATTCCTGGTGCTACTGCTGCCCATTCCAGTTCATTTGCTATTGCGAGTTTGGAGGAGCTTGAGCCTGCTGAACTAATGGCACCATCTGTTCAAATAAATCCGGCTCCTGTAAAGCCACTTATTAGGCTCAGCTTCACGTCCTTGGGTAAAGCTACAAGTCACGGCTCGTGA

Protein sequence

MASSGAANQHNGQRLGITEPISFGGPMEYDVTKTHELEKFLENVGLYEHQEEAVRREEVLGRLDQIVKLWVKTISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHATREDDFFGELHRMLSEMPEVTELHPVPDAFVPVMRFKFNGVSIDLLYAKLSLWVIPEDLDVSQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGFLGGINWALLVARICQLYPNALPNMLVSRFFRVFTQWRWPNPVMLCDIEEGSLGLQVWDPRRNPKDRYHLMPIITPAYPSMNSSYNVSSSTLRIMTEEFQRGHEICEAMESSNGDWDMLFEPYPFFEAYKNYLQIDINAENADDYQKWKGWVESRLRTLTLKIERHTYNMLQCHPHPGEFTDNSKPFHSCYFMGLQRKKEVPASEGEQFDIRMTVDDFRQTVNMYTMWKPGMEIRVSHVKRRSIPAFVFPGGIRPRPAKSSGDKRSLEMKSGFDKKLDDGKKRKKENGGDSTTSVGVGLTKNSASLPCLSEGGHAGSPISTATSLVKVDHLEMDGVIEPKTEKIDTEMLHPATNGALNGSVQQSPPLRTASAPIASSNSEEAETLAIEKMMAGPYGTHQALPELDELEVDIENADQVGEVAELSLESEALEIGIPGATAAHSSSFAIASLEELEPAELMAPSVQINPAPVKPLIRLSFTSLGKATSHGS
Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Spo04041Spo04041gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Spo04041.1Spo04041.1-proteinpolypeptide


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
Spo04041.1.exon.1Spo04041.1.exon.1exon
Spo04041.1.exon.2Spo04041.1.exon.2exon
Spo04041.1.exon.3Spo04041.1.exon.3exon
Spo04041.1.exon.4Spo04041.1.exon.4exon
Spo04041.1.exon.5Spo04041.1.exon.5exon
Spo04041.1.exon.6Spo04041.1.exon.6exon
Spo04041.1.exon.7Spo04041.1.exon.7exon
Spo04041.1.exon.8Spo04041.1.exon.8exon
Spo04041.1.exon.9Spo04041.1.exon.9exon
Spo04041.1.exon.10Spo04041.1.exon.10exon
Spo04041.1.exon.11Spo04041.1.exon.11exon
Spo04041.1.exon.12Spo04041.1.exon.12exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Spo04041.1.CDS.1Spo04041.1.CDS.1CDS
Spo04041.1.CDS.2Spo04041.1.CDS.2CDS
Spo04041.1.CDS.3Spo04041.1.CDS.3CDS
Spo04041.1.CDS.4Spo04041.1.CDS.4CDS
Spo04041.1.CDS.5Spo04041.1.CDS.5CDS
Spo04041.1.CDS.6Spo04041.1.CDS.6CDS
Spo04041.1.CDS.7Spo04041.1.CDS.7CDS
Spo04041.1.CDS.8Spo04041.1.CDS.8CDS
Spo04041.1.CDS.9Spo04041.1.CDS.9CDS
Spo04041.1.CDS.10Spo04041.1.CDS.10CDS
Spo04041.1.CDS.11Spo04041.1.CDS.11CDS
Spo04041.1.CDS.12Spo04041.1.CDS.12CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Spo04041.1.utr3p.1Spo04041.1.utr3p.1three_prime_UTR


Homology
BLAST of Spo04041.1 vs. NCBI nr
Match: gi|902237178|gb|KNA24682.1| (hypothetical protein SOVF_013410 [Spinacia oleracea])

HSP 1 Score: 1467.2 bits (3797), Expect = 0.000e+0
Identity = 727/727 (100.00%), Postives = 727/727 (100.00%), Query Frame = 1

		  

Query: 1   MASSGAANQHNGQRLGITEPISFGGPMEYDVTKTHELEKFLENVGLYEHQEEAVRREEVL 60
           MASSGAANQHNGQRLGITEPISFGGPMEYDVTKTHELEKFLENVGLYEHQEEAVRREEVL
Sbjct: 1   MASSGAANQHNGQRLGITEPISFGGPMEYDVTKTHELEKFLENVGLYEHQEEAVRREEVL 60

Query: 61  GRLDQIVKLWVKTISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHAT 120
           GRLDQIVKLWVKTISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHAT
Sbjct: 61  GRLDQIVKLWVKTISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHAT 120

Query: 121 REDDFFGELHRMLSEMPEVTELHPVPDAFVPVMRFKFNGVSIDLLYAKLSLWVIPEDLDV 180
           REDDFFGELHRMLSEMPEVTELHPVPDAFVPVMRFKFNGVSIDLLYAKLSLWVIPEDLDV
Sbjct: 121 REDDFFGELHRMLSEMPEVTELHPVPDAFVPVMRFKFNGVSIDLLYAKLSLWVIPEDLDV 180

Query: 181 SQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGFL 240
           SQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGFL
Sbjct: 181 SQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGFL 240

Query: 241 GGINWALLVARICQLYPNALPNMLVSRFFRVFTQWRWPNPVMLCDIEEGSLGLQVWDPRR 300
           GGINWALLVARICQLYPNALPNMLVSRFFRVFTQWRWPNPVMLCDIEEGSLGLQVWDPRR
Sbjct: 241 GGINWALLVARICQLYPNALPNMLVSRFFRVFTQWRWPNPVMLCDIEEGSLGLQVWDPRR 300

Query: 301 NPKDRYHLMPIITPAYPSMNSSYNVSSSTLRIMTEEFQRGHEICEAMESSNGDWDMLFEP 360
           NPKDRYHLMPIITPAYPSMNSSYNVSSSTLRIMTEEFQRGHEICEAMESSNGDWDMLFEP
Sbjct: 301 NPKDRYHLMPIITPAYPSMNSSYNVSSSTLRIMTEEFQRGHEICEAMESSNGDWDMLFEP 360

Query: 361 YPFFEAYKNYLQIDINAENADDYQKWKGWVESRLRTLTLKIERHTYNMLQCHPHPGEFTD 420
           YPFFEAYKNYLQIDINAENADDYQKWKGWVESRLRTLTLKIERHTYNMLQCHPHPGEFTD
Sbjct: 361 YPFFEAYKNYLQIDINAENADDYQKWKGWVESRLRTLTLKIERHTYNMLQCHPHPGEFTD 420

Query: 421 NSKPFHSCYFMGLQRKKEVPASEGEQFDIRMTVDDFRQTVNMYTMWKPGMEIRVSHVKRR 480
           NSKPFHSCYFMGLQRKKEVPASEGEQFDIRMTVDDFRQTVNMYTMWKPGMEIRVSHVKRR
Sbjct: 421 NSKPFHSCYFMGLQRKKEVPASEGEQFDIRMTVDDFRQTVNMYTMWKPGMEIRVSHVKRR 480

Query: 481 SIPAFVFPGGIRPRPAKSSGDKRSLEMKSGFDKKLDDGKKRKKENGGDSTTSVGVGLTKN 540
           SIPAFVFPGGIRPRPAKSSGDKRSLEMKSGFDKKLDDGKKRKKENGGDSTTSVGVGLTKN
Sbjct: 481 SIPAFVFPGGIRPRPAKSSGDKRSLEMKSGFDKKLDDGKKRKKENGGDSTTSVGVGLTKN 540

Query: 541 SASLPCLSEGGHAGSPISTATSLVKVDHLEMDGVIEPKTEKIDTEMLHPATNGALNGSVQ 600
           SASLPCLSEGGHAGSPISTATSLVKVDHLEMDGVIEPKTEKIDTEMLHPATNGALNGSVQ
Sbjct: 541 SASLPCLSEGGHAGSPISTATSLVKVDHLEMDGVIEPKTEKIDTEMLHPATNGALNGSVQ 600

Query: 601 QSPPLRTASAPIASSNSEEAETLAIEKMMAGPYGTHQALPELDELEVDIENADQVGEVAE 660
           QSPPLRTASAPIASSNSEEAETLAIEKMMAGPYGTHQALPELDELEVDIENADQVGEVAE
Sbjct: 601 QSPPLRTASAPIASSNSEEAETLAIEKMMAGPYGTHQALPELDELEVDIENADQVGEVAE 660

Query: 661 LSLESEALEIGIPGATAAHSSSFAIASLEELEPAELMAPSVQINPAPVKPLIRLSFTSLG 720
           LSLESEALEIGIPGATAAHSSSFAIASLEELEPAELMAPSVQINPAPVKPLIRLSFTSLG
Sbjct: 661 LSLESEALEIGIPGATAAHSSSFAIASLEELEPAELMAPSVQINPAPVKPLIRLSFTSLG 720

Query: 721 KATSHGS 728
           KATSHGS
Sbjct: 721 KATSHGS 727

BLAST of Spo04041.1 vs. NCBI nr
Match: gi|731324606|ref|XP_010673060.1| (PREDICTED: nuclear poly(A) polymerase 1 [Beta vulgaris subsp. vulgaris])

HSP 1 Score: 1184.9 bits (3064), Expect = 0.000e+0
Identity = 603/739 (81.60%), Postives = 650/739 (87.96%), Query Frame = 1

		  

Query: 1   MASSGAANQHNGQRLGITEPISFGGPMEYDVTKTHELEKFLENVGLYEHQEEAVRREEVL 60
           MA SGA  +++GQRLGITEPIS GGP EYDVTK+HELEK+LE+VGLYE Q EAV REEVL
Sbjct: 1   MAGSGANFRYHGQRLGITEPISCGGPTEYDVTKSHELEKYLEDVGLYESQGEAVSREEVL 60

Query: 61  GRLDQIVKLWVKTISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHAT 120
           GRLDQIVKLWVKTISRAKGLNEQLVQ+ANAKIFTFGSYRLGVHGPGADIDTLCVGPRHAT
Sbjct: 61  GRLDQIVKLWVKTISRAKGLNEQLVQDANAKIFTFGSYRLGVHGPGADIDTLCVGPRHAT 120

Query: 121 REDDFFGELHRMLSEMPEVTELHPVPDAFVPVMRFKFNGVSIDLLYAKLSLWVIPEDLDV 180
           RE+DFFGELHRMLSEMPEVTELHPVPDA+VPVM+FKF+GVSIDLLYAK SLWVIPEDLD+
Sbjct: 121 REEDFFGELHRMLSEMPEVTELHPVPDAYVPVMKFKFSGVSIDLLYAKSSLWVIPEDLDI 180

Query: 181 SQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGFL 240
           SQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQ+FRTTLRCM+FWAKRRGVYSNVAGFL
Sbjct: 181 SQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQHFRTTLRCMKFWAKRRGVYSNVAGFL 240

Query: 241 GGINWALLVARICQLYPNALPNMLVSRFFRVFTQWRWPNPVMLCDIEEGSLGLQVWDPRR 300
           GGINWA+LVARICQLYPN+LPNMLVSRFFRV+TQWRWPNPVMLC+IEEGS GLQVWDPR+
Sbjct: 241 GGINWAILVARICQLYPNSLPNMLVSRFFRVYTQWRWPNPVMLCEIEEGSFGLQVWDPRK 300

Query: 301 NPKDRYHLMPIITPAYPSMNSSYNVSSSTLRIMTEEFQRGHEICEAMESSNGDWDMLFEP 360
           NPKDRYHLMPIITPAYPSMNSSYNVSSSTLRIMTEEFQRGH++CEAMES+  +WD LFEP
Sbjct: 301 NPKDRYHLMPIITPAYPSMNSSYNVSSSTLRIMTEEFQRGHDLCEAMESNKAEWDTLFEP 360

Query: 361 YPFFEAYKNYLQIDINAENADDYQKWKGWVESRLRTLTLKIERHTYNMLQCHPHPGEFTD 420
           YPFFEAY+NYLQIDINAENADDY+KWKGWVESRLRTLTLKIERHTYNMLQCHPHPGEFTD
Sbjct: 361 YPFFEAYRNYLQIDINAENADDYRKWKGWVESRLRTLTLKIERHTYNMLQCHPHPGEFTD 420

Query: 421 NSKPFHSCYFMGLQRKKEVPASEGEQFDIRMTVDDFRQTVNMYTMWKPGMEIRVSHVKRR 480
           N KPFHSCYFMGLQRKKEVPA EGEQFDIRMTVD+F+QTVNMYT+WKPGMEIRVSHVKR+
Sbjct: 421 NCKPFHSCYFMGLQRKKEVPAGEGEQFDIRMTVDEFKQTVNMYTLWKPGMEIRVSHVKRK 480

Query: 481 SIPAFVFPGGIRP-RPAKSS-GDKRSLEMKSGFDKKLDDGKKRKKENGGDSTTSVGVGLT 540
           SIP+FVFPGGIRP RP KSS G KRS E+KS  D KLDDG+KRKKEN  D  TS  +   
Sbjct: 481 SIPSFVFPGGIRPSRPTKSSGGSKRSSEIKSDVD-KLDDGRKRKKEN--DMMTSAAI--P 540

Query: 541 KNSASLPCLSEGGHAGSPISTATSLVKVDHLEMDGVIEPKTEKIDTEMLHPATNGALNGS 600
           +NSASLP  S GGH GSPISTATS VKV+HLE+DG  E K E +DTEML    NG LNG+
Sbjct: 541 RNSASLPSSSGGGHTGSPISTATSSVKVEHLELDGYSERKAEIVDTEMLCTRNNGELNGT 600

Query: 601 VQQSPPLRTASAPIASSNSEEAETLAIEKMMAGPYGTHQALPELDELEVDIENADQV--- 660
             QS PLR +SAP   S+S+E E LAIEKMM GPYG+HQ LPELDELE D EN +QV   
Sbjct: 601 EHQSIPLRNSSAPAVLSDSKETEKLAIEKMMLGPYGSHQVLPELDELENDPENGNQVEHS 660

Query: 661 GEVAELSLESEAL----EIGIPGATAAHSSSFAIASLEELEPAELMAPS-VQINPAPV-- 720
           G   +L  E   +    EIG  G TAAH SSF+  SLEELEPAEL  P+ VQI PAPV  
Sbjct: 661 GGAVKLLEEPSPMKASPEIGNLGTTAAHFSSFSNGSLEELEPAELTMPAPVQIIPAPVQN 720

Query: 721 KPLIRLSFTSLGKATSHGS 728
           KPLIRLSFTSLGKATSH S
Sbjct: 721 KPLIRLSFTSLGKATSHSS 734

BLAST of Spo04041.1 vs. NCBI nr
Match: gi|590665099|ref|XP_007036647.1| (Poly(A) polymerase 1 isoform 1 [Theobroma cacao])

HSP 1 Score: 1010.4 bits (2611), Expect = 1.600e-291
Identity = 532/766 (69.45%), Postives = 609/766 (79.50%), Query Frame = 1

		  

Query: 1   MASSGAANQHNGQRLGITEPISFGGPMEYDVTKTHELEKFLENVGLYEHQEEAVRREEVL 60
           M S G  N++NGQRLGITEPIS GGP +YDV KT ELEK+L+NVGLYE QEEAV REEVL
Sbjct: 1   MGSPGLGNRNNGQRLGITEPISLGGPTDYDVIKTRELEKYLQNVGLYESQEEAVGREEVL 60

Query: 61  GRLDQIVKLWVKTISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHAT 120
           GRLDQ VK WVK ISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHAT
Sbjct: 61  GRLDQTVKNWVKAISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHAT 120

Query: 121 REDDFFGELHRMLSEMPEVTELHPVPDAFVPVMRFKFNGVSIDLLYAKLSLWVIPEDLDV 180
           RE+DFFGEL++MLSEMPEV+ELHPVPDA VPVM+FKF GVSIDLLYAKLSLWVIPEDLD+
Sbjct: 121 REEDFFGELYKMLSEMPEVSELHPVPDAHVPVMKFKFKGVSIDLLYAKLSLWVIPEDLDI 180

Query: 181 SQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGFL 240
           SQDSILQN DEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGFL
Sbjct: 181 SQDSILQNTDEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGFL 240

Query: 241 GGINWALLVARICQLYPNALPNMLVSRFFRVFTQWRWPNPVMLCDIEEGSLGLQVWDPRR 300
           GGINWALLVARICQLYPNALPNMLVSRFFRV+TQWRWPNPVMLC IEEGSLGLQVWDPR+
Sbjct: 241 GGINWALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGLQVWDPRK 300

Query: 301 NPKDRYHLMPIITPAYPSMNSSYNVSSSTLRIMTEEFQRGHEICEAMESSNGDWDMLFEP 360
           NPKDRYHLMPIITPAYP MNSSYNVSSSTLRIMT+EFQRG EICEAME++  DWD+LFE 
Sbjct: 301 NPKDRYHLMPIITPAYPCMNSSYNVSSSTLRIMTDEFQRGSEICEAMEANKADWDILFES 360

Query: 361 YPFFEAYKNYLQIDINAENADDYQKWKGWVESRLRTLTLKIERHTYNMLQCHPHPGEFTD 420
           Y FFEAYKNYLQIDI+AENADD +KWKGWVESRLR LTLKIERHTYNMLQCHPHPG+F D
Sbjct: 361 YAFFEAYKNYLQIDISAENADDLRKWKGWVESRLRQLTLKIERHTYNMLQCHPHPGDFQD 420

Query: 421 NSKPFHSCYFMGLQRKKEVPASEGEQFDIRMTVDDFRQTVNMYTMWKPGMEIRVSHVKRR 480
            S+PFH  YFMGLQRK+ VP +EGEQFDIR+TV++F+ +VNMYT+WKPGMEIRV+HVKRR
Sbjct: 421 KSRPFHGSYFMGLQRKQGVPVNEGEQFDIRLTVEEFKHSVNMYTLWKPGMEIRVTHVKRR 480

Query: 481 SIPAFVFPGGIRP-RPAK---------------SSGDKRSLEMKSGFDKKLDDGKKRKK- 540
           +IP+FVFPGG+RP RP+K                +G  +S E+K   D + DDGKKRK+ 
Sbjct: 481 NIPSFVFPGGVRPSRPSKVTWDSMRVSDAKVSGHAGPDKSGEVKGVADGQ-DDGKKRKRV 540

Query: 541 ENGGDSTTSVGVGLTKNSASLPCLSEGGHAGSPISTATSL-VKVDHLEMDGVIEPKTEKI 600
           ++ GD+     +  +K   ++P  S  G  GSP+ST +S   K D+ +  G+IE   EK 
Sbjct: 541 DDNGDAQ----LRSSKYITAVPSSSLEGRVGSPVSTVSSCSTKGDYSDATGLIETTREKA 600

Query: 601 DTEMLH-----------PATNGALNGSVQQSPPLRTASAPIASSNSEEAETLAIEKMMAG 660
           ++ M +            + NG ++GSV  +PP++ ++    +S+  EAE LAIEK+M+G
Sbjct: 601 ESNMTNGLINSRSLEELSSHNGEVDGSVGCNPPIKVSA---DASSCTEAENLAIEKIMSG 660

Query: 661 PYGTHQALP-ELDELEVDIENADQVGEV-------AELSLESEALEIGIPGATAAHSSSF 720
           PYG HQA P EL+ELE D+E  +QV  V        E S+   A    +  +  A  S+ 
Sbjct: 661 PYGAHQAFPQELEELEDDLEFRNQVRSVENTKSGPVESSMSDLAGAAPVTSSNGAGPSTS 720

Query: 721 AIAS--LEELEPAELMAP-SVQINPAPV---KPLIRLSFTSLGKAT 724
             AS  +EELEPAEL A  S +I  APV   KPLIRL+FTSLGKA+
Sbjct: 721 LHASGGIEELEPAELTAMISNRIPSAPVAQRKPLIRLNFTSLGKAS 758

BLAST of Spo04041.1 vs. NCBI nr
Match: gi|1012102489|ref|XP_015957750.1| (PREDICTED: nuclear poly(A) polymerase 1-like [Arachis duranensis])

HSP 1 Score: 1003.4 bits (2593), Expect = 2.000e-289
Identity = 528/762 (69.29%), Postives = 607/762 (79.66%), Query Frame = 1

		  

Query: 1   MASSGAANQHNGQ--RLGITEPISFGGPMEYDVTKTHELEKFLENVGLYEHQEEAVRREE 60
           M S G +N++NGQ  RLGITEPIS GGP EYDV KT ELEK+L++ GLYE+QEEAV REE
Sbjct: 1   MGSPGLSNRNNGQQQRLGITEPISLGGPTEYDVIKTRELEKYLQDAGLYENQEEAVSREE 60

Query: 61  VLGRLDQIVKLWVKTISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRH 120
           VLGRLDQIVK+WVKTISRAKGLN+QLVQEANAKIFTFGSYRLGVHGPGADIDTLCV PRH
Sbjct: 61  VLGRLDQIVKIWVKTISRAKGLNDQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVAPRH 120

Query: 121 ATREDDFFGELHRMLSEMPEVTELHPVPDAFVPVMRFKFNGVSIDLLYAKLSLWVIPEDL 180
            +RE+DFFGELHRMLSEMPEVTELHPVPDA VPVM FKFNGVSIDLLYA+LSLWVIPEDL
Sbjct: 121 VSREEDFFGELHRMLSEMPEVTELHPVPDAHVPVMGFKFNGVSIDLLYARLSLWVIPEDL 180

Query: 181 DVSQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAG 240
           D+SQ+SILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNV+G
Sbjct: 181 DISQESILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVSG 240

Query: 241 FLGGINWALLVARICQLYPNALPNMLVSRFFRVFTQWRWPNPVMLCDIEEGSLGLQVWDP 300
           FLGGINWALLVARICQL+PNALPNMLVSRFFRV+TQWRWPNPV+LC IEEGSLGLQVWDP
Sbjct: 241 FLGGINWALLVARICQLFPNALPNMLVSRFFRVYTQWRWPNPVLLCAIEEGSLGLQVWDP 300

Query: 301 RRNPKDRYHLMPIITPAYPSMNSSYNVSSSTLRIMTEEFQRGHEICEAMESSNGDWDMLF 360
           RR PKDR+HLMPIITPAYP MNSSYNVSSSTLRIMTEEFQRG+EICEAME++N +WD LF
Sbjct: 301 RRYPKDRFHLMPIITPAYPCMNSSYNVSSSTLRIMTEEFQRGNEICEAMEANNANWDALF 360

Query: 361 EPYPFFEAYKNYLQIDINAENADDYQKWKGWVESRLRTLTLKIERHTYNMLQCHPHPGEF 420
           EPYPFFEAYKNYLQID++AENADD +KWKGWVESRLR LTLKIERHTY MLQCHPHPG+F
Sbjct: 361 EPYPFFEAYKNYLQIDVSAENADDLRKWKGWVESRLRHLTLKIERHTYGMLQCHPHPGDF 420

Query: 421 TDNSKPFHSCYFMGLQRKKEVPASEGEQFDIRMTVDDFRQTVNMYTMWKPGMEIRVSHVK 480
           +D SKPFH  YFMGLQRK+ VP +EGEQFDIR TV++F+ +VNMYT+WKPGM I+VSHVK
Sbjct: 421 SDKSKPFHCSYFMGLQRKQGVPVNEGEQFDIRHTVEEFKHSVNMYTLWKPGMAIQVSHVK 480

Query: 481 RRSIPAFVFPGGIRP-RPAKSSGD-KRSLEMKS---GFDKKLDDGK--------KRKKEN 540
           RRSIP FVFPGG+RP RP K++ D KRS E++    G  +K  +G+        +RK++ 
Sbjct: 481 RRSIPNFVFPGGVRPSRPTKATWDSKRSSELRDSVHGQTEKSQEGQAVALREADERKRKR 540

Query: 541 GGDSTTSVGVGLTKNSASLPCLSEGGHAGS--PISTATSL-VKVDHLEMDGVIEPKTEKI 600
             DS  ++    +K+ ASLP  S   H  S  P+S A+S  +K D  E++ V     EK 
Sbjct: 541 AEDSIDNLRT--SKSFASLPPSSGDVHDDSRNPVSIASSCSMKCDESEVNSV----NEKP 600

Query: 601 DTEML--HPATNGALNGSVQQSPPLRTASAPIASSNSEEAETLAIEKMMAGPYGTHQALP 660
           D + L   P+ +G  NGS +    +      I + NS+EAE LAIEK+++GPY THQALP
Sbjct: 601 DLKSLTGSPSRHGETNGSARSIQQVNHMLTGINTCNSKEAENLAIEKIISGPYDTHQALP 660

Query: 661 -ELDELEVDIENADQV----GEVAELSLESEALEIGIPG-------ATAAHSSSFAIASL 720
            E DELE D+E  +Q     G + + +L+S   E+ + G        T+  +      SL
Sbjct: 661 EEPDELEDDVEYRNQFKNLGGNINKSNLDSSHSELAVVGEPVITEKETSCSNHLLPNESL 720

Query: 721 EELEPAELMAPSVQINPAPV---KPLIRLSFTSLGKATSHGS 728
           EELEPAEL AP +    AP+   KPLIRL+FTSLGKA    S
Sbjct: 721 EELEPAELTAPFISSTAAPLPQRKPLIRLNFTSLGKAADKSS 756

BLAST of Spo04041.1 vs. NCBI nr
Match: gi|1021495596|ref|XP_016190813.1| (PREDICTED: nuclear poly(A) polymerase 1-like [Arachis ipaensis])

HSP 1 Score: 1002.7 bits (2591), Expect = 3.400e-289
Identity = 532/764 (69.63%), Postives = 611/764 (79.97%), Query Frame = 1

		  

Query: 1   MASSGAANQHNGQ--RLGITEPISFGGPMEYDVTKTHELEKFLENVGLYEHQEEAVRREE 60
           M S G +N++NGQ  RLGITEPIS GGP EYDV KT ELEK+L++ GLYE+QEEAV REE
Sbjct: 1   MGSPGLSNRNNGQQQRLGITEPISLGGPTEYDVIKTRELEKYLQDAGLYENQEEAVSREE 60

Query: 61  VLGRLDQIVKLWVKTISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRH 120
           VLGRLDQIVK+WVKTISRAKGLN+QLVQEANAKIFTFGSYRLGVHGPGADIDTLCV PRH
Sbjct: 61  VLGRLDQIVKIWVKTISRAKGLNDQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVAPRH 120

Query: 121 ATREDDFFGELHRMLSEMPEVTELHPVPDAFVPVMRFKFNGVSIDLLYAKLSLWVIPEDL 180
            +RE+DFFGELHRMLSEMPEVTELHPVPDA VPVM FKFNGVSIDLLYA+LSLWVIPEDL
Sbjct: 121 VSREEDFFGELHRMLSEMPEVTELHPVPDAHVPVMGFKFNGVSIDLLYARLSLWVIPEDL 180

Query: 181 DVSQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAG 240
           D+SQ+SILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNV+G
Sbjct: 181 DISQESILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVSG 240

Query: 241 FLGGINWALLVARICQLYPNALPNMLVSRFFRVFTQWRWPNPVMLCDIEEGSLGLQVWDP 300
           FLGGINWALLVARICQL+PNALPNMLVSRFFRV+TQWRWPNPV+LC IEEGSLGLQVWDP
Sbjct: 241 FLGGINWALLVARICQLFPNALPNMLVSRFFRVYTQWRWPNPVLLCAIEEGSLGLQVWDP 300

Query: 301 RRNPKDRYHLMPIITPAYPSMNSSYNVSSSTLRIMTEEFQRGHEICEAMESSNGDWDMLF 360
           RR PKDR+HLMPIITPAYP MNSSYNVSSSTLRIMTEEFQRG+EICEAME++N +WD LF
Sbjct: 301 RRYPKDRFHLMPIITPAYPCMNSSYNVSSSTLRIMTEEFQRGNEICEAMEANNANWDALF 360

Query: 361 EPYPFFEAYKNYLQIDINAENADDYQKWKGWVESRLRTLTLKIERHTYNMLQCHPHPGEF 420
           EPYPFFEAYKNYLQID++AENADD +KWKGWVESRLR LTLKIERHTY MLQCHPHPG+F
Sbjct: 361 EPYPFFEAYKNYLQIDVSAENADDLRKWKGWVESRLRHLTLKIERHTYGMLQCHPHPGDF 420

Query: 421 TDNSKPFHSCYFMGLQRKKEVPASEGEQFDIRMTVDDFRQTVNMYTMWKPGMEIRVSHVK 480
           +D SKPFH  YFMGLQRK+ VP +EGEQFDIR TV++F+ +VNMYT+WKPGM I+VSHVK
Sbjct: 421 SDKSKPFHCSYFMGLQRKQGVPVNEGEQFDIRHTVEEFKHSVNMYTLWKPGMAIQVSHVK 480

Query: 481 RRSIPAFVFPGGIRP-RPAKSSGD-KRSLEMKS---GFDKKLDDGK--------KRKKEN 540
           RRSIP FVFPGGIRP RP K++ D KRS E++    G  +K  +G+        +RK++ 
Sbjct: 481 RRSIPNFVFPGGIRPSRPTKATWDSKRSSELRDSGHGQAEKSQEGQAVALREADERKRKR 540

Query: 541 GGDSTTSVGVGLTKNSASLPCLSEGGHAGS--PISTATSL-VKVDHLEMDGVIEPKTEKI 600
             DS  ++    +K+ ASLP  S   H  S  P+S A+S  +K D  E++ V     EK 
Sbjct: 541 AEDSIDNLRT--SKSFASLPPSSGDVHDDSRNPVSIASSCSMKCDESEVNSV----NEKP 600

Query: 601 DTEML--HPATNGALNG--SVQQSPPLRTASAPIASSNSEEAETLAIEKMMAGPYGTHQA 660
           D + L   P+ +G  NG  S+QQ   + T +    + NS+EAE LAIEK+++GPY THQA
Sbjct: 601 DLKSLTGSPSRHGETNGSASIQQVNHMLTGT---NTCNSKEAENLAIEKIISGPYDTHQA 660

Query: 661 LP-ELDELEVDIENADQV----GEVAELSLESEALEIGIPG-------ATAAHSSSFAIA 720
           LP E DELE D+E  +Q     G + + +L+S   E+ + G        T+  +  F   
Sbjct: 661 LPEEPDELEDDVEYRNQFKNLGGNINKSNLDSSHSELAVVGEPVITEKETSCSNHLFPNE 720

Query: 721 SLEELEPAELMAPSVQINPAPV---KPLIRLSFTSLGKATSHGS 728
           SLEELEPAEL AP +    AP+   KPLIRL+FTSLGKA    S
Sbjct: 721 SLEELEPAELTAPFISSTAAPLPQRKPLIRLNFTSLGKAADKSS 755

BLAST of Spo04041.1 vs. UniProtKB/TrEMBL
Match: A0A0K9S0P7_SPIOL (Uncharacterized protein OS=Spinacia oleracea GN=SOVF_013410 PE=4 SV=1)

HSP 1 Score: 1467.2 bits (3797), Expect = 0.000e+0
Identity = 727/727 (100.00%), Postives = 727/727 (100.00%), Query Frame = 1

		  

Query: 1   MASSGAANQHNGQRLGITEPISFGGPMEYDVTKTHELEKFLENVGLYEHQEEAVRREEVL 60
           MASSGAANQHNGQRLGITEPISFGGPMEYDVTKTHELEKFLENVGLYEHQEEAVRREEVL
Sbjct: 1   MASSGAANQHNGQRLGITEPISFGGPMEYDVTKTHELEKFLENVGLYEHQEEAVRREEVL 60

Query: 61  GRLDQIVKLWVKTISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHAT 120
           GRLDQIVKLWVKTISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHAT
Sbjct: 61  GRLDQIVKLWVKTISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHAT 120

Query: 121 REDDFFGELHRMLSEMPEVTELHPVPDAFVPVMRFKFNGVSIDLLYAKLSLWVIPEDLDV 180
           REDDFFGELHRMLSEMPEVTELHPVPDAFVPVMRFKFNGVSIDLLYAKLSLWVIPEDLDV
Sbjct: 121 REDDFFGELHRMLSEMPEVTELHPVPDAFVPVMRFKFNGVSIDLLYAKLSLWVIPEDLDV 180

Query: 181 SQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGFL 240
           SQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGFL
Sbjct: 181 SQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGFL 240

Query: 241 GGINWALLVARICQLYPNALPNMLVSRFFRVFTQWRWPNPVMLCDIEEGSLGLQVWDPRR 300
           GGINWALLVARICQLYPNALPNMLVSRFFRVFTQWRWPNPVMLCDIEEGSLGLQVWDPRR
Sbjct: 241 GGINWALLVARICQLYPNALPNMLVSRFFRVFTQWRWPNPVMLCDIEEGSLGLQVWDPRR 300

Query: 301 NPKDRYHLMPIITPAYPSMNSSYNVSSSTLRIMTEEFQRGHEICEAMESSNGDWDMLFEP 360
           NPKDRYHLMPIITPAYPSMNSSYNVSSSTLRIMTEEFQRGHEICEAMESSNGDWDMLFEP
Sbjct: 301 NPKDRYHLMPIITPAYPSMNSSYNVSSSTLRIMTEEFQRGHEICEAMESSNGDWDMLFEP 360

Query: 361 YPFFEAYKNYLQIDINAENADDYQKWKGWVESRLRTLTLKIERHTYNMLQCHPHPGEFTD 420
           YPFFEAYKNYLQIDINAENADDYQKWKGWVESRLRTLTLKIERHTYNMLQCHPHPGEFTD
Sbjct: 361 YPFFEAYKNYLQIDINAENADDYQKWKGWVESRLRTLTLKIERHTYNMLQCHPHPGEFTD 420

Query: 421 NSKPFHSCYFMGLQRKKEVPASEGEQFDIRMTVDDFRQTVNMYTMWKPGMEIRVSHVKRR 480
           NSKPFHSCYFMGLQRKKEVPASEGEQFDIRMTVDDFRQTVNMYTMWKPGMEIRVSHVKRR
Sbjct: 421 NSKPFHSCYFMGLQRKKEVPASEGEQFDIRMTVDDFRQTVNMYTMWKPGMEIRVSHVKRR 480

Query: 481 SIPAFVFPGGIRPRPAKSSGDKRSLEMKSGFDKKLDDGKKRKKENGGDSTTSVGVGLTKN 540
           SIPAFVFPGGIRPRPAKSSGDKRSLEMKSGFDKKLDDGKKRKKENGGDSTTSVGVGLTKN
Sbjct: 481 SIPAFVFPGGIRPRPAKSSGDKRSLEMKSGFDKKLDDGKKRKKENGGDSTTSVGVGLTKN 540

Query: 541 SASLPCLSEGGHAGSPISTATSLVKVDHLEMDGVIEPKTEKIDTEMLHPATNGALNGSVQ 600
           SASLPCLSEGGHAGSPISTATSLVKVDHLEMDGVIEPKTEKIDTEMLHPATNGALNGSVQ
Sbjct: 541 SASLPCLSEGGHAGSPISTATSLVKVDHLEMDGVIEPKTEKIDTEMLHPATNGALNGSVQ 600

Query: 601 QSPPLRTASAPIASSNSEEAETLAIEKMMAGPYGTHQALPELDELEVDIENADQVGEVAE 660
           QSPPLRTASAPIASSNSEEAETLAIEKMMAGPYGTHQALPELDELEVDIENADQVGEVAE
Sbjct: 601 QSPPLRTASAPIASSNSEEAETLAIEKMMAGPYGTHQALPELDELEVDIENADQVGEVAE 660

Query: 661 LSLESEALEIGIPGATAAHSSSFAIASLEELEPAELMAPSVQINPAPVKPLIRLSFTSLG 720
           LSLESEALEIGIPGATAAHSSSFAIASLEELEPAELMAPSVQINPAPVKPLIRLSFTSLG
Sbjct: 661 LSLESEALEIGIPGATAAHSSSFAIASLEELEPAELMAPSVQINPAPVKPLIRLSFTSLG 720

Query: 721 KATSHGS 728
           KATSHGS
Sbjct: 721 KATSHGS 727

BLAST of Spo04041.1 vs. UniProtKB/TrEMBL
Match: A0A0J8CN87_BETVU (Uncharacterized protein OS=Beta vulgaris subsp. vulgaris GN=BVRB_3g063260 PE=4 SV=1)

HSP 1 Score: 1184.9 bits (3064), Expect = 0.000e+0
Identity = 603/739 (81.60%), Postives = 650/739 (87.96%), Query Frame = 1

		  

Query: 1   MASSGAANQHNGQRLGITEPISFGGPMEYDVTKTHELEKFLENVGLYEHQEEAVRREEVL 60
           MA SGA  +++GQRLGITEPIS GGP EYDVTK+HELEK+LE+VGLYE Q EAV REEVL
Sbjct: 1   MAGSGANFRYHGQRLGITEPISCGGPTEYDVTKSHELEKYLEDVGLYESQGEAVSREEVL 60

Query: 61  GRLDQIVKLWVKTISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHAT 120
           GRLDQIVKLWVKTISRAKGLNEQLVQ+ANAKIFTFGSYRLGVHGPGADIDTLCVGPRHAT
Sbjct: 61  GRLDQIVKLWVKTISRAKGLNEQLVQDANAKIFTFGSYRLGVHGPGADIDTLCVGPRHAT 120

Query: 121 REDDFFGELHRMLSEMPEVTELHPVPDAFVPVMRFKFNGVSIDLLYAKLSLWVIPEDLDV 180
           RE+DFFGELHRMLSEMPEVTELHPVPDA+VPVM+FKF+GVSIDLLYAK SLWVIPEDLD+
Sbjct: 121 REEDFFGELHRMLSEMPEVTELHPVPDAYVPVMKFKFSGVSIDLLYAKSSLWVIPEDLDI 180

Query: 181 SQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGFL 240
           SQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQ+FRTTLRCM+FWAKRRGVYSNVAGFL
Sbjct: 181 SQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQHFRTTLRCMKFWAKRRGVYSNVAGFL 240

Query: 241 GGINWALLVARICQLYPNALPNMLVSRFFRVFTQWRWPNPVMLCDIEEGSLGLQVWDPRR 300
           GGINWA+LVARICQLYPN+LPNMLVSRFFRV+TQWRWPNPVMLC+IEEGS GLQVWDPR+
Sbjct: 241 GGINWAILVARICQLYPNSLPNMLVSRFFRVYTQWRWPNPVMLCEIEEGSFGLQVWDPRK 300

Query: 301 NPKDRYHLMPIITPAYPSMNSSYNVSSSTLRIMTEEFQRGHEICEAMESSNGDWDMLFEP 360
           NPKDRYHLMPIITPAYPSMNSSYNVSSSTLRIMTEEFQRGH++CEAMES+  +WD LFEP
Sbjct: 301 NPKDRYHLMPIITPAYPSMNSSYNVSSSTLRIMTEEFQRGHDLCEAMESNKAEWDTLFEP 360

Query: 361 YPFFEAYKNYLQIDINAENADDYQKWKGWVESRLRTLTLKIERHTYNMLQCHPHPGEFTD 420
           YPFFEAY+NYLQIDINAENADDY+KWKGWVESRLRTLTLKIERHTYNMLQCHPHPGEFTD
Sbjct: 361 YPFFEAYRNYLQIDINAENADDYRKWKGWVESRLRTLTLKIERHTYNMLQCHPHPGEFTD 420

Query: 421 NSKPFHSCYFMGLQRKKEVPASEGEQFDIRMTVDDFRQTVNMYTMWKPGMEIRVSHVKRR 480
           N KPFHSCYFMGLQRKKEVPA EGEQFDIRMTVD+F+QTVNMYT+WKPGMEIRVSHVKR+
Sbjct: 421 NCKPFHSCYFMGLQRKKEVPAGEGEQFDIRMTVDEFKQTVNMYTLWKPGMEIRVSHVKRK 480

Query: 481 SIPAFVFPGGIRP-RPAKSS-GDKRSLEMKSGFDKKLDDGKKRKKENGGDSTTSVGVGLT 540
           SIP+FVFPGGIRP RP KSS G KRS E+KS  D KLDDG+KRKKEN  D  TS  +   
Sbjct: 481 SIPSFVFPGGIRPSRPTKSSGGSKRSSEIKSDVD-KLDDGRKRKKEN--DMMTSAAI--P 540

Query: 541 KNSASLPCLSEGGHAGSPISTATSLVKVDHLEMDGVIEPKTEKIDTEMLHPATNGALNGS 600
           +NSASLP  S GGH GSPISTATS VKV+HLE+DG  E K E +DTEML    NG LNG+
Sbjct: 541 RNSASLPSSSGGGHTGSPISTATSSVKVEHLELDGYSERKAEIVDTEMLCTRNNGELNGT 600

Query: 601 VQQSPPLRTASAPIASSNSEEAETLAIEKMMAGPYGTHQALPELDELEVDIENADQV--- 660
             QS PLR +SAP   S+S+E E LAIEKMM GPYG+HQ LPELDELE D EN +QV   
Sbjct: 601 EHQSIPLRNSSAPAVLSDSKETEKLAIEKMMLGPYGSHQVLPELDELENDPENGNQVEHS 660

Query: 661 GEVAELSLESEAL----EIGIPGATAAHSSSFAIASLEELEPAELMAPS-VQINPAPV-- 720
           G   +L  E   +    EIG  G TAAH SSF+  SLEELEPAEL  P+ VQI PAPV  
Sbjct: 661 GGAVKLLEEPSPMKASPEIGNLGTTAAHFSSFSNGSLEELEPAELTMPAPVQIIPAPVQN 720

Query: 721 KPLIRLSFTSLGKATSHGS 728
           KPLIRLSFTSLGKATSH S
Sbjct: 721 KPLIRLSFTSLGKATSHSS 734

BLAST of Spo04041.1 vs. UniProtKB/TrEMBL
Match: A0A061FVR9_THECC (Poly(A) polymerase 1 isoform 1 OS=Theobroma cacao GN=TCM_012521 PE=4 SV=1)

HSP 1 Score: 1010.4 bits (2611), Expect = 1.100e-291
Identity = 532/766 (69.45%), Postives = 609/766 (79.50%), Query Frame = 1

		  

Query: 1   MASSGAANQHNGQRLGITEPISFGGPMEYDVTKTHELEKFLENVGLYEHQEEAVRREEVL 60
           M S G  N++NGQRLGITEPIS GGP +YDV KT ELEK+L+NVGLYE QEEAV REEVL
Sbjct: 1   MGSPGLGNRNNGQRLGITEPISLGGPTDYDVIKTRELEKYLQNVGLYESQEEAVGREEVL 60

Query: 61  GRLDQIVKLWVKTISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHAT 120
           GRLDQ VK WVK ISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHAT
Sbjct: 61  GRLDQTVKNWVKAISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHAT 120

Query: 121 REDDFFGELHRMLSEMPEVTELHPVPDAFVPVMRFKFNGVSIDLLYAKLSLWVIPEDLDV 180
           RE+DFFGEL++MLSEMPEV+ELHPVPDA VPVM+FKF GVSIDLLYAKLSLWVIPEDLD+
Sbjct: 121 REEDFFGELYKMLSEMPEVSELHPVPDAHVPVMKFKFKGVSIDLLYAKLSLWVIPEDLDI 180

Query: 181 SQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGFL 240
           SQDSILQN DEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGFL
Sbjct: 181 SQDSILQNTDEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGFL 240

Query: 241 GGINWALLVARICQLYPNALPNMLVSRFFRVFTQWRWPNPVMLCDIEEGSLGLQVWDPRR 300
           GGINWALLVARICQLYPNALPNMLVSRFFRV+TQWRWPNPVMLC IEEGSLGLQVWDPR+
Sbjct: 241 GGINWALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGLQVWDPRK 300

Query: 301 NPKDRYHLMPIITPAYPSMNSSYNVSSSTLRIMTEEFQRGHEICEAMESSNGDWDMLFEP 360
           NPKDRYHLMPIITPAYP MNSSYNVSSSTLRIMT+EFQRG EICEAME++  DWD+LFE 
Sbjct: 301 NPKDRYHLMPIITPAYPCMNSSYNVSSSTLRIMTDEFQRGSEICEAMEANKADWDILFES 360

Query: 361 YPFFEAYKNYLQIDINAENADDYQKWKGWVESRLRTLTLKIERHTYNMLQCHPHPGEFTD 420
           Y FFEAYKNYLQIDI+AENADD +KWKGWVESRLR LTLKIERHTYNMLQCHPHPG+F D
Sbjct: 361 YAFFEAYKNYLQIDISAENADDLRKWKGWVESRLRQLTLKIERHTYNMLQCHPHPGDFQD 420

Query: 421 NSKPFHSCYFMGLQRKKEVPASEGEQFDIRMTVDDFRQTVNMYTMWKPGMEIRVSHVKRR 480
            S+PFH  YFMGLQRK+ VP +EGEQFDIR+TV++F+ +VNMYT+WKPGMEIRV+HVKRR
Sbjct: 421 KSRPFHGSYFMGLQRKQGVPVNEGEQFDIRLTVEEFKHSVNMYTLWKPGMEIRVTHVKRR 480

Query: 481 SIPAFVFPGGIRP-RPAK---------------SSGDKRSLEMKSGFDKKLDDGKKRKK- 540
           +IP+FVFPGG+RP RP+K                +G  +S E+K   D + DDGKKRK+ 
Sbjct: 481 NIPSFVFPGGVRPSRPSKVTWDSMRVSDAKVSGHAGPDKSGEVKGVADGQ-DDGKKRKRV 540

Query: 541 ENGGDSTTSVGVGLTKNSASLPCLSEGGHAGSPISTATSL-VKVDHLEMDGVIEPKTEKI 600
           ++ GD+     +  +K   ++P  S  G  GSP+ST +S   K D+ +  G+IE   EK 
Sbjct: 541 DDNGDAQ----LRSSKYITAVPSSSLEGRVGSPVSTVSSCSTKGDYSDATGLIETTREKA 600

Query: 601 DTEMLH-----------PATNGALNGSVQQSPPLRTASAPIASSNSEEAETLAIEKMMAG 660
           ++ M +            + NG ++GSV  +PP++ ++    +S+  EAE LAIEK+M+G
Sbjct: 601 ESNMTNGLINSRSLEELSSHNGEVDGSVGCNPPIKVSA---DASSCTEAENLAIEKIMSG 660

Query: 661 PYGTHQALP-ELDELEVDIENADQVGEV-------AELSLESEALEIGIPGATAAHSSSF 720
           PYG HQA P EL+ELE D+E  +QV  V        E S+   A    +  +  A  S+ 
Sbjct: 661 PYGAHQAFPQELEELEDDLEFRNQVRSVENTKSGPVESSMSDLAGAAPVTSSNGAGPSTS 720

Query: 721 AIAS--LEELEPAELMAP-SVQINPAPV---KPLIRLSFTSLGKAT 724
             AS  +EELEPAEL A  S +I  APV   KPLIRL+FTSLGKA+
Sbjct: 721 LHASGGIEELEPAELTAMISNRIPSAPVAQRKPLIRLNFTSLGKAS 758

BLAST of Spo04041.1 vs. UniProtKB/TrEMBL
Match: A0A0D2S678_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_006G193600 PE=4 SV=1)

HSP 1 Score: 989.9 bits (2558), Expect = 1.600e-285
Identity = 520/763 (68.15%), Postives = 593/763 (77.72%), Query Frame = 1

		  

Query: 1   MASSGAANQHNGQRLGITEPISFGGPMEYDVTKTHELEKFLENVGLYEHQEEAVRREEVL 60
           M S G    ++GQRLGITEPIS GGP EYDV KT ELEK+L+NVGLYE QEEAV REEVL
Sbjct: 1   MGSPGLGTGNSGQRLGITEPISLGGPTEYDVIKTRELEKYLQNVGLYESQEEAVSREEVL 60

Query: 61  GRLDQIVKLWVKTISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHAT 120
           GRLDQIVK WVK ISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHAT
Sbjct: 61  GRLDQIVKNWVKAISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHAT 120

Query: 121 REDDFFGELHRMLSEMPEVTELHPVPDAFVPVMRFKFNGVSIDLLYAKLSLWVIPEDLDV 180
           RE+DFFGELH+MLSEMPEV+ELHPVPDA VP+M+FKF GVSIDLLYAKLSLWVIPEDLD+
Sbjct: 121 REEDFFGELHKMLSEMPEVSELHPVPDAHVPIMKFKFKGVSIDLLYAKLSLWVIPEDLDI 180

Query: 181 SQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGFL 240
           SQDSILQN D+QTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGFL
Sbjct: 181 SQDSILQNTDDQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGFL 240

Query: 241 GGINWALLVARICQLYPNALPNMLVSRFFRVFTQWRWPNPVMLCDIEEGSLGLQVWDPRR 300
           GGINWALLVARICQLYPNALPNMLVSRFFRV+TQWRWPNPVMLC I+EGSLGLQVWDPR+
Sbjct: 241 GGINWALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIKEGSLGLQVWDPRK 300

Query: 301 NPKDRYHLMPIITPAYPSMNSSYNVSSSTLRIMTEEFQRGHEICEAMESSNGDWDMLFEP 360
           NPKDRYHLMPIITPAYPSMNSSYNVSSSTLRIMT+EFQRG EICEAME++  DWD LFE 
Sbjct: 301 NPKDRYHLMPIITPAYPSMNSSYNVSSSTLRIMTDEFQRGSEICEAMEANKADWDALFEA 360

Query: 361 YPFFEAYKNYLQIDINAENADDYQKWKGWVESRLRTLTLKIERHTYNMLQCHPHPGEFTD 420
           Y FFEAYKNYLQIDI+AEN DD + WKGWVESRLR LTLKIERHTYNMLQCHPHPG+F D
Sbjct: 361 YAFFEAYKNYLQIDISAENDDDLRNWKGWVESRLRQLTLKIERHTYNMLQCHPHPGDFQD 420

Query: 421 NSKPFHSCYFMGLQRKKEVPASEGEQFDIRMTVDDFRQTVNMYTMWKPGMEIRVSHVKRR 480
           NS+PFH  YFMGLQRK  VP +EGEQFDIR+TV++F+ +VN YT+WKPGMEIRVSHVKRR
Sbjct: 421 NSRPFHCSYFMGLQRKLGVPVNEGEQFDIRLTVEEFKHSVNTYTLWKPGMEIRVSHVKRR 480

Query: 481 SIPAFVFPGGIRP-RPAKSSGDKRSL---------------EMKSGFDKKLDDGKKRKKE 540
           SIP+FVFPGG+RP RP+K++ D R                 E+K   D ++ DGKKRK+ 
Sbjct: 481 SIPSFVFPGGVRPSRPSKATWDSRRASDAKVSGHAGSDKPGEVKGAADGQV-DGKKRKR- 540

Query: 541 NGGDSTTSVGVGLTKNSASLPCLSEGGHAGSPISTATSL-VKVDHLEMDGVIEPKTEKID 600
              D +    +  +K   ++P  S    AGSP  T +   +K D+++  G++EP   K +
Sbjct: 541 --ADDSADTQLKNSKYITAVPSSSAEVQAGSPGGTVSPCSLKGDNVDATGLVEPTRGKDE 600

Query: 601 TEMLH----------PATNGALNGSVQQSPPLRTASAPIASSNSEEAETLAIEKMMAGPY 660
           + M +           + N  ++GS++  PP         +S+S+EAE LAIE++M+GPY
Sbjct: 601 SNMTNGSKTSSTDELSSLNSEVDGSLRCIPPHTGLHVTADASSSKEAEKLAIEQIMSGPY 660

Query: 661 GTHQALP-ELDELEVDIENADQVGEV---------AELSLESEALEIGIPGATAAHSSSF 720
            +HQA P E +ELE D+E  ++V  V         A +S  + A  I          S  
Sbjct: 661 VSHQAFPEEPEELEDDLEFRNRVVSVGNTNNGPLQAPVSDAAGAAPIISSNGAGPSISLH 720

Query: 721 AIASLEELEPAELMAPSVQINPAPV---KPLIRLSFTSLGKAT 724
           A  S+EELEPAEL A    I  APV   KPLIRL+FTSLGKA+
Sbjct: 721 ASGSIEELEPAELTA-MTSIPVAPVVQKKPLIRLNFTSLGKAS 758

BLAST of Spo04041.1 vs. UniProtKB/TrEMBL
Match: M5WE13_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa001856mg PE=4 SV=1)

HSP 1 Score: 979.5 bits (2531), Expect = 2.100e-282
Identity = 519/761 (68.20%), Postives = 591/761 (77.66%), Query Frame = 1

		  

Query: 1   MASSGAANQHNGQRLGITEPISFGGPMEYDVTKTHELEKFLENVGLYEHQEEAVRREEVL 60
           MAS G +N++NG+RLGITEPIS GGP EYDV KT ELEK+L++  LYE QEEAV REEVL
Sbjct: 1   MASPGLSNRNNGKRLGITEPISLGGPTEYDVIKTRELEKYLQDARLYESQEEAVSREEVL 60

Query: 61  GRLDQIVKLWVKTISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHAT 120
           GRLDQIVK+WVKTISR KGLNEQLV EANAKIFTFGSYRLGVHGPGADIDTLCVGPRHAT
Sbjct: 61  GRLDQIVKIWVKTISRTKGLNEQLVHEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHAT 120

Query: 121 REDDFFGELHRMLSEMPEVTELHPVPDAFVPVMRFKFNGVSIDLLYAKLSLWVIPEDLDV 180
           RE+DFFGEL RMLSEMPEVTELHPVPDA VPVM+FKF+GVSIDLLYAKLSLWVIPEDLD+
Sbjct: 121 REEDFFGELQRMLSEMPEVTELHPVPDAHVPVMKFKFSGVSIDLLYAKLSLWVIPEDLDI 180

Query: 181 SQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGFL 240
           SQDSILQNADEQTVRSLNGCRVTDQILRLVP+IQNFRTTLRCMR WAKRRGVYSNVAGFL
Sbjct: 181 SQDSILQNADEQTVRSLNGCRVTDQILRLVPSIQNFRTTLRCMRLWAKRRGVYSNVAGFL 240

Query: 241 GGINWALLVARICQLYPNALPNMLVSRFFRVFTQWRWPNPVMLCDIEEGSLGLQVWDPRR 300
           GGINWALLVARICQLYPNALPNMLVSRFFRV+TQWRWPNPVMLC IEEGSLGLQVWDPRR
Sbjct: 241 GGINWALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGLQVWDPRR 300

Query: 301 NPKDRYHLMPIITPAYPSMNSSYNVSSSTLRIMTEEFQRGHEICEAMESSNGDWDMLFEP 360
           NPKD+YHLMPIITPAYPSMNSSYNVSSSTLRIM EEFQRG+EICEAME++  DWD LFE 
Sbjct: 301 NPKDKYHLMPIITPAYPSMNSSYNVSSSTLRIMLEEFQRGNEICEAMEANKADWDTLFES 360

Query: 361 YPFFEAYKNYLQIDINAENADDYQKWKGWVESRLRTLTLKIERHTYNMLQCHPHPGEFTD 420
           Y FFEAYKNYLQIDI+AENADD++KWKGWVESRLR LTLKIERHTY MLQCHPHPG+F+D
Sbjct: 361 YDFFEAYKNYLQIDISAENADDFRKWKGWVESRLRQLTLKIERHTYGMLQCHPHPGDFSD 420

Query: 421 NSKPFHSCYFMGLQRKKEVPASEGEQFDIRMTVDDFRQTVNMYTMWKPGMEIRVSHVKRR 480
            S+PFHS YFMGLQRK+ VP +EGEQFDIR TV++F+Q+VN+YT+ + GMEIRVSHVKRR
Sbjct: 421 KSRPFHSSYFMGLQRKQGVPVTEGEQFDIRATVEEFKQSVNLYTLLERGMEIRVSHVKRR 480

Query: 481 SIPAFVFPGGIRP-RPAKSS-GDKRSLEMKSGFDKK-------------LDDGKKRKKEN 540
           +IP FVFPG +RP R +K + G +R  E+K   D +              D G+KRK+ +
Sbjct: 481 NIPNFVFPGEVRPLRLSKVTWGSRRGSELKVSGDSQPDKLCEGKTDLDGSDGGQKRKRVD 540

Query: 541 GGDSTTSVGVGLTKNSASLPCLSEGGHAGSP-----ISTATSLVKVD-HLEMDGVIEPKT 600
               T S      + + SL   S   HA SP      S +T    +D + ++D  I    
Sbjct: 541 DNVETNS------RYAKSLHLSSGEVHAASPPISNISSCSTKCESMDANKKVDDSIADSL 600

Query: 601 EKIDTEMLHPATNGALNGSVQQSPPLRTASAPIASSNSEEAETLAIEKMMAGPYGTHQAL 660
           EKI+     P  NG +  S +  PP  +  A   +S+S+EAE +A+ K MAGPY +HQAL
Sbjct: 601 EKIENPADIPYQNGQIEVSSRCKPPNDSLPAAANTSSSKEAEKMALGKNMAGPYVSHQAL 660

Query: 661 PELDELEVDIENADQVGEVA--------ELSLESEALEIGIPGATAAHSSSFAI-ASLEE 720
           PELDELE D E+  QV + +        E S ES ++   +  +  A  S+ +    LEE
Sbjct: 661 PELDELEDDSEHGHQVKDFSRNMKSSQMEPSEESVSVSAPVNSSNGAGPSTDSYNGGLEE 720

Query: 721 LEPAELMAPSVQINP----APVKPLIRLSFTSLGKATSHGS 728
           LEPAELM PS    P    A  K +IRL+FTSL KA+   S
Sbjct: 721 LEPAELMVPSSNGTPPEPVAQKKSIIRLNFTSLAKASGKSS 755

BLAST of Spo04041.1 vs. ExPASy Swiss-Prot
Match: PAPS1_ARATH (Nuclear poly(A) polymerase 1 OS=Arabidopsis thaliana GN=PAPS1 PE=1 SV=1)

HSP 1 Score: 848.2 bits (2190), Expect = 6.700e-245
Identity = 456/734 (62.13%), Postives = 524/734 (71.39%), Query Frame = 1

		  

Query: 6   AANQHNGQRLGITEPISFGGPMEYDVTKTHELEKFLENVGLYEHQEEAVRREEVLGRLDQ 65
           A+ Q NGQR G++EPIS GGP E+DV KT ELEK L++VGLYE +EEAVRREEVLG LDQ
Sbjct: 2   ASVQQNGQRFGVSEPISMGGPTEFDVIKTRELEKHLQDVGLYESKEEAVRREEVLGILDQ 61

Query: 66  IVKLWVKTISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHATREDDF 125
           IVK W+KTISRAKGLN+QL+ EANAKIFTFGSYRLGVHGPGADIDTLCVGPRHATRE DF
Sbjct: 62  IVKTWIKTISRAKGLNDQLLHEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHATREGDF 121

Query: 126 FGELHRMLSEMPEVTELHPVPDAFVPVMRFKFNGVSIDLLYAKLSLWVIPEDLDVSQDSI 185
           FGEL RMLSEMPEVTELHPVPDA VP+M FK NGVSIDLLYA+L LWVIPEDLD+SQDSI
Sbjct: 122 FGELQRMLSEMPEVTELHPVPDAHVPLMGFKLNGVSIDLLYAQLPLWVIPEDLDLSQDSI 181

Query: 186 LQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGFLGGINW 245
           LQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNV+GFLGGINW
Sbjct: 182 LQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVSGFLGGINW 241

Query: 246 ALLVARICQLYPNALPNMLVSRFFRVFTQWRWPNPVMLCDIEEGSLGLQVWDPRRNPKDR 305
           ALLVARICQLYPNALPN+LVSRFFRVF QW WPN + LC  +EGSLGLQVWDPR NPKDR
Sbjct: 242 ALLVARICQLYPNALPNILVSRFFRVFYQWNWPNAIFLCSPDEGSLGLQVWDPRINPKDR 301

Query: 306 YHLMPIITPAYPSMNSSYNVSSSTLRIMTEEFQRGHEICEAMESSNGDWDMLFEPYPFFE 365
            H+MPIITPAYP MNSSYNVS STLRIM  EFQRG+EICEAMES+  DWD LFEP+ FFE
Sbjct: 302 LHIMPIITPAYPCMNSSYNVSESTLRIMKGEFQRGNEICEAMESNKADWDTLFEPFAFFE 361

Query: 366 AYKNYLQIDINAENADDYQKWKGWVESRLRTLTLKIERHTYNMLQCHPHPGEFTDNSKPF 425
           AYKNYLQIDI+A N DD +KWKGWVESRLR LTLKIERH + ML CHPHP +F D S+P 
Sbjct: 362 AYKNYLQIDISAANVDDLRKWKGWVESRLRQLTLKIERH-FKMLHCHPHPHDFQDTSRPL 421

Query: 426 HSCYFMGLQRKKEVPASEGEQFDIRMTVDDFRQTVNMYTMWKPGMEIRVSHVKRRSIPAF 485
           H  YFMGLQRK+ VPA+EGEQFDIR TV++F+ TVN YT+W PGMEI V H+KRRS+P F
Sbjct: 422 HCSYFMGLQRKQGVPAAEGEQFDIRRTVEEFKHTVNAYTLWIPGMEISVGHIKRRSLPNF 481

Query: 486 VFPGGIRPRPAKSSGDKRSLEMKSGFDKKLDDGKKRKKENGGDSTTSVGVGLTKNSASLP 545
           VFPGG+RP               S   K   D  +R +    +S+TS     T  +  + 
Sbjct: 482 VFPGGVRP---------------SHTSKGTWDSNRRSEHR--NSSTSSAPAATTTTTEMS 541

Query: 546 CLSEGGHAGSPI-------STATSLVKVDHLEMDGVIEPKTEKIDTEMLHPATNGALNGS 605
             S+ G + SP+         + +L           +    E  +    +P+  G++  S
Sbjct: 542 SESKAG-SNSPVDGKKRKWGDSETLTDQPRNSKHIAVSVPVENCEGGSPNPSV-GSICSS 601

Query: 606 VQQSPPLRTASAPIASSNSEEA--------ETLAIEKMMAGPYGTHQALPELDELEVDIE 665
             +       S PI+    E          E+L IEK+      T QA  E +ELE   +
Sbjct: 602 PMKDYCTNGKSEPISKDPPENVVAFSKDPPESLPIEKI-----ATPQA-HETEELEESFD 661

Query: 666 NADQVGEVAELSLESEALEIGIPGATA-AHSSSFAIASLEELEPAELMAPSVQINPA--P 722
             +QV E     +   +    IP   A ++ S F   ++EELE      P     P+   
Sbjct: 662 FGNQVIEQISHKVAVLSATATIPPFEATSNGSPFPYEAVEELEVLPTRQPDAAHRPSVQQ 709

BLAST of Spo04041.1 vs. ExPASy Swiss-Prot
Match: PAPS4_ARATH (Nuclear poly(A) polymerase 4 OS=Arabidopsis thaliana GN=PAPS4 PE=1 SV=1)

HSP 1 Score: 693.7 bits (1789), Expect = 2.100e-198
Identity = 323/484 (66.74%), Postives = 388/484 (80.17%), Query Frame = 1

		  

Query: 16  GITEPISFGGPMEYDVTKTHELEKFLENVGLYEHQEEAVRREEVLGRLDQIVKLWVKTIS 75
           GIT+P+S  GP   D+ +  ELEK+L + GLYE +++ +RREEVLGR+DQIVK WVK ++
Sbjct: 22  GITKPLSLAGPSSADIKRNVELEKYLVDEGLYESKDDTMRREEVLGRIDQIVKHWVKQLT 81

Query: 76  RAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHATREDDFFGELHRMLSE 135
           + +G  +Q+V++ANA IFTFGSYRLGVHGPGADIDTLCVGP +  RE+DFF  LH +L+E
Sbjct: 82  QQRGYTDQMVEDANAVIFTFGSYRLGVHGPGADIDTLCVGPSYVNREEDFFIILHDILAE 141

Query: 136 MPEVTELHPVPDAFVPVMRFKFNGVSIDLLYAKLSLWVIPEDLDVSQDSILQNADEQTVR 195
           M EVTELHPVPDA VPVM+FKF G+ IDLLYA +SL V+P+DLD+S  S+L   DE TVR
Sbjct: 142 MEEVTELHPVPDAHVPVMKFKFQGIPIDLLYASISLLVVPQDLDISSSSVLCEVDEPTVR 201

Query: 196 SLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGFLGGINWALLVARICQL 255
           SLNGCRV DQIL+LVPN ++FRTTLRC+++WAK+RGVYSNV GFLGG+NWALLVAR+CQL
Sbjct: 202 SLNGCRVADQILKLVPNFEHFRTTLRCLKYWAKKRGVYSNVTGFLGGVNWALLVARVCQL 261

Query: 256 YPNALPNMLVSRFFRVFTQWRWPNPVMLCDIEEGSLGLQVWDPRRNPKDRYHLMPIITPA 315
           YPNA+P+MLVSRFFRV+TQWRWPNPVMLC IEE  LG  VWD R+N +DRYHLMPIITPA
Sbjct: 262 YPNAIPSMLVSRFFRVYTQWRWPNPVMLCAIEEDELGFPVWDRRKNHRDRYHLMPIITPA 321

Query: 316 YPSMNSSYNVSSSTLRIMTEEFQRGHEICEAMESSNGDWDMLFEPYPFFEAYKNYLQIDI 375
           YP MNSSYNVS STLR+MTE+FQ G+ I + +E +   W  LFE Y FFEAYKNYLQ+DI
Sbjct: 322 YPCMNSSYNVSQSTLRVMTEQFQFGNNILQEIELNKQHWSSLFEQYMFFEAYKNYLQVDI 381

Query: 376 NAENADDYQKWKGWVESRLRTLTLKIERHTYNMLQCHPHPGEFTDNSKPF-HSCYFMGLQ 435
            A +A+D   WKGWVESR R LTLKIER T  ML CHP P E+ D ++ F H  +FMGLQ
Sbjct: 382 VAADAEDLLAWKGWVESRFRQLTLKIERDTNGMLMCHPQPNEYVDTARQFLHCAFFMGLQ 441

Query: 436 RKKEVPASEGEQFDIRMTVDDFRQTVNMYTMWKPGMEIRVSHVKRRSIPAFVFPGGIRPR 495
           R + V   E +QFDIR TVD+FRQ VNMY  WKPGM++ VSHV+RR +P FVFP G R R
Sbjct: 442 RAEGVGGQECQQFDIRGTVDEFRQEVNMYMFWKPGMDVFVSHVRRRQLPPFVFPNGYR-R 501

Query: 496 PAKS 499
           P +S
Sbjct: 502 PRQS 504

BLAST of Spo04041.1 vs. ExPASy Swiss-Prot
Match: PAPS2_ARATH (Nuclear poly(A) polymerase 2 OS=Arabidopsis thaliana GN=PAPS2 PE=1 SV=2)

HSP 1 Score: 685.3 bits (1767), Expect = 7.500e-196
Identity = 319/478 (66.74%), Postives = 380/478 (79.50%), Query Frame = 1

		  

Query: 16  GITEPISFGGPMEYDVTKTHELEKFLENVGLYEHQEEAVRREEVLGRLDQIVKLWVKTIS 75
           GITEP+S  GP   DV +  ELEKFL + GLYE +EE +RREEV+ R+DQIVK WVK ++
Sbjct: 24  GITEPLSIAGPSAADVKRNLELEKFLVDEGLYESKEETMRREEVVVRIDQIVKHWVKQLT 83

Query: 76  RAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHATREDDFFGELHRMLSE 135
           R +G  +Q+V++ANA IFTFGSYRLGVHGP ADIDTLCVGP +  RE+DFF     +L+E
Sbjct: 84  RQRGYTDQMVEDANAVIFTFGSYRLGVHGPMADIDTLCVGPSYVNREEDFFIFFRDILAE 143

Query: 136 MPEVTELHPVPDAFVPVMRFKFNGVSIDLLYAKLSLWVIPEDLDVSQDSILQNADEQTVR 195
           M EVTEL PV DA VPVM+FKF G+SIDLLYA +SL VIP+DLD+S  S+L + DEQTVR
Sbjct: 144 MEEVTELQPVTDAHVPVMKFKFQGISIDLLYASISLLVIPQDLDISNSSVLCDVDEQTVR 203

Query: 196 SLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGFLGGINWALLVARICQL 255
           SLNGCRV DQIL+LVPN ++FRTTLRC+++WAK+RGVYSNV GFLGG+NWALLVAR+CQ 
Sbjct: 204 SLNGCRVADQILKLVPNSEHFRTTLRCLKYWAKKRGVYSNVTGFLGGVNWALLVARLCQF 263

Query: 256 YPNALPNMLVSRFFRVFTQWRWPNPVMLCDIEEGSLGLQVWDPRRNPKDRYHLMPIITPA 315
           YPNA+P+MLVSRFFRV+TQWRWPNPVMLC IEE  L   VWDPR+N +DRYHLMPIITPA
Sbjct: 264 YPNAIPSMLVSRFFRVYTQWRWPNPVMLCAIEEDDLSFPVWDPRKNHRDRYHLMPIITPA 323

Query: 316 YPSMNSSYNVSSSTLRIMTEEFQRGHEICEAMESSNGDWDMLFEPYPFFEAYKNYLQIDI 375
           YP MNSSYNVS STLR+MTE+FQ G+ IC+ +E +   W  LF+ Y FFEAYKNYLQ+D+
Sbjct: 324 YPCMNSSYNVSQSTLRVMTEQFQFGNTICQEIELNKQHWSSLFQQYMFFEAYKNYLQVDV 383

Query: 376 NAENADDYQKWKGWVESRLRTLTLKIERHTYNMLQCHPHPGEFTDNSKPFHSC-YFMGLQ 435
            A +A+D   WKGWVESR R LTLKIER T  ML CHP P E+ D SK F  C +FMGLQ
Sbjct: 384 LAADAEDLLAWKGWVESRFRQLTLKIERDTNGMLMCHPQPNEYVDTSKQFRHCAFFMGLQ 443

Query: 436 RKKEVPASEGEQFDIRMTVDDFRQTVNMYTMWKPGMEIRVSHVKRRSIPAFVFPGGIR 493
           R       E +QFDIR TVD+FRQ VNMY  W+PGM++ VSHV+RR +P+FVFP G +
Sbjct: 444 RADGFGGQECQQFDIRGTVDEFRQEVNMYMFWRPGMDVHVSHVRRRQLPSFVFPNGYK 501

BLAST of Spo04041.1 vs. ExPASy Swiss-Prot
Match: PAP_DICDI (Poly(A) polymerase OS=Dictyostelium discoideum GN=papA PE=3 SV=1)

HSP 1 Score: 485.3 bits (1248), Expect = 1.100e-135
Identity = 270/621 (43.48%), Postives = 373/621 (60.06%), Query Frame = 1

		  

Query: 15  LGITEPISFGGPMEYDVTKTHELEKFLENVGLYEHQEEAVRREEVLGRLDQIVKLWVKTI 74
           LG+TEPIS   P   D   + ELE  L +  L+E  EE+ +REE+LG+L+QIV+ W K +
Sbjct: 53  LGVTEPISTAPPSSIDFKLSTELENTLISFNLFESPEESRKREEILGKLNQIVREWAKQV 112

Query: 75  SRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHATREDDFFGELHRMLS 134
           S  KG  EQ   E  AKIFTFGSYRLGVHGPG+DIDTLCVGP+H  R D FF +L  +L 
Sbjct: 113 SLKKGYPEQTASEVVAKIFTFGSYRLGVHGPGSDIDTLCVGPKHIMRSD-FFDDLSDILK 172

Query: 135 EMPEVTELHPVPDAFVPVMRFKFNGVSIDLLYAKLSLWVIPEDL-DVSQDSILQNADEQT 194
             PE+TE   V DAFVPV+   F+G+ IDL+YAKL+L  IPE+L D+  +S L+N DE++
Sbjct: 173 VHPEITEFTTVKDAFVPVITMVFSGIPIDLIYAKLALTAIPEELNDLIDESFLKNIDEKS 232

Query: 195 VRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGFLGGINWALLVARIC 254
           + SLNGCRVTDQIL+LVPNI NFR  LRC++ WA RRG+YSN+ GFLGG++WALL ARIC
Sbjct: 233 ILSLNGCRVTDQILKLVPNIPNFRMALRCIKLWAIRRGIYSNILGFLGGVSWALLTARIC 292

Query: 255 QLYPNALPNMLVSRFFRVFTQWRWPNPVMLCDIEEGS-LGLQVWDPRRNPKDRYHLMPII 314
           QLYPN+ P+ ++ RFF+V+  W+WP P++LC I+EG  LG +VW+P+R   D+ HLMPII
Sbjct: 293 QLYPNSAPSTIIHRFFKVYEIWKWPAPILLCHIQEGGILGPKVWNPKR---DKAHLMPII 352

Query: 315 TPAYPSMNSSYNVSSSTLRIMTEEFQRGHEICEAMESSNGDWDMLFEPYPFFEAYKNYLQ 374
           TPAYPSMNS+YNVS STL++M  EF RG EI   +E+    W  L E   FF  Y  Y++
Sbjct: 353 TPAYPSMNSTYNVSKSTLQLMKSEFVRGAEITRKIETGECTWKNLLEKCDFFTRYSFYIE 412

Query: 375 IDINAENADDYQKWKGWVESRLRTLTLKIERHTYNMLQCHPHPGEFTDN----SKPFHSC 434
           ID  + N +D +KW+GW+ES+LR L   +E  T  M    P+P  FT+N    + P   C
Sbjct: 413 IDCYSMNEEDSRKWEGWIESKLRFLISNLE-STPKMKFAVPYPKGFTNNLHKANNPDQIC 472

Query: 435 --YFMGLQRKKEVPASEGEQFDIRMTVDDFRQTVNMYTMWKPG---MEIRVSHVKRRSIP 494
             +FMGL           +  D+   V +F   +  +   +P    M+I+V ++K++ +P
Sbjct: 473 TSFFMGLSFNFSNTPGADKSVDLTKAVTEFTGIIKDWLRTQPNPDTMDIKVQYIKKKQLP 532

Query: 495 AFVFPGGIRPRPAKSSGDKRSLEMKSGFDKKLDDGKKRKKENGGDS--TTSVGVGLTKNS 554
           AFV   G    P K++  + S    S   KKL       K N   S  TT++    T ++
Sbjct: 533 AFVKDEG-PEEPVKTTKKRSSTGEPSATRKKLKSENSDNKLNSPKSPITTNINSTPTTST 592

Query: 555 ASLPCLSEGGHAGSPISTATSLVKVDHLEMDGVIEPKTEKIDTEMLHP-ATNGALNGSVQ 614
            +    +      +  +T T+ V +       +  P      TE+  P +T+   +    
Sbjct: 593 PTTTANTTTNTTTATTTTTTTTVPITSTPTSNISSPTMN--STELTTPTSTSTTTSNDSI 652

Query: 615 QSPPLRTASAPIASSNSEEAE 622
            +PP  T    +   +++  E
Sbjct: 653 TTPPTTTTINSVQPPSAQPTE 665

BLAST of Spo04041.1 vs. ExPASy Swiss-Prot
Match: PAPOA_HUMAN (Poly(A) polymerase alpha OS=Homo sapiens GN=PAPOLA PE=1 SV=4)

HSP 1 Score: 450.7 bits (1158), Expect = 3.100e-125
Identity = 247/566 (43.64%), Postives = 342/566 (60.42%), Query Frame = 1

		  

Query: 16  GITEPISFGGPMEYDVTKTHELEKFLENVGLYEHQEEAVRREEVLGRLDQIVKLWVKTIS 75
           GIT PIS   P E D   T +L + L+  G++E +EE  RR  +LG+L+ +VK W++ IS
Sbjct: 21  GITSPISLAAPKETDCVLTQKLIETLKPFGVFEEEEELQRRILILGKLNNLVKEWIREIS 80

Query: 76  RAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHATREDDFFGELHRMLSE 135
            +K L + +++    KIFTFGSYRLGVH  GADID LCV PRH  R D FF   +  L  
Sbjct: 81  ESKNLPQSVIENVGGKIFTFGSYRLGVHTKGADIDALCVAPRHVDRSD-FFTSFYDKLKL 140

Query: 136 MPEVTELHPVPDAFVPVMRFKFNGVSIDLLYAKLSLWVIPEDLDVSQDSILQNADEQTVR 195
             EV +L  V +AFVPV++  F+G+ ID+L+A+L+L  IPEDLD+  DS+L+N D + +R
Sbjct: 141 QEEVKDLRAVEEAFVPVIKLCFDGIEIDILFARLALQTIPEDLDLRDDSLLKNLDIRCIR 200

Query: 196 SLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGFLGGINWALLVARICQL 255
           SLNGCRVTD+IL LVPNI NFR TLR ++ WAKR  +YSN+ GFLGG++WA+LVAR CQL
Sbjct: 201 SLNGCRVTDEILHLVPNIDNFRLTLRAIKLWAKRHNIYSNILGFLGGVSWAMLVARTCQL 260

Query: 256 YPNALPNMLVSRFFRVFTQWRWPNPVMLCDIEEGSLGLQVWDPRRNPKDRYHLMPIITPA 315
           YPNA+ + LV +FF VF++W WPNPV+L   EE +L L VWDPR NP DRYHLMPIITPA
Sbjct: 261 YPNAIASTLVHKFFLVFSKWEWPNPVLLKQPEECNLNLPVWDPRVNPSDRYHLMPIITPA 320

Query: 316 YPSMNSSYNVSSSTLRIMTEEFQRGHEICEAMESSNGDWDMLFEPYPFFEAYKNYLQIDI 375
           YP  NS+YNVS ST  +M EEF++G  I + +  S  +W  LFE   FF+ YK+Y+ +  
Sbjct: 321 YPQQNSTYNVSVSTRMVMVEEFKQGLAITDEILLSKAEWSKLFEAPNFFQKYKHYIVLLA 380

Query: 376 NAENADDYQKWKGWVESRLRTLTLKIERHTYNML-----QCHPHPGEFTDNSKPFHSCYF 435
           +A       +W G VES++R L   +E++ +  L     Q  P P E  D  + F + + 
Sbjct: 381 SAPTEKQRLEWVGLVESKIRILVGSLEKNEFITLAHVNPQSFPAPKENPDKEE-FRTMWV 440

Query: 436 MGLQRKKEVPASEGEQFDIRMTVDDF-----RQTVNMYTMWKPGMEIRVSHVKRRSIPAF 495
           +GL  KK    SE    D+   +  F     RQ +N   M++  M+I   HVKR+ +   
Sbjct: 441 IGLVFKK-TENSENLSVDLTYDIQSFTDTVYRQAINS-KMFEVDMKIAAMHVKRKQLHQ- 500

Query: 496 VFPGGIRPRPAKSSGDKRSLEMKSGFDKKLDDGKKRKKENGGDSTTSVGVGLTKNSASLP 555
           + P  +  +  K S +   +++ +  D  LD            S TS       NS+   
Sbjct: 501 LLPNHVLQKKKKHSTE--GVKLTALNDSSLDLSMDSDNSMSVPSPTSATKTSPLNSSG-- 560

Query: 556 CLSEGGHAGSPISTATSLVKVDHLEM 572
             S+G ++ +P  TA S+  +   E+
Sbjct: 561 -SSQGRNSPAPAVTAASVTNIQATEV 576

BLAST of Spo04041.1 vs. TAIR (Arabidopsis)
Match: AT1G17980.1 (poly(A) polymerase 1)

HSP 1 Score: 848.2 bits (2190), Expect = 3.800e-246
Identity = 456/734 (62.13%), Postives = 524/734 (71.39%), Query Frame = 1

		  

Query: 6   AANQHNGQRLGITEPISFGGPMEYDVTKTHELEKFLENVGLYEHQEEAVRREEVLGRLDQ 65
           A+ Q NGQR G++EPIS GGP E+DV KT ELEK L++VGLYE +EEAVRREEVLG LDQ
Sbjct: 2   ASVQQNGQRFGVSEPISMGGPTEFDVIKTRELEKHLQDVGLYESKEEAVRREEVLGILDQ 61

Query: 66  IVKLWVKTISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHATREDDF 125
           IVK W+KTISRAKGLN+QL+ EANAKIFTFGSYRLGVHGPGADIDTLCVGPRHATRE DF
Sbjct: 62  IVKTWIKTISRAKGLNDQLLHEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHATREGDF 121

Query: 126 FGELHRMLSEMPEVTELHPVPDAFVPVMRFKFNGVSIDLLYAKLSLWVIPEDLDVSQDSI 185
           FGEL RMLSEMPEVTELHPVPDA VP+M FK NGVSIDLLYA+L LWVIPEDLD+SQDSI
Sbjct: 122 FGELQRMLSEMPEVTELHPVPDAHVPLMGFKLNGVSIDLLYAQLPLWVIPEDLDLSQDSI 181

Query: 186 LQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGFLGGINW 245
           LQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNV+GFLGGINW
Sbjct: 182 LQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVSGFLGGINW 241

Query: 246 ALLVARICQLYPNALPNMLVSRFFRVFTQWRWPNPVMLCDIEEGSLGLQVWDPRRNPKDR 305
           ALLVARICQLYPNALPN+LVSRFFRVF QW WPN + LC  +EGSLGLQVWDPR NPKDR
Sbjct: 242 ALLVARICQLYPNALPNILVSRFFRVFYQWNWPNAIFLCSPDEGSLGLQVWDPRINPKDR 301

Query: 306 YHLMPIITPAYPSMNSSYNVSSSTLRIMTEEFQRGHEICEAMESSNGDWDMLFEPYPFFE 365
            H+MPIITPAYP MNSSYNVS STLRIM  EFQRG+EICEAMES+  DWD LFEP+ FFE
Sbjct: 302 LHIMPIITPAYPCMNSSYNVSESTLRIMKGEFQRGNEICEAMESNKADWDTLFEPFAFFE 361

Query: 366 AYKNYLQIDINAENADDYQKWKGWVESRLRTLTLKIERHTYNMLQCHPHPGEFTDNSKPF 425
           AYKNYLQIDI+A N DD +KWKGWVESRLR LTLKIERH + ML CHPHP +F D S+P 
Sbjct: 362 AYKNYLQIDISAANVDDLRKWKGWVESRLRQLTLKIERH-FKMLHCHPHPHDFQDTSRPL 421

Query: 426 HSCYFMGLQRKKEVPASEGEQFDIRMTVDDFRQTVNMYTMWKPGMEIRVSHVKRRSIPAF 485
           H  YFMGLQRK+ VPA+EGEQFDIR TV++F+ TVN YT+W PGMEI V H+KRRS+P F
Sbjct: 422 HCSYFMGLQRKQGVPAAEGEQFDIRRTVEEFKHTVNAYTLWIPGMEISVGHIKRRSLPNF 481

Query: 486 VFPGGIRPRPAKSSGDKRSLEMKSGFDKKLDDGKKRKKENGGDSTTSVGVGLTKNSASLP 545
           VFPGG+RP               S   K   D  +R +    +S+TS     T  +  + 
Sbjct: 482 VFPGGVRP---------------SHTSKGTWDSNRRSEHR--NSSTSSAPAATTTTTEMS 541

Query: 546 CLSEGGHAGSPI-------STATSLVKVDHLEMDGVIEPKTEKIDTEMLHPATNGALNGS 605
             S+ G + SP+         + +L           +    E  +    +P+  G++  S
Sbjct: 542 SESKAG-SNSPVDGKKRKWGDSETLTDQPRNSKHIAVSVPVENCEGGSPNPSV-GSICSS 601

Query: 606 VQQSPPLRTASAPIASSNSEEA--------ETLAIEKMMAGPYGTHQALPELDELEVDIE 665
             +       S PI+    E          E+L IEK+      T QA  E +ELE   +
Sbjct: 602 PMKDYCTNGKSEPISKDPPENVVAFSKDPPESLPIEKI-----ATPQA-HETEELEESFD 661

Query: 666 NADQVGEVAELSLESEALEIGIPGATA-AHSSSFAIASLEELEPAELMAPSVQINPA--P 722
             +QV E     +   +    IP   A ++ S F   ++EELE      P     P+   
Sbjct: 662 FGNQVIEQISHKVAVLSATATIPPFEATSNGSPFPYEAVEELEVLPTRQPDAAHRPSVQQ 709

BLAST of Spo04041.1 vs. TAIR (Arabidopsis)
Match: AT4G32850.8 (nuclear poly(a) polymerase)

HSP 1 Score: 693.7 bits (1789), Expect = 1.200e-199
Identity = 323/484 (66.74%), Postives = 388/484 (80.17%), Query Frame = 1

		  

Query: 16  GITEPISFGGPMEYDVTKTHELEKFLENVGLYEHQEEAVRREEVLGRLDQIVKLWVKTIS 75
           GIT+P+S  GP   D+ +  ELEK+L + GLYE +++ +RREEVLGR+DQIVK WVK ++
Sbjct: 22  GITKPLSLAGPSSADIKRNVELEKYLVDEGLYESKDDTMRREEVLGRIDQIVKHWVKQLT 81

Query: 76  RAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHATREDDFFGELHRMLSE 135
           + +G  +Q+V++ANA IFTFGSYRLGVHGPGADIDTLCVGP +  RE+DFF  LH +L+E
Sbjct: 82  QQRGYTDQMVEDANAVIFTFGSYRLGVHGPGADIDTLCVGPSYVNREEDFFIILHDILAE 141

Query: 136 MPEVTELHPVPDAFVPVMRFKFNGVSIDLLYAKLSLWVIPEDLDVSQDSILQNADEQTVR 195
           M EVTELHPVPDA VPVM+FKF G+ IDLLYA +SL V+P+DLD+S  S+L   DE TVR
Sbjct: 142 MEEVTELHPVPDAHVPVMKFKFQGIPIDLLYASISLLVVPQDLDISSSSVLCEVDEPTVR 201

Query: 196 SLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGFLGGINWALLVARICQL 255
           SLNGCRV DQIL+LVPN ++FRTTLRC+++WAK+RGVYSNV GFLGG+NWALLVAR+CQL
Sbjct: 202 SLNGCRVADQILKLVPNFEHFRTTLRCLKYWAKKRGVYSNVTGFLGGVNWALLVARVCQL 261

Query: 256 YPNALPNMLVSRFFRVFTQWRWPNPVMLCDIEEGSLGLQVWDPRRNPKDRYHLMPIITPA 315
           YPNA+P+MLVSRFFRV+TQWRWPNPVMLC IEE  LG  VWD R+N +DRYHLMPIITPA
Sbjct: 262 YPNAIPSMLVSRFFRVYTQWRWPNPVMLCAIEEDELGFPVWDRRKNHRDRYHLMPIITPA 321

Query: 316 YPSMNSSYNVSSSTLRIMTEEFQRGHEICEAMESSNGDWDMLFEPYPFFEAYKNYLQIDI 375
           YP MNSSYNVS STLR+MTE+FQ G+ I + +E +   W  LFE Y FFEAYKNYLQ+DI
Sbjct: 322 YPCMNSSYNVSQSTLRVMTEQFQFGNNILQEIELNKQHWSSLFEQYMFFEAYKNYLQVDI 381

Query: 376 NAENADDYQKWKGWVESRLRTLTLKIERHTYNMLQCHPHPGEFTDNSKPF-HSCYFMGLQ 435
            A +A+D   WKGWVESR R LTLKIER T  ML CHP P E+ D ++ F H  +FMGLQ
Sbjct: 382 VAADAEDLLAWKGWVESRFRQLTLKIERDTNGMLMCHPQPNEYVDTARQFLHCAFFMGLQ 441

Query: 436 RKKEVPASEGEQFDIRMTVDDFRQTVNMYTMWKPGMEIRVSHVKRRSIPAFVFPGGIRPR 495
           R + V   E +QFDIR TVD+FRQ VNMY  WKPGM++ VSHV+RR +P FVFP G R R
Sbjct: 442 RAEGVGGQECQQFDIRGTVDEFRQEVNMYMFWKPGMDVFVSHVRRRQLPPFVFPNGYR-R 501

Query: 496 PAKS 499
           P +S
Sbjct: 502 PRQS 504

BLAST of Spo04041.1 vs. TAIR (Arabidopsis)
Match: AT2G25850.2 (poly(A) polymerase 2)

HSP 1 Score: 685.3 bits (1767), Expect = 4.200e-197
Identity = 319/478 (66.74%), Postives = 380/478 (79.50%), Query Frame = 1

		  

Query: 16  GITEPISFGGPMEYDVTKTHELEKFLENVGLYEHQEEAVRREEVLGRLDQIVKLWVKTIS 75
           GITEP+S  GP   DV +  ELEKFL + GLYE +EE +RREEV+ R+DQIVK WVK ++
Sbjct: 24  GITEPLSIAGPSAADVKRNLELEKFLVDEGLYESKEETMRREEVVVRIDQIVKHWVKQLT 83

Query: 76  RAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHATREDDFFGELHRMLSE 135
           R +G  +Q+V++ANA IFTFGSYRLGVHGP ADIDTLCVGP +  RE+DFF     +L+E
Sbjct: 84  RQRGYTDQMVEDANAVIFTFGSYRLGVHGPMADIDTLCVGPSYVNREEDFFIFFRDILAE 143

Query: 136 MPEVTELHPVPDAFVPVMRFKFNGVSIDLLYAKLSLWVIPEDLDVSQDSILQNADEQTVR 195
           M EVTEL PV DA VPVM+FKF G+SIDLLYA +SL VIP+DLD+S  S+L + DEQTVR
Sbjct: 144 MEEVTELQPVTDAHVPVMKFKFQGISIDLLYASISLLVIPQDLDISNSSVLCDVDEQTVR 203

Query: 196 SLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGFLGGINWALLVARICQL 255
           SLNGCRV DQIL+LVPN ++FRTTLRC+++WAK+RGVYSNV GFLGG+NWALLVAR+CQ 
Sbjct: 204 SLNGCRVADQILKLVPNSEHFRTTLRCLKYWAKKRGVYSNVTGFLGGVNWALLVARLCQF 263

Query: 256 YPNALPNMLVSRFFRVFTQWRWPNPVMLCDIEEGSLGLQVWDPRRNPKDRYHLMPIITPA 315
           YPNA+P+MLVSRFFRV+TQWRWPNPVMLC IEE  L   VWDPR+N +DRYHLMPIITPA
Sbjct: 264 YPNAIPSMLVSRFFRVYTQWRWPNPVMLCAIEEDDLSFPVWDPRKNHRDRYHLMPIITPA 323

Query: 316 YPSMNSSYNVSSSTLRIMTEEFQRGHEICEAMESSNGDWDMLFEPYPFFEAYKNYLQIDI 375
           YP MNSSYNVS STLR+MTE+FQ G+ IC+ +E +   W  LF+ Y FFEAYKNYLQ+D+
Sbjct: 324 YPCMNSSYNVSQSTLRVMTEQFQFGNTICQEIELNKQHWSSLFQQYMFFEAYKNYLQVDV 383

Query: 376 NAENADDYQKWKGWVESRLRTLTLKIERHTYNMLQCHPHPGEFTDNSKPFHSC-YFMGLQ 435
            A +A+D   WKGWVESR R LTLKIER T  ML CHP P E+ D SK F  C +FMGLQ
Sbjct: 384 LAADAEDLLAWKGWVESRFRQLTLKIERDTNGMLMCHPQPNEYVDTSKQFRHCAFFMGLQ 443

Query: 436 RKKEVPASEGEQFDIRMTVDDFRQTVNMYTMWKPGMEIRVSHVKRRSIPAFVFPGGIR 493
           R       E +QFDIR TVD+FRQ VNMY  W+PGM++ VSHV+RR +P+FVFP G +
Sbjct: 444 RADGFGGQECQQFDIRGTVDEFRQEVNMYMFWRPGMDVHVSHVRRRQLPSFVFPNGYK 501

BLAST of Spo04041.1 vs. TAIR (Arabidopsis)
Match: AT3G06560.1 (poly(A) polymerase 3)

HSP 1 Score: 290.8 bits (743), Expect = 2.300e-78
Identity = 165/448 (36.83%), Postives = 247/448 (55.13%), Query Frame = 1

		  

Query: 37  LEKFLENVGLYEHQEEAVRREEVLGRLDQIVKLWVKTISRAKGLNEQLVQEANAKIFTFG 96
           L + + N GL    E+ V+R  V+ +L +IV  WVK ++    L +  +   NA I  +G
Sbjct: 21  LRQLMVNEGLIPSLEDEVKRRGVINQLRKIVVRWVKNVAWQHRLPQNQIDATNATILPYG 80

Query: 97  SYRLGVHGPGADIDTLCVGPRHATREDDFFGELHRMLSEMPEVTELHPVPDAFVPVMRFK 156
           SY LGV+G  +DID LC+GP  A+  +DFF  L  ML    EV+ELH V DA VP++RFK
Sbjct: 81  SYGLGVYGSESDIDALCIGPFFASIAEDFFISLRDMLKSRREVSELHCVKDAKVPLIRFK 140

Query: 157 FNGVSIDLLYAKLSLWVIPEDLDVSQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNF 216
           F+G+ +DL YA+L +  IP ++DV     L++ DE + + L+G R    IL+LVP+++ F
Sbjct: 141 FDGILVDLPYAQLRVLSIPNNVDVLNPFFLRDIDETSWKILSGVRANKCILQLVPSLELF 200

Query: 217 RTTLRCMRFWAKRRGVYSNVAGFLGGINWALLVARICQLYPNALPNMLVSRFFRVFTQWR 276
           ++ LRC++ WAKRRGVY N+ GFLGG++ A+L A +C   PNA  + L++ FF  F  W+
Sbjct: 201 QSLLRCVKLWAKRRGVYGNLNGFLGGVHMAILAAFVCGYQPNATLSSLLANFFYTFAHWQ 260

Query: 277 WPNPVMLCDIEEGSLGLQVWDPRRNPKDRYHLMPIITPAYPSMNSSYNVSSSTLRIMTEE 336
           WP PV+L +    S G               LMPI  P       +  ++ ST   +  E
Sbjct: 261 WPTPVVLLEDTYPSTGAPP-----------GLMPIQLPCGSHQYCNSTITRSTFYKIVAE 320

Query: 337 FQRGHEICEAMESSNGDWDMLFEPYPFFEAYKNYLQIDINAENADDYQKWKGWVESRLRT 396
           F  GH + +     N  W  LFE YP+   Y  + +I ++A N +D   W GWV+SR R 
Sbjct: 321 FLLGHNLTKDYLKLNFSWKDLFELYPYANTYTWFTKIHLSAANQEDLSDWVGWVKSRFRC 380

Query: 397 LTLKIERHTYNMLQCHPHPGEFTDN-SKPFHSCYFMGLQRKKEVPASEGEQFDIRMTVDD 456
           L +KIE   Y +  C P+P E+ +  +K  +  ++ GLQ  + +  S+ E   I     D
Sbjct: 381 LLIKIE-EVYGI--CDPNPTEYVETYTKQPNIVFYWGLQ-LRTINVSDIESVKI-----D 440

Query: 457 FRQTVNMYTMWKPGMEIRVSHVKRRSIP 484
           F + VN  +       I+++ VK   +P
Sbjct: 441 FLKNVNSGSFRGTVGRIQLTLVKASQLP 448

The following BLAST results are available for this feature:
BLAST of Spo04041.1 vs. NCBI nr
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. NCBI nr)
Total hits: 5
Match NameE-valueIdentityDescription
gi|902237178|gb|KNA24682.1|0.0e+0100.hypothetical protein SOVF_0134... [more]
gi|731324606|ref|XP_010673060.1|0.0e+081.6PREDICTED: nuclear poly(A) pol... [more]
gi|590665099|ref|XP_007036647.1|1.6e-29169.4Poly(A) polymerase 1 isoform 1... [more]
gi|1012102489|ref|XP_015957750.1|2.0e-28969.2PREDICTED: nuclear poly(A) pol... [more]
gi|1021495596|ref|XP_016190813.1|3.4e-28969.6PREDICTED: nuclear poly(A) pol... [more]
back to top
BLAST of Spo04041.1 vs. UniProtKB/TrEMBL
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. UniprotKB/TrEMBL)
Total hits: 5
Match NameE-valueIdentityDescription
A0A0K9S0P7_SPIOL0.0e+0100.Uncharacterized protein OS=Spi... [more]
A0A0J8CN87_BETVU0.0e+081.6Uncharacterized protein OS=Bet... [more]
A0A061FVR9_THECC1.1e-29169.4Poly(A) polymerase 1 isoform 1... [more]
A0A0D2S678_GOSRA1.6e-28568.1Uncharacterized protein OS=Gos... [more]
M5WE13_PRUPE2.1e-28268.2Uncharacterized protein OS=Pru... [more]
back to top
BLAST of Spo04041.1 vs. ExPASy Swiss-Prot
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. ExPASy SwissProt)
Total hits: 5
Match NameE-valueIdentityDescription
PAPS1_ARATH6.7e-24562.1Nuclear poly(A) polymerase 1 O... [more]
PAPS4_ARATH2.1e-19866.7Nuclear poly(A) polymerase 4 O... [more]
PAPS2_ARATH7.5e-19666.7Nuclear poly(A) polymerase 2 O... [more]
PAP_DICDI1.1e-13543.4Poly(A) polymerase OS=Dictyost... [more]
PAPOA_HUMAN3.1e-12543.6Poly(A) polymerase alpha OS=Ho... [more]
back to top
BLAST of Spo04041.1 vs. TAIR (Arabidopsis)
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. TAIR)
Total hits: 4
Match NameE-valueIdentityDescription
AT1G17980.13.8e-24662.1poly(A) polymerase 1[more]
AT4G32850.81.2e-19966.7nuclear poly(a) polymerase[more]
AT2G25850.24.2e-19766.7poly(A) polymerase 2[more]
AT3G06560.12.3e-7836.8poly(A) polymerase 3[more]
back to top
InterPro
Analysis Name: InterPro Annotations of S. oleracea
Date Performed: 2018-06-29
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002934Polymerase, nucleotidyl transferase domainPFAMPF01909NTP_transf_2coord: 81..165
score: 2.2
IPR007010Poly(A) polymerase, RNA-binding domainGENE3D3.30.70.590coord: 363..487
score: 4.6
IPR007010Poly(A) polymerase, RNA-binding domainPFAMPF04926PAP_RNA-bindcoord: 363..421
score: 5.5E-13coord: 421..490
score: 6.
IPR007012Poly(A) polymerase, central domainPFAMPF04928PAP_centralcoord: 16..360
score: 1.7E
IPR011068Nucleotidyltransferase, class I, C-terminal-likeunknownSSF55003PAP/Archaeal CCA-adding enzyme, C-terminal domaincoord: 361..487
score: 9.42
IPR014492Poly(A) polymerasePANTHERPTHR10682POLY A POLYMERASEcoord: 1..504
score:
NoneNo IPR availableGENE3D1.10.1410.10coord: 156..361
score: 1.3
NoneNo IPR availablePANTHERPTHR10682:SF10FI03258Pcoord: 1..504
score:
NoneNo IPR availableunknownSSF81301Nucleotidyltransferasecoord: 15..208
score: 1.06
NoneNo IPR availableunknownSSF81631PAP/OAS1 substrate-binding domaincoord: 211..360
score: 8.63

GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0031123 RNA 3'-end processing
biological_process GO:0043631 RNA polyadenylation
cellular_component GO:0005634 nucleus
molecular_function GO:0016779 nucleotidyltransferase activity
molecular_function GO:0004652 polynucleotide adenylyltransferase activity
molecular_function GO:0003723 RNA binding