Spo14001.1 (mRNA)

Overview
NameSpo14001.1
TypemRNA
OrganismSpinacia oleracea (Spinach)
DescriptionPlant/T7A14-6 protein
LocationSpoScf_02380 : 13100 .. 17071 (+)
Sequence length1143
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTTGACAAGTGTAATCCGGTTTATTGTGCACAGTGCACAGTCCAATAGCAGTTTACCAGTTTCGGTATTAGTTTCAGTAAACAAAGAAAGTGAGAGTGAGGAAAAGCAGTGTGTGAAAGCATAGGGTAAGAAAGTTGAAAGCTTTAATAGGAGGAGAGTACTCTCACTCTGTAGTTAATTGCTTTGGATCTGAGATTCTGAAATAGAAATACTCGAGCAAAAAGATGACGGGGGTTGGTTCTGGTTCTGGTTCTGGTTCTGGTTCTGGTTCTGGTTCTGGCTGTGGTTCTGGTGGCGGAGCAGCATCAAATATGATAATGTCGAGGCGAGCAGGCGGTGGTTGGTTGAGATGTTGTTTAGTGACGTTTGCTGTTATCTCAGCACTTTGTGTTTCTGGTCCTGCTCTCTATTGGAAATTCAATCTCCATAATCATATTAATATCAATCTGCACCAACAACAAACACAACACCATCTATCTTCTCCTTCTTCTTGTCCAACTTGCTCCTGCGATTGCCCACCTCCTCCTTCTCTCCTTCAGATTGCCCCTGGTTTGTTCTTTATTTGCGATTCAGATGTTTGATTACTTACTTGTATGCCCGATTAGCCTCTAATTTTGATGTTTGATTACTTACTTGTATGCCCGATTAGCCTCTAATTTTGATGTTTCATTACTTACTTGTATGCCCGATTAGCCTCTAATGTTGATGTTTGATTACTTACTTGTATGCCCGATTAGCCTCTAATTTTGATGTTTGATTACTTACTTGTATGCCCGATTAGCCTGCAATTGCATATTCTCTAATTAAGAATCTCGCTTTAACACTGGATTACTGTAAATAGCATAGTATGCACCTAATTAGGATTAGGGATTACTGGATTTGTGTTCTGATATTTCATGTTTTTGCAGGATTGGCCAATCTTTCAGTACCAGGTATGGATTCATTCATAACCCCAATAACTCTGTTTACAAGCTTGTATATTTAAAAGAATAGATCAAGGAGGATGAAACATTTTTAAACCATTTGCACGCAGATTGTGGGAGCAGTGACCCAGATCTGAAGAAAGAAATGGCAAAACAATTTGCGGATCTGTTGACAGAAGAGTTGAAGCTGCAAGAGGCAGTTGCTGCAGAGCACGCGGTGCATATGAATGCAACATTGACAGAAGCAAGAAGAGTAGCTTCTCAGTATCAAAGAGAAGGTGATAAATGCATTGCTGCAACCGAAACCTGCGAGGGAGCCAGAGAGCGTTCTGAGGCTTTCCTTAGGAAGGAGATCAAGTTGACTTCACTTTGGGAACGAAGAGCTCACAAATTGGGCTGGCAACCTGAATTCAACTGAAATCAACTCACTTTCTATTTGTATGTGTCCTTTCCTTCTTCACAAACAACTCTCTTACTGTAGTTTTTCTAGTAATTTGTTGCTCATACTGTAAATTTGACTCTACTTATCATATATAAAGCTTCAGATTGAGATTTTGAGTATCTTGTCCTCAACCCACCAATCCTGCTGCAGAAAAGGGATGATATATTTCTGGACCAAATTGTGTTGGAAAACATGTAGCACAGCTAACAAATTTGGTTATCACTTTGCTGTCACAGCACAAGGAGCAATACGGCTCATTGTGCGCTGTGATAATAACAAATACATAAATTATAAATAGCATCCAGAAATTACAATGAAAATGTGTCCTCTAGGGTGAAATAAATTATAAATTCCTACATAACTGCTACTACTACATTACTACATTAGGTAAGGGTCCAGCCCAAAGTCTTCTCTTCCTGTTGCAGTTTCACCATTTCCCTGCACCAACACAATCACATCAGCAGCTTGCTGGAAGAGTAGAGTACATGAAGAAAATTATAAATTTAGTCTGATGTAATGTGTTATAATCAAGAACTTGTTCATGAGAAACAAGAAAAAGTTATAATTCTATACATAAGTTGGATGTAGTATTGTAGTTAGTTACGACTTACGGTCATACCTGTTTGAGAAATCTCCCCACTATATCCTCCAAGGATTCAGAATTACTCTGGTTTGAACCACTGAGAAGCTTTGGGTCATCCAGCAGGTAATTGTCATCAAACTTGAGAGTTGAGTCGCTATTGGATTTCTGAACTTGGATTTGTGAACAAGTACCTCCCAACTGGCTGAAGGGTGAAAACCCCATCATGTTATTGTTGCTTTCTCTCACATTATCTCCTAGTGTTGGAGGAGTAGCAGACGTGATCACGGAGCTTGAAGTGTTGGAAAAATTACATGAATTCAGAACATAATCCGGCCTTTGATTATTCCAAATTTGTGGTTGTGCCAGATTATTGAAGCTGTTGAAACCACTTGGACAGTCTTGGCTTTGTACATTTCCTTTCGAAGCACCCATGGATGAGAATCCACAAAAAACATCAATGGGGTCATTTTGAATACTCTGGATTTGAGTATTACTTACTGGAACAATGCCATCATTAGAGACATTGATTGGAAGCAAATTTCGAGGGGTTGAAAGGGGTTGAGTTGTAGGAGGTTGAACAGGAGTTCCCCAGTTGGTTTTGGAGCTAGGATCCAGTAAATTAGGAGTAAGATTTGAAGAGGGTCCAAAATCTCTGGTGTTTAATGTTGGGGTTCCATTAATCATCATGGAAGTGGTTGGTGGCATCTGTTGAAATAAACTAGTATTCAAAGGCCCAAATTTGGAAAGAGTATTGAAGGAGTTAGAAAGTTGGGGTGTTTGAAGCATTGGGGAAGAAGAGAGGCCTCTAAGACTAATACCGGAAGAGGTGTTGAAGCGACCAAGCAATCCTCCACCACCTGACTGATCTGGAAATCTACCTGCCCCTGCAAGGCTGCGATATATTCCATAGCCTCCACCAAGAGGACCTATGTGTGCATCCTCTGCCCCACTTCTTCCTGCCTGCTGATTCGACCCTGAATTTACCTTTCGGAGGTAAAGCCTATATTTCTGCACAAAAACAAGTAGCCTTCCGGAATTAGGCATTACTACTTTGTCAATATAATATATATATACTCCTAAAAAAAAGATCCATAATTATAAACACCTGCAAATGGCTTGCAACTTTTTCTCTTGTGAGTCCTTCAACGTCCATTAGGTCAAGGATCTTCTTAGGAAAGGCCTCTGGCAAATGGAAAAGAAAGAAATGAAGCATTAGCAAACTAGTGTTTAAATTAGTCAAGTAATTAGTAAACAAGTAGTGTTGTGTTACTGACTATCAAGGCCTAATTGGTTGACAGCATCCACAAACTTCTGATGTAATTCGACAGACCAAAAAAGTCTGGGCTTTCTGTGGGTTGTTTGGTCCTCGTTCTCATGGCCACTTTCTTCACCTTCATCATCTTCATCTTCATCGTCATCATTATTATTATTATTATTGTTGTTGTTGTCATCATCTCCAGAAATATTAGAACTCTCTACAATAGCTGTTGGCCTTTGTTGTTGTTGCTTGTTCCTCCTGACAACATGTTGCCATATGTTTTGGAGCTCCTGAATCCTGACAGGTTTCACCAAATAGTCACAAGCACCATTCCTTACACCTTGCATCACTCGCTTCGTATCACTATAAGCTGATAGCACTGTCAATTCCAAACAGAAACAGACACCCTCATTTGTTTAAACATCTACTCCAATACAATTCATACTTGAATATAGATTCAAAGATTAACATCCAAATACATCGATCTAGTTGGTACCAACATTTAATCTATTATAATTTTGCAGAGAAAACAATTTCTGGCAATTAAGGTTCATTGGAAGGCAGAAAGGAATTAACTTAGGTAGGGGATGAGCTTACTAATGACAGGTAGATTCATTTCGAGACCTACGAGTTGAAGGAGCTTGATACCATCCATGTCCGGCATCTCAACATCGGTGATCACAATATCAAACTTGTTCTTGTTTCGTTTCAAAATCCTCAGTGCTTCCTTTGCTTGGTTCGTCGTCGTAACTATTCAATTCCAACA

mRNA sequence

TTTGACAAGTGTAATCCGGTTTATTGTGCACAGTGCACAGTCCAATAGCAGTTTACCAGTTTCGGTATTAGTTTCAGTAAACAAAGAAAGTGAGAGTGAGGAAAAGCAGTGTGTGAAAGCATAGGGTAAGAAAGTTGAAAGCTTTAATAGGAGGAGAGTACTCTCACTCTGTAGTTAATTGCTTTGGATCTGAGATTCTGAAATAGAAATACTCGAGCAAAAAGATGACGGGGGTTGGTTCTGGTTCTGGTTCTGGTTCTGGTTCTGGTTCTGGTTCTGGCTGTGGTTCTGGTGGCGGAGCAGCATCAAATATGATAATGTCGAGGCGAGCAGGCGGTGGTTGGTTGAGATGTTGTTTAGTGACGTTTGCTGTTATCTCAGCACTTTGTGTTTCTGGTCCTGCTCTCTATTGGAAATTCAATCTCCATAATCATATTAATATCAATCTGCACCAACAACAAACACAACACCATCTATCTTCTCCTTCTTCTTGTCCAACTTGCTCCTGCGATTGCCCACCTCCTCCTTCTCTCCTTCAGATTGCCCCTGGATTGGCCAATCTTTCAGTACCAGATTGTGGGAGCAGTGACCCAGATCTGAAGAAAGAAATGGCAAAACAATTTGCGGATCTGTTGACAGAAGAGTTGAAGCTGCAAGAGGCAGTTGCTGCAGAGCACGCGGTGCATATGAATGCAACATTGACAGAAGCAAGAAGAGTAGCTTCTCAGTATCAAAGAGAAGGTGATAAATGCATTGCTGCAACCGAAACCTGCGAGGGAGCCAGAGAGCGTTCTGAGGCTTTCCTTAGGAAGGAGATCAAGTTGACTTCACTTTGGGAACGAAGAGCTCACAAATTGGGCTGGCAACCTGAATTCAACTGAAATCAACTCACTTTCTATTTAGAAAACAATTTCTGGCAATTAAGGTTCATTGGAAGGCAGAAAGGAATTAACTTAGGTAGGGGATGAGCTTACTAATGACAGGTAGATTCATTTCGAGACCTACGAGTTGAAGGAGCTTGATACCATCCATGTCCGGCATCTCAACATCGGTGATCACAATATCAAACTTGTTCTTGTTTCGTTTCAAAATCCTCAGTGCTTCCTTTGCTTGGTTCGTCGTCGTAACTATTCAATTCCAACA

Coding sequence (CDS)

ATGACGGGGGTTGGTTCTGGTTCTGGTTCTGGTTCTGGTTCTGGTTCTGGTTCTGGCTGTGGTTCTGGTGGCGGAGCAGCATCAAATATGATAATGTCGAGGCGAGCAGGCGGTGGTTGGTTGAGATGTTGTTTAGTGACGTTTGCTGTTATCTCAGCACTTTGTGTTTCTGGTCCTGCTCTCTATTGGAAATTCAATCTCCATAATCATATTAATATCAATCTGCACCAACAACAAACACAACACCATCTATCTTCTCCTTCTTCTTGTCCAACTTGCTCCTGCGATTGCCCACCTCCTCCTTCTCTCCTTCAGATTGCCCCTGGATTGGCCAATCTTTCAGTACCAGATTGTGGGAGCAGTGACCCAGATCTGAAGAAAGAAATGGCAAAACAATTTGCGGATCTGTTGACAGAAGAGTTGAAGCTGCAAGAGGCAGTTGCTGCAGAGCACGCGGTGCATATGAATGCAACATTGACAGAAGCAAGAAGAGTAGCTTCTCAGTATCAAAGAGAAGGTGATAAATGCATTGCTGCAACCGAAACCTGCGAGGGAGCCAGAGAGCGTTCTGAGGCTTTCCTTAGGAAGGAGATCAAGTTGACTTCACTTTGGGAACGAAGAGCTCACAAATTGGGCTGGCAACCTGAATTCAACTGA

Protein sequence

MTGVGSGSGSGSGSGSGSGCGSGGGAASNMIMSRRAGGGWLRCCLVTFAVISALCVSGPALYWKFNLHNHININLHQQQTQHHLSSPSSCPTCSCDCPPPPSLLQIAPGLANLSVPDCGSSDPDLKKEMAKQFADLLTEELKLQEAVAAEHAVHMNATLTEARRVASQYQREGDKCIAATETCEGARERSEAFLRKEIKLTSLWERRAHKLGWQPEFN
Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Spo14001Spo14001gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Spo14001.1Spo14001.1-proteinpolypeptide


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
Spo14001.1.exon.1Spo14001.1.exon.1exon
Spo14001.1.exon.2Spo14001.1.exon.2exon
Spo14001.1.exon.3Spo14001.1.exon.3exon
Spo14001.1.exon.4Spo14001.1.exon.4exon


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Spo14001.1.utr5p.1Spo14001.1.utr5p.1five_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Spo14001.1.CDS.1Spo14001.1.CDS.1CDS
Spo14001.1.CDS.2Spo14001.1.CDS.2CDS
Spo14001.1.CDS.3Spo14001.1.CDS.3CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Spo14001.1.utr3p.1Spo14001.1.utr3p.1three_prime_UTR
Spo14001.1.utr3p.2Spo14001.1.utr3p.2three_prime_UTR


Homology
BLAST of Spo14001.1 vs. NCBI nr
Match: gi|902183879|gb|KNA10197.1| (hypothetical protein SOVF_146280 [Spinacia oleracea])

HSP 1 Score: 417.9 bits (1073), Expect = 1.100e-113
Identity = 216/218 (99.08%), Postives = 216/218 (99.08%), Query Frame = 1

		  

Query: 1   MTGVGSGSGSGSGSGSGSGCGSGGGAASNMIMSRRAGGGWLRCCLVTFAVISALCVSGPA 60
           MTGVG  SGSGSGSGSGSGCGSGGGAASNMIMSRRAGGGWLRCCLVTFAVISALCVSGPA
Sbjct: 1   MTGVG--SGSGSGSGSGSGCGSGGGAASNMIMSRRAGGGWLRCCLVTFAVISALCVSGPA 60

Query: 61  LYWKFNLHNHININLHQQQTQHHLSSPSSCPTCSCDCPPPPSLLQIAPGLANLSVPDCGS 120
           LYWKFNLHNHININLHQQQTQHHLSSPSSCPTCSCDCPPPPSLLQIAPGLANLSVPDCGS
Sbjct: 61  LYWKFNLHNHININLHQQQTQHHLSSPSSCPTCSCDCPPPPSLLQIAPGLANLSVPDCGS 120

Query: 121 SDPDLKKEMAKQFADLLTEELKLQEAVAAEHAVHMNATLTEARRVASQYQREGDKCIAAT 180
           SDPDLKKEMAKQFADLLTEELKLQEAVAAEHAVHMNATLTEARRVASQYQREGDKCIAAT
Sbjct: 121 SDPDLKKEMAKQFADLLTEELKLQEAVAAEHAVHMNATLTEARRVASQYQREGDKCIAAT 180

Query: 181 ETCEGARERSEAFLRKEIKLTSLWERRAHKLGWQPEFN 219
           ETCEGARERSEAFLRKEIKLTSLWERRAHKLGWQPEFN
Sbjct: 181 ETCEGARERSEAFLRKEIKLTSLWERRAHKLGWQPEFN 216

BLAST of Spo14001.1 vs. NCBI nr
Match: gi|731352816|ref|XP_010687736.1| (PREDICTED: uncharacterized protein LOC104901814 [Beta vulgaris subsp. vulgaris])

HSP 1 Score: 277.7 bits (709), Expect = 1.700e-71
Identity = 151/205 (73.66%), Postives = 162/205 (79.02%), Query Frame = 1

		  

Query: 14  SGSGSGCGSGGGAASNMIMSRRAGGGWLRCCLVTFAVISALCVSGPALYWKFN--LHNHI 73
           +GS S  GS       M MSRR+   W+RCCLV FAVISALCVS PALYWK +    + I
Sbjct: 4   TGSSSSSGSSASTMMMMSMSRRS---WVRCCLVMFAVISALCVSAPALYWKLHHSFSSSI 63

Query: 74  NINLHQQQTQHHLSSPSSCPTCSCDCPPPPSLLQIAPGLANLSVPDCGSSDPDLKKEMAK 133
           N   HQ+  Q  L   SSCP CSCDCPP PSLLQIAPGLANLSVPDCGSSDPDLKKEMAK
Sbjct: 64  NNITHQKHQQLLLLQSSSCPPCSCDCPPTPSLLQIAPGLANLSVPDCGSSDPDLKKEMAK 123

Query: 134 QFADLLTEELKLQEAVAAEHAVHMNATLTEARRVASQYQREGDKCIAATETCEGARERSE 193
           QF DLLTEELKLQE VAAE+A HMN +L EARRVASQYQREGDKC+AATETCEGARERSE
Sbjct: 124 QFVDLLTEELKLQEGVAAENAQHMNVSLAEARRVASQYQREGDKCVAATETCEGARERSE 183

Query: 194 AFLRKEIKLTSLWERRAHKLGWQPE 217
           A LRKE+K TSLWERRA KLGW+ E
Sbjct: 184 ALLRKEMKFTSLWERRARKLGWEGE 205

BLAST of Spo14001.1 vs. NCBI nr
Match: gi|823154951|ref|XP_012477360.1| (PREDICTED: uncharacterized protein LOC105793002 [Gossypium raimondii])

HSP 1 Score: 243.8 bits (621), Expect = 2.800e-61
Identity = 125/185 (67.57%), Postives = 143/185 (77.30%), Query Frame = 1

		  

Query: 32  MSRRAGGGWLRCCLVTFAVISALCVSGPALYWKFNLHNHININLHQQQTQHHLSSPSSCP 91
           MSRR+ G WLR CLV FAV+SAL V GPALYW+F            ++T   + S SSCP
Sbjct: 1   MSRRSSGTWLRLCLVIFAVVSALAVCGPALYWRF------------KKTLRFVDSKSSCP 60

Query: 92  TCSCDCPPPPSLLQIAPGLANLSVPDCGSSDPDLKKEMAKQFADLLTEELKLQEAVAAEH 151
            C CDCPPP SLL+IAPGLANLSV DCGS+DPDLKKEM KQF DLLTEELKLQEAVA EH
Sbjct: 61  PCICDCPPPLSLLKIAPGLANLSVTDCGSNDPDLKKEMEKQFVDLLTEELKLQEAVAEEH 120

Query: 152 AVHMNATLTEARRVASQYQREGDKCIAATETCEGARERSEAFLRKEIKLTSLWERRAHKL 211
             HMN T  EA+RVASQYQRE +KCIAA ETCEGARER+EA L KE K+T++WE+RA ++
Sbjct: 121 TRHMNITFGEAKRVASQYQREAEKCIAAIETCEGARERAEALLIKERKVTTIWEQRARQM 173

Query: 212 GWQPE 217
           GW+ E
Sbjct: 181 GWEGE 173

BLAST of Spo14001.1 vs. NCBI nr
Match: gi|728841841|gb|KHG21284.1| (Ubiquitin-associated 2 [Gossypium arboreum])

HSP 1 Score: 243.0 bits (619), Expect = 4.700e-61
Identity = 125/185 (67.57%), Postives = 142/185 (76.76%), Query Frame = 1

		  

Query: 32  MSRRAGGGWLRCCLVTFAVISALCVSGPALYWKFNLHNHININLHQQQTQHHLSSPSSCP 91
           MSRR+ G WLR CLV FAV+SAL V GPALYW+F            ++T   + S SSCP
Sbjct: 1   MSRRSSGTWLRLCLVIFAVVSALAVCGPALYWRF------------KKTLRFVDSKSSCP 60

Query: 92  TCSCDCPPPPSLLQIAPGLANLSVPDCGSSDPDLKKEMAKQFADLLTEELKLQEAVAAEH 151
            C CDCPPP SLL+IAPGLANLSV DCGS+DPDLKKEM KQF DLLTEELKLQEAVA EH
Sbjct: 61  PCICDCPPPLSLLKIAPGLANLSVTDCGSNDPDLKKEMEKQFVDLLTEELKLQEAVAEEH 120

Query: 152 AVHMNATLTEARRVASQYQREGDKCIAATETCEGARERSEAFLRKEIKLTSLWERRAHKL 211
             HMN T  EA+RVASQYQRE +KCIAA ETCEGARER+EA L KE K T++WE+RA ++
Sbjct: 121 TRHMNITFGEAKRVASQYQREAEKCIAAIETCEGARERAEALLIKERKATTIWEQRARQM 173

Query: 212 GWQPE 217
           GW+ E
Sbjct: 181 GWEGE 173

BLAST of Spo14001.1 vs. NCBI nr
Match: gi|590572923|ref|XP_007011979.1| (Uncharacterized protein TCM_037096 [Theobroma cacao])

HSP 1 Score: 233.8 bits (595), Expect = 2.900e-58
Identity = 126/185 (68.11%), Postives = 143/185 (77.30%), Query Frame = 1

		  

Query: 32  MSRRAGGGWLRCCLVTFAVISALCVSGPALYWKFNLHNHININLHQQQTQHHLSSPSSCP 91
           MSRRAG   LR CLV FAV+SAL V GPALYW+F            ++T     S SSCP
Sbjct: 1   MSRRAGT-CLRLCLVIFAVVSALGVCGPALYWRF------------KKTLRLGDSKSSCP 60

Query: 92  TCSCDCPPPPSLLQIAPGLANLSVPDCGSSDPDLKKEMAKQFADLLTEELKLQEAVAAEH 151
            C CDCPPP SLL+IAPGLANLSV DCGSSDPDLK+EM KQF DLLTEELKLQEAV AEH
Sbjct: 61  PCICDCPPPLSLLKIAPGLANLSVTDCGSSDPDLKQEMEKQFVDLLTEELKLQEAVTAEH 120

Query: 152 AVHMNATLTEARRVASQYQREGDKCIAATETCEGARERSEAFLRKEIKLTSLWERRAHKL 211
           A H+N T  EA+RVASQYQRE +KCIAATETCEGARER+EA L +E K+T+LWE+RA ++
Sbjct: 121 ARHVNITFGEAKRVASQYQREAEKCIAATETCEGARERAEALLIRERKVTTLWEQRARQM 172

Query: 212 GWQPE 217
           GW+ E
Sbjct: 181 GWEGE 172

BLAST of Spo14001.1 vs. UniProtKB/TrEMBL
Match: A0A0K9QSC9_SPIOL (Uncharacterized protein OS=Spinacia oleracea GN=SOVF_146280 PE=4 SV=1)

HSP 1 Score: 417.9 bits (1073), Expect = 7.400e-114
Identity = 216/218 (99.08%), Postives = 216/218 (99.08%), Query Frame = 1

		  

Query: 1   MTGVGSGSGSGSGSGSGSGCGSGGGAASNMIMSRRAGGGWLRCCLVTFAVISALCVSGPA 60
           MTGVG  SGSGSGSGSGSGCGSGGGAASNMIMSRRAGGGWLRCCLVTFAVISALCVSGPA
Sbjct: 1   MTGVG--SGSGSGSGSGSGCGSGGGAASNMIMSRRAGGGWLRCCLVTFAVISALCVSGPA 60

Query: 61  LYWKFNLHNHININLHQQQTQHHLSSPSSCPTCSCDCPPPPSLLQIAPGLANLSVPDCGS 120
           LYWKFNLHNHININLHQQQTQHHLSSPSSCPTCSCDCPPPPSLLQIAPGLANLSVPDCGS
Sbjct: 61  LYWKFNLHNHININLHQQQTQHHLSSPSSCPTCSCDCPPPPSLLQIAPGLANLSVPDCGS 120

Query: 121 SDPDLKKEMAKQFADLLTEELKLQEAVAAEHAVHMNATLTEARRVASQYQREGDKCIAAT 180
           SDPDLKKEMAKQFADLLTEELKLQEAVAAEHAVHMNATLTEARRVASQYQREGDKCIAAT
Sbjct: 121 SDPDLKKEMAKQFADLLTEELKLQEAVAAEHAVHMNATLTEARRVASQYQREGDKCIAAT 180

Query: 181 ETCEGARERSEAFLRKEIKLTSLWERRAHKLGWQPEFN 219
           ETCEGARERSEAFLRKEIKLTSLWERRAHKLGWQPEFN
Sbjct: 181 ETCEGARERSEAFLRKEIKLTSLWERRAHKLGWQPEFN 216

BLAST of Spo14001.1 vs. UniProtKB/TrEMBL
Match: A0A0J8BUI5_BETVU (Uncharacterized protein OS=Beta vulgaris subsp. vulgaris GN=BVRB_8g192490 PE=4 SV=1)

HSP 1 Score: 277.7 bits (709), Expect = 1.200e-71
Identity = 151/205 (73.66%), Postives = 162/205 (79.02%), Query Frame = 1

		  

Query: 14  SGSGSGCGSGGGAASNMIMSRRAGGGWLRCCLVTFAVISALCVSGPALYWKFN--LHNHI 73
           +GS S  GS       M MSRR+   W+RCCLV FAVISALCVS PALYWK +    + I
Sbjct: 4   TGSSSSSGSSASTMMMMSMSRRS---WVRCCLVMFAVISALCVSAPALYWKLHHSFSSSI 63

Query: 74  NINLHQQQTQHHLSSPSSCPTCSCDCPPPPSLLQIAPGLANLSVPDCGSSDPDLKKEMAK 133
           N   HQ+  Q  L   SSCP CSCDCPP PSLLQIAPGLANLSVPDCGSSDPDLKKEMAK
Sbjct: 64  NNITHQKHQQLLLLQSSSCPPCSCDCPPTPSLLQIAPGLANLSVPDCGSSDPDLKKEMAK 123

Query: 134 QFADLLTEELKLQEAVAAEHAVHMNATLTEARRVASQYQREGDKCIAATETCEGARERSE 193
           QF DLLTEELKLQE VAAE+A HMN +L EARRVASQYQREGDKC+AATETCEGARERSE
Sbjct: 124 QFVDLLTEELKLQEGVAAENAQHMNVSLAEARRVASQYQREGDKCVAATETCEGARERSE 183

Query: 194 AFLRKEIKLTSLWERRAHKLGWQPE 217
           A LRKE+K TSLWERRA KLGW+ E
Sbjct: 184 ALLRKEMKFTSLWERRARKLGWEGE 205

BLAST of Spo14001.1 vs. UniProtKB/TrEMBL
Match: A0A0D2S7U2_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_004G291300 PE=4 SV=1)

HSP 1 Score: 243.8 bits (621), Expect = 1.900e-61
Identity = 125/185 (67.57%), Postives = 143/185 (77.30%), Query Frame = 1

		  

Query: 32  MSRRAGGGWLRCCLVTFAVISALCVSGPALYWKFNLHNHININLHQQQTQHHLSSPSSCP 91
           MSRR+ G WLR CLV FAV+SAL V GPALYW+F            ++T   + S SSCP
Sbjct: 1   MSRRSSGTWLRLCLVIFAVVSALAVCGPALYWRF------------KKTLRFVDSKSSCP 60

Query: 92  TCSCDCPPPPSLLQIAPGLANLSVPDCGSSDPDLKKEMAKQFADLLTEELKLQEAVAAEH 151
            C CDCPPP SLL+IAPGLANLSV DCGS+DPDLKKEM KQF DLLTEELKLQEAVA EH
Sbjct: 61  PCICDCPPPLSLLKIAPGLANLSVTDCGSNDPDLKKEMEKQFVDLLTEELKLQEAVAEEH 120

Query: 152 AVHMNATLTEARRVASQYQREGDKCIAATETCEGARERSEAFLRKEIKLTSLWERRAHKL 211
             HMN T  EA+RVASQYQRE +KCIAA ETCEGARER+EA L KE K+T++WE+RA ++
Sbjct: 121 TRHMNITFGEAKRVASQYQREAEKCIAAIETCEGARERAEALLIKERKVTTIWEQRARQM 173

Query: 212 GWQPE 217
           GW+ E
Sbjct: 181 GWEGE 173

BLAST of Spo14001.1 vs. UniProtKB/TrEMBL
Match: A0A0B0PAG6_GOSAR (Ubiquitin-associated 2 OS=Gossypium arboreum GN=F383_02581 PE=4 SV=1)

HSP 1 Score: 243.0 bits (619), Expect = 3.300e-61
Identity = 125/185 (67.57%), Postives = 142/185 (76.76%), Query Frame = 1

		  

Query: 32  MSRRAGGGWLRCCLVTFAVISALCVSGPALYWKFNLHNHININLHQQQTQHHLSSPSSCP 91
           MSRR+ G WLR CLV FAV+SAL V GPALYW+F            ++T   + S SSCP
Sbjct: 1   MSRRSSGTWLRLCLVIFAVVSALAVCGPALYWRF------------KKTLRFVDSKSSCP 60

Query: 92  TCSCDCPPPPSLLQIAPGLANLSVPDCGSSDPDLKKEMAKQFADLLTEELKLQEAVAAEH 151
            C CDCPPP SLL+IAPGLANLSV DCGS+DPDLKKEM KQF DLLTEELKLQEAVA EH
Sbjct: 61  PCICDCPPPLSLLKIAPGLANLSVTDCGSNDPDLKKEMEKQFVDLLTEELKLQEAVAEEH 120

Query: 152 AVHMNATLTEARRVASQYQREGDKCIAATETCEGARERSEAFLRKEIKLTSLWERRAHKL 211
             HMN T  EA+RVASQYQRE +KCIAA ETCEGARER+EA L KE K T++WE+RA ++
Sbjct: 121 TRHMNITFGEAKRVASQYQREAEKCIAAIETCEGARERAEALLIKERKATTIWEQRARQM 173

Query: 212 GWQPE 217
           GW+ E
Sbjct: 181 GWEGE 173

BLAST of Spo14001.1 vs. UniProtKB/TrEMBL
Match: A0A061GJK9_THECC (Uncharacterized protein OS=Theobroma cacao GN=TCM_037096 PE=4 SV=1)

HSP 1 Score: 233.8 bits (595), Expect = 2.000e-58
Identity = 126/185 (68.11%), Postives = 143/185 (77.30%), Query Frame = 1

		  

Query: 32  MSRRAGGGWLRCCLVTFAVISALCVSGPALYWKFNLHNHININLHQQQTQHHLSSPSSCP 91
           MSRRAG   LR CLV FAV+SAL V GPALYW+F            ++T     S SSCP
Sbjct: 1   MSRRAGT-CLRLCLVIFAVVSALGVCGPALYWRF------------KKTLRLGDSKSSCP 60

Query: 92  TCSCDCPPPPSLLQIAPGLANLSVPDCGSSDPDLKKEMAKQFADLLTEELKLQEAVAAEH 151
            C CDCPPP SLL+IAPGLANLSV DCGSSDPDLK+EM KQF DLLTEELKLQEAV AEH
Sbjct: 61  PCICDCPPPLSLLKIAPGLANLSVTDCGSSDPDLKQEMEKQFVDLLTEELKLQEAVTAEH 120

Query: 152 AVHMNATLTEARRVASQYQREGDKCIAATETCEGARERSEAFLRKEIKLTSLWERRAHKL 211
           A H+N T  EA+RVASQYQRE +KCIAATETCEGARER+EA L +E K+T+LWE+RA ++
Sbjct: 121 ARHVNITFGEAKRVASQYQREAEKCIAATETCEGARERAEALLIRERKVTTLWEQRARQM 172

Query: 212 GWQPE 217
           GW+ E
Sbjct: 181 GWEGE 172

BLAST of Spo14001.1 vs. TAIR (Arabidopsis)
Match: AT4G30996.1 (Protein of unknown function (DUF1068))

HSP 1 Score: 217.2 bits (552), Expect = 9.700e-57
Identity = 117/182 (64.29%), Postives = 133/182 (73.08%), Query Frame = 1

		  

Query: 35  RAGGGWLRCCLVTFAVISALCVSGPALYWKFNLHNHININLHQQQTQHHLSSPSSCPTCS 94
           R  G  +RC LV FAV+SAL V GPALYWKFN           +       + S CP C 
Sbjct: 3   RRSGDCMRC-LVIFAVVSALVVCGPALYWKFN-----------KGFVGSTRANSLCPPCV 62

Query: 95  CDCPPPPSLLQIAPGLANLSVPDCGSSDPDLKKEMAKQFADLLTEELKLQEAVAAEHAVH 154
           CDCPPP SLLQIAPGLANLS+ DCGS DP+LK+EM KQF DLLTEELKLQEAVA EH+ H
Sbjct: 63  CDCPPPLSLLQIAPGLANLSITDCGSDDPELKQEMEKQFVDLLTEELKLQEAVADEHSRH 122

Query: 155 MNATLTEARRVASQYQREGDKCIAATETCEGARERSEAFLRKEIKLTSLWERRAHKLGWQ 214
           MN TL EA+RVASQYQ+E +KC AATE CE ARER+EA L KE K+TSLWE+RA + GW+
Sbjct: 123 MNVTLAEAKRVASQYQKEAEKCNAATEICESARERAEALLIKERKITSLWEKRARQSGWE 172

Query: 215 PE 217
            E
Sbjct: 183 GE 172

BLAST of Spo14001.1 vs. TAIR (Arabidopsis)
Match: AT2G24290.1 (Protein of unknown function (DUF1068))

HSP 1 Score: 208.0 bits (528), Expect = 5.900e-54
Identity = 114/185 (61.62%), Postives = 132/185 (71.35%), Query Frame = 1

		  

Query: 32  MSRRAGGGWLRCCLVTFAVISALCVSGPALYWKFNLHNHININLHQQQTQHHLSSPSSCP 91
           M+RR+G      CLV F+V+SAL V GPALYWK N          +       S+ S CP
Sbjct: 1   MARRSGN--CMRCLVIFSVVSALLVCGPALYWKLN----------KGFVGSARSTNSICP 60

Query: 92  TCSCDCPPPPSLLQIAPGLANLSVPDCGSSDPDLKKEMAKQFADLLTEELKLQEAVAAEH 151
            C CD PPP SLLQIAPGLANLS+  CGS DP+LK+EM K F DLLTEELKLQEAVA EH
Sbjct: 61  PCVCDFPPPLSLLQIAPGLANLSITGCGSDDPELKEEMEKPFVDLLTEELKLQEAVADEH 120

Query: 152 AVHMNATLTEARRVASQYQREGDKCIAATETCEGARERSEAFLRKEIKLTSLWERRAHKL 211
           + HMN TL EA+RVASQYQ+E +KC AATE CE ARER++A L KE K+T LWERRA +L
Sbjct: 121 SRHMNVTLAEAKRVASQYQKEAEKCNAATEICESARERAQALLLKERKITFLWERRARQL 173

Query: 212 GWQPE 217
           GW+ E
Sbjct: 181 GWEGE 173

BLAST of Spo14001.1 vs. TAIR (Arabidopsis)
Match: AT2G32580.1 (Protein of unknown function (DUF1068))

HSP 1 Score: 115.9 bits (289), Expect = 3.100e-26
Identity = 65/159 (40.88%), Postives = 93/159 (58.49%), Query Frame = 1

		  

Query: 56  VSGPALYWKFNLHNHININLHQQQTQHHLSSPSSCPTCSCDCPPPPSLLQIAPGLANLSV 115
           + GP LYW                T+    S +SC  C CDC   P LL I  GL+N S 
Sbjct: 23  ILGPPLYWHL--------------TEALAVSATSCSACVCDCSSLP-LLTIPTGLSNGSF 82

Query: 116 PDCGSSDPDLKKEMAKQFADLLTEELKLQEAVAAEHAVHMNATLTEARRVASQYQREGDK 175
            DC   DP++ ++  K +A+LLTEELK +EA + E    ++  L EA+++ S YQ+E DK
Sbjct: 83  TDCAKRDPEVNEDTEKNYAELLTEELKQREAASMEKHKRVDTGLLEAKKITSSYQKEADK 142

Query: 176 CIAATETCEGARERSEAFLRKEIKLTSLWERRAHKLGWQ 215
           C +  ETCE ARE++E  L ++ KLTS+WE+RA + G++
Sbjct: 143 CNSGMETCEEAREKAEKALVEQKKLTSMWEQRARQKGYK 166

BLAST of Spo14001.1 vs. TAIR (Arabidopsis)
Match: AT1G05070.1 (Protein of unknown function (DUF1068))

HSP 1 Score: 114.4 bits (285), Expect = 8.900e-26
Identity = 70/174 (40.23%), Postives = 97/174 (55.75%), Query Frame = 1

		  

Query: 41  LRCCLVTFAVISALCVSGPALYWKFNLHNHININLHQQQTQHHLSSPSSCPTCSCDCPPP 100
           L+  L    +  A  + GP LYW      H+   L          S SSCP+C C+C   
Sbjct: 8   LKIGLALLGLSMAGYILGPPLYW------HLTEALAAV-------SASSCPSCPCECSTY 67

Query: 101 PSLLQIAPGLANLSVPDCGSSDPDLKKEMAKQFADLLTEELKLQEAVAAEHAVHMNATLT 160
            S + I   L+N S  DC   DP++ ++  K +A+LLTEELKL+EA + E     +  L 
Sbjct: 68  -SAVTIPKELSNASFADCAKHDPEVNEDTEKNYAELLTEELKLREAESLEKHKRADMGLL 127

Query: 161 EARRVASQYQREGDKCIAATETCEGARERSEAFLRKEIKLTSLWERRAHKLGWQ 215
           EA++V S YQ+E DKC +  ETCE ARE++E  L ++ KLTS WE RA + GW+
Sbjct: 128 EAKKVTSSYQKEADKCNSGMETCEEAREKAELALAEQKKLTSRWEERARQKGWR 167

BLAST of Spo14001.1 vs. TAIR (Arabidopsis)
Match: AT4G04360.1 (Protein of unknown function (DUF1068))

HSP 1 Score: 108.6 bits (270), Expect = 4.900e-24
Identity = 69/169 (40.83%), Postives = 97/169 (57.40%), Query Frame = 1

		  

Query: 50  VISALCV----SGPALYWKFNLHNHININLHQQQTQHHLSSPSSCPTCSCDCPPPPSLLQ 109
           V+  LC+    +GP+LYW  +L+  I  +LH           SSCP C CDC   P LL 
Sbjct: 14  VVMGLCIVAYIAGPSLYW--HLNETIADSLH-----------SSCPPCVCDCSSQP-LLS 73

Query: 110 IAPGLANLSVPDCGSSDPDLKKEMAKQFADLLTEELKLQEAVAAEHAVHMNATLTEARRV 169
           I  GL+N S  DC   +    +E    F +++ EELKL+EA A E     +  L +A++ 
Sbjct: 74  IPDGLSNHSFLDCMRHEEG-SEESESSFTEMVAEELKLREAQAQEDEWRADRLLLDAKKA 133

Query: 170 ASQYQREGDKCIAATETCEGARERSEAFLRKEIKLTSLWERRAHKLGWQ 215
           ASQYQ+E DKC    ETCE ARE++EA L ++ +L+ +WE RA + GW+
Sbjct: 134 ASQYQKEADKCSMGMETCELAREKAEAALDEQRRLSYMWELRARQGGWK 167

The following BLAST results are available for this feature:
BLAST of Spo14001.1 vs. NCBI nr
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. NCBI nr)
Total hits: 5
Match NameE-valueIdentityDescription
gi|902183879|gb|KNA10197.1|1.1e-11399.0hypothetical protein SOVF_1462... [more]
gi|731352816|ref|XP_010687736.1|1.7e-7173.6PREDICTED: uncharacterized pro... [more]
gi|823154951|ref|XP_012477360.1|2.8e-6167.5PREDICTED: uncharacterized pro... [more]
gi|728841841|gb|KHG21284.1|4.7e-6167.5Ubiquitin-associated 2 [Gossyp... [more]
gi|590572923|ref|XP_007011979.1|2.9e-5868.1Uncharacterized protein TCM_03... [more]
back to top
BLAST of Spo14001.1 vs. UniProtKB/TrEMBL
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. UniprotKB/TrEMBL)
Total hits: 5
Match NameE-valueIdentityDescription
A0A0K9QSC9_SPIOL7.4e-11499.0Uncharacterized protein OS=Spi... [more]
A0A0J8BUI5_BETVU1.2e-7173.6Uncharacterized protein OS=Bet... [more]
A0A0D2S7U2_GOSRA1.9e-6167.5Uncharacterized protein OS=Gos... [more]
A0A0B0PAG6_GOSAR3.3e-6167.5Ubiquitin-associated 2 OS=Goss... [more]
A0A061GJK9_THECC2.0e-5868.1Uncharacterized protein OS=The... [more]
back to top
BLAST of Spo14001.1 vs. ExPASy Swiss-Prot
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. ExPASy SwissProt)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Spo14001.1 vs. TAIR (Arabidopsis)
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. TAIR)
Total hits: 5
Match NameE-valueIdentityDescription
AT4G30996.19.7e-5764.2Protein of unknown function (D... [more]
AT2G24290.15.9e-5461.6Protein of unknown function (D... [more]
AT2G32580.13.1e-2640.8Protein of unknown function (D... [more]
AT1G05070.18.9e-2640.2Protein of unknown function (D... [more]
AT4G04360.14.9e-2440.8Protein of unknown function (D... [more]
back to top
InterPro
Analysis Name: InterPro Annotations of S. oleracea
Date Performed: 2018-06-29
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR010471Protein of unknown function DUF1068PFAMPF06364DUF1068coord: 40..214
score: 6.3
NoneNo IPR availablePANTHERPTHR32254FAMILY NOT NAMEDcoord: 30..216
score: 2.2E
NoneNo IPR availablePANTHERPTHR32254:SF4EXPRESSED PROTEINcoord: 30..216
score: 2.2E