Help  Custom Search
 
ClpB protein, putative. of Plasmodium falciparum 3D7

 Names & Lineage - Classification - Sequence - Domains - Ontologies - Refernces - X-Ref - External Links -Record Info 

NAMES AND LINEAGE

Protein names

Name

ClpB protein, putative.

Synonymous Names

Q8IB03_PLAF7, Casenolytic peptidase B, ClpB.

Taxon Information

Organism

Plasmodium falciparum 3D7

Organism ID

36329 [NCBI] [UNIPROT]

Lineage

Eukaryota; Alveolata; Apicomplexa; Aconoidasida; Haemosporida; Plasmodium; Plasmodium (Laverania); Plasmodium falciparum

CLASSIFICATION

Protein Classification

Protein Family

Belongs to Hsp100 Protein family

Protein Type

Class I Clp Proteins
(ClpB-Caseinolytic peptidase B)

SEQUENCE INFORMATION

Sequence
MVNSFFFCFVIIGLIYVWDITYSKKAKIFFNKNDIFSIKNTHWDIYDKKKYFFIGNNHLKNEESFLPEVR KDYKSQIKEYKNSTNGIIYHNNKNRLSYTINDQVNYDNNMTSGINKKRKVKDSSIHMNNSYEKNRNKNKF ALFMSDEEYTINSDDYTEKAWEAISSLNKIGEKYDSAYVEAEMLLLALLNDSPDGLAERILKESGIDTQL LVQEIDDYLKKQPKMPSGFGEQKILGRTLQTVLSTSKRLKKEFNDEYISIEHLLLSIISEDSKFTRPWLL KYNVNYEKVKKAVEKIRGKKKVTSKTPEMTYQALEKYSRDLTALARAGKLDPVIGRDNEIRRAIQILSRR TKNNPILLGDPGVGKTAIVEGLAIKIVQGDVPDSLKGRKLVSLDMSSLIAGAKYRGDFEERLKSILKEVQ DAEGQVVMFIDEIHTVVGAGAVAEGALDAGNILKPMLARGELRCIGATTVSEYRQFIEKDKALERRFQQI LVEQPSVDETISILRGLKERYEVHHGVRILDSALVQAAVLSDRYISYRFLPDKAIDLIDEAASNLKIQLS SKPIQLENIEKQLIQLEMEKISILGDKQKNLFNYSSVANTHNNNNNSSISSNNSSSYGNAEETEATVDYT KSPNFLKKRINEKEIDRLKMIDRIMSELRKEQRKILDSWSTEKSYVDNIRAIKERIDVVKIEIEKAERYF DLNRAAELRFETLPDLEKQLKKAEENYLNDIPEKSRILKDEVTSEDIVNIVSMSTGIRLNKLLKSEKEKI LNLENELHKQIIGQDDAVKVVTKAVQRSRVGMNNPKRPIASLMFLGPTGVGKTELSKVLADVLFDTPEAV IHFDMSEYMEKHSISKLIGAAPGYVGYEQGGLLTDAVRKKPYSIILFDEIEKAHPDVYNLLLRVIDEGKL SDTKGNVANFRNTIIIFTSNLGSQSILDLANDPNKKEKIKEQVMKSVRETFRPEFYNRIDDHVIFDSLSK KELKEIANIEIRKVANRLFDKNFKITIDDAVFSYIVDKAYDPSFGARPLKRVIQSEIETEIAVRILDETF VENDTINISLKDQKLHFSKS

FASTA Format Composition Secondary Structure

Nucleotide
 ATGGTTAATAGTTTTTTTTTTTGTTTTGTGATAATTGGTCTTATTTATGTATGGGACATCACGTACAGTAAAAAAGCTAAAATATTTTT
TAATAAAAATGATATCTTTAGTATAAAAAATACGCACTGGGATATTTATGATAAGAAGAAATATTTTTTTATTGGAAATAATCATTTAAA
AAATGAAGAAAGTTTTTTACCAGAAGTAAGAAAGGATTATAAATCACAAATAAAAGAATATAAGAATTCAACGAATGGTATTATATATCA
TAATAATAAAAACAGATTAAGTTATACAATAAATGATCAAGTAAATTATGATAATAATATGACAAGTGGTATTAATAAAAAAAGAAAAGT
TAAAGATAGTAGTATACACATGAATAATTCTTATGAAAAAAATAGAAACAAAAATAAATTTGCTTTATTTATGAGTGATGAAGAATATAC
CATTAATTCAGATGATTATACCGAAAAAGCTTGGGAAGCTATTAGCTCCTTAAATAAAATTGGAGAAAAATATGATTCGGCATATGTAGA
AGCTGAAATGTTATTATTAGCTCTACTAAATGATTCACCCGATGGTTTAGCTGAAAGAATATTAAAAGAAAGTGGTATAGATACCCAATT
ATTAGTTCAAGAAATTGATGATTATTTAAAAAAACAACCTAAGATGCCTAGTGGTTTTGGAGAACAGAAAATATTAGGTAGAACTTTACA
AACTGTATTAAGTACTAGTAAAAGATTAAAAAAAGAATTTAATGATGAATATATTTCCATAGAACACCTATTACTAAGTATCATTTCAGA
AGATTCTAAATTTACTAGACCCTGGTTATTAAAATATAATGTAAATTATGAAAAAGTAAAAAAAGCTGTAGAAAAAATTCGAGGAAAAAA
AAAAGTTACTTCTAAAACACCAGAAATGACTTATCAAGCTCTAGAAAAATATAGTAGAGATCTAACAGCTTTGGCAAGAGCAGGAAAATT
AGATCCTGTTATAGGTAGAGATAATGAAATTAGAAGAGCCATACAAATTTTATCCAGAAGAACTAAAAATAATCCTATCTTATTAGGAGA
TCCTGGTGTTGGGAAAACAGCTATTGTTGAAGGGTTAGCCATAAAAATCGTACAAGGAGATGTACCTGACTCATTAAAAGGAAGGAAATT
AGTATCTTTAGATATGTCTTCTCTTATAGCTGGTGCAAAATATAGAGGTGATTTTGAAGAAAGGCTAAAATCAATTCTGAAAGAAGTACA
AGATGCTGAAGGTCAAGTTGTTATGTTTATAGATGAAATCCATACTGTTGTGGGAGCTGGAGCGGTCGCAGAAGGTGCATTAGATGCTGG
TAATATATTAAAACCTATGTTAGCTAGAGGTGAATTACGTTGTATTGGTGCTACGACGGTTAGTGAATATAGACAATTTATAGAAAAGGA
TAAAGCATTAGAAAGAAGATTTCAACAAATTCTTGTTGAACAACCAAGTGTTGATGAAACTATTAGTATATTAAGAGGTCTAAAAGAAAG
ATATGAAGTTCATCATGGTGTACGTATATTAGATTCTGCATTAGTACAAGCTGCTGTTTTATCAGATCGTTATATTAGTTATAGATTCTT
ACCAGATAAAGCGATTGATCTTATTGACGAAGCTGCATCTAATCTTAAAATACAACTATCTAGTAAACCTATTCAATTAGAAAATATAGA
AAAACAACTTATACAATTAGAAATGGAAAAAATATCCATATTAGGAGATAAACAAAAGAATCTATTTAATTATTCTAGTGTAGCTAACAC
ACACAATAATAATAATAATAGTAGTATTAGTAGCAATAACTCGTCATCATATGGTAACGCTGAAGAAACTGAAGCAACTGTTGATTATAC
TAAAAGCCCCAATTTTTTAAAAAAAAGAATTAATGAAAAAGAAATTGATAGATTAAAAATGATCGATCGAATCATGAGCGAATTAAGAAA
AGAACAAAGAAAAATCCTAGATTCTTGGTCCACCGAAAAAAGCTATGTAGATAATATCAGAGCTATTAAAGAAAGAATAGATGTTGTTAA
AATAGAAATTGAAAAAGCTGAAAGATATTTTGATTTAAATAGAGCAGCTGAATTGAGATTTGAAACATTACCTGATTTAGAAAAACAATT
AAAAAAAGCAGAAGAAAATTATCTAAATGATATCCCTGAAAAAAGTAGAATATTAAAAGATGAAGTTACAAGTGAAGATATTGTTAATAT
TGTAAGTATGTCTACCGGTATCAGATTAAATAAATTACTAAAATCTGAAAAAGAAAAAATACTTAATCTTGAAAATGAATTACATAAACA
AATTATCGGTCAAGATGATGCCGTAAAAGTTGTAACCAAAGCTGTTCAAAGATCTAGGGTTGGAATGAATAACCCTAAAAGACCAATAGC
ATCTTTAATGTTTTTAGGACCAACAGGAGTAGGAAAAACGGAATTATCTAAGGTATTGGCAGATGTATTATTTGACACACCAGAAGCAGT
AATTCATTTTGATATGTCTGAATATATGGAGAAGCATTCAATTAGTAAATTAATAGGTGCCGCACCAGGTTATGTGGGATATGAACAAGG
AGGATTATTAACAGATGCAGTACGTAAAAAACCATATTCTATCATTTTATTTGATGAAATAGAAAAAGCACATCCTGATGTATATAATTT
ATTATTAAGAGTTATAGATGAGGGAAAATTATCTGATACCAAAGGAAATGTAGCTAATTTTAGAAATACAATTATTATATTTACATCCAA
TTTAGGAAGTCAAAGTATACTAGATCTAGCTAATGATCCAAATAAAAAAGAAAAAATCAAAGAACAGGTAATGAAATCAGTGAGAGAAAC
ATTTAGACCTGAATTTTATAACAGAATTGATGATCATGTTATATTTGATAGCTTATCAAAAAAAGAATTAAAAGAAATTGCAAATATTGA
AATTAGAAAAGTAGCTAATCGTCTATTTGATAAAAATTTTAAAATAACTATAGACGATGCTGTCTTTTCATATATAGTAGATAAAGCCTA
TGATCCTTCTTTTGGTGCTAGACCTCTTAAAAGAGTTATACAATCTGAAATAGAAACGGAAATTGCTGTAAGAATATTAGATGAAACCTT
TGTAGAAAATGATACTATTAATATATCTCTCAAGGATCAGAAGTTGCACTTTTCAAAAAGTTAA 

Download Fasta Sequence

Composition
NucleotideCountPercentage
Adenine136243 %
Guanine54017 %
Thyamine96330 %
Cytosine34811 %
A+T232573 %
G+C88828 %

Total Bases :   3213 bases

Microsatellites
SequencePosition LengthRepeats
TTTTTTTTTT1125
AAAAAA6923
TTTTTT8423
AAAAAA11323
TTTTTT15223
AAAAAA17623
TTTTTT19123
TATATA26023
ATAATAATA26833
ATAATAATA31933
AAAAAA34523
AAAAAA39423
TTATTATTA54933
AAAAAA65623
AAAAAA74623
ATATAT76723
AAAAAA86623
AAAAAA88023
AAAAAAAAAA89325
ATATAT135123
AAAAAA173523
ACACACAC179524
AATAATAATAATAAT180335
TTTTTT190123
AAAAAA190823
AAAAAA201423
AAAAAA215923
AAAAAA219723
AAAAAA230223
ACACAC250323
ATATAT254023
AAAAAA263423
TATATA268923
TTATTATTA269733
AAAAAA283223
AAAAAA283923
GAGAGA287023
AAAAAA293623
ATATAT303823
ATATAT316923

Microsatellite Map

ATGGTTAATAGTTTTTTTTTTTGTTTTGTGATAATTGGTCTTATTTATGTATGGGACATCACGTACAGTAAAAAAGCTAAAATATTTTTT
AATAAAAATGATATCTTTAGTATAAAAAATACGCACTGGGATATTTATGATAAGAAGAAATATTTTTTTATTGGAAATAATCATTTAAAA
AATGAAGAAAGTTTTTTACCAGAAGTAAGAAAGGATTATAAATCACAAATAAAAGAATATAAGAATTCAACGAATGGTATTATATATCAT
AATAATAAAAACAGATTAAGTTATACAATAAATGATCAAGTAAATTATGATAATAATATGACAAGTGGTATTAATAAAAAAAGAAAAGTT
AAAGATAGTAGTATACACATGAATAATTCTTATGAAAAAAATAGAAACAAAAATAAATTTGCTTTATTTATGAGTGATGAAGAATATACC
ATTAATTCAGATGATTATACCGAAAAAGCTTGGGAAGCTATTAGCTCCTTAAATAAAATTGGAGAAAAATATGATTCGGCATATGTAGAA
GCTGAAATGTTATTATTAGCTCTACTAAATGATTCACCCGATGGTTTAGCTGAAAGAATATTAAAAGAAAGTGGTATAGATACCCAATTA
TTAGTTCAAGAAATTGATGATTATTTAAAAAAACAACCTAAGATGCCTAGTGGTTTTGGAGAACAGAAAATATTAGGTAGAACTTTACAA
ACTGTATTAAGTACTAGTAAAAGATTAAAAAAAGAATTTAATGATGAATATATTTCCATAGAACACCTATTACTAAGTATCATTTCAGAA
GATTCTAAATTTACTAGACCCTGGTTATTAAAATATAATGTAAATTATGAAAAAGTAAAAAAAGCTGTAGAAAAAATTCGAGGAAAAAAA
AAAGTTACTTCTAAAACACCAGAAATGACTTATCAAGCTCTAGAAAAATATAGTAGAGATCTAACAGCTTTGGCAAGAGCAGGAAAATTA
GATCCTGTTATAGGTAGAGATAATGAAATTAGAAGAGCCATACAAATTTTATCCAGAAGAACTAAAAATAATCCTATCTTATTAGGAGAT
CCTGGTGTTGGGAAAACAGCTATTGTTGAAGGGTTAGCCATAAAAATCGTACAAGGAGATGTACCTGACTCATTAAAAGGAAGGAAATTA
GTATCTTTAGATATGTCTTCTCTTATAGCTGGTGCAAAATATAGAGGTGATTTTGAAGAAAGGCTAAAATCAATTCTGAAAGAAGTACAA
GATGCTGAAGGTCAAGTTGTTATGTTTATAGATGAAATCCATACTGTTGTGGGAGCTGGAGCGGTCGCAGAAGGTGCATTAGATGCTGGT
AATATATTAAAACCTATGTTAGCTAGAGGTGAATTACGTTGTATTGGTGCTACGACGGTTAGTGAATATAGACAATTTATAGAAAAGGAT
AAAGCATTAGAAAGAAGATTTCAACAAATTCTTGTTGAACAACCAAGTGTTGATGAAACTATTAGTATATTAAGAGGTCTAAAAGAAAGA
TATGAAGTTCATCATGGTGTACGTATATTAGATTCTGCATTAGTACAAGCTGCTGTTTTATCAGATCGTTATATTAGTTATAGATTCTTA
CCAGATAAAGCGATTGATCTTATTGACGAAGCTGCATCTAATCTTAAAATACAACTATCTAGTAAACCTATTCAATTAGAAAATATAGAA
AAACAACTTATACAATTAGAAATGGAAAAAATATCCATATTAGGAGATAAACAAAAGAATCTATTTAATTATTCTAGTGTAGCTAACACA
CACAATAATAATAATAATAGTAGTATTAGTAGCAATAACTCGTCATCATATGGTAACGCTGAAGAAACTGAAGCAACTGTTGATTATACT
AAAAGCCCCAATTTTTTAAAAAAAAGAATTAATGAAAAAGAAATTGATAGATTAAAAATGATCGATCGAATCATGAGCGAATTAAGAAAA
GAACAAAGAAAAATCCTAGATTCTTGGTCCACCGAAAAAAGCTATGTAGATAATATCAGAGCTATTAAAGAAAGAATAGATGTTGTTAAA
ATAGAAATTGAAAAAGCTGAAAGATATTTTGATTTAAATAGAGCAGCTGAATTGAGATTTGAAACATTACCTGATTTAGAAAAACAATTA
AAAAAAGCAGAAGAAAATTATCTAAATGATATCCCTGAAAAAAGTAGAATATTAAAAGATGAAGTTACAAGTGAAGATATTGTTAATATT
GTAAGTATGTCTACCGGTATCAGATTAAATAAATTACTAAAATCTGAAAAAGAAAAAATACTTAATCTTGAAAATGAATTACATAAACAA
ATTATCGGTCAAGATGATGCCGTAAAAGTTGTAACCAAAGCTGTTCAAAGATCTAGGGTTGGAATGAATAACCCTAAAAGACCAATAGCA
TCTTTAATGTTTTTAGGACCAACAGGAGTAGGAAAAACGGAATTATCTAAGGTATTGGCAGATGTATTATTTGACACACCAGAAGCAGTA
ATTCATTTTGATATGTCTGAATATATGGAGAAGCATTCAATTAGTAAATTAATAGGTGCCGCACCAGGTTATGTGGGATATGAACAAGGA
GGATTATTAACAGATGCAGTACGTAAAAAACCATATTCTATCATTTTATTTGATGAAATAGAAAAAGCACATCCTGATGTATATAATTTA
TTATTAAGAGTTATAGATGAGGGAAAATTATCTGATACCAAAGGAAATGTAGCTAATTTTAGAAATACAATTATTATATTTACATCCAAT
TTAGGAAGTCAAAGTATACTAGATCTAGCTAATGATCCAAATAAAAAAGAAAAAATCAAAGAACAGGTAATGAAATCAGTGAGAGAAACA
TTTAGACCTGAATTTTATAACAGAATTGATGATCATGTTATATTTGATAGCTTATCAAAAAAAGAATTAAAAGAAATTGCAAATATTGAA
ATTAGAAAAGTAGCTAATCGTCTATTTGATAAAAATTTTAAAATAACTATAGACGATGCTGTCTTTTCATATATAGTAGATAAAGCCTAT
GATCCTTCTTTTGGTGCTAGACCTCTTAAAAGAGTTATACAATCTGAAATAGAAACGGAAATTGCTGTAAGAATATTAGATGAAACCTTT
GTAGAAAATGATACTATTAATATATCTCTCAAGGATCAGAAGTTGCACTTTTCAAAAAGTTAA

Gene Information
Molecule Type mRNA
Bases 3213 Bases
Molecular Weight997882.7 Da
Coding Sequence 1..3213
NCBI ReferenceXM_001349322
Notes Plasmodium falciparum 3D7 ClpB protein; putative (PF08_0063) mRNA;complete cds. ClpBprotein;putative

DOMAINS AND MOTIFS   Hide

Domains
Domain Position Length Description Feature Identifier
Signal peptide 1 to 23 23 ER targeting peptide Manually identified
N Domain 169 to 220 52 Clp amino terminal domain PF02861
N Domain 246 to 298 53 Clp amino terminal domain PF02861
NBD_1 355 to 493 139 Nucleotide binding domain 1 PF00004
M Domain 407 to 429 23 Coiled coil Middle domain Manually identified
M Domain 675 to 698 25 Coiled coil Middle domain Manually identified
NBD_2 807 to 972 166 Nucleotide binding domain 2 PF07724
Clp_C 978 to 1067 90 C terminal, D2 small domain PF10431

Graphical View

STRUCTURE   hide

3D Structure

PDB Id

View in Jmol

2P65 Jmol

ONTOLOGIES   Show


LITERATURE REFERENCES   Show


ACCESSIONS   Show


OTHER LINKS   Show



Protein Card Information

Record Information

Entry Name

ClpB_PLFA1

Accession Number

HSP100_0655

Added on

2012-02-02

Updated On

2014-01-20