Help  Custom Search
 
ATPase AAA-2 domain protein. of Acidiphilium cryptum str. JF-5

 Names & Lineage - Classification - Sequence - Domains - Ontologies - Refernces - X-Ref - External Links -Record Info 

NAMES AND LINEAGE

Protein names

Name

ATPase AAA-2 domain protein.

Synonymous Names

A5G325_ACICJ, Caseinolytic peptidase B, ClpB.

Taxon Information

Organism

Acidiphilium cryptum str. JF-5

Organism ID

349163 [NCBI] [UNIPROT]

Lineage

Bacteria; Proteobacteria; Alphaproteobacteria; Rhodospirillales; Acetobacteraceae; Acidiphilium; Acidiphilium cryptum JF-5

CLASSIFICATION

Protein Classification

Protein Family

Belongs to Hsp100 Protein family

Protein Type

Class I Clp Proteins
(ClpB-Caseinolytic peptidase B)

SEQUENCE INFORMATION

Sequence
MAAIDLKSLIARLDDHCRRSLEAAAGLTLSRSHYNVEIEHWLLKLADGTDTDIPLILRHYEIDIGRFLID LNRALDRMKTGNARAPALSPEVVELAKQAWLLASVEHGLSRTRSGHLLWALLADETLARHAREASSHFAK IQPDALKRDFNAITANSIEAQTATRATDGAPAAGGTDDGTPRPGGSAALDQFTEDLTARARAGRIDPILG RDFEIRQAIDILTRRRQNNPIFTGEAGVGKTAVVEGIALKIAQGDVPEALKNVTLRTLDMALLQAGAGMK GEFENRLKSVIDEVKASPKPIILFIDEAHTLIGAGGQAGQNDAANLLKPALARGELRTIAATTWAEYKKY FERDAALTRRFQVVKVEEPSEAMATAMIRGLVGTLEKHHKVRILDEAVSEAVRLSARYIPSRQLPDKAIS LIDTACARVGMSQATEPAPIEDRRRRIDLIDTETGILRREMASGGDHAAREAELIEERARLVAELAELEA RFTTEKQLAGDIAELRTRLEDDANAEDKDALRASLAEKTAALKASQGENPLIFPVVDGQAVAEIVESWTG IPAGRMQSDEINTVLKLKEAMEQRIVGQPHALEAVAQAIRTSRAKLTDPRKPIGVFMMVGTSGTGKTETA LTLANLLYGGEQNMTTINMTEFKEEHKVSLLMGSPPGYVGYGEGGVLTEAVRRRPYSVVLLDEMEKAHPG VQDVFFQVFDKGNMKDGEGRDIDFKNTIIIMTSNAGTDLIEKLFADPETAPDAAALAEALRPELLKYFKP AFLGRVTIVPYFPLSDEVIRQIVVLQLNKIAKRVREGYKAEFTYEPALVETIAARCKESASGARNVENIL SRTLLPELSAEVLARLAEGEMISRVTVGTSEDGMFRYTINQA

FASTA Format Composition Secondary Structure

Nucleotide
 ATGGCTGCTATCGATCTCAAATCTCTGATCGCCCGCCTCGACGACCACTGCCGCCGCAGCCTCGAGGCCGCAGCCGGCCTAACGCTGTC
GCGAAGCCATTACAACGTCGAAATCGAACATTGGCTGCTCAAGCTCGCCGATGGCACCGACACCGACATTCCGCTTATTCTTCGACATTA
CGAGATCGATATCGGTCGTTTCTTGATCGATCTCAACCGCGCGCTCGACCGGATGAAGACCGGCAACGCCCGCGCCCCGGCCCTGTCACC
GGAAGTCGTCGAGCTCGCGAAACAGGCCTGGCTGCTGGCATCGGTCGAACACGGGCTGAGCCGCACGCGCTCGGGTCACCTGCTTTGGGC
CCTGCTGGCCGACGAGACCCTGGCCCGCCACGCCCGCGAGGCCTCCAGCCATTTCGCAAAAATCCAGCCGGACGCACTGAAGCGCGACTT
CAACGCCATCACCGCCAATTCCATCGAGGCGCAGACCGCGACCCGCGCGACCGATGGCGCGCCGGCTGCGGGCGGCACCGATGACGGCAC
GCCCCGTCCCGGCGGCTCGGCGGCGCTGGACCAGTTCACCGAAGATCTGACCGCCCGCGCGCGCGCCGGCCGGATCGACCCGATCCTCGG
CCGCGACTTCGAAATCCGCCAGGCGATCGACATCCTCACCCGCCGCCGGCAGAACAACCCGATCTTCACCGGCGAGGCAGGTGTGGGCAA
GACCGCGGTGGTCGAGGGCATCGCGCTGAAGATCGCCCAGGGCGACGTGCCGGAGGCGCTGAAGAACGTCACGCTGCGCACGCTCGACAT
GGCGCTGCTCCAGGCCGGGGCGGGCATGAAGGGCGAGTTCGAGAACCGGCTGAAATCGGTGATCGACGAGGTGAAGGCCTCGCCGAAGCC
GATCATCCTGTTCATCGACGAGGCGCACACGCTGATCGGCGCCGGCGGCCAGGCCGGCCAGAACGACGCGGCGAACCTGCTCAAGCCCGC
CCTCGCCCGCGGCGAGCTGCGCACCATCGCGGCGACGACCTGGGCCGAATACAAGAAATATTTCGAGCGCGACGCCGCCCTGACCCGCCG
CTTCCAGGTGGTCAAGGTCGAGGAGCCGTCCGAGGCGATGGCCACCGCGATGATCCGCGGCCTCGTCGGCACGCTCGAGAAGCACCACAA
GGTCCGCATCCTCGACGAGGCGGTGAGCGAGGCCGTCAGGCTGTCGGCGCGTTATATTCCCTCACGCCAGCTGCCGGACAAGGCGATCTC
GCTGATCGACACGGCCTGCGCCCGCGTTGGCATGAGCCAGGCGACCGAACCCGCCCCGATCGAGGATCGCCGCCGCCGGATCGACCTGAT
CGACACCGAAACCGGCATCCTCCGCCGTGAGATGGCCAGCGGCGGCGATCACGCCGCCCGCGAGGCCGAACTGATCGAGGAGCGTGCGCG
GCTGGTCGCCGAACTCGCCGAGCTCGAGGCCCGCTTCACGACCGAGAAGCAGCTCGCCGGCGATATCGCCGAGCTGCGCACCCGGCTGGA
AGACGACGCGAACGCGGAGGACAAGGACGCGCTGCGCGCCAGCCTCGCCGAAAAGACCGCGGCGCTGAAGGCCTCGCAGGGCGAGAACCC
GCTGATCTTCCCGGTGGTCGACGGCCAGGCCGTCGCCGAGATCGTCGAGAGCTGGACCGGCATTCCCGCCGGGCGCATGCAGTCCGACGA
GATCAACACGGTGCTGAAGCTGAAGGAGGCGATGGAGCAGCGCATCGTCGGCCAGCCCCATGCGCTCGAGGCGGTCGCCCAGGCGATCCG
CACCTCGCGCGCGAAACTCACCGACCCGCGCAAGCCGATCGGCGTCTTCATGATGGTCGGCACCTCCGGCACCGGCAAGACCGAAACGGC
GCTTACCCTCGCCAACCTGCTCTATGGCGGCGAGCAGAACATGACCACCATCAACATGACCGAGTTCAAGGAAGAACATAAGGTCAGCCT
GCTGATGGGCAGCCCGCCCGGCTATGTCGGCTATGGCGAGGGCGGCGTGCTGACCGAGGCGGTGCGCCGGCGCCCCTATTCGGTGGTGCT
GCTCGACGAGATGGAAAAGGCCCATCCCGGCGTGCAGGACGTCTTCTTCCAGGTGTTCGACAAGGGCAACATGAAGGACGGCGAAGGCCG
CGACATCGACTTCAAGAACACCATCATCATCATGACGTCGAATGCCGGCACCGACCTGATCGAGAAACTCTTCGCCGATCCCGAAACCGC
GCCGGACGCCGCCGCCCTCGCCGAAGCGCTGCGGCCGGAGCTGCTGAAATATTTCAAGCCCGCCTTTCTCGGCCGCGTCACCATCGTCCC
CTATTTCCCGCTCTCCGACGAGGTGATCCGCCAGATCGTGGTGCTCCAGCTCAACAAGATCGCGAAGCGCGTGCGCGAGGGCTACAAGGC
CGAATTCACCTACGAGCCCGCGCTCGTCGAAACCATTGCCGCACGCTGCAAGGAATCGGCCTCGGGCGCGCGCAACGTGGAAAACATCCT
CTCCCGCACGCTGCTGCCCGAACTCTCGGCCGAGGTTCTGGCCAGGCTTGCCGAAGGCGAGATGATTTCCCGCGTGACGGTGGGCACATC
CGAAGACGGGATGTTCCGCTACACCATCAATCAGGCTTAA 

Download Fasta Sequence

Composition
NucleotideCountPercentage
Adenine51320 %
Guanine82732 %
Thyamine37315 %
Cytosine93636 %
A+T88634 %
G+C176367 %

Total Bases :   2649 bases

Microsatellites
SequencePosition LengthRepeats
CGCGCG21623
CGCGCG49223
GCGCGC50523
CGCGCGCGCG59425
CCGCCGCCG66833
CGCCGCCGC132633
GCGGCGGCG138733
GCGCGC156223
CGCGCGCG180424
CATCATCATCAT218034
CGCCGCCGC225533
GCGCGCGC249424

Microsatellite Map

ATGGCTGCTATCGATCTCAAATCTCTGATCGCCCGCCTCGACGACCACTGCCGCCGCAGCCTCGAGGCCGCAGCCGGCCTAACGCTGTCG
CGAAGCCATTACAACGTCGAAATCGAACATTGGCTGCTCAAGCTCGCCGATGGCACCGACACCGACATTCCGCTTATTCTTCGACATTAC
GAGATCGATATCGGTCGTTTCTTGATCGATCTCAACCGCGCGCTCGACCGGATGAAGACCGGCAACGCCCGCGCCCCGGCCCTGTCACCG
GAAGTCGTCGAGCTCGCGAAACAGGCCTGGCTGCTGGCATCGGTCGAACACGGGCTGAGCCGCACGCGCTCGGGTCACCTGCTTTGGGCC
CTGCTGGCCGACGAGACCCTGGCCCGCCACGCCCGCGAGGCCTCCAGCCATTTCGCAAAAATCCAGCCGGACGCACTGAAGCGCGACTTC
AACGCCATCACCGCCAATTCCATCGAGGCGCAGACCGCGACCCGCGCGACCGATGGCGCGCCGGCTGCGGGCGGCACCGATGACGGCACG
CCCCGTCCCGGCGGCTCGGCGGCGCTGGACCAGTTCACCGAAGATCTGACCGCCCGCGCGCGCGCCGGCCGGATCGACCCGATCCTCGGC
CGCGACTTCGAAATCCGCCAGGCGATCGACATCCTCACCCGCCGCCGGCAGAACAACCCGATCTTCACCGGCGAGGCAGGTGTGGGCAAG
ACCGCGGTGGTCGAGGGCATCGCGCTGAAGATCGCCCAGGGCGACGTGCCGGAGGCGCTGAAGAACGTCACGCTGCGCACGCTCGACATG
GCGCTGCTCCAGGCCGGGGCGGGCATGAAGGGCGAGTTCGAGAACCGGCTGAAATCGGTGATCGACGAGGTGAAGGCCTCGCCGAAGCCG
ATCATCCTGTTCATCGACGAGGCGCACACGCTGATCGGCGCCGGCGGCCAGGCCGGCCAGAACGACGCGGCGAACCTGCTCAAGCCCGCC
CTCGCCCGCGGCGAGCTGCGCACCATCGCGGCGACGACCTGGGCCGAATACAAGAAATATTTCGAGCGCGACGCCGCCCTGACCCGCCGC
TTCCAGGTGGTCAAGGTCGAGGAGCCGTCCGAGGCGATGGCCACCGCGATGATCCGCGGCCTCGTCGGCACGCTCGAGAAGCACCACAAG
GTCCGCATCCTCGACGAGGCGGTGAGCGAGGCCGTCAGGCTGTCGGCGCGTTATATTCCCTCACGCCAGCTGCCGGACAAGGCGATCTCG
CTGATCGACACGGCCTGCGCCCGCGTTGGCATGAGCCAGGCGACCGAACCCGCCCCGATCGAGGATCGCCGCCGCCGGATCGACCTGATC
GACACCGAAACCGGCATCCTCCGCCGTGAGATGGCCAGCGGCGGCGATCACGCCGCCCGCGAGGCCGAACTGATCGAGGAGCGTGCGCGG
CTGGTCGCCGAACTCGCCGAGCTCGAGGCCCGCTTCACGACCGAGAAGCAGCTCGCCGGCGATATCGCCGAGCTGCGCACCCGGCTGGAA
GACGACGCGAACGCGGAGGACAAGGACGCGCTGCGCGCCAGCCTCGCCGAAAAGACCGCGGCGCTGAAGGCCTCGCAGGGCGAGAACCCG
CTGATCTTCCCGGTGGTCGACGGCCAGGCCGTCGCCGAGATCGTCGAGAGCTGGACCGGCATTCCCGCCGGGCGCATGCAGTCCGACGAG
ATCAACACGGTGCTGAAGCTGAAGGAGGCGATGGAGCAGCGCATCGTCGGCCAGCCCCATGCGCTCGAGGCGGTCGCCCAGGCGATCCGC
ACCTCGCGCGCGAAACTCACCGACCCGCGCAAGCCGATCGGCGTCTTCATGATGGTCGGCACCTCCGGCACCGGCAAGACCGAAACGGCG
CTTACCCTCGCCAACCTGCTCTATGGCGGCGAGCAGAACATGACCACCATCAACATGACCGAGTTCAAGGAAGAACATAAGGTCAGCCTG
CTGATGGGCAGCCCGCCCGGCTATGTCGGCTATGGCGAGGGCGGCGTGCTGACCGAGGCGGTGCGCCGGCGCCCCTATTCGGTGGTGCTG
CTCGACGAGATGGAAAAGGCCCATCCCGGCGTGCAGGACGTCTTCTTCCAGGTGTTCGACAAGGGCAACATGAAGGACGGCGAAGGCCGC
GACATCGACTTCAAGAACACCATCATCATCATGACGTCGAATGCCGGCACCGACCTGATCGAGAAACTCTTCGCCGATCCCGAAACCGCG
CCGGACGCCGCCGCCCTCGCCGAAGCGCTGCGGCCGGAGCTGCTGAAATATTTCAAGCCCGCCTTTCTCGGCCGCGTCACCATCGTCCCC
TATTTCCCGCTCTCCGACGAGGTGATCCGCCAGATCGTGGTGCTCCAGCTCAACAAGATCGCGAAGCGCGTGCGCGAGGGCTACAAGGCC
GAATTCACCTACGAGCCCGCGCTCGTCGAAACCATTGCCGCACGCTGCAAGGAATCGGCCTCGGGCGCGCGCAACGTGGAAAACATCCTC
TCCCGCACGCTGCTGCCCGAACTCTCGGCCGAGGTTCTGGCCAGGCTTGCCGAAGGCGAGATGATTTCCCGCGTGACGGTGGGCACATCC
GAAGACGGGATGTTCCGCTACACCATCAATCAGGCTTAA

Gene Information
Molecule Type genomic DNA
Bases 2649 Bases
Molecular Weight817010.52 Da
Coding Sequence complement(CP000697.1:3348860..3351508)
NCBI ReferenceABQ32257
NotesPFAM: AAA ATPase; central domain protein; Clp N

DOMAINS AND MOTIFS   Hide

Domains
Domain Position Length Description Feature Identifier
N Domain 24 to 76 53 Clp amino terminal domain PF02861
NBD_1 230 to 367 138 Nucleotide binding domain 1 PF00004
M Domain 467 to 499 33 Coiled coil Middle domain Manually identified
NBD_2 611 to 779 169 Nucleotide binding domain 2 PF07724
Clp_C 784 to 874 91 C terminal, D2 small domain PF10431

Graphical View

ONTOLOGIES   Show


LITERATURE REFERENCES   Show


ACCESSIONS   Show


OTHER LINKS   Show



Protein Card Information

Record Information

Entry Name

ClpB_ACCR2

Accession Number

HSP100_0007

Added on

2012-02-02

Updated On

2014-01-20