Help  Custom Search
 
ATPase AAA-2 domain protein. of Acidothermus cellulolyticus str. 11B

 Names & Lineage - Classification - Sequence - Domains - Ontologies - Refernces - X-Ref - External Links -Record Info 

NAMES AND LINEAGE

Protein names

Name

ATPase AAA-2 domain protein.

Synonymous Names

A0LRD4_ACIC1, Caseinolytic peptidase C, ClpC.

Taxon Information

Organism

Acidothermus cellulolyticus str. 11B

Organism ID

351607 [NCBI] [UNIPROT]

Lineage

Bacteria; Actinobacteria; Actinobacteridae; Actinomycetales; Frankineae; Acidothermaceae; Acidothermus; Acidothermus cellulolyticus 11B

CLASSIFICATION

Protein Classification

Protein Family

Belongs to Hsp100 Protein family

Protein Type

Class I Clp Proteins
(ClpC-Caseinolytic peptidase C)

SEQUENCE INFORMATION

Sequence
MFERFTDRARRVVVLAQEEARMLNHNYIGTEHILLGLIHEGEGVAAKALESLGISLEGVRQQVEEIIGQG QQAPSGHIPFTPRAKKVLELSLREALQLGHNYIGTEHILLGLIREGEGVAAQVLVKLGADLNRVRQQVIQ LLSGYQGKAEATAGGPAEGTQSTSLVLDQFGRNLTQAAREGKLDPVIGREKEIERVMQVLSRRTKNNPVL VGEPGVGKTAIVEGLAQAIVKGEVPETLKDKQLYTLDLGALVAGSRYRGDFEERLKKVLKEIRTRGDIVL FIDELHTLVGAGAAEGAIDAASILKPMLARGELQTIGATTLDEYRKHLEKDAALERRFQPIQVNEPSLAH TIEILKGLRDRYESHHKVTITDGALVAAAQLADRYISDRFLPDKAIDLIDEAGSRLRIRRMTAPPDLREF DEKIAAVRREKESAIDAQDFEKAAALRDKEKQLLAAKAQREREWKAGDMDVVAEVDEDLIAEVLSAATGI PVFKLTEEETARLLRMEEELHKRVIGQDEAIKALSQAIRRTRAGLKDPKRPGGSFIFAGPSGVGKTELSK ALAEFLFGDEDSLIQLDMSEFMEKHTVSRLFGSPPGYVGYDEGGQLTEKVRRKPFSVVLFDEVEKAHPDI FNSLLQILEDGRLTDAQGRVVDFKNTVIIMTTNLGTRDIAKGFNMGFSKENDTQGSYERMRAKVQDELKQ HFRPEFLNRVDDIIVFHQLTEDEIVKIVDLMIAKVDERLRDRDMSIELTPAAKALLAQKGYDPVLGARPL RRTIQRLVEDPVSEKILFGELRPGHIIVVDAENVGTPDARLTFRGEPKPAAVPETAVVGSEPEQPTTNR

FASTA Format Composition Secondary Structure

Nucleotide
 ATGTTCGAGCGGTTCACCGACCGAGCCCGGCGAGTGGTTGTGCTCGCGCAAGAAGAAGCGCGCATGCTCAACCACAATTACATCGGCAC
TGAACACATCCTTCTTGGCCTGATTCATGAAGGTGAGGGAGTCGCCGCAAAGGCGCTGGAGAGCCTCGGCATCTCCCTGGAGGGTGTCCG
CCAGCAGGTCGAAGAGATCATCGGCCAGGGGCAGCAGGCGCCTTCCGGTCACATCCCGTTCACGCCGAGAGCCAAGAAGGTTCTCGAACT
CAGCCTCCGCGAGGCGCTGCAGCTCGGCCACAACTACATCGGCACCGAGCACATCCTGCTCGGCCTGATCCGGGAAGGTGAGGGAGTCGC
CGCCCAGGTTCTGGTCAAGCTGGGCGCCGACCTCAACCGGGTCCGCCAGCAGGTCATCCAATTGCTCTCCGGTTACCAGGGCAAGGCGGA
GGCGACTGCCGGAGGCCCGGCGGAGGGCACTCAGTCGACGTCCCTCGTTCTCGATCAATTCGGCCGCAACCTCACGCAGGCGGCCCGCGA
GGGCAAGCTCGACCCGGTGATCGGACGGGAGAAGGAAATCGAGCGGGTCATGCAGGTGCTCTCCCGGCGTACCAAGAACAATCCGGTTCT
CGTCGGTGAGCCCGGCGTCGGCAAGACGGCCATCGTCGAGGGGCTGGCTCAGGCGATCGTCAAGGGCGAGGTGCCGGAGACCCTGAAAGA
CAAGCAGCTGTACACGCTCGACCTGGGTGCGCTGGTCGCCGGCAGCCGGTACCGCGGCGATTTCGAGGAGCGGCTCAAGAAGGTGCTCAA
AGAGATCCGCACCCGCGGCGACATCGTCCTGTTCATCGATGAGCTGCACACCCTCGTCGGCGCCGGCGCCGCCGAGGGTGCGATTGACGC
GGCCTCCATCCTCAAGCCGATGCTGGCCCGCGGCGAGCTGCAGACCATCGGGGCGACGACGCTCGATGAGTACCGCAAGCACCTGGAGAA
GGACGCCGCCCTCGAGCGGCGGTTCCAGCCGATCCAGGTCAATGAGCCGTCACTAGCGCACACCATCGAAATTCTGAAGGGCCTGCGTGA
TCGGTACGAGAGTCACCACAAGGTCACCATCACCGACGGCGCCCTGGTCGCGGCCGCGCAGTTGGCTGACCGGTACATCTCGGACCGCTT
CCTGCCGGACAAGGCCATCGACCTCATCGATGAGGCCGGGTCACGGCTGCGGATCCGCCGGATGACCGCGCCGCCGGACCTGCGGGAATT
CGACGAGAAAATCGCCGCGGTTCGGCGCGAGAAAGAGAGCGCGATCGACGCGCAGGATTTCGAGAAGGCCGCGGCGCTCCGCGACAAAGA
AAAGCAGCTCCTTGCCGCGAAGGCGCAGCGGGAGCGGGAGTGGAAAGCCGGTGACATGGACGTCGTCGCGGAGGTCGACGAGGACCTCAT
CGCCGAGGTTCTCTCCGCGGCCACCGGGATTCCGGTCTTCAAGCTCACCGAAGAGGAGACGGCCCGGCTCCTGCGGATGGAAGAAGAGCT
GCACAAGCGGGTGATCGGCCAAGACGAGGCCATCAAGGCGCTGTCGCAGGCGATCCGGCGTACCCGCGCCGGGTTGAAAGACCCGAAGCG
GCCGGGTGGTTCGTTCATCTTCGCCGGACCATCCGGGGTTGGGAAGACCGAGCTCTCCAAGGCGCTCGCCGAATTCCTGTTTGGTGACGA
GGATTCGCTCATCCAGTTGGACATGAGCGAATTCATGGAGAAGCACACGGTGTCGCGGCTCTTCGGTTCCCCGCCTGGATATGTCGGGTA
CGACGAGGGCGGGCAGCTGACCGAGAAGGTGCGCCGCAAGCCGTTCTCCGTCGTCCTCTTCGACGAGGTGGAGAAGGCCCACCCGGACAT
TTTCAATTCGCTCCTGCAAATTCTCGAGGACGGACGCCTCACCGACGCCCAGGGGCGGGTCGTCGATTTCAAGAACACGGTCATCATCAT
GACGACCAACCTCGGCACCCGCGACATCGCCAAGGGCTTCAACATGGGGTTCTCCAAGGAGAACGACACCCAAGGCAGTTACGAGCGGAT
GCGGGCCAAGGTGCAGGACGAATTGAAGCAGCACTTCCGGCCCGAATTCCTCAACCGGGTGGACGACATCATCGTCTTCCACCAGCTCAC
CGAGGACGAGATCGTCAAGATCGTCGACTTGATGATCGCGAAGGTGGACGAACGGCTGCGCGATCGCGACATGTCCATCGAGCTCACCCC
GGCGGCGAAGGCGCTCCTCGCCCAGAAGGGGTACGACCCGGTGCTCGGCGCACGGCCGTTGCGGCGGACCATCCAGCGCCTGGTCGAGGA
TCCGGTCAGCGAGAAGATTCTCTTCGGTGAATTGCGGCCCGGTCACATCATCGTGGTCGATGCGGAGAACGTGGGGACGCCGGACGCCCG
GCTCACCTTCCGCGGCGAACCGAAGCCGGCCGCCGTGCCGGAGACCGCGGTGGTTGGCTCCGAGCCGGAGCAACCGACGACGAACCGCTG
A 

Download Fasta Sequence

Composition
NucleotideCountPercentage
Adenine50420 %
Guanine83434 %
Thyamine38516 %
Cytosine79732 %
A+T88936 %
G+C163165 %

Total Bases :   2520 bases

Microsatellites
SequencePosition LengthRepeats
AAGAAGAAG4933
AGAGAG129223
TCTCTC144823
TCATCATCA196933
CGACGACGA250333

Microsatellite Map

ATGTTCGAGCGGTTCACCGACCGAGCCCGGCGAGTGGTTGTGCTCGCGCAAGAAGAAGCGCGCATGCTCAACCACAATTACATCGGCACT
GAACACATCCTTCTTGGCCTGATTCATGAAGGTGAGGGAGTCGCCGCAAAGGCGCTGGAGAGCCTCGGCATCTCCCTGGAGGGTGTCCGC
CAGCAGGTCGAAGAGATCATCGGCCAGGGGCAGCAGGCGCCTTCCGGTCACATCCCGTTCACGCCGAGAGCCAAGAAGGTTCTCGAACTC
AGCCTCCGCGAGGCGCTGCAGCTCGGCCACAACTACATCGGCACCGAGCACATCCTGCTCGGCCTGATCCGGGAAGGTGAGGGAGTCGCC
GCCCAGGTTCTGGTCAAGCTGGGCGCCGACCTCAACCGGGTCCGCCAGCAGGTCATCCAATTGCTCTCCGGTTACCAGGGCAAGGCGGAG
GCGACTGCCGGAGGCCCGGCGGAGGGCACTCAGTCGACGTCCCTCGTTCTCGATCAATTCGGCCGCAACCTCACGCAGGCGGCCCGCGAG
GGCAAGCTCGACCCGGTGATCGGACGGGAGAAGGAAATCGAGCGGGTCATGCAGGTGCTCTCCCGGCGTACCAAGAACAATCCGGTTCTC
GTCGGTGAGCCCGGCGTCGGCAAGACGGCCATCGTCGAGGGGCTGGCTCAGGCGATCGTCAAGGGCGAGGTGCCGGAGACCCTGAAAGAC
AAGCAGCTGTACACGCTCGACCTGGGTGCGCTGGTCGCCGGCAGCCGGTACCGCGGCGATTTCGAGGAGCGGCTCAAGAAGGTGCTCAAA
GAGATCCGCACCCGCGGCGACATCGTCCTGTTCATCGATGAGCTGCACACCCTCGTCGGCGCCGGCGCCGCCGAGGGTGCGATTGACGCG
GCCTCCATCCTCAAGCCGATGCTGGCCCGCGGCGAGCTGCAGACCATCGGGGCGACGACGCTCGATGAGTACCGCAAGCACCTGGAGAAG
GACGCCGCCCTCGAGCGGCGGTTCCAGCCGATCCAGGTCAATGAGCCGTCACTAGCGCACACCATCGAAATTCTGAAGGGCCTGCGTGAT
CGGTACGAGAGTCACCACAAGGTCACCATCACCGACGGCGCCCTGGTCGCGGCCGCGCAGTTGGCTGACCGGTACATCTCGGACCGCTTC
CTGCCGGACAAGGCCATCGACCTCATCGATGAGGCCGGGTCACGGCTGCGGATCCGCCGGATGACCGCGCCGCCGGACCTGCGGGAATTC
GACGAGAAAATCGCCGCGGTTCGGCGCGAGAAAGAGAGCGCGATCGACGCGCAGGATTTCGAGAAGGCCGCGGCGCTCCGCGACAAAGAA
AAGCAGCTCCTTGCCGCGAAGGCGCAGCGGGAGCGGGAGTGGAAAGCCGGTGACATGGACGTCGTCGCGGAGGTCGACGAGGACCTCATC
GCCGAGGTTCTCTCCGCGGCCACCGGGATTCCGGTCTTCAAGCTCACCGAAGAGGAGACGGCCCGGCTCCTGCGGATGGAAGAAGAGCTG
CACAAGCGGGTGATCGGCCAAGACGAGGCCATCAAGGCGCTGTCGCAGGCGATCCGGCGTACCCGCGCCGGGTTGAAAGACCCGAAGCGG
CCGGGTGGTTCGTTCATCTTCGCCGGACCATCCGGGGTTGGGAAGACCGAGCTCTCCAAGGCGCTCGCCGAATTCCTGTTTGGTGACGAG
GATTCGCTCATCCAGTTGGACATGAGCGAATTCATGGAGAAGCACACGGTGTCGCGGCTCTTCGGTTCCCCGCCTGGATATGTCGGGTAC
GACGAGGGCGGGCAGCTGACCGAGAAGGTGCGCCGCAAGCCGTTCTCCGTCGTCCTCTTCGACGAGGTGGAGAAGGCCCACCCGGACATT
TTCAATTCGCTCCTGCAAATTCTCGAGGACGGACGCCTCACCGACGCCCAGGGGCGGGTCGTCGATTTCAAGAACACGGTCATCATCATG
ACGACCAACCTCGGCACCCGCGACATCGCCAAGGGCTTCAACATGGGGTTCTCCAAGGAGAACGACACCCAAGGCAGTTACGAGCGGATG
CGGGCCAAGGTGCAGGACGAATTGAAGCAGCACTTCCGGCCCGAATTCCTCAACCGGGTGGACGACATCATCGTCTTCCACCAGCTCACC
GAGGACGAGATCGTCAAGATCGTCGACTTGATGATCGCGAAGGTGGACGAACGGCTGCGCGATCGCGACATGTCCATCGAGCTCACCCCG
GCGGCGAAGGCGCTCCTCGCCCAGAAGGGGTACGACCCGGTGCTCGGCGCACGGCCGTTGCGGCGGACCATCCAGCGCCTGGTCGAGGAT
CCGGTCAGCGAGAAGATTCTCTTCGGTGAATTGCGGCCCGGTCACATCATCGTGGTCGATGCGGAGAACGTGGGGACGCCGGACGCCCGG
CTCACCTTCCGCGGCGAACCGAAGCCGGCCGCCGTGCCGGAGACCGCGGTGGTTGGCTCCGAGCCGGAGCAACCGACGACGAACCGCTGA

Gene Information
Molecule Type genomic DNA
Bases 2520 Bases
Molecular Weight779950.48 Da
Coding Sequence CP000481.1:233834..236353
NCBI ReferenceABK51994
NotesPFAM: UvrB/UvrC protein; AAA ATPase; central domain

DOMAINS AND MOTIFS   Hide

Domains
Domain Position Length Description Feature Identifier
N Domain 16 to 68 53 Clp amino terminal domain PF02861
N Domain 91 to 143 53 Clp amino terminal domain PF02861
NBD_1 208 to 345 138 Nucleotide binding domain 1 PF00004
UVR 421 to 456 36 UvrB/UvrC motif PF02151
NBD_2 540 to 713 174 Nucleotide binding domain 2 PF07724
Clp_C 719 to 809 91 C terminal, D2 small domain PF10431

Graphical View

ONTOLOGIES   Show


LITERATURE REFERENCES   Show


ACCESSIONS   Show


OTHER LINKS   Show



Protein Card Information

Record Information

Entry Name

ClpC_ACCE1

Accession Number

HSP100_0020

Added on

2012-02-02

Updated On

2014-01-20