Help  Custom Search
 
ATP-dependent Clp protease, ATP-binding subunit ClpC. of Acidobacterium capsulatum str. ATCC 51196

 Names & Lineage - Classification - Sequence - Domains - Ontologies - Refernces - X-Ref - External Links -Record Info 

NAMES AND LINEAGE

Protein names

Name

ATP-dependent Clp protease, ATP-binding subunit ClpC.

Synonymous Names

C1F373_ACIC5, Caseinolytic peptidase C, ClpC.

Gene names

Gene Name

clpC

Gene Locus Tag

ACP_2765

Gene Identifier

7699911

Taxon Information

Organism

Acidobacterium capsulatum str. ATCC 51196

Organism ID

240015 [NCBI] [UNIPROT]

Lineage

Bacteria; Acidobacteria; Acidobacteriales; Acidobacteriaceae; Acidobacterium; Acidobacterium capsulatum ATCC 51196

CLASSIFICATION

Protein Classification

Protein Family

Belongs to Hsp100 Protein family

Protein Type

Class I Clp Proteins
(ClpC-Caseinolytic peptidase C)

SEQUENCE INFORMATION

Sequence
MFERYTEKARRVIFFARYEASQFGSPYIETEHLLLGLLREDKALTNRFLRSHASVESIRKQIEGHTTIRE KVSTSVDLPLSNECKRVLAYAAEEAERLSHKHIGTEHLLLGLLREEKCFAAEILQERGLKLVAIREELAR ATQEKAPPAQRNRESSLLAEFSRDLTQAAADNTLDPLIGRDQELERVVQILCRRTKNNPVLIGEPGVGKT AIVEGLAQRIADGDVPSFLADKRVLALDLSLIVAGTKYRGQFEERLKTIMKELMENQNSIIFIDELHTLV GAGSAEGSLDAANILKPALSRGEIQCIGATTPGEYRKSIEKDRSLERRFQAVKVPPPNEEDAIKIIMGIK DRYEKFHAVSYTDDSIEFAVSHSNRYIPDRFLPDKAIDLIDEAGARVKLRQTSLPEEITEVQKRIKFIVH RMENAIANHEFEKARFYSDEERKERENLRALREKYHLDDSTAGIVSREDIEDVVSRWTGVPITSIKEEET QKLLRVEGELHKRVISQEKAISALARAIRRSRAGLKSPHRPIGSFLFLGPTGVGKTEVARTLAQFLFGSE KSIIRFDMSEFMEKHSVSKLIGSPPGYVGYEEGGQLTERVKRSPYSVVLLDEIEKAHPDVFNILLQVFED GQLTDGLGNTVDFKNTIIIMTSNIGARHLQRKQGLGFQSDREELVMDKVEDLVRNEVKRTFNPEFLNRID EIIIFQSLTDADLIQILELLVQQLNANLAQKAITISVNEEAKKWILEKTLIDRSYGARPLRRALQRYVED PLSEALIAGHISDRPAFLEVYLDNNQLFYRPVAREGEDTKPEGVLLYS

FASTA Format Composition Secondary Structure

Nucleotide
 ATGTTCGAACGCTATACAGAAAAAGCACGGCGCGTGATCTTTTTCGCACGGTATGAGGCCAGCCAGTTCGGCTCGCCCTATATCGAGAC
CGAGCACCTGCTGCTGGGTCTGCTGCGCGAGGACAAAGCGTTGACCAATCGCTTCCTTCGCTCCCACGCGTCAGTGGAGTCGATTCGCAA
GCAGATTGAAGGCCACACCACCATTCGCGAAAAGGTCTCGACCTCCGTCGACCTGCCGCTCTCCAACGAATGCAAGCGCGTGCTCGCCTA
CGCCGCCGAAGAAGCCGAGCGGCTCTCGCACAAGCACATCGGCACCGAGCACCTGCTGCTCGGCCTGCTGCGTGAAGAAAAGTGCTTCGC
CGCTGAAATCCTCCAGGAGCGCGGCCTCAAGCTCGTCGCCATTCGCGAAGAGCTGGCGCGCGCCACGCAGGAGAAAGCGCCGCCCGCGCA
GCGCAACCGGGAATCCAGCCTGCTGGCCGAGTTTTCGCGCGACCTCACCCAGGCCGCGGCTGACAACACACTCGACCCGCTCATCGGCCG
CGATCAGGAGCTCGAACGCGTCGTCCAGATTCTCTGCCGCCGCACCAAAAACAATCCGGTGCTCATCGGCGAGCCAGGCGTCGGCAAAAC
CGCCATCGTTGAGGGGCTGGCCCAGCGCATCGCCGACGGCGACGTGCCCAGCTTCCTCGCCGACAAGCGCGTGCTCGCGCTCGACCTCTC
ATTGATCGTCGCGGGAACGAAATACCGCGGCCAGTTTGAAGAGCGCCTCAAGACCATCATGAAAGAGCTGATGGAAAATCAGAACTCCAT
CATCTTCATCGATGAGCTGCACACGCTGGTCGGCGCAGGCTCGGCCGAGGGCTCGCTCGACGCGGCCAACATCCTCAAGCCCGCCCTCTC
GCGCGGTGAAATCCAGTGCATCGGCGCCACCACGCCCGGCGAGTACCGCAAGTCCATCGAGAAAGACCGCTCCCTCGAGCGGCGCTTCCA
GGCCGTCAAGGTGCCGCCGCCCAATGAAGAGGATGCCATCAAGATCATCATGGGCATCAAAGACCGCTATGAGAAGTTCCACGCGGTCAG
CTACACCGATGATTCCATCGAGTTCGCCGTCTCGCACTCCAATCGCTACATTCCCGACCGCTTCCTGCCCGACAAGGCCATCGACCTCAT
CGACGAGGCCGGCGCGCGCGTCAAGCTGCGCCAGACCTCGCTGCCTGAGGAGATCACCGAGGTACAGAAGCGCATCAAGTTCATCGTCCA
CCGCATGGAGAACGCCATCGCGAACCACGAGTTCGAAAAGGCGCGCTTCTACTCCGACGAGGAACGCAAGGAGCGCGAAAACCTGCGCGC
CCTCCGCGAGAAATATCACCTCGATGACTCCACCGCCGGCATCGTAAGCCGCGAAGACATCGAAGACGTGGTCAGCCGCTGGACCGGCGT
GCCCATCACTTCGATCAAGGAAGAAGAGACCCAGAAGCTGCTGCGCGTCGAAGGCGAGCTGCACAAGCGCGTCATCTCGCAGGAGAAGGC
CATCTCGGCCCTCGCCCGCGCCATCCGCCGCTCCCGTGCGGGCCTCAAGTCGCCGCACCGGCCCATCGGCTCGTTCCTCTTCCTCGGCCC
CACCGGCGTTGGCAAAACCGAGGTCGCGCGCACCCTCGCGCAATTCCTTTTCGGCAGCGAGAAGTCGATCATCCGCTTCGATATGTCGGA
GTTCATGGAAAAGCACTCCGTCTCGAAGCTCATCGGTTCGCCTCCGGGCTACGTCGGCTATGAGGAAGGCGGCCAGCTCACCGAGCGCGT
CAAACGTTCGCCCTACTCGGTCGTGCTGCTCGACGAAATCGAAAAGGCGCACCCGGATGTCTTCAACATCCTGTTGCAGGTCTTTGAGGA
TGGCCAGTTGACCGACGGCCTCGGCAACACGGTCGACTTCAAGAACACCATCATCATCATGACCTCCAACATCGGCGCGCGGCACCTGCA
GCGCAAGCAGGGCCTCGGCTTCCAGAGCGACCGCGAAGAGCTCGTCATGGACAAGGTCGAAGATCTCGTGCGCAACGAGGTCAAGCGCAC
CTTCAATCCCGAGTTCCTCAACCGCATCGACGAGATCATCATCTTCCAGTCGCTCACCGACGCCGACCTCATCCAGATTCTTGAACTGCT
CGTGCAGCAGCTCAATGCCAATCTGGCGCAGAAGGCCATCACCATCTCGGTCAACGAAGAGGCCAAGAAGTGGATCCTCGAGAAGACGCT
CATCGACCGCAGCTACGGAGCGCGCCCCCTGCGCCGCGCACTGCAGCGCTACGTCGAAGACCCGCTCTCCGAGGCCCTCATCGCCGGCCA
CATCAGTGACCGTCCGGCCTTCCTTGAGGTCTACCTCGACAACAACCAGCTCTTCTACCGCCCCGTCGCGCGCGAGGGCGAAGACACCAA
ACCCGAAGGCGTCCTGCTCTACAGCTAA 

Download Fasta Sequence

Composition
NucleotideCountPercentage
Adenine53322 %
Guanine66628 %
Thyamine39717 %
Cytosine86136 %
A+T93038 %
G+C152763 %

Total Bases :   2457 bases

Microsatellites
SequencePosition LengthRepeats
CTGCTGCTG9633
GCGCGCGC41424
CGCGCG48423
ACACAC51423
CGCGCG89823
GCCGCCGCC100133
GCGCGCGC118024
GCGCGC129923
GCGCGC134323
CGCGCG164323
CATCATCATCAT193734
GCGCGC196323
ATCATCATC210333
GCGCGC226823
CGCGCGCG240524

Microsatellite Map

ATGTTCGAACGCTATACAGAAAAAGCACGGCGCGTGATCTTTTTCGCACGGTATGAGGCCAGCCAGTTCGGCTCGCCCTATATCGAGACC
GAGCACCTGCTGCTGGGTCTGCTGCGCGAGGACAAAGCGTTGACCAATCGCTTCCTTCGCTCCCACGCGTCAGTGGAGTCGATTCGCAAG
CAGATTGAAGGCCACACCACCATTCGCGAAAAGGTCTCGACCTCCGTCGACCTGCCGCTCTCCAACGAATGCAAGCGCGTGCTCGCCTAC
GCCGCCGAAGAAGCCGAGCGGCTCTCGCACAAGCACATCGGCACCGAGCACCTGCTGCTCGGCCTGCTGCGTGAAGAAAAGTGCTTCGCC
GCTGAAATCCTCCAGGAGCGCGGCCTCAAGCTCGTCGCCATTCGCGAAGAGCTGGCGCGCGCCACGCAGGAGAAAGCGCCGCCCGCGCAG
CGCAACCGGGAATCCAGCCTGCTGGCCGAGTTTTCGCGCGACCTCACCCAGGCCGCGGCTGACAACACACTCGACCCGCTCATCGGCCGC
GATCAGGAGCTCGAACGCGTCGTCCAGATTCTCTGCCGCCGCACCAAAAACAATCCGGTGCTCATCGGCGAGCCAGGCGTCGGCAAAACC
GCCATCGTTGAGGGGCTGGCCCAGCGCATCGCCGACGGCGACGTGCCCAGCTTCCTCGCCGACAAGCGCGTGCTCGCGCTCGACCTCTCA
TTGATCGTCGCGGGAACGAAATACCGCGGCCAGTTTGAAGAGCGCCTCAAGACCATCATGAAAGAGCTGATGGAAAATCAGAACTCCATC
ATCTTCATCGATGAGCTGCACACGCTGGTCGGCGCAGGCTCGGCCGAGGGCTCGCTCGACGCGGCCAACATCCTCAAGCCCGCCCTCTCG
CGCGGTGAAATCCAGTGCATCGGCGCCACCACGCCCGGCGAGTACCGCAAGTCCATCGAGAAAGACCGCTCCCTCGAGCGGCGCTTCCAG
GCCGTCAAGGTGCCGCCGCCCAATGAAGAGGATGCCATCAAGATCATCATGGGCATCAAAGACCGCTATGAGAAGTTCCACGCGGTCAGC
TACACCGATGATTCCATCGAGTTCGCCGTCTCGCACTCCAATCGCTACATTCCCGACCGCTTCCTGCCCGACAAGGCCATCGACCTCATC
GACGAGGCCGGCGCGCGCGTCAAGCTGCGCCAGACCTCGCTGCCTGAGGAGATCACCGAGGTACAGAAGCGCATCAAGTTCATCGTCCAC
CGCATGGAGAACGCCATCGCGAACCACGAGTTCGAAAAGGCGCGCTTCTACTCCGACGAGGAACGCAAGGAGCGCGAAAACCTGCGCGCC
CTCCGCGAGAAATATCACCTCGATGACTCCACCGCCGGCATCGTAAGCCGCGAAGACATCGAAGACGTGGTCAGCCGCTGGACCGGCGTG
CCCATCACTTCGATCAAGGAAGAAGAGACCCAGAAGCTGCTGCGCGTCGAAGGCGAGCTGCACAAGCGCGTCATCTCGCAGGAGAAGGCC
ATCTCGGCCCTCGCCCGCGCCATCCGCCGCTCCCGTGCGGGCCTCAAGTCGCCGCACCGGCCCATCGGCTCGTTCCTCTTCCTCGGCCCC
ACCGGCGTTGGCAAAACCGAGGTCGCGCGCACCCTCGCGCAATTCCTTTTCGGCAGCGAGAAGTCGATCATCCGCTTCGATATGTCGGAG
TTCATGGAAAAGCACTCCGTCTCGAAGCTCATCGGTTCGCCTCCGGGCTACGTCGGCTATGAGGAAGGCGGCCAGCTCACCGAGCGCGTC
AAACGTTCGCCCTACTCGGTCGTGCTGCTCGACGAAATCGAAAAGGCGCACCCGGATGTCTTCAACATCCTGTTGCAGGTCTTTGAGGAT
GGCCAGTTGACCGACGGCCTCGGCAACACGGTCGACTTCAAGAACACCATCATCATCATGACCTCCAACATCGGCGCGCGGCACCTGCAG
CGCAAGCAGGGCCTCGGCTTCCAGAGCGACCGCGAAGAGCTCGTCATGGACAAGGTCGAAGATCTCGTGCGCAACGAGGTCAAGCGCACC
TTCAATCCCGAGTTCCTCAACCGCATCGACGAGATCATCATCTTCCAGTCGCTCACCGACGCCGACCTCATCCAGATTCTTGAACTGCTC
GTGCAGCAGCTCAATGCCAATCTGGCGCAGAAGGCCATCACCATCTCGGTCAACGAAGAGGCCAAGAAGTGGATCCTCGAGAAGACGCTC
ATCGACCGCAGCTACGGAGCGCGCCCCCTGCGCCGCGCACTGCAGCGCTACGTCGAAGACCCGCTCTCCGAGGCCCTCATCGCCGGCCAC
ATCAGTGACCGTCCGGCCTTCCTTGAGGTCTACCTCGACAACAACCAGCTCTTCTACCGCCCCGTCGCGCGCGAGGGCGAAGACACCAAA
CCCGAAGGCGTCCTGCTCTACAGCTAA

Gene Information
Molecule Type genomic DNA
Bases 2457 Bases
Molecular Weight755884.21 Da
Coding Sequence complement(CP001472.1:3220240..3222696)
NCBI ReferenceACO32257
Notesidentified by similarity to SP:P37571; similarity to

DOMAINS AND MOTIFS   Hide

Domains
Domain Position Length Description Feature Identifier
N Domain 16 to 65 50 Clp amino terminal domain PF02861
N Domain 91 to 143 53 Clp amino terminal domain PF02861
NBD_1 199 to 334 136 Nucleotide binding domain 1 PF00004
UVR 412 to 446 35 UvrB/UvrC motif PF02151
NBD_2 530 to 702 173 Nucleotide binding domain 2 PF07724
Clp_C 708 to 797 90 C terminal, D2 small domain PF10431

Graphical View

ONTOLOGIES   Show


LITERATURE REFERENCES   Show


ACCESSIONS   Show


OTHER LINKS   Show



Protein Card Information

Record Information

Entry Name

ClpC_ACCA1

Accession Number

HSP100_0016

Added on

2012-02-02

Updated On

2014-01-20