Protein

UniProt accession
A8CGF4 [UniProt]
Protein name
Endolysin
PhaLP type
endolysin

evidence: UniProt function annotation

probability: 99 % (predicted by ML model)

Protein sequence
MMQISSNGITRLKREEGERLKAYSDSRGIPTIGVGHTGKVDGNSVASGMTITAEKSSELLKEDLQWVEDAISSLVRVPLNQNQYDALCSLIFNIGKSAFAGSTVLRQLNLKNYQAAADAFLLWKKAGKDPDILLPRRRRERALFLS
Physico‐chemical
properties
protein length:146 AA
molecular weight:16139,00000 Da
isoelectric point:9,51426
aromaticity:0,06164
hydropathy:-0,30137

Domains

Domains [InterPro]
Protein sequence: A8CGF4
1 146
Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Salmonella phage P22 (Bacteriophage P22)
[NCBI]
10754 Lederbergvirus >
Host Salmonella enterica subsp. enterica serovar Typhimurium
[NCBI]
90371 Bacteria > Proteobacteria > Gammaproteobacteria > Enterobacteriales > Enterobacteriaceae > Salmonella

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
BAF80780.1 [NCBI]
Genbank nucleotide accession
AB362338 [NCBI]
CDS location
range 38488 -> 38928
strand +
CDS
ATGATGCAAATCAGCAGTAACGGAATCACCAGATTAAAACGTGAAGAAGGTGAGAGACTAAAAGCCTATTCAGATAGCAGGGGGATACCAACCATTGGGGTTGGGCATACCGGAAAAGTGGATGGTAATTCTGTCGCATCAGGGATGACAATCACCGCCGAAAAATCTTCTGAACTGCTTAAAGAGGATTTGCAGTGGGTTGAAGATGCGATAAGTAGTCTTGTTCGCGTCCCGCTAAATCAGAACCAGTATGATGCGCTATGTAGCCTGATATTCAACATAGGTAAATCAGCATTTGCCGGCTCTACCGTTCTTCGCCAGTTGAATTTAAAGAATTACCAGGCAGCAGCAGATGCTTTCCTGTTATGGAAAAAAGCTGGTAAAGACCCTGATATTCTCCTTCCACGGAGGCGGCGAGAAAGAGCGCTGTTCTTATCGTGA

Genbank protein accession
BAG12663.1 [NCBI]
Genbank nucleotide accession
AB426868 [NCBI]
CDS location
range 38488 -> 38928
strand +
CDS
ATGATGCAAATCAGCAGTAACGGAATCACCAGATTAAAACGTGAAGAAGGTGAGAGACTAAAAGCCTATTCAGATAGCAGGGGGATACCAACCATTGGGGTTGGGCATACCGGAAAAGTGGATGGTAATTCTGTCGCATCAGGGATGACAATCACCGCCGAAAAATCTTCTGAACTGCTTAAAGAGGATTTGCAGTGGGTTGAAGATGCGATAAGTAGTCTTGTTCGCGTCCCGCTAAATCAGAACCAGTATGATGCGCTATGTAGCCTGATATTCAACATAGGTAAATCAGCATTTGCCGGCTCTACCGTTCTTCGCCAGTTGAATTTAAAGAATTACCAGGCAGCAGCAGATGCTTTCCTGTTATGGAAAAAAGCTGGTAAAGACCCTGATATTCTCCTTCCACGGAGGCGGCGAGAAAGAGCGCTGTTCTTATCGTGA

Genbank protein accession
AAM81442.1 [NCBI]
Genbank nucleotide accession
AF527608 [NCBI]
CDS location
range 38551 -> 38991
strand +
CDS
ATGATGCAAATCAGCAGTAACGGAATCACCAGATTAAAACGTGAAGAAGGTGAGAGACTAAAAGCCTATTCAGATAGCAGGGGGATACCAACCATTGGGGTTGGGCATACCGGAAAAGTGGATGGTAATTCTGTCGCATCAGGGATGACAATCACCGCCGAAAAATCTTCTGAACTGCTTAAAGAGGATTTGCAGTGGGTTGAAGATGCGATAAGTAGTCTTGTTCGCGTCCCGCTAAATCAGAACCAGTATGATGCGCTATGTAGCCTGATATTCAACATAGGTAAATCAGCATTTGCCGGCTCTACCGTTCTTCGCCAGTTGAATTTAAAGAATTACCAGGCAGCAGCAGATGCTTTCCTGTTATGGAAAAAAGCTGGTAAAGACCCTGATATTCTCCTTCCACGGAGGCGGCGAGAAAGAGCGCTGTTCTTATCGTGA

Gene Ontology

Description Category Evidence (source)
GO:0003796 lysozyme activity Molecular function Inferred from Electronic Annotation (UniProt)
GO:0009253 peptidoglycan catabolic process Biological process Inferred from Electronic Annotation (InterPro)
GO:0016998 cell wall macromolecule catabolic process Biological process Inferred from Electronic Annotation (InterPro)
GO:0030430 host cell cytoplasm Cellular component Inferred from Electronic Annotation (UniProt)
GO:0042742 defense response to bacterium Biological process Inferred from Electronic Annotation (UniProt)
GO:0044659 viral release from host cell by cytolysis Biological process Inferred from Electronic Annotation (InterPro)

Enzymatic activity

EC Number Entry Name Reaction Catalyzed Classification Evidence Source
3.2.1.17 lysozyme
aka muramidase
D-glucosamine residues in chitodextrins
Hydrolases
Glycosylases
Glycosidases, i.e. enzymes hydrolyzing O- and S-glycosyl compounds
match to sequence model evidence used in automatic assertion
ECO:0000256
ARBA:ARBA00000632

Tertiary structure

PDB ID: upi0000003a96_model

Method: AlphaFold3 Non-commercial

Resolution:

Chain position: A