Protein
- UniProt accession
- O03937 [UniProt]
- Protein name
- Minor capsid protein
- PhaLP type
-
VAL
evidence: GO annotation
probability: 99 % (predicted by ML model)
- Protein sequence
-
MADGTVTIDVLMGTKSFMSDRERVENLLKTLGADAGNQMDEAFTNNSNKVQKKARETKKKIKNEFDSPIIIKLEAKAKEAGVKDFRKILNQIPRNQLTRLKAKSERDEVIDWKKEISRIPEKKSTRLKVDKKQASDDLTALKKQSESTEHSFSHLKEIVVGTFLGGAIQAGVQGLVTGLKDAAKAGMEYNKQQDMMRMNWHNLTTEAPKDGEELLTYINHVSQHSIYAADTIDKMAQSFYHVHSSEKETKKWTDDFVALGSTLHVSNDALKESGEQFAKIVAGGKTSAEDMSVMISRFPMFGEALQKATGKSMSQLYAMSAAGKLTSKQFTEALDYLGKKYRGSTEEAMNSFQGMSMYIKSRWSMLTGNIMASSFKMSKGVAQDMRNLLSDNMMKKYADLASTAISHVTGWLVELIKYVNAHKNTIVDIIGNLGKILGIIGKTVWKTFSDIVYDIAKMFGLVGEKAQESKDPLDKIDDALKNLSKNQELIENLTKAFIAMFALKKGMEFIGMLASLRKSLIETAAVSKMVDLFGGSGVTSAGGKAVTQTVAKEAGGTAATAGSSKVLGRLFAKGGATSTAELEAASGLGGGKAMMAARGLTKAVPYMSIAASIPELFGTTQKTLGKHLGGFAGSAGGPAAGAAAGSAVMPVVGTAVGGVIGGLAGSKLGQSVGGSIQKGITKSFPKLTSKMSDLGHDMAKKFSGSFKPKPSLNDKQFSKSYTSLTKTLNKQAKIKIKTDTSGISKAQKLTDTTYGKMKKSVDKYYGHKRQMSIKDYATLVQNGSMTEKEANKLLNKAKENYNKQAKAQKDNIGKMKKDSDSYYSKLGKAESQKNKDLAAARKKDGKNHEKYLADKKKIEKDFQTKTAGDRKKYLAQLAKDENKSNDAVTKATKISSGKQLDILENLKDHKGKLSKQQMTETIKNSAQERDKTIDNADKQRDKSVSAAKKKYKETVDAADKERYENGTMSRKQYEEVVDKARQQRDDSIDAADAQKKKTVKKAEETHTKVVDEATKQAGEHKGAVDSETGDVITFWGTFISTLRGDWNDMTGGINSILHALNKNWGNIPTWKKHAAGLNGSMGEHTALVGEEGFEYMGTSNGSIMPIGVEGPEIRNIPAGASILPHGMSVEFAQMAKDLPGYKIGLPGWLTSTFSALKKGAEGAADLVSEGASGVVNKIANATGIGKLAKTLNDNTTAFGAIASGAKDSLIDNAVKYVQGFFDQFSDTSEDGAGSLAPHFGSPFKESSGYGPRAGGFHKGIDFAAPLGTPIPAQYGGTVVQAGPASGFGNWVVIKPSGASVDTIYGHMKRMKVKTGQHVKAGQIIAWVGSEGQSSGPHVHYELRAGLGGKSYNPMTYGASAGNPCGHSVNRWRPYVVRALKANGFAATDSQVAAWMKVIKRESNGDPSVINTWDRNAQLGHPSKGLVQTIQPTFDAYKFKGHNNPLNGYDDLLAGIHYMKAIYGSGPSAFARVSGPMGYDSGGRVMKKQLAWLAENNPEYVVNPERDSADSLIVEAARARAAKAPNGLVAKAMRVVGTAKAGIQRTAPSFASRGVAQAEGQVAGNQAISGDLTITVPLDSNVLAQAVYPKAKVMQQRDITIQAKKGGLH
- Physico‐chemical
properties -
protein length: 1608 AA molecular weight: 172852,00000 Da isoelectric point: 9,56583 aromaticity: 0,06592 hydropathy: -0,47289
Domains
Domains [InterPro]
Taxonomy
Name | Taxonomy ID | Lineage | |
---|---|---|---|
Phage |
Lactobacillus phage phig1e [NCBI] |
52979 | No lineage information |
Host |
Lactobacillus [NCBI] |
1578 | Bacteria > Firmicutes > Bacilli > Lactobacillales > Lactobacillaceae > |
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
CAA66745.1
[NCBI]
Genbank nucleotide accession
X98106
[NCBI]
CDS location
range 15416 -> 20242
strand -
strand -
CDS
ATGGCAGACGGAACAGTAACAATTGATGTGTTAATGGGTACCAAGTCGTTTATGAGTGACCGTGAACGGGTCGAAAACCTACTTAAGACGCTTGGAGCCGATGCTGGTAACCAAATGGACGAAGCCTTTACTAACAACTCTAACAAGGTACAGAAGAAAGCTAGAGAAACTAAGAAGAAAATTAAGAATGAATTTGATTCCCCAATTATTATTAAGTTGGAAGCTAAGGCGAAAGAAGCTGGCGTAAAAGATTTTAGAAAGATACTCAACCAGATTCCTAGAAATCAGTTAACACGTTTGAAAGCTAAGTCGGAACGCGACGAGGTTATAGACTGGAAAAAAGAAATCAGTCGCATTCCTGAAAAGAAGTCTACACGATTAAAAGTAGATAAGAAACAAGCTTCTGATGATTTAACTGCTTTGAAGAAGCAGTCGGAATCAACCGAGCATAGTTTCTCACACCTCAAAGAGATTGTTGTGGGAACATTTCTTGGTGGCGCGATTCAGGCTGGTGTTCAAGGCCTAGTGACTGGGTTAAAAGATGCTGCTAAAGCTGGTATGGAATATAACAAGCAGCAGGATATGATGCGCATGAACTGGCATAATCTAACGACTGAAGCACCTAAGGATGGTGAGGAACTATTAACATACATCAATCATGTTTCACAGCACTCTATTTACGCTGCCGATACTATTGATAAGATGGCGCAAAGTTTTTATCATGTCCATTCAAGTGAAAAAGAGACTAAAAAGTGGACTGATGATTTTGTAGCATTGGGATCAACACTGCATGTTTCAAATGATGCGTTAAAAGAATCCGGTGAGCAATTCGCAAAAATTGTAGCTGGTGGGAAAACATCGGCTGAAGATATGTCTGTTATGATTAGTCGCTTTCCAATGTTTGGTGAAGCTTTACAAAAGGCAACAGGGAAGTCAATGAGTCAGCTTTATGCGATGTCAGCTGCTGGAAAATTGACCTCAAAACAATTTACTGAAGCGCTGGATTATTTAGGCAAAAAATATAGAGGCAGTACCGAAGAGGCAATGAATAGTTTCCAAGGTATGTCAATGTATATAAAGTCGCGATGGTCAATGCTGACTGGTAATATCATGGCTTCATCTTTCAAAATGAGTAAGGGCGTTGCCCAAGATATGAGAAATTTATTATCTGACAATATGATGAAAAAGTATGCTGATTTAGCATCTACTGCAATTTCACATGTTACTGGATGGTTGGTTGAACTCATTAAATACGTTAATGCTCATAAAAATACAATTGTCGACATTATCGGAAATCTTGGCAAAATACTAGGCATCATTGGTAAAACTGTCTGGAAAACATTTAGCGACATAGTCTATGACATTGCAAAGATGTTTGGGCTGGTGGGCGAAAAGGCACAAGAATCTAAAGATCCACTAGACAAGATTGATGATGCTTTAAAGAACTTATCCAAGAACCAAGAGTTGATCGAGAACTTGACCAAAGCATTTATTGCGATGTTTGCACTCAAAAAAGGTATGGAGTTTATTGGCATGCTGGCAAGTTTGCGTAAGTCACTTATCGAAACGGCTGCTGTGTCTAAGATGGTTGATTTGTTCGGTGGTAGTGGCGTTACTAGTGCGGGCGGTAAGGCCGTTACTCAGACGGTTGCTAAAGAAGCCGGTGGAACTGCTGCTACGGCTGGTAGTTCTAAAGTTCTTGGACGTCTGTTTGCAAAAGGTGGCGCTACTTCAACGGCAGAACTTGAAGCGGCTAGTGGCCTAGGCGGTGGCAAAGCCATGATGGCTGCTCGTGGGCTCACTAAAGCTGTTCCATATATGAGCATTGCCGCTTCAATACCAGAGCTGTTTGGCACGACTCAGAAGACACTAGGTAAGCACTTGGGTGGGTTCGCTGGTTCGGCTGGTGGGCCTGCCGCGGGTGCTGCTGCCGGCTCTGCAGTTATGCCGGTCGTTGGGACTGCTGTTGGTGGTGTAATCGGTGGATTAGCAGGTAGTAAGCTTGGCCAATCGGTGGGTGGCAGTATTCAAAAAGGCATTACCAAGAGCTTCCCTAAACTTACTAGTAAGATGTCTGATCTAGGCCATGATATGGCTAAGAAGTTCAGTGGTAGCTTCAAACCTAAGCCATCGCTAAATGATAAGCAATTTTCGAAATCATATACCTCACTGACGAAGACACTAAATAAACAGGCCAAAATAAAAATTAAGACCGACACTTCCGGCATCAGCAAGGCTCAGAAGCTCACTGATACAACGTATGGCAAGATGAAGAAGTCGGTCGACAAGTACTATGGTCACAAGCGTCAGATGTCTATCAAGGACTATGCAACGTTGGTTCAGAACGGTTCTATGACTGAAAAAGAGGCCAATAAGCTGCTAAACAAGGCCAAAGAGAACTACAACAAGCAGGCGAAAGCTCAGAAAGATAACATTGGGAAAATGAAAAAAGATTCCGATAGTTATTACTCGAAGCTTGGCAAGGCTGAATCACAAAAGAACAAAGACTTGGCTGCTGCCCGTAAGAAGGACGGCAAAAATCATGAAAAGTATTTAGCTGATAAAAAGAAAATCGAAAAGGACTTCCAAACCAAAACGGCCGGCGACCGTAAGAAGTATTTAGCTCAGCTAGCCAAGGATGAAAATAAATCGAATGATGCGGTTACAAAAGCAACTAAGATTTCATCTGGAAAGCAGCTCGATATTCTTGAAAACTTGAAAGACCACAAGGGCAAGCTGTCTAAGCAACAAATGACTGAAACAATTAAAAATTCAGCTCAAGAACGCGATAAGACCATTGATAACGCCGACAAGCAACGTGATAAGTCGGTTAGCGCGGCCAAAAAGAAGTACAAGGAAACAGTTGACGCTGCTGATAAGGAACGCTACGAGAACGGTACGATGAGCCGTAAGCAGTATGAAGAAGTTGTCGATAAAGCTAGACAACAACGCGACGACTCCATTGATGCTGCTGATGCTCAGAAGAAGAAGACCGTCAAGAAAGCGGAGGAAACGCACACTAAGGTCGTTGATGAAGCGACTAAGCAGGCTGGGGAGCATAAAGGTGCGGTTGATTCCGAAACCGGTGACGTCATTACTTTTTGGGGAACATTCATTTCCACCTTGCGTGGTGATTGGAATGATATGACGGGTGGCATTAACTCTATCTTGCATGCTTTAAATAAGAATTGGGGGAACATTCCTACTTGGAAAAAGCATGCCGCTGGTCTGAACGGTTCCATGGGCGAACATACGGCGCTCGTTGGTGAAGAAGGATTCGAATACATGGGAACGTCGAATGGTTCAATCATGCCAATTGGTGTCGAAGGACCTGAAATTCGTAACATTCCAGCGGGTGCGTCCATTTTGCCACATGGTATGTCCGTTGAGTTTGCTCAGATGGCTAAAGACTTGCCTGGGTACAAGATTGGATTGCCTGGTTGGTTAACCAGCACGTTCAGCGCTTTGAAGAAAGGTGCTGAGGGCGCTGCTGATCTTGTTAGCGAAGGTGCTAGTGGCGTGGTCAATAAGATTGCTAACGCAACTGGCATTGGTAAGCTTGCAAAGACGCTCAACGATAATACCACCGCGTTTGGCGCGATTGCGAGTGGGGCTAAGGACTCCTTGATTGATAATGCAGTCAAGTATGTACAAGGATTCTTTGATCAGTTCTCCGACACATCTGAAGATGGTGCTGGTTCATTAGCACCGCACTTTGGTTCACCGTTCAAGGAATCTTCGGGATATGGCCCACGTGCAGGTGGTTTCCACAAAGGTATCGACTTTGCGGCGCCATTAGGTACGCCGATCCCAGCTCAATATGGTGGTACTGTCGTGCAGGCAGGCCCAGCTAGTGGGTTCGGTAACTGGGTTGTTATCAAGCCGTCTGGTGCGTCCGTAGATACGATTTACGGACACATGAAACGAATGAAAGTGAAGACTGGTCAGCATGTCAAAGCCGGTCAAATTATTGCGTGGGTTGGTAGTGAAGGCCAATCAAGTGGCCCACACGTCCATTATGAGTTGCGTGCTGGTTTGGGTGGTAAGAGCTATAACCCAATGACTTATGGCGCTAGTGCGGGTAACCCGTGTGGTCATTCAGTTAATCGCTGGCGACCATATGTTGTACGTGCATTAAAGGCCAATGGGTTCGCTGCTACCGACAGTCAAGTGGCTGCTTGGATGAAGGTTATCAAACGCGAGTCAAACGGGGACCCATCGGTGATTAACACTTGGGACCGTAACGCTCAACTTGGGCACCCTTCTAAAGGGCTCGTTCAGACGATTCAGCCAACATTTGATGCGTATAAGTTCAAAGGTCACAACAATCCGCTCAACGGGTATGACGACCTGCTAGCTGGTATTCACTACATGAAGGCCATTTATGGTTCAGGTCCAAGCGCGTTTGCTCGCGTGAGTGGCCCAATGGGTTACGATTCGGGTGGCCGTGTCATGAAGAAACAGCTAGCATGGTTGGCTGAAAATAACCCAGAATACGTGGTTAACCCAGAACGCGATAGTGCTGACAGCCTGATTGTTGAGGCGGCACGGGCACGCGCTGCTAAAGCGCCTAATGGCTTAGTTGCTAAGGCTATGCGAGTAGTTGGAACTGCTAAGGCAGGTATTCAACGCACAGCGCCAAGCTTTGCATCACGGGGCGTGGCACAGGCAGAAGGCCAAGTTGCCGGTAACCAAGCAATCAGTGGCGATTTGACAATCACTGTGCCATTAGATAGCAATGTATTGGCACAGGCGGTATATCCTAAGGCCAAAGTTATGCAGCAACGTGATATTACGATTCAAGCTAAGAAGGGAGGTTTGCATTAG
Gene Ontology
Description | Category | Evidence (source) | |
---|---|---|---|
GO:0004222 | metalloendopeptidase activity | Molecular function | Inferred from Electronic Annotation (InterPro) |
GO:0031640 | killing of cells of another organism | Biological process | Inferred from Electronic Annotation (UniProt) |
GO:0042742 | defense response to bacterium | Biological process | Inferred from Electronic Annotation (UniProt) |
GO:0098003 | viral tail assembly | Biological process | Inferred from Electronic Annotation (UniProt) |
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
PDB ID: upi000009c1f4_model
Method: AlphaFold3 Non-commercial
Resolution: –
Chain position: A