Protein

Protein accession
Q6SE70 [UniProt]
Representative
70s7t
Source
UniProt (cluster: phalp2_11314)
Protein name
Putative putative minor tail protein
Lysin probability
100%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MDLEELELRFRANYGDVIQKIDELTNLIVQKTGDMQYKMQNNLDKFQQSMNESTSRANENVKEEIHQRSEAEEAKQKLLEQTLNTQNEVTDKIVQGNKEQAESSKEAVNQSEKSLDSLTARLQEASNMQQRIAQQTSTAHDVIKDISRPKSQTQVKEPLKSTSQNDYSSFDNYQEKRIQSYMPKRPVDLGIDDEIQAEASRAKKEVDSLVTHINEKMEQARSMQRRIATLTANRDNLDMSKQGSKVKAMRLDDQIADAQIKMERYQTQAKALAQEMSQELDTVPNSLRRIEREMDQTEGKIERIRRSIAEMRDNDATLGRSSGTNKEIKNAEAEYRRLVARSNELAKAYTYVSARGDELRNSSSRINTELAEENRNVSNLGSKFSRLKNTISNVNSSLRRFGNSGNSSIRKAGSGVSVLSERLRGVRMAIRMLASQLIVFTLLYQGIMMLAQGMGAALMTNRQFASSFNAIKVNLLTAFYPIYSYVLPAINAFMSSLQKATAWIAQFTSALTGMSLSSARSGAHGLYDQVQAMNDTSKAASKANEAVKKQQQEQAKAVQRANQQIAQANRQGAAAVAAENERIKASNEQAKKAFEDTKKANEDLQASLMGFDELNVLDNSKNNNDNGSFEAQPLEKFTPQQKQDTPIFDDPGIDDGETGGNQGLDWNVPLVASQNAIDAANKVKKVLGEIFDPMKKAWDEKGQAVVDAAKYSWTEIRRLLGDVGNSFLHVWDNGTGQKVIENLLQLLADMLNVIGDISRAFAEAWEESGRGTKFIQTIFNSLNNVLETIHHIAQAFRDAWNTGDLGKRIFANLLDLATKLVGFVGDIAKSFDEAWQHGNAGTRLWQAWLNALNNILDTYKHIVTSIDEAWKHSNLGVSIWSHIIQIVTGVGNTIGNLAGQFDKAWQHGNVGTSIFKTLLGMVDDMLGSLGDMATYTANWAKKLDFTPLLQSIDNLLKAIRPVTKDVWDGLSWAYKNVLLPLARFTITKVIPEFLNALAAALKVVHSVIKAAEPVLEWFFDSFIKPLAKIAGFAIVEALKLLTKALEGLSDWIDHHQTAVKIMTATLLTLLGIKVAKATIAGIQSFTDTLKILAMLKFDKLKAAAKYADDLLGVAIEFAKHPITNIKELAKLSFENIKGGYNRIKDLWGEVNKGWQDSNLAKTDFLKSARSSIQSGEPMKLGQKLGTGLSGAMIAVTSGIDIYKGIKANNKEEKFADFGSGIGGAVGGAIGLWFGGPLGAAVGQQIGSLIGKWGGVGASKFGDGWSKYGKGKKPKDWVEAIGFKAHEILDNFTSWAKSVGRDINNNISKGKKDVQTASSNIHKWSTNFISGAKKDIKSWAQNIGSNIHKDVDKGKKLAKQAGDKVKEWSTSFISDAKKKVHDWSSQIGSNVNNSVENGQAMAKNAGTKIKNWTTDFRESASGLVRSWAERLGEHINNGSESSRSGATNAGNKISQWTRSFFGGANQSVSSWASGLGGHVNDGIGGAYNSAKNAGERLGSWVSSFRHDTSRKLGSWAGTLGSTIGNGITDGLHNISRAVRKVVNAIVKPVQNATDQVRKGINWVLGKLGGGSIGWGFFDWNSYANGTNSHPGGLALVNDQAGDTYRESYELPNGEQGLFPAERNLLTYLPAGTKVKTATDTANELAGMVPKYAGGIGNFNFDFSGIFSGISSALGNLNFGNIFDGVGNFVDGVMEELEKVTDDIAHPERLVNYIIDKFVTYDWSLGDASLKFAKGAVHQEKKGMMNWAKKVINQFGGATHQTGLGAEGWRSAVKKALRKNGLPATPAYVNAWVRQIQTESGGNEHAVQGGYTDINTLTGDLAKGLLQTISATFNAYKFPGHGNIFNGYDNMLAAINYAKHRYGSDMLAVIGHGHGYEDGGLIAKHGFYEISENDKPEMVVPLTNRELGMRRINEAIAFMNQNFGGGLQMPSSLSRRTAIDSSIYSDTQSNDSTSVQRGGFKEMSTELVNAIVQAIQMQNFNSNNGKPIDLHLSVKIGDESFGEHAIKGINTINQKNGRNMLNL
Physico‐chemical
properties
protein length:2021 AA
molecular weight:221385,9 Da
isoelectric point:9,00
hydropathy:-0,43
Representative Protein Details
Accession
70s7t
Protein name
70s7t
Sequence length
768 AA
Molecular weight
85476,72090 Da
Isoelectric point
6,08782
Sequence
MDLEELELRFTANYGDVMRKLDEFTNIISQKTNDMQYKIQDGLGKINQSMNDNVSKANESAKEEVRQRTEAENAKQKLLEQTLNTQNDVTDKVIQGNKEQAESSKEAVNQSEKSLDSLTARLQEASNMQQRIAQQTSAAREAVSDIAKPKEQLQRRDRVETRPQVQNSSFDDYQEKRIQSYMPKRPVDLGIDDEIQAEASRAKKEVDGLATHFNEKMEQARSMQRRIATLTANRDNLDMSKQGSKVKSMRLDDQIADAQIKMSRYQNQAKALAQEMSQELNSIPSSLKRIETEMDQTEGKIERIRRSIAEMRDNDATLGRSAGNDKELKQAEAEYKRLVARSDELAKAYSYVSSRGDELKASSSRINTELSEESKNVSGLSSKFNRLRNTISNVNSSLKRFGNSGSSSMRRAGGGASMLSERLKGVRMAMRMLASQLIVFTLLYQAIMMLAQGMGAALMTNKQFASSFNAIKVNLLTAFYPIYSYVLPAVNALMNSLRKATAWIAQFSSALTGMSLSSARSGAHGLYDQVQAMNDTSKAASKANEAVKKQQQEQAKAVQRANQQIAQANRQGAAAVAAENERIKASNEQAKKAFEDTKKANEDLQASLMGFDELNVLDNNKNNQDNGSFEAQPLEKFTPQQKQDSPIFDDAGLDDSGAGNGNQGLDWNVPLEASQNAIDAANKVKKVLGEIFDPMKKAWDEKGQAVVDAAKYSWQEIKRLLGDVGNSFLHVWDNGTGQKVMENLLQLLADMLDIIGDISRAFAEAWEE
Other Proteins in cluster: phalp2_11314
Total (incl. this protein): 6 Avg length: 855,0 Avg pI: 8,27

Protein ID Length (AA) pI
70s7t 768 6,08782
1FWBo 467 9,47493
71uhB 558 9,64042
7OnAb 768 6,08782
7XjW5 548 9,33323
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_37977
5oGw7
2 43,2% 592 8.322E-136
2 phalp2_36472
1bVAu
9 34,6% 618 9.170E-91

Domains

Domains [InterPro]

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Lactobacillus prophage Lj965
[NCBI]
139870 No lineage information
Host Lactobacillus johnsonii
[NCBI]
33959 Firmicutes > Bacilli > Lactobacillales > Lactobacillaceae > Lactobacillus >

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
AY459535 [NCBI]
CDS location
range 25714 -> 31779
strand +
CDS
ATGGATCTAGAAGAACTTGAGTTAAGATTTAGAGCAAATTATGGGGATGTCATTCAAAAGATTGATGAATTAACTAATCTTATTGTGCAAAAAACAGGTGATATGCAATATAAGATGCAAAATAATTTAGATAAATTTCAGCAATCAATGAATGAGAGTACTTCAAGAGCTAATGAAAATGTTAAAGAAGAGATTCATCAACGTAGTGAAGCGGAAGAAGCTAAACAAAAATTACTTGAGCAGACTCTTAATACTCAAAATGAGGTAACAGATAAGATTGTTCAAGGAAATAAAGAACAAGCCGAAAGCTCTAAAGAAGCTGTTAATCAGTCAGAAAAGAGCCTGGATAGTTTAACAGCTCGCTTGCAAGAAGCTTCTAATATGCAACAACGAATTGCACAACAAACTAGTACAGCTCATGATGTAATAAAAGATATTTCTAGACCCAAATCACAAACGCAAGTAAAAGAACCATTAAAGTCCACTTCACAGAATGACTACTCAAGCTTTGATAACTATCAAGAAAAACGAATTCAGAGTTATATGCCAAAGCGTCCTGTTGATTTAGGTATTGATGATGAAATACAAGCTGAAGCATCCAGAGCTAAAAAAGAAGTTGATAGCTTAGTAACTCACATCAACGAGAAGATGGAACAGGCTCGATCAATGCAACGGAGAATTGCTACTTTAACAGCTAATCGAGACAATCTGGATATGAGCAAGCAGGGTAGTAAAGTTAAAGCAATGAGATTAGATGATCAAATTGCTGACGCTCAAATTAAGATGGAACGGTATCAGACTCAAGCTAAGGCTCTTGCACAGGAAATGTCGCAGGAGCTTGATACTGTTCCTAACTCATTAAGACGCATTGAGCGTGAAATGGATCAAACGGAAGGCAAGATTGAACGAATTAGACGTTCTATTGCTGAAATGAGGGATAATGATGCGACTCTTGGCCGATCCTCGGGCACTAATAAAGAGATTAAGAATGCTGAAGCTGAGTACAGAAGATTAGTTGCTAGAAGTAATGAGTTAGCAAAGGCATATACATACGTCAGTGCACGTGGAGATGAGTTAAGAAATAGTTCTTCACGAATTAATACTGAATTAGCTGAAGAAAACAGAAATGTCTCAAATCTTGGTTCTAAGTTTAGCAGGCTAAAAAATACTATTTCTAATGTTAATTCATCTCTTAGACGATTTGGTAATAGTGGGAATTCTTCTATACGGAAAGCTGGATCAGGCGTTTCAGTTTTAAGTGAACGACTAAGGGGCGTTCGAATGGCAATTCGAATGTTAGCCAGTCAATTAATTGTGTTTACTCTGCTCTATCAAGGAATTATGATGCTTGCCCAAGGTATGGGTGCAGCATTAATGACTAATCGACAATTTGCAAGTAGTTTTAACGCAATTAAGGTTAACTTGTTGACTGCTTTTTATCCAATTTATAGTTATGTTTTACCTGCTATAAATGCTTTTATGAGTTCATTGCAAAAAGCAACTGCATGGATTGCTCAATTTACTTCTGCGTTAACAGGAATGAGCTTGTCTAGTGCTAGAAGTGGTGCTCATGGTTTGTACGATCAAGTACAGGCAATGAATGATACTTCAAAGGCTGCAAGTAAAGCTAATGAAGCTGTTAAAAAACAACAGCAAGAACAAGCTAAAGCAGTTCAACGTGCTAATCAGCAAATTGCTCAAGCTAATCGTCAAGGAGCGGCAGCCGTTGCAGCAGAAAATGAAAGAATAAAAGCATCTAACGAACAAGCTAAAAAAGCTTTTGAAGATACAAAGAAAGCTAATGAAGACCTTCAAGCATCACTCATGGGCTTTGATGAGCTGAACGTTTTAGATAACAGCAAAAACAATAACGATAATGGTAGTTTTGAAGCTCAACCACTAGAAAAATTTACTCCACAGCAAAAGCAGGATACACCTATTTTTGATGATCCCGGAATAGATGATGGTGAGACAGGTGGAAATCAGGGGCTTGATTGGAATGTTCCATTAGTTGCATCCCAAAATGCTATTGATGCTGCTAATAAAGTTAAAAAGGTTTTAGGCGAAATCTTTGATCCAATGAAAAAAGCTTGGGATGAAAAAGGCCAAGCTGTTGTTGATGCAGCTAAGTATTCATGGACTGAAATTAGGCGCTTATTAGGGGATGTAGGTAATTCTTTCTTACATGTTTGGGACAATGGAACAGGACAAAAAGTAATAGAAAATCTACTTCAACTTTTAGCCGATATGCTGAACGTCATTGGCGACATTTCACGAGCTTTTGCAGAAGCATGGGAAGAGAGTGGTCGAGGGACCAAGTTTATTCAGACTATCTTCAATTCTCTCAATAATGTTCTTGAAACTATCCATCATATAGCTCAAGCGTTTCGTGATGCATGGAATACTGGTGATCTAGGCAAGAGAATTTTTGCTAATTTGCTTGATTTAGCTACTAAGTTAGTTGGGTTTGTTGGCGATATTGCTAAATCCTTTGATGAGGCATGGCAACATGGTAATGCCGGAACTAGACTATGGCAAGCTTGGCTAAATGCACTGAATAATATACTTGATACTTACAAGCATATAGTTACATCAATTGATGAAGCATGGAAACACTCCAACTTAGGCGTTTCAATTTGGAGTCATATTATTCAAATAGTAACTGGTGTGGGAAACACTATTGGCAATTTAGCTGGTCAATTTGATAAAGCGTGGCAACATGGAAATGTTGGAACTTCTATCTTTAAAACTTTGCTCGGTATGGTAGATGACATGCTTGGATCACTTGGTGATATGGCAACGTATACTGCAAACTGGGCTAAGAAGCTTGATTTTACCCCATTGCTTCAGTCAATTGATAATCTGCTAAAAGCTATTCGACCAGTAACCAAAGATGTTTGGGATGGATTATCGTGGGCATACAAGAATGTCTTGCTTCCACTCGCACGTTTTACAATTACTAAGGTAATCCCTGAGTTCTTAAATGCCTTAGCTGCTGCTCTTAAAGTCGTACATAGTGTAATCAAAGCGGCAGAACCCGTTCTAGAATGGTTCTTCGATAGTTTTATTAAACCATTGGCTAAAATAGCTGGTTTTGCAATTGTAGAGGCTTTAAAACTTCTAACTAAAGCACTTGAGGGCTTATCTGATTGGATAGATCATCATCAAACGGCCGTTAAGATCATGACTGCAACGTTGCTTACGCTTTTAGGTATTAAGGTTGCTAAAGCTACTATTGCTGGAATTCAAAGCTTTACTGATACTCTGAAGATCCTTGCAATGCTCAAATTTGATAAATTGAAAGCTGCTGCAAAGTATGCTGATGATTTATTAGGAGTAGCAATTGAATTTGCAAAGCACCCAATAACTAATATTAAAGAACTTGCAAAATTAAGTTTTGAAAATATCAAGGGTGGCTATAACCGCATCAAAGATCTTTGGGGTGAAGTAAATAAAGGCTGGCAGGATAGTAATCTTGCCAAGACCGATTTTCTTAAATCTGCTCGCTCTTCAATACAATCTGGCGAACCAATGAAGCTGGGGCAAAAACTTGGTACTGGCTTATCTGGTGCTATGATTGCTGTAACTTCAGGAATTGATATCTACAAAGGAATCAAAGCTAATAACAAAGAAGAAAAATTTGCAGATTTTGGTTCTGGAATTGGTGGTGCAGTTGGTGGTGCCATTGGACTCTGGTTCGGTGGCCCACTAGGGGCAGCCGTAGGACAACAGATTGGCTCATTAATTGGTAAATGGGGCGGTGTTGGTGCCTCTAAATTTGGTGATGGTTGGTCTAAGTATGGTAAAGGTAAGAAACCTAAAGACTGGGTTGAAGCCATAGGCTTTAAGGCCCATGAAATCTTAGATAACTTTACTTCTTGGGCTAAATCCGTTGGTAGAGATATTAATAACAATATTTCTAAAGGTAAAAAAGATGTTCAAACTGCTAGCTCTAATATCCATAAATGGTCCACTAATTTTATTTCTGGTGCAAAGAAAGATATTAAATCTTGGGCACAAAATATCGGTTCTAACATTCATAAAGATGTAGACAAAGGTAAGAAACTAGCTAAACAAGCTGGAGATAAAGTAAAAGAATGGTCTACTAGCTTTATTAGTGATGCTAAAAAGAAAGTTCATGATTGGTCTTCTCAAATTGGTTCTAATGTAAATAATAGCGTTGAAAATGGTCAAGCCATGGCTAAGAATGCTGGAACTAAAATTAAAAATTGGACTACCGACTTTAGAGAATCAGCCAGTGGACTAGTAAGATCATGGGCCGAAAGACTAGGAGAACACATTAATAATGGCTCTGAATCATCTCGCTCTGGAGCCACTAATGCCGGTAATAAAATTTCTCAATGGACAAGAAGTTTCTTTGGAGGAGCCAACCAAAGTGTTTCTAGTTGGGCCAGTGGTCTAGGCGGTCATGTTAATGACGGAATAGGTGGAGCTTATAACTCTGCTAAAAATGCTGGTGAACGTCTAGGAAGCTGGGTTTCAAGCTTTAGACATGATACTTCAAGAAAACTTGGTTCATGGGCTGGCACTCTTGGGAGCACAATTGGGAATGGAATTACTGATGGACTGCACAACATTAGTCGAGCCGTTCGAAAAGTGGTTAATGCAATTGTTAAACCGGTGCAAAATGCTACTGATCAAGTTAGAAAAGGTATTAACTGGGTACTTGGTAAACTTGGTGGTGGGTCAATTGGCTGGGGATTCTTTGATTGGAATTCTTATGCGAATGGGACGAATAGCCACCCAGGTGGACTAGCCTTGGTTAATGATCAAGCCGGAGACACTTATCGGGAAAGTTATGAATTACCTAACGGAGAACAAGGTTTATTCCCAGCTGAACGTAATTTGTTAACTTACTTGCCAGCTGGCACAAAAGTTAAAACTGCAACTGATACGGCTAATGAATTAGCAGGTATGGTTCCTAAATATGCTGGTGGTATAGGTAACTTTAATTTCGATTTTAGTGGTATCTTTAGCGGAATTAGTAGCGCTCTTGGAAACCTAAATTTTGGGAATATATTTGATGGGGTTGGCAATTTTGTTGATGGTGTAATGGAAGAACTTGAAAAAGTTACTGATGATATAGCTCATCCAGAACGATTAGTTAACTACATTATTGATAAATTTGTCACCTATGATTGGAGTTTAGGCGACGCTTCATTAAAGTTTGCTAAAGGCGCTGTTCATCAAGAAAAGAAAGGCATGATGAATTGGGCTAAGAAAGTTATCAATCAATTTGGTGGTGCAACTCATCAAACAGGGCTGGGTGCGGAAGGTTGGCGTAGTGCTGTTAAGAAAGCATTGCGCAAGAATGGTCTACCTGCAACACCAGCATATGTAAATGCATGGGTTCGTCAAATCCAAACTGAGTCTGGTGGTAATGAACATGCTGTACAAGGCGGTTATACGGATATCAACACCTTAACAGGAGACTTGGCAAAAGGATTGTTGCAAACTATCTCTGCAACTTTTAACGCTTATAAATTCCCTGGTCACGGTAATATCTTTAATGGTTACGACAACATGCTTGCAGCAATTAACTATGCTAAGCATCGCTACGGTTCAGACATGCTTGCTGTTATCGGTCACGGACATGGCTATGAAGACGGTGGCTTAATTGCTAAACACGGTTTCTATGAGATTAGCGAAAACGACAAGCCAGAAATGGTTGTTCCTTTAACTAATCGTGAGTTAGGTATGCGAAGAATTAATGAAGCTATTGCATTTATGAATCAGAATTTTGGTGGTGGCTTGCAAATGCCATCTTCCTTAAGCAGACGAACTGCTATTGATAGTTCAATTTATTCTGATACTCAATCTAATGACTCTACGTCAGTTCAGCGTGGAGGATTCAAAGAGATGAGTACAGAATTAGTAAATGCTATAGTTCAAGCAATTCAAATGCAAAACTTTAATAGTAATAACGGAAAACCTATTGATTTACACTTATCTGTCAAGATTGGTGATGAGTCATTTGGCGAACACGCTATTAAAGGTATTAATACTATCAATCAAAAGAATGGTAGAAACATGTTAAATCTTTAG

Gene Ontology

Description Category Evidence (source)
GO:0016020 membrane cellular component None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
upi0000227b5d_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (70s7t) rather than this protein.
PDB ID
70s7t
Method AlphaFoldv2
Resolution 53.81
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50