Structure-to-Sequence (s2s) is a computing process based on RDkit and the characteristics of cyclic peptide sequences, which can convert cyclic peptide SMILES into sequence information. This process mainly relies on the completeness of the monomer reference library. You can access our default monomer reference library through download link. The details of s2s are available on dfwlab/cyclicpepedia on Github. And you can use this tool online on the cyclicpepedia.
Version : 1.0.1 (2023-12-26)
SMILES : CCC(C)C1NC(=O)C(C)NC(=O)C(CO)NC(=O)C(CO)NC(=O)C(C(C)CC)NC(=O)C2CSSCC3NC(=O)C(C(C)C)NC(=O)C(CCCCN)NC(=O)C(CO)NC(=O)C(CCCCN)NC(=O)C4CSSCC(NC(=O)C(CO)NC(=O)C(CCC(=O)O)NC(=O)CNC(=O)C(CSSCC(NC(=O)CNC1=O)C(=O)NC(CO)C(=O)N4)NC(=O)C1CCCN1C(=O)C(C(C)CC)NC(=O)CNC(=O)C(CC(N)=O)NC(=O)C(CCCNC(=N)N)NC(=O)C(Cc1ccc(O)cc1)NC3=O)C(=O)NC(C(C)C)C(=O)NC(Cc1c[nH]c3ccccc13)C(=O)NC(C(C)CC)C(=O)N1CCCC1C(=O)N2
SMILES is corrected!
Amino acid 1
Amino acid 2
Amino acid 3
Amino acid 4
Amino acid 5
Amino acid 6
Amino acid 7
Amino acid 8
Amino acid 9
Amino acid 10
Amino acid 11
Amino acid 12
Amino acid 13
Amino acid 14
Amino acid 15
Amino acid 16
Amino acid 17
Amino acid 18
Amino acid 19
Amino acid 20
Amino acid 21
Amino acid 22
Amino acid 23
Amino acid 24
Amino acid 25
Amino acid 26
Amino acid 27
Amino acid 28
Amino acid 29
Amino acid 30
Number of chain(s) identified from the structure: 1
Amino acid sequence : Ile(1)--Gly--Cys(2)--Ser--Cys(3)--Lys--Ser--Lys--Val--Cys(4)--Tyr--Orn--Asn--Gly--Ile--Pro--Cys(2)--Gly--Glu--Ser--Cys(3)--Val--Trp--Ile--Pro--Cys(4)--Ile--Ser--Ser--Ala(1)
Amino acid mapping
Amino acid location
Matched amino acid from monomer reference library
Amino acid 1: Ile
Amino acid 2: Gly
Amino acid 3: Cys
Amino acid 4: Ser
Amino acid 5: Cys
Amino acid 6: Lys
Amino acid 7: Ser
Amino acid 8: Lys
Amino acid 9: Val
Amino acid 10: Cys
Amino acid 11: Tyr
Amino acid 12: Orn
Amino acid 13: Asn
Amino acid 14: Gly
Amino acid 15: Ile
Amino acid 16: Pro
Amino acid 17: Cys
Amino acid 18: Gly
Amino acid 19: Glu
Amino acid 20: Ser
Amino acid 21: Cys
Amino acid 22: Val
Amino acid 23: Trp
Amino acid 24: Ile
Amino acid 25: Pro
Amino acid 26: Cys
Amino acid 27: Ile
Amino acid 28: Ser
Amino acid 29: Ser
Amino acid 30: Ala