Structure 2 Sequence Report

Structure-to-Sequence (s2s) is a computing process based on RDkit and the characteristics of cyclic peptide sequences, which can convert cyclic peptide SMILES into sequence information. This process mainly relies on the completeness of the monomer reference library. You can access our default monomer reference library through download link. The details of s2s are available on dfwlab/cyclicpepedia on Github. And you can use this tool online on the cyclicpepedia.


Version : 1.0.1 (2023-12-26)


Load SMILES :

SMILES : CCC(C)C1NC(=O)C(C)NC(=O)C(CO)NC(=O)C(CO)NC(=O)C(C(C)CC)NC(=O)C2CSSCC3NC(=O)C(C(C)C)NC(=O)C(CCCCN)NC(=O)C(CO)NC(=O)C(CCCCN)NC(=O)C4CSSCC(NC(=O)C(CO)NC(=O)C(CCC(=O)O)NC(=O)CNC(=O)C(CSSCC(NC(=O)CNC1=O)C(=O)NC(CO)C(=O)N4)NC(=O)C1CCCN1C(=O)C(C(C)CC)NC(=O)CNC(=O)C(CC(N)=O)NC(=O)C(CCCNC(=N)N)NC(=O)C(Cc1ccc(O)cc1)NC3=O)C(=O)NC(C(C)C)C(=O)NC(Cc1c[nH]c3ccccc13)C(=O)NC(C(C)CC)C(=O)N1CCCC1C(=O)N2

SMILES is corrected!


Identify peptide skeleton and renumber atoms


Identify amino acid units


Obtain complete amino acid structures

Amino acid 1

Amino acid 2

Amino acid 3

Amino acid 4

Amino acid 5

Amino acid 6

Amino acid 7

Amino acid 8

Amino acid 9

Amino acid 10

Amino acid 11

Amino acid 12

Amino acid 13

Amino acid 14

Amino acid 15

Amino acid 16

Amino acid 17

Amino acid 18

Amino acid 19

Amino acid 20

Amino acid 21

Amino acid 22

Amino acid 23

Amino acid 24

Amino acid 25

Amino acid 26

Amino acid 27

Amino acid 28

Amino acid 29

Amino acid 30


Identify amino acids based on the monomer reference library

Number of chain(s) identified from the structure: 1

> Chain 1 :

Amino acid sequence : Ile(1)--Gly--Cys(2)--Ser--Cys(3)--Lys--Ser--Lys--Val--Cys(4)--Tyr--Orn--Asn--Gly--Ile--Pro--Cys(2)--Gly--Glu--Ser--Cys(3)--Val--Trp--Ile--Pro--Cys(4)--Ile--Ser--Ser--Ala(1)

Amino acid mapping

Amino acid location

2024-01-15T11:30:07.975504 image/svg+xml Matplotlib v3.7.4, https://matplotlib.org/

Matched amino acid from monomer reference library

Amino acid 1: Ile

Amino acid 2: Gly

Amino acid 3: Cys

Amino acid 4: Ser

Amino acid 5: Cys

Amino acid 6: Lys

Amino acid 7: Ser

Amino acid 8: Lys

Amino acid 9: Val

Amino acid 10: Cys

Amino acid 11: Tyr

Amino acid 12: Orn

Amino acid 13: Asn

Amino acid 14: Gly

Amino acid 15: Ile

Amino acid 16: Pro

Amino acid 17: Cys

Amino acid 18: Gly

Amino acid 19: Glu

Amino acid 20: Ser

Amino acid 21: Cys

Amino acid 22: Val

Amino acid 23: Trp

Amino acid 24: Ile

Amino acid 25: Pro

Amino acid 26: Cys

Amino acid 27: Ile

Amino acid 28: Ser

Amino acid 29: Ser

Amino acid 30: Ala