Structure 2 Sequence Report

Structure-to-Sequence (s2s) is a computing process based on RDkit and the characteristics of cyclic peptide sequences, which can convert cyclic peptide SMILES into sequence information. This process mainly relies on the completeness of the monomer reference library. You can access our default monomer reference library through download link. The details of s2s are available on dfwlab/cyclicpepedia on Github. And you can use this tool online on the cyclicpepedia.


Version : 1.0.1 (2023-12-26)


Load SMILES :

SMILES : CC[C@H](C)[C@@H]1NC(=O)CNC(=O)[C@H](CCCNC(N)=[NH2+])NC(=O)[C@H](Cc2ccc(O)cc2)NC(=O)[C@@H]2CSSC[C@H](NC1=O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CCCNC(N)=[NH2+])C(=O)N[C@@H](CCCNC(N)=[NH2+])C(=O)N[C@H](C(=O)N[C@@H](CCCNC(N)=[NH2+])C(N)=O)CSSC[C@H](NC(=O)[C@H](Cc1c[nH]c3ccccc13)NC(=O)[C@@H]([NH3+])CCCC[NH3+])C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CCCNC(N)=[NH2+])C(=O)N[C@@H](C(C)C)C(=O)N2

SMILES is corrected!


Identify peptide skeleton and renumber atoms


Identify amino acid units


Obtain complete amino acid structures

Amino acid 1

Amino acid 2

Amino acid 3

Amino acid 4

Amino acid 5

Amino acid 6

Amino acid 7

Amino acid 8

Amino acid 9

Amino acid 10

Amino acid 11

Amino acid 12

Amino acid 13

Amino acid 14

Amino acid 15

Amino acid 16

Amino acid 17


Identify amino acids based on the monomer reference library

Number of chain(s) identified from the structure: 1

> Chain 1 :

Amino acid sequence : Lys--Trp--Cys(1)--Phe--Orn--Val--Cys(2)--Tyr--Orn--Gly--Ile--Cys(2)--Tyr--Orn--Orn--Cys(1)--Put

Amino acid mapping

Amino acid location

2024-01-15T14:07:40.157517 image/svg+xml Matplotlib v3.7.4, https://matplotlib.org/

Matched amino acid from monomer reference library

Amino acid 1: Lys

Amino acid 2: Trp

Amino acid 3: Cys

Amino acid 4: Phe

Amino acid 5: Orn

Amino acid 6: Val

Amino acid 7: Cys

Amino acid 8: Tyr

Amino acid 9: Orn

Amino acid 10: Gly

Amino acid 11: Ile

Amino acid 12: Cys

Amino acid 13: Tyr

Amino acid 14: Orn

Amino acid 15: Orn

Amino acid 16: Cys

Amino acid 17: Put