Structure 2 Sequence Report

Structure-to-Sequence (s2s) is a computing process based on RDkit and the characteristics of cyclic peptide sequences, which can convert cyclic peptide SMILES into sequence information. This process mainly relies on the completeness of the monomer reference library. You can access our default monomer reference library through download link. The details of s2s are available on dfwlab/cyclicpepedia on Github. And you can use this tool online on the cyclicpepedia.

Version : 1.0.1 (2023-12-26)

Load SMILES :

SMILES : CC(C)C[C@@H]1NC(=O)[C@H](C)NC(=O)[C@H](NC(=O)[C@H](CO)NC(=O)[C@@H](CCCCN)NC(=O)[C@@H](CC(C)C)NC(=O)[C@@H](N)Cc2ccccc2)[C@H](C(N)=O)OC(=O)[C@H](CC(C)C)NC1=O

SMILES is corrected!

Identify peptide skeleton and renumber atoms

Identify amino acid units

Obtain complete amino acid structures

Amino acid 1

Amino acid 2

Amino acid 3

Amino acid 4

Amino acid 5

Amino acid 6

Amino acid 7

Amino acid 8

Identify amino acids based on the monomer reference library

Number of chain(s) identified from the structure: 1

> Chain 1 :

Amino acid sequence : Phe--Leu--Lys--Ser--Asn(1)--Ala--Leu--Leu(1)

Amino acid mapping

Amino acid location

Matched amino acid from monomer reference library

Amino acid 1: Phe

Amino acid 2: Leu

Amino acid 3: Lys

Amino acid 4: Ser

Amino acid 5: Asn

Amino acid 6: Ala

Amino acid 7: Leu

Amino acid 8: Leu