smiles_to_structure

This script reads a list of SMILES (Simplified Molecular Input Line Entry System) strings from a file, converts them into chemical structures using RDKit, and displays the structures as images directly within a Jupyter notebook.

https://github.com/quantaosun/smiles_to_structure

Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (9.4%) to scientific vocabulary
Last synced: 10 months ago · JSON representation ·

Repository

This script reads a list of SMILES (Simplified Molecular Input Line Entry System) strings from a file, converts them into chemical structures using RDKit, and displays the structures as images directly within a Jupyter notebook.

Basic Info
  • Host: GitHub
  • Owner: quantaosun
  • Language: Jupyter Notebook
  • Default Branch: main
  • Homepage:
  • Size: 4.8 MB
Statistics
  • Stars: 0
  • Watchers: 1
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Created almost 2 years ago · Last pushed over 1 year ago
Metadata Files
Readme Citation

README.md

SmilestoStructure

This script reads a list of SMILES (Simplified Molecular Input Line Entry System) strings from a raw SMILES file, converts them into chemical structures using RDKit, and displays the resulting structures as images directly within a Jupyter notebook. Each structure is labeled with its corresponding entry number for easy identification.

Open In Colab

Tutorial: 1. Generate a csv file, for example, for a specific protein target, you can download a csv file from Chembl database

  1. Open this csv file with your excel or Mac number software, export it to csv again, since the downloaded csv may have some minor format issue not recognised by this code.

  2. Upload this new csv file to the code via the Colab budge link, and it is ready to process through the SMILES inside the csv you provided.

  3. read the file by modifying the file name df = pd.read_csv('11.csv') df

  4. Change the column number based on where is your SMILES in the csv file, if they are stored at 9th column, it should be !awk -F "\"*,\"*" '{print $8}' 11.csv > smile.smi

(6. Delete rows that not a SMILES, and export the new smiles string to a new csv file.)

Display the molecules as images inside Jupyter Notebook

image

Owner

  • Name: Sun
  • Login: quantaosun
  • Kind: user
  • Location: China

Medicinal Chemistry/Modelling

Citation (CITATION.cff)

cff-version: 0.0.1
message: "If you use this software, please cite it as below."
authors:
- family-names: "Sun"
  given-names: "Quantao"
title: "smiles_to_structures"
version: 0.0.1
date-released: 2024-7-24
repository-code: https://github.com/quantaosun/Smiles_to_Structure
keywords:
  - "rdkit"
  - "jupyter notebook"
  - "smiles structure"
license: MIT
url: https://github.com/quantaosun/Smiles_to_Structure

GitHub Events

Total
  • Push event: 4
Last Year
  • Push event: 4