smiles_to_structure
This script reads a list of SMILES (Simplified Molecular Input Line Entry System) strings from a file, converts them into chemical structures using RDKit, and displays the structures as images directly within a Jupyter notebook.
Science Score: 44.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (9.4%) to scientific vocabulary
Repository
This script reads a list of SMILES (Simplified Molecular Input Line Entry System) strings from a file, converts them into chemical structures using RDKit, and displays the structures as images directly within a Jupyter notebook.
Basic Info
Statistics
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 0
Metadata Files
README.md
SmilestoStructure
This script reads a list of SMILES (Simplified Molecular Input Line Entry System) strings from a raw SMILES file, converts them into chemical structures using RDKit, and displays the resulting structures as images directly within a Jupyter notebook. Each structure is labeled with its corresponding entry number for easy identification.
Tutorial: 1. Generate a csv file, for example, for a specific protein target, you can download a csv file from Chembl database
Open this csv file with your excel or Mac number software, export it to csv again, since the downloaded csv may have some minor format issue not recognised by this code.
Upload this new csv file to the code via the Colab budge link, and it is ready to process through the SMILES inside the csv you provided.
read the file by modifying the file name
df = pd.read_csv('11.csv') dfChange the column number based on where is your SMILES in the csv file, if they are stored at 9th column, it should be
!awk -F "\"*,\"*" '{print $8}' 11.csv > smile.smi
(6. Delete rows that not a SMILES, and export the new smiles string to a new csv file.)
Display the molecules as images inside Jupyter Notebook
Owner
- Name: Sun
- Login: quantaosun
- Kind: user
- Location: China
- Repositories: 218
- Profile: https://github.com/quantaosun
Medicinal Chemistry/Modelling
Citation (CITATION.cff)
cff-version: 0.0.1 message: "If you use this software, please cite it as below." authors: - family-names: "Sun" given-names: "Quantao" title: "smiles_to_structures" version: 0.0.1 date-released: 2024-7-24 repository-code: https://github.com/quantaosun/Smiles_to_Structure keywords: - "rdkit" - "jupyter notebook" - "smiles structure" license: MIT url: https://github.com/quantaosun/Smiles_to_Structure
GitHub Events
Total
- Push event: 4
Last Year
- Push event: 4