ic50dnatop

https://github.com/rahul1roshan/ic50dnatop

Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

✓
CITATION.cff file
Found CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
○
DOI references
○
Academic publication links
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (14.7%) to scientific vocabulary

Last synced: 10 months ago · JSON representation ·

Repository

Basic Info

Host: GitHub
Owner: rahul1roshan
Language: Python
Default Branch: main
Size: 310 KB

Statistics

Stars: 1
Watchers: 1
Forks: 0
Open Issues: 0
Releases: 0

Created about 1 year ago · Last pushed about 1 year ago

Metadata Files

Readme Citation Authors

IC50 Prediction Web Application This project is a comprehensive web application for performing Quantitative Structure-Activity Relationship (QSAR) analysis using Streamlit. The application allows users to fetch bioactivity data from a ChEMBL database, compute molecular fingerprints, and train machine learning models for regression and classification tasks. It supports SMILES input files and outputs predictions based on trained models.

Importance and Advantages of the Solution Importance:

Efficient Drug Discovery: QSAR analysis is crucial in predicting the bioactivity of chemical compounds, significantly reducing the time and cost associated with drug discovery.

Data-Driven Decisions: By leveraging bioactivity data and molecular fingerprints, researchers can make more informed decisions about which compounds to prioritize for further testing.

Advantages: Streamlined Workflow: The web app integrates data fetching, fingerprint computation, and model training into a single platform, simplifying the workflow for users.

User-Friendly Interface: With Streamlit, the application provides an intuitive interface for users to interact with, making complex processes accessible to those with limited programming knowledge.

Scalable and Customizable: The modular design allows users to easily update the models and datasets, ensuring the tool remains relevant and effective as new data becomes available.

Reproducibility: By automating the QSAR process, the web app ensures that results are reproducible and consistent, which is critical for scientific research.

Accessibility: Researchers and developers can access and use the app from any location, facilitating collaboration and data sharing.

Features Fetch bioactivity data from ChEMBL using ChEMBL IDs. Compute molecular fingerprints using RDKit and PaDEL-Descriptor. Train and save the best regression and classification models. Process SMILES files to predict bioactivity classes and pIC50 values. Provide downloadable results as CSV files. Installation Prerequisites Python 3.7 or higher Required Python packages (listed in requirements.txt) Steps Clone the repository:

git clone https://github.com/yourusername/QSARWebApp.git cd QSARWebApp Install the required Python packages:

pip install -r requirements.txt Ensure you have the ChEMBL database (chembl_34.db) in the specified directory.

Run the Streamlit application:

streamlit run app.py Usage Fetching Bioactivity Data Enter the ChEMBL ID of the target. Click "Fetch Data" to retrieve bioactivity data for the specified target. The data will be displayed in the app and saved as CSV and SMI files. Computing Molecular Fingerprints Ensure the SMI file is saved in the specified directory. Click "Compute Fingerprints" to generate PaDEL and RDKit fingerprints. The computed fingerprints will be saved as CSV files. Training Models Train the best regression model: Click "Train Regression Model". The best model will be trained and saved based on R² score. Train the best classification model: Click "Train Classification Model". The best model will be trained and saved based on accuracy. Processing New SMILES Files Upload a SMILES file in CSV or SMI format. Click "Process SMILES File". The app will compute fingerprints, apply trained models, and output predictions. Directory Structure bioactivitydata/: Directory for storing fetched bioactivity data CSV files. smifolder/: Directory for storing SMILES files. fingerprints/: Directory for storing computed fingerprint CSV files. regressionmodels/: Directory for storing trained regression models. classificationmodels/: Directory for storing trained classification models. predictions/: Directory for storing prediction results. Dependencies pandas numpy sqlite3 glob zipfile padelpy tempfile shutil rdkit sklearn joblib streamlit Contributing Contributions are welcome! Please fork the repository and submit a pull request with your changes.

License This project is licensed under the (https://www.richwebs.com/team/).

Feel free to reach out for any questions or support. Happy QSAR analysis!

Note: Ensure all paths and directories are correctly set up in the code before running the application.

Owner

Login: rahul1roshan
Kind: user

Repositories: 1
Profile: https://github.com/rahul1roshan

Citation (citation.txt)

( To cite his work: Rahul Gupta, https://ic50activdomain.streamlit.app/ )

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science

ic50dnatop

Science Score: 44.0%

Repository

Basic Info

Statistics

Metadata Files

README.md

Owner

Citation (citation.txt)

GitHub Events

Total

Last Year