Science Score: 10.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
○codemeta.json file
-
○.zenodo.json file
-
○DOI references
-
○Academic publication links
-
✓Committers with academic emails
1 of 3 committers (33.3%) from academic institutions -
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (9.5%) to scientific vocabulary
Keywords
Repository
Auto Data Science - Python Library.
Basic Info
Statistics
- Stars: 138
- Watchers: 14
- Forks: 32
- Open Issues: 1
- Releases: 0
Topics
Metadata Files
README.md
GML
Creators
Contributors
Mehran Kamal
Rafey Iqbal Rahman
Tired of doing Data Science manually? GML is here for you!
GML is an automatic data science library in python built on top of multiple Python packages. Complete features which we offer are listed as:
Installation:
python
pip install GML
https://pypi.org/project/GML
If you are facing any pytorch related issue during installation, kindly refer to following solution: https://github.com/Muhammad4hmed/GML/issues/6#issuecomment-735912557
Features:
Auto Feature Engineering
```python from GML import FeatureEngineering
fe = FeatureEngineering(Data, 'target', fillmissingdata=True, encodedata=True, normalize=True, removeoutliers=True, newfeatures=True, featengsteps=2 ) # feateng_steps = 0 for features selection without feature creation
Xnew, y, test = fe.getnew_data() ```
Click Here for complete DEMO
Auto EDA (Powered by Sweetviz)
```python from GML import sweetviz
result1 = sweetviz.compare([train,'train'],[test,'test'],'target') result2 = sweetviz.analyze([train,'train'])
result.showhtml() result2.showhtml() ```
Click Here for complete DEMO
Auto Machine Learning
```python from GML import AutoML
gml_ml = AutoML()
gmlml.GMLClassifier(X, y, metric = accuracyscore, folds = 10) ```
Click Here for complete DEMO
Auto Text Cleaning
```python from GML import AutoNLP
nlp = AutoNLP()
cleanX = X.apply(lambda x: nlp.clean(x)) ```
Click Here for complete DEMO
Auto Text Classification using transformers
```python from GML import AutoNLP
nlp = AutoNLP()
nlp.setparams(cleanX, tokenizername='roberta-large-mnli', BATCHSIZE=4, modelname='roberta-large-mnli', MAX_LEN=200)
model = nlp.train_model(tokenizedX, y) ```
Click Here for complete DEMO
Auto Image Classification with Augmentation
```python from GML import AutoImageProcessing
gmlimageprocessing = AutoImageProcessing()
model = gmlimageprocessing.imgClassificationcsv(imgpath = './covidimagedata/train', trainpath = './covidimagedata/Trainingsetcovid.csv', modellist = models, tfms = True, advanceaugmentation = True, epochs=1) ```
Click Here for complete DEMO
Text Augmentation using transformers: GPT-2
```python from GML import AutoNLP
nlp = AutoNLP()
nlp.augmentation_train('./data.csv')
nlp.set_params(X['Text'])
newText = nlp.augmentationgenerate(y = y, SENTENCES = 100) ```
Click Here for complete DEMO
More cool features and handling of different data types like audio data etc will be added in future.
Feel free to give suggestions, report bugs and contribute.
Owner
- Name: Muhammad Ahmed
- Login: Muhammad4hmed
- Kind: user
- Location: Pakistan
- Website: http://muhammad-ahmed.com
- Twitter: muhammad4hmed
- Repositories: 23
- Profile: https://github.com/Muhammad4hmed
GitHub Events
Total
Last Year
Committers
Last synced: over 2 years ago
Top Committers
| Name | Commits | |
|---|---|---|
| Muhammad Ahmed | k****1@n****k | 9 |
| Naman Tuli | 6****8 | 2 |
| Mehran Kamal | m****h@g****m | 1 |
Committer Domains (Top 20 + Academic)
Issues and Pull Requests
Last synced: 6 months ago
All Time
- Total issues: 11
- Total pull requests: 3
- Average time to close issues: 20 days
- Average time to close pull requests: 2 days
- Total issue authors: 7
- Total pull request authors: 3
- Average comments per issue: 5.36
- Average comments per pull request: 0.0
- Merged pull requests: 3
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 0
- Pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- prashanthGit945 (5)
- Rubix982 (1)
- Vaderv (1)
- harry418 (1)
- chitown88 (1)
- aadilganigaie (1)
- SharjeelAbbas014 (1)
Pull Request Authors
- namantuli18 (1)
- mehrankamal (1)
- RafeyIqbalRahman (1)
Top Labels
Issue Labels
Pull Request Labels
Dependencies
- Keras *
- Pint *
- albumentations *
- beautifulsoup4 *
- catboost *
- category_encoders *
- efficientnet_pytorch *
- fastai ==1.0.61
- ftfy *
- lightgbm *
- matplotlib *
- numpy *
- pandas *
- requests *
- scikit-learn *
- seaborn *
- sympy *
- tensorflow *
- torch *
- torchvision *
- tqdm *
- transformers *
- xgboost *
- Keras *
- Pint *
- albumentations *
- beautifulsoup4 *
- catboost *
- category_encoders *
- efficientnet_pytorch *
- fastai ==1.0.61
- ftfy *
- lightgbm *
- matplotlib *
- numpy *
- pandas *
- requests *
- scikit-learn *
- seaborn *
- sympy *
- tensorflow *
- torch *
- torchvision *
- tqdm *
- transformers *
- xgboost *