GML

Auto Data Science - Python Library.

https://github.com/Muhammad4hmed/GML

Science Score: 10.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
  • .zenodo.json file
  • DOI references
  • Academic publication links
  • Committers with academic emails
    1 of 3 committers (33.3%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (9.5%) to scientific vocabulary

Keywords

gml pypi python
Last synced: 6 months ago · JSON representation

Repository

Auto Data Science - Python Library.

Basic Info
  • Host: GitHub
  • Owner: Muhammad4hmed
  • License: mit
  • Language: HTML
  • Default Branch: master
  • Homepage:
  • Size: 43.4 MB
Statistics
  • Stars: 138
  • Watchers: 14
  • Forks: 32
  • Open Issues: 1
  • Releases: 0
Topics
gml pypi python
Created about 6 years ago · Last pushed about 5 years ago
Metadata Files
Readme License

README.md

GML Brain+Machine Adding AI Revolution

Generic badge Generic badge Generic badge Generic badge
PyPI version PyPI license PyPI pyversions GitHub issues

Creators

Muhammad Ahmed
Naman Tuli

Contributors

Mehran Kamal
Rafey Iqbal Rahman

Tired of doing Data Science manually? GML is here for you!

GML is an automatic data science library in python built on top of multiple Python packages. Complete features which we offer are listed as:


Installation:


python pip install GML


https://pypi.org/project/GML
If you are facing any pytorch related issue during installation, kindly refer to following solution: https://github.com/Muhammad4hmed/GML/issues/6#issuecomment-735912557

Features:


Auto Feature Engineering



```python from GML import FeatureEngineering

fe = FeatureEngineering(Data, 'target', fillmissingdata=True, encodedata=True, normalize=True, removeoutliers=True, newfeatures=True, featengsteps=2 ) # feateng_steps = 0 for features selection without feature creation

Xnew, y, test = fe.getnew_data() ```

Click Here for complete DEMO


Auto EDA (Powered by Sweetviz)



```python from GML import sweetviz

result1 = sweetviz.compare([train,'train'],[test,'test'],'target') result2 = sweetviz.analyze([train,'train'])

result.showhtml() result2.showhtml() ```

Click Here for complete DEMO



Auto Machine Learning



```python from GML import AutoML

gml_ml = AutoML()

gmlml.GMLClassifier(X, y, metric = accuracyscore, folds = 10) ```


Click Here for complete DEMO

Auto Text Cleaning



```python from GML import AutoNLP

nlp = AutoNLP()

cleanX = X.apply(lambda x: nlp.clean(x)) ```

Click Here for complete DEMO


Auto Text Classification using transformers



```python from GML import AutoNLP

nlp = AutoNLP()

nlp.setparams(cleanX, tokenizername='roberta-large-mnli', BATCHSIZE=4, modelname='roberta-large-mnli', MAX_LEN=200)

model = nlp.train_model(tokenizedX, y) ```

Click Here for complete DEMO


Auto Image Classification with Augmentation



```python from GML import AutoImageProcessing

gmlimageprocessing = AutoImageProcessing()

model = gmlimageprocessing.imgClassificationcsv(imgpath = './covidimagedata/train', trainpath = './covidimagedata/Trainingsetcovid.csv', modellist = models, tfms = True, advanceaugmentation = True, epochs=1) ```

Click Here for complete DEMO


Text Augmentation using transformers: GPT-2



```python from GML import AutoNLP

nlp = AutoNLP()

nlp.augmentation_train('./data.csv')

nlp.set_params(X['Text'])

newText = nlp.augmentationgenerate(y = y, SENTENCES = 100) ```

Click Here for complete DEMO



More cool features and handling of different data types like audio data etc will be added in future.
Feel free to give suggestions, report bugs and contribute.

Owner

  • Name: Muhammad Ahmed
  • Login: Muhammad4hmed
  • Kind: user
  • Location: Pakistan

GitHub Events

Total
Last Year

Committers

Last synced: over 2 years ago

All Time
  • Total Commits: 12
  • Total Committers: 3
  • Avg Commits per committer: 4.0
  • Development Distribution Score (DDS): 0.25
Past Year
  • Commits: 0
  • Committers: 0
  • Avg Commits per committer: 0.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
Muhammad Ahmed k****1@n****k 9
Naman Tuli 6****8 2
Mehran Kamal m****h@g****m 1
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 11
  • Total pull requests: 3
  • Average time to close issues: 20 days
  • Average time to close pull requests: 2 days
  • Total issue authors: 7
  • Total pull request authors: 3
  • Average comments per issue: 5.36
  • Average comments per pull request: 0.0
  • Merged pull requests: 3
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • prashanthGit945 (5)
  • Rubix982 (1)
  • Vaderv (1)
  • harry418 (1)
  • chitown88 (1)
  • aadilganigaie (1)
  • SharjeelAbbas014 (1)
Pull Request Authors
  • namantuli18 (1)
  • mehrankamal (1)
  • RafeyIqbalRahman (1)
Top Labels
Issue Labels
documentation (1) good first issue (1)
Pull Request Labels

Dependencies

GML.egg-info/requires.txt pypi
  • Keras *
  • Pint *
  • albumentations *
  • beautifulsoup4 *
  • catboost *
  • category_encoders *
  • efficientnet_pytorch *
  • fastai ==1.0.61
  • ftfy *
  • lightgbm *
  • matplotlib *
  • numpy *
  • pandas *
  • requests *
  • scikit-learn *
  • seaborn *
  • sympy *
  • tensorflow *
  • torch *
  • torchvision *
  • tqdm *
  • transformers *
  • xgboost *
setup.py pypi
  • Keras *
  • Pint *
  • albumentations *
  • beautifulsoup4 *
  • catboost *
  • category_encoders *
  • efficientnet_pytorch *
  • fastai ==1.0.61
  • ftfy *
  • lightgbm *
  • matplotlib *
  • numpy *
  • pandas *
  • requests *
  • scikit-learn *
  • seaborn *
  • sympy *
  • tensorflow *
  • torch *
  • torchvision *
  • tqdm *
  • transformers *
  • xgboost *