https://github.com/darkstarstrix/auto_api

My personal ML infrastructure

Science Score: 26.0%

This score indicates how likely this project is to be science-related based on various indicators:

○
CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
○
DOI references
○
Academic publication links
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (12.0%) to scientific vocabulary

Keywords

infrastructure infrastructure-as-code machine-learning python

Last synced: 10 months ago · JSON representation

Repository

My personal ML infrastructure

Basic Info

Host: GitHub
Owner: DarkStarStrix
License: mit
Language: Python
Default Branch: master
Homepage:
Size: 14.6 MB

Statistics

Stars: 13
Watchers: 2
Forks: 3
Open Issues: 0
Releases: 4

Topics

infrastructure infrastructure-as-code machine-learning python

Created over 1 year ago · Last pushed about 1 year ago

Metadata Files

Readme Contributing Funding License Code of conduct

AutoML Pipeline

A simplified machine learning automation framework that handles the complexities of ML workflows through configuration. The project provides an easy-to-use interface for training machine learning models while automatically managing optimization, visualization, and training processes.

[!IMPORTANT] This framework is designed for rapid ML prototyping and experimentation. For production deployments, please review the performance metrics and model validation sections carefully.

🚀 Quick Start

```python from lightningauto import AutoML from config import getclassification_config

Get configuration

config = getclassificationconfig()

Initialize and train

automl = AutoML(config) automl.fit(traindata, valdata) ```

[!TIP] Check the examples/ directory for complete notebook demonstrations of common use cases.

✨ Key Features

Automated Training: Simplified training process with minimal user intervention
Configuration Templates: Easy-to-use configuration templates for common ML tasks
Visualization: Built-in visualization tools for performance analysis

📊 Performance Metrics

[!NOTE] These metrics represent baseline performance and may vary based on your dataset and configuration.

Training Loss: 1.6422
Validation Loss: 1.6169
Learning Rate: 0.000896

📈 Visualization Analysis

1. Class Distribution

Class Distribution

[!NOTE] The distribution plot shows: - Class 2 dominates with ~50 samples - Class 0 follows with ~27 samples - Class 4 has the least representation with ~20 samples - Clear sign of class imbalance that may need addressing

2. Classification Results

Classification Matrix

[!NOTE] Key observations from the confusion matrix: - Strong diagonal pattern indicates good overall classification - Class 2 shows the highest confidence with 10–14 correct predictions - Some classifications between neighboring classes - Class 3 shows room for improvement in discrimination

3. Training Dynamics

Loss During Training

[!NOTE] The loss plot reveals: - Training loss (blue) shows healthy fluctuation between 1.54–1.70 - Validation loss (orange) maintains stability around 1.62 - No significant overfitting as validation loss remains stable - Good model convergence with occasional exploration spikes

4. Learning Rate Schedule

Learning Rate Schedule

[!NOTE] The learning rate schedule demonstrates: - Smooth cosine decay from 2e-3 to 9e-4 - Gradual learning rate reduction for fine-tuning - Proper annealing behavior for optimization stability

📁 Project Structure

automl/ ├── lightning_auto.py # Core AutoML engine ├── config.py # Configuration templates ├── train.py # Training script ├── WriterSide/ # Documentation └── examples/ # Example notebooks (Coming soon!)

⚙️ Configuration Templates

[!WARNING] Always validate configuration parameters against your specific use case before training.

Linear Classification Example

python config = { "model": { "type": "classification", "input_dim": 10, "output_dim": 5, "task": "classification" }, "training": { "learning_rate": 0.002, "epochs": 30 } # ... other parameters }

🤝 Contributing

[!CAUTION] Before submitting large changes, please open an issue to discuss the proposed modifications.

Adding New Configuration Templates

Create a new function in config.py: python def get_custom_config(): return { "model": { # model specifications }, "training": { # training parameters } }
Add documentation and example usage
Submit a pull request

Backend Development

For new features:
- Fork the repository
- Create feature branch
- Add tests
- Submit pull request

📝 License

This project is licensed under the MIT License—see the LICENSE file for details.

Owner

Name: Allan Murimi Wandia
Login: DarkStarStrix
Kind: user
Location: U.S.A
Company: Freelance

Website: https://www.kaggle.com/allanwandia
Repositories: 1
Profile: https://github.com/DarkStarStrix

Full stack Dev Turning ideas into projects

GitHub Events

Total

Create event: 24
Issues event: 17
Release event: 4
Watch event: 9
Delete event: 19
Issue comment event: 12
Public event: 1
Push event: 80
Pull request event: 38
Fork event: 4

Last Year

Create event: 24
Issues event: 17
Release event: 4
Watch event: 9
Delete event: 19
Issue comment event: 12
Public event: 1
Push event: 80
Pull request event: 38
Fork event: 4

Issues and Pull Requests

Last synced: 11 months ago

All Time

Total issues: 2
Total pull requests: 12
Average time to close issues: 14 days
Average time to close pull requests: 10 minutes
Total issue authors: 1
Total pull request authors: 2
Average comments per issue: 1.0
Average comments per pull request: 0.0
Merged pull requests: 11
Bot issues: 0
Bot pull requests: 12

Past Year

Issues: 2
Pull requests: 12
Average time to close issues: 14 days
Average time to close pull requests: 10 minutes
Issue authors: 1
Pull request authors: 2
Average comments per issue: 1.0
Average comments per pull request: 0.0
Merged pull requests: 11
Bot issues: 0
Bot pull requests: 12

View more stats

Top Authors

Issue Authors

DarkStarStrix (9)
dependabot[bot] (1)

Pull Request Authors

dependabot[bot] (15)
imgbot[bot] (4)

Top Labels

Issue Labels

enhancement (9) dependencies (1)

Pull Request Labels

dependencies (15) python (6)

Dependencies

requirements.txt pypi

FastAPI *
matplotlib *
numpy *
pandas *
pytorch-lightning *
requests *
setuptools *
torch *
transformers *
uvicorn *

setup.py pypi

feature-engine >=1.0.0
numpy >=1.21.0
optuna >=2.10.0
pandas >=1.3.0
pytorch-lightning >=2.0.0
scikit-learn >=1.0.0
tensorboard >=2.12.0
torch >=2.0.0

https://github.com/darkstarstrix/auto_api

Science Score: 26.0%

Keywords

Repository

Basic Info

Statistics

Topics

Metadata Files

README.Md

AutoML Pipeline

🚀 Quick Start

Get configuration

Initialize and train

✨ Key Features

📊 Performance Metrics

📈 Visualization Analysis

1. Class Distribution

2. Classification Results

3. Training Dynamics

4. Learning Rate Schedule

📁 Project Structure

⚙️ Configuration Templates

Linear Classification Example

🤝 Contributing

Adding New Configuration Templates

Backend Development

📝 License

Owner

GitHub Events

Total

Last Year

Issues and Pull Requests

All Time

Past Year

Top Authors

Issue Authors

Pull Request Authors

Top Labels

Issue Labels

Pull Request Labels

Dependencies