https://github.com/darkstarstrix/auto_api

My personal ML infrastructure

https://github.com/darkstarstrix/auto_api

Science Score: 26.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (12.0%) to scientific vocabulary

Keywords

infrastructure infrastructure-as-code machine-learning python
Last synced: 5 months ago · JSON representation

Repository

My personal ML infrastructure

Basic Info
  • Host: GitHub
  • Owner: DarkStarStrix
  • License: mit
  • Language: Python
  • Default Branch: master
  • Homepage:
  • Size: 14.6 MB
Statistics
  • Stars: 13
  • Watchers: 2
  • Forks: 3
  • Open Issues: 0
  • Releases: 4
Topics
infrastructure infrastructure-as-code machine-learning python
Created over 1 year ago · Last pushed 9 months ago
Metadata Files
Readme Contributing Funding License Code of conduct

README.Md

AutoML Pipeline

A simplified machine learning automation framework that handles the complexities of ML workflows through configuration. The project provides an easy-to-use interface for training machine learning models while automatically managing optimization, visualization, and training processes.

[!IMPORTANT] This framework is designed for rapid ML prototyping and experimentation. For production deployments, please review the performance metrics and model validation sections carefully.

🚀 Quick Start

```python from lightningauto import AutoML from config import getclassification_config

Get configuration

config = getclassificationconfig()

Initialize and train

automl = AutoML(config) automl.fit(traindata, valdata) ```

[!TIP] Check the examples/ directory for complete notebook demonstrations of common use cases.

✨ Key Features

  • Automated Training: Simplified training process with minimal user intervention
  • Configuration Templates: Easy-to-use configuration templates for common ML tasks
  • Visualization: Built-in visualization tools for performance analysis

📊 Performance Metrics

[!NOTE] These metrics represent baseline performance and may vary based on your dataset and configuration.

  • Training Loss: 1.6422
  • Validation Loss: 1.6169
  • Learning Rate: 0.000896

📈 Visualization Analysis

1. Class Distribution

Class Distribution

[!NOTE] The distribution plot shows: - Class 2 dominates with ~50 samples - Class 0 follows with ~27 samples - Class 4 has the least representation with ~20 samples - Clear sign of class imbalance that may need addressing

2. Classification Results

Classification Matrix

[!NOTE] Key observations from the confusion matrix: - Strong diagonal pattern indicates good overall classification - Class 2 shows the highest confidence with 10–14 correct predictions - Some classifications between neighboring classes - Class 3 shows room for improvement in discrimination

3. Training Dynamics

Loss During Training

[!NOTE] The loss plot reveals: - Training loss (blue) shows healthy fluctuation between 1.54–1.70 - Validation loss (orange) maintains stability around 1.62 - No significant overfitting as validation loss remains stable - Good model convergence with occasional exploration spikes

4. Learning Rate Schedule

Learning Rate Schedule

[!NOTE] The learning rate schedule demonstrates: - Smooth cosine decay from 2e-3 to 9e-4 - Gradual learning rate reduction for fine-tuning - Proper annealing behavior for optimization stability

📁 Project Structure

automl/ ├── lightning_auto.py # Core AutoML engine ├── config.py # Configuration templates ├── train.py # Training script ├── WriterSide/ # Documentation └── examples/ # Example notebooks (Coming soon!)

⚙️ Configuration Templates

[!WARNING] Always validate configuration parameters against your specific use case before training.

Linear Classification Example

python config = { "model": { "type": "classification", "input_dim": 10, "output_dim": 5, "task": "classification" }, "training": { "learning_rate": 0.002, "epochs": 30 } # ... other parameters }

🤝 Contributing

[!CAUTION] Before submitting large changes, please open an issue to discuss the proposed modifications.

Adding New Configuration Templates

  1. Create a new function in config.py: python def get_custom_config(): return { "model": { # model specifications }, "training": { # training parameters } }
  2. Add documentation and example usage
  3. Submit a pull request

Backend Development

  1. For new features:
    • Fork the repository
    • Create feature branch
    • Add tests
    • Submit pull request

📝 License

This project is licensed under the MIT License—see the LICENSE file for details.

Owner

  • Name: Allan Murimi Wandia
  • Login: DarkStarStrix
  • Kind: user
  • Location: U.S.A
  • Company: Freelance

Full stack Dev Turning ideas into projects

GitHub Events

Total
  • Create event: 24
  • Issues event: 17
  • Release event: 4
  • Watch event: 9
  • Delete event: 19
  • Issue comment event: 12
  • Public event: 1
  • Push event: 80
  • Pull request event: 38
  • Fork event: 4
Last Year
  • Create event: 24
  • Issues event: 17
  • Release event: 4
  • Watch event: 9
  • Delete event: 19
  • Issue comment event: 12
  • Public event: 1
  • Push event: 80
  • Pull request event: 38
  • Fork event: 4

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 2
  • Total pull requests: 12
  • Average time to close issues: 14 days
  • Average time to close pull requests: 10 minutes
  • Total issue authors: 1
  • Total pull request authors: 2
  • Average comments per issue: 1.0
  • Average comments per pull request: 0.0
  • Merged pull requests: 11
  • Bot issues: 0
  • Bot pull requests: 12
Past Year
  • Issues: 2
  • Pull requests: 12
  • Average time to close issues: 14 days
  • Average time to close pull requests: 10 minutes
  • Issue authors: 1
  • Pull request authors: 2
  • Average comments per issue: 1.0
  • Average comments per pull request: 0.0
  • Merged pull requests: 11
  • Bot issues: 0
  • Bot pull requests: 12
Top Authors
Issue Authors
  • DarkStarStrix (9)
  • dependabot[bot] (1)
Pull Request Authors
  • dependabot[bot] (15)
  • imgbot[bot] (4)
Top Labels
Issue Labels
enhancement (9) dependencies (1)
Pull Request Labels
dependencies (15) python (6)

Dependencies

requirements.txt pypi
  • FastAPI *
  • matplotlib *
  • numpy *
  • pandas *
  • pytorch-lightning *
  • requests *
  • setuptools *
  • torch *
  • transformers *
  • uvicorn *
setup.py pypi
  • feature-engine >=1.0.0
  • numpy >=1.21.0
  • optuna >=2.10.0
  • pandas >=1.3.0
  • pytorch-lightning >=2.0.0
  • scikit-learn >=1.0.0
  • tensorboard >=2.12.0
  • torch >=2.0.0