lightgbm
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.
upgini
Data search & enrichment library for Machine Learning → Easily find and add relevant features to your ML & AI pipeline from hundreds of public and premium external data sources, including open & commercial LLMs
rgf
Home repository for the Regularized Greedy Forest (RGF) library. It includes original implementation from the paper and multithreaded one written in C++, along with various language-specific wrappers.
https://github.com/catalyst-cooperative/pudl-examples
Example Jupyter notebooks hosted on Kaggle that demonstrate how to work with US energy data from PUDL.
NYISOToolkit
Access data, statistics, and visualizations for New York's electricity grid.
https://github.com/aisuko/notebooks
Implementation for the different ML tasks on Kaggle platform with GPUs.
https://github.com/lromul/argus-tgs-salt
Kaggle | 14th place solution for TGS Salt Identification Challenge
https://github.com/lromul/argus-alaska
Kaggle | Part of 25th place solution for ALASKA2 Image Steganalysis kaggle competition.
https://github.com/aryashah2k/kaggle-profile-of-arya
Repository Hosting Files and Media Of My Work Published On Kaggle (Competitions, Notebooks, Discussions & Dataset Links)
https://github.com/agrover112/fasttext-real-or-not-nlp-with-disaster-tweets
Kaggle real or not disaster tweets classification using FastText.
https://github.com/ahmedshahriar/customer-churn-prediction
Extensive EDA of the IBM telco customer churn dataset, implemented various statistical hypotheses tests and Performed single-level Stacking Ensemble and tuned hyperparameters using Optuna.
goto-conversion
goto_conversion - Powered $47,000 of prize money, over 10 Gold Medals and 100 Medals on Kaggle
https://github.com/albertgallegojimenez/kaggle-projects
This repository showcases my Kaggle projects in Data Science and Machine Learning fields, offering valuable resources and practical examples for the community. Feel free to explore and leverage the shared content.
https://github.com/ahmedshahriar/kaggle-notebooks
Collection of my competition notebooks on Kaggle
https://github.com/aisuko/generative-ai
The notebooks for generative AI by using PyTorch, Huggingface/diffusers, transforms. And the implementing of the algorithms in paper
https://github.com/amr-yasser226/intrusion-detection-kaggle
End-to-end pipeline for multi-class cyber-attack detection using per-flow network features: data profiling, deduplication, skew-correction, outlier treatment, feature engineering, imbalance handling, and tree-based modeling (XGBoost, LightGBM, CatBoost, stacking), with a final Kaggle submission scoring 0.9146 public / 0.9163 private.
https://github.com/cedrickchee/data-science-notebooks
Data science Python notebooks—a collection of Jupyter notebooks on machine learning, deep learning, statistical inference, data analysis and visualization.
https://github.com/arvkevi/clinvar-kaggle
Scripts used to generate the ClinVar conflicting classifications dataset on Kaggle
https://github.com/aidinhamedi/pneumonia-detection-ai
This project uses a deep learning model built with the TensorFlow Library to detect pneumonia in X-ray images. The model architecture is based on the EfficientNetB7 model, which has achieved an accuracy of approximately 97.12% (97.11538%) on our test data. This high accuracy rate is one of the strengths of our AI model.
https://github.com/fralfaro/kaggle-competitions
Notebooks - kaggle competitions
vesuvius-challenge
Unveiling the secrets of an ancient library buried by Mount Vesuvius, this Kaggle competition, supported by the Vesuvius Challenge organization, tasked participants with detecting ink from 3D X-ray scans of charred scrolls preserved in a Roman villa in Herculaneum.