Updated 9 months ago

lightgbm • Rank 33.1 • Science 46%

A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.

Updated 9 months ago

upgini • Rank 17.5 • Science 26%

Data search & enrichment library for Machine Learning → Easily find and add relevant features to your ML & AI pipeline from hundreds of public and premium external data sources, including open & commercial LLMs

Updated 9 months ago

rgf • Rank 19.0 • Science 23%

Home repository for the Regularized Greedy Forest (RGF) library. It includes original implementation from the paper and multithreaded one written in C++, along with various language-specific wrappers.

Updated 8 months ago

https://github.com/lromul/argus-alaska • Rank 2.6 • Science 13%

Kaggle | Part of 25th place solution for ALASKA2 Image Steganalysis kaggle competition.

Updated 9 months ago

https://github.com/aryashah2k/kaggle-profile-of-arya • Rank 2.2 • Science 10%

Repository Hosting Files and Media Of My Work Published On Kaggle (Competitions, Notebooks, Discussions & Dataset Links)

Updated 9 months ago

https://github.com/ahmedshahriar/customer-churn-prediction • Science 13%

Extensive EDA of the IBM telco customer churn dataset, implemented various statistical hypotheses tests and Performed single-level Stacking Ensemble and tuned hyperparameters using Optuna.

Updated 9 months ago

https://github.com/albertgallegojimenez/kaggle-projects • Science 13%

This repository showcases my Kaggle projects in Data Science and Machine Learning fields, offering valuable resources and practical examples for the community. Feel free to explore and leverage the shared content.

Updated 9 months ago

https://github.com/aisuko/generative-ai • Science 23%

The notebooks for generative AI by using PyTorch, Huggingface/diffusers, transforms. And the implementing of the algorithms in paper

Updated 9 months ago

https://github.com/amr-yasser226/intrusion-detection-kaggle • Science 26%

End-to-end pipeline for multi-class cyber-attack detection using per-flow network features: data profiling, deduplication, skew-correction, outlier treatment, feature engineering, imbalance handling, and tree-based modeling (XGBoost, LightGBM, CatBoost, stacking), with a final Kaggle submission scoring 0.9146 public / 0.9163 private.

Updated 9 months ago

kyara • Science 67%

[Kaggle-2nd] Lightweight yet Effective Chinese LLM.

Updated 9 months ago

https://github.com/cedrickchee/data-science-notebooks • Science 36%

Data science Python notebooks—a collection of Jupyter notebooks on machine learning, deep learning, statistical inference, data analysis and visualization.

Updated 9 months ago

https://github.com/arvkevi/clinvar-kaggle • Science 10%

Scripts used to generate the ClinVar conflicting classifications dataset on Kaggle

Updated 9 months ago

https://github.com/aidinhamedi/pneumonia-detection-ai • Science 26%

This project uses a deep learning model built with the TensorFlow Library to detect pneumonia in X-ray images. The model architecture is based on the EfficientNetB7 model, which has achieved an accuracy of approximately 97.12% (97.11538%) on our test data. This high accuracy rate is one of the strengths of our AI model.

Updated 9 months ago

vesuvius-challenge • Science 44%

Unveiling the secrets of an ancient library buried by Mount Vesuvius, this Kaggle competition, supported by the Vesuvius Challenge organization, tasked participants with detecting ink from 3D X-ray scans of charred scrolls preserved in a Roman villa in Herculaneum.