Segmenteverygrain

Segmenteverygrain: A Python module for segmentation of grains in images - Published in JOSS (2025)

https://github.com/zsylvester/segmenteverygrain

Science Score: 93.0%

This score indicates how likely this project is to be science-related based on various indicators:

○
CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
✓
DOI references
Found 4 DOI reference(s) in README and JOSS metadata
✓
Academic publication links
Links to: joss.theoj.org
○
Committers with academic emails
○
Institutional organization owner
✓
JOSS paper metadata
Published in Journal of Open Source Software

Scientific Fields

Artificial Intelligence and Machine Learning Computer Science - 71% confidence

Sociology Social Sciences - 64% confidence

Last synced: 6 months ago · JSON representation

Repository

A SAM-based model for instance segmentation of images of grains

Basic Info

Host: GitHub
Owner: zsylvester
License: apache-2.0
Language: Jupyter Notebook
Default Branch: main
Homepage: https://zsylvester.github.io/segmenteverygrain/index.html
Size: 559 MB

Statistics

Stars: 507
Watchers: 17
Forks: 65
Open Issues: 7
Releases: 8

Created over 2 years ago · Last pushed 6 months ago

Metadata Files

Readme License

segmenteverygrain

Description

'segmenteverygrain' is a Python package that aims to detect grains (or grain-like objects) in images. The goal is to develop an ML model that does a reasonably good job at detecting most of the grains in a photo, so that it will be useful for determining grain size and grain shape, a common task in geomorphology and sedimentary geology. 'segmenteverygrain' relies on the Segment Anything Model (SAM), developed by Meta, for getting high-quality outlines of the grains. However, SAM requires prompts for every object detected and, when used in 'everything' mode, it tends to be slow and results in many overlapping masks and non-grain (background) objects. To deal with these issues, 'segmenteverygrain' relies on a Unet-style, patch-based convolutional neural network to create a first-pass segmentation which is then used to generate prompts for the SAM-based segmentation. Some of the grains will be missed with this approach, but the segmentations that are created tend to be of high quality.

'segmenteverygrain' also includes a set of functions that make it possible to clean up the segmentation results: delete and merge objects by clicking on them, and adding grains that were not segmented automatically. The QC-d masks can be saved and added to a dataset of grain images. These images then can be used to improve the Unet model. Many of the images used in the dataset are from the sedinet project.

Requirements

numpy
matplotlib
scipy
pandas
pillow
scikit-image
opencv-python
networkx
rasterio
shapely
tensorflow
pytorch
segment-anything
rtree
tqdm

Documentation

More documentation is available at https://zsylvester.github.io/segmenteverygrain/index.html.

Installation

'segmenteverygrain' is available through pypi.org, so you can install it by running: pip install segmenteverygrain If you are using 'pip', you need to make sure that the Python version is 3.9 (and not higher), so that all dependencies work correctly.

Note that you need to clone the repository to get the U-Net model file and the example images in one go; otherwise you need to download these manually and place them in the right folders.

The easiest way of creating a Python environment in which 'segmenteverygrain' works well is to use the 'environment.yml' file with conda (or mamba).

We recommend that you install conda or mamba to manage your Python environments. Due to licensing restrictions on Aanconda, miniforge might be the best option as there are no strings attached and it allows you to rely on the mamba package solver, which is faster than conda. If you are using mamba, you can simply replace conda with mamba in the commands below.

In Anaconda Prompt (Windows), or Terminal (Mac), enter the following to install pip and git packages:

conda install pip git

Download the segmenteverygrain files.

Windows: git clone --depth 1 https://github.com/zsylvester/segmenteverygrain.git Linux/Mac: git clone --depth 1 git@github.com:zsylvester/segmenteverygrain.git

Set up the segmenteverygrain environment with conda (Windows): conda env create -f segmenteverygrain/environment.yml

Set up the segmenteverygrain environment with conda (Mac): conda env create -f segmenteverygrain/environment_macos.yml

Activate environment: conda activate segmenteverygrain

Getting started

See the Segmenteverygrain.ipynb notebook for an example of how the models can be loaded and used for segmenting an image and QC-ing the result. The notebook goes through the steps of loading the models, running the segmentation, interactively updating the result, and saving the grain data and the mask. The last section of the notebook illustrates the use of the 'predictlargeimage' function that is recommended for large images (e.g., larger than 2000x3000 pixels). The images below illustrate how a relatively large thin-section image of a sandstone can be segmented using this approach. Image from Digital Rocks Portal.

If the base Unet model does not work well on a specific type of image, it is a good idea to generate some new training data (a few small images are usually enough) and to fine tune the base model so that it works better on the new image type. This can be done by running the cells in the last section ('Finetuning the base model') of the Segmenteverygrain.ipynb notebook.

The Segmenteverygrain_colab.ipynb has been adjusted so that the segmentation can be tested in Google Colab. That said, the interactivity in Colab is not as smooth as in a local notebook.

Running times

It takes 2 minutes and 40 seconds to run the full segmentation on a 3 megapixel (e.g., 1500x2000 pixels) image, on an Apple M2 Max laptop with 96 GB RAM. The same image takes the same amount of time to segment using Google Colab with a Nvidia A100 GPU.

Obviously, large images take longer to process. The segmentation of the ~20 megapixel example image that is provided in the repository ('mairetalL2DJI0382image.jpg') takes ~20 minutes with both hardware configurations mentioned before. As the processing of large images is done in patches, the increase in computational time is roughly linear.

Contributing

We welcome contributions from anyone interested in improving the project. To contribute to the model use the following steps:

Fork the repository.
Create a new branch for your changes:

bash git checkout -b feature/my-feature

Make your changes and commit them:

bash git add . git commit -m "Add my feature"

Push your changes to your forked repository:

bash git push origin feature/my-feature

Create a pull request from your forked repository back to the original repository.

Reporting Issues

If you encounter any issues or problems while using segmentanygrain, we encourage you to report them to us. This helps us identify and address any bugs or areas for improvement.

To report an issue, please follow these steps:

Check the Existing Issues: Before submitting a new issue, search our issue tracker to see if the problem you're experiencing has already been reported. If you find a similar issue, you can add any additional information or comments to that existing issue.
Create a New Issue: If you don't find an existing issue that matches your problem, create a new issue by clicking the "New issue" button on the issues page. Provide a clear and descriptive title for your issue, and include the following information in the description:
- A detailed description of the problem you're experiencing, including any error messages or unexpected behavior.
- The steps to reproduce the issue, if possible.
- Your operating system and the version of the software you're using.
- Any relevant logs or screenshots that could help us understand the problem.
Submit the Issue: Once you've provided all the necessary information, click the "Submit new issue" button to create the issue. Our team will review the issue and respond as soon as possible.

We appreciate you taking the time to report any issues you encounter. Your feedback helps us improve.

Acknowledgements

Thanks to Danny Stockli, Nick Howes, Kalinda Roberts, Jake Covault, Matt Malkowski, Raymond Luong, Wilson Bai, Rowan Martindale, and Sergey Fomel for discussions and/or helping with generating training data. Funding for this work came from the Quantitative Clastics Laboratory industrial consortium at the Bureau of Economic Geology, The University of Texas at Austin.

License

segmenteverygrain is licensed under the Apache License 2.0.

Owner

Name: Zoltán Sylvester
Login: zsylvester
Kind: user
Location: Austin, TX
Company: Bureau of Economic Geology, UT Austin

Website: https://zsylvester.github.io
Repositories: 3
Profile: https://github.com/zsylvester

Geologist interested in analyzing, modeling, and visualizing clastic sedimentary systems

JOSS Publication

Segmenteverygrain: A Python module for segmentation of grains in images

Published

August 11, 2025

DOI

10.21105/joss.07953

Volume 10, Issue 112, Page 7953

Authors

Zoltán Sylvester

Bureau of Economic Geology, Jackson School of Geosciences, The University of Texas at Austin, Austin, TX, USA, Department of Earth and Planetary Sciences, Jackson School of Geosciences, The University of Texas at Austin, Austin, TX, USA

Daniel F. Stockli

Department of Earth and Planetary Sciences, Jackson School of Geosciences, The University of Texas at Austin, Austin, TX, USA

Nick Howes
The Water Institute, New Orleans, LA, USA

Kalinda Roberts
BSC Group, Boston, MA, USA

Matthew A. Malkowski

Department of Earth and Planetary Sciences, Jackson School of Geosciences, The University of Texas at Austin, Austin, TX, USA

Zsófia Poros
ConocoPhillips, Houston, TX, USA

Rowan C. Martindale

Department of Earth and Planetary Sciences, Jackson School of Geosciences, The University of Texas at Austin, Austin, TX, USA

Wilson Bai
Department of Earth and Planetary Sciences, Jackson School of Geosciences, The University of Texas at Austin, Austin, TX, USA

Editor

Anastassia Vybornova

GitHub Events

Total

Create event: 3
Release event: 4
Issues event: 27
Watch event: 55
Issue comment event: 41
Push event: 129
Pull request event: 13
Fork event: 19

Last Year

Create event: 3
Release event: 4
Issues event: 27
Watch event: 55
Issue comment event: 41
Push event: 129
Pull request event: 13
Fork event: 19

Committers

Last synced: about 2 years ago

All Time

Total Commits: 38
Total Committers: 1
Avg Commits per committer: 38.0
Development Distribution Score (DDS): 0.0

Past Year

Commits: 38
Committers: 1
Avg Commits per committer: 38.0
Development Distribution Score (DDS): 0.0

Top Committers

Name	Email	Commits
Zoltan Sylvester	z**r@g**m	38

Issues and Pull Requests

Last synced: 6 months ago

All Time

Total issues: 30
Total pull requests: 14
Average time to close issues: 2 months
Average time to close pull requests: 9 days
Total issue authors: 18
Total pull request authors: 3
Average comments per issue: 1.07
Average comments per pull request: 0.5
Merged pull requests: 7
Bot issues: 0
Bot pull requests: 0

Past Year

Issues: 19
Pull requests: 14
Average time to close issues: 12 days
Average time to close pull requests: 9 days
Issue authors: 10
Pull request authors: 3
Average comments per issue: 0.68
Average comments per pull request: 0.5
Merged pull requests: 7
Bot issues: 0
Bot pull requests: 0

View more stats

Top Authors

Issue Authors

pr4deepr (8)
jessepisel (3)
GitHamza0206 (2)
ThomasMGeo (2)
panovr (1)
dirtbirb (1)
SuroshAhmadZobair (1)
lonelywanderer8 (1)
louisejuliedelhaye (1)
LigoMan99 (1)
giselerudderham (1)
haydenclose (1)
Dongwoo-Im (1)
Ronda-00 (1)
h-m-088 (1)

Pull Request Authors

dirtbirb (8)
jessepisel (4)
kalir44 (2)

Top Labels

Issue Labels

Pull Request Labels

Packages

Total packages: 1
Total downloads:
- pypi 403 last-month

Total dependent packages: 0
Total dependent repositories: 0
Total versions: 18
Total maintainers: 1

pypi.org: segmenteverygrain

a SAM-based model for segmenting grains in images of grains

Homepage: https://github.com/zsylvester/segmenteverygrain
Documentation: https://segmenteverygrain.readthedocs.io/
License: Apache Software License
Latest release: 0.2.3
published 7 months ago

Versions: 18
Dependent Packages: 0
Dependent Repositories: 0
Downloads: 403 Last month

Rankings

Dependent packages count: 7.3%

Average: 26.9%

Stargazers count: 28.3%

Forks count: 30.5%

Dependent repos count: 41.4%

Maintainers (1)

zsylvester

Last synced: 6 months ago

Dependencies

setup.py pypi

matplotlib *
networkx *
numpy *
opencv-python *
pillow *
rasterio *
scikit-image *
scipy *
segment-anything *
shapely *
tensorflow *
tqdm *

docs/requirements.txt pypi

itertools *
matplotlib ==3.7.1
networkx ==3.1
numpy ==1.24.3
pandas ==2.0.3
rasterio ==1.3.9
rtree *
scikit-image ==0.21.0
scikit-learn ==1.3.2
scipy ==1.10.1
segment-anything ==1.0
shapely ==2.0.3
sphinx *
sphinx-rtd-theme *
tensorflow ==2.13.0
tqdm *
warnings *

.github/workflows/ci.yaml actions

actions/checkout v4 composite
actions/setup-python v5 composite

.github/workflows/draft-pdf.yml actions

actions/checkout v4 composite
actions/upload-artifact v4 composite
openjournals/openjournals-draft-action master composite

requirements.txt pypi

jupyterlab *
matplotlib *
networkx *
numpy *
opencv-python *
pandas *
pillow *
pip *
rasterio *
rtree *
scikit-image *
scikit-learn *
scipy *
segment-anything *
segmenteverygrain *
shapely *
tensorflow *
torch *
torchvision *
tqdm *

environment.yml conda

jupyterlab
matplotlib
networkx
numpy
pandas
pip
pyqt
python 3.9.*
scikit-image
scikit-learn
scipy
shapely
tqdm

Segmenteverygrain

Science Score: 93.0%

Scientific Fields

Repository

Basic Info

Statistics

Metadata Files

README.md

segmenteverygrain

Description

Requirements

Documentation

Installation

Getting started

Running times

Contributing

Reporting Issues

Acknowledgements

License

Owner

JOSS Publication

Segmenteverygrain: A Python module for segmentation of grains in images

Authors

Editor

Tags

GitHub Events

Total

Last Year

Committers

All Time

Past Year

Top Committers

Issues and Pull Requests

All Time

Past Year

Top Authors

Issue Authors

Pull Request Authors

Top Labels

Issue Labels

Pull Request Labels

Packages

pypi.org: segmenteverygrain

Rankings

Maintainers (1)

Dependencies