Bellybutton: accessible and customizable deep-learning image segmentation

Dillavou, Sam; Hanlan, Jesse M.; Chieco, Anthony T.; Xiao, Hongyi; Fulco, Sage; Turner, Kevin T.; Durian, Douglas J.

doi:10.1038/s41598-024-63906-y

Download PDF

Article
Open access
Published: 20 June 2024

Bellybutton: accessible and customizable deep-learning image segmentation

Sam Dillavou¹,
Jesse M. Hanlan¹,
Anthony T. Chieco¹,
Hongyi Xiao^1,2,
Sage Fulco³,
Kevin T. Turner³ &
…
Douglas J. Durian^1,3,4

Scientific Reports volume 14, Article number: 14281 (2024) Cite this article

3204 Accesses
1 Citations
Metrics details

Subjects

Abstract

The conversion of raw images into quantifiable data can be a major hurdle and time-sink in experimental research, and typically involves identifying region(s) of interest, a process known as segmentation. Machine learning tools for image segmentation are often specific to a set of tasks, such as tracking cells, or require substantial compute or coding knowledge to train and use. Here we introduce an easy-to-use (no coding required), image segmentation method, using a 15-layer convolutional neural network that can be trained on a laptop: Bellybutton. The algorithm trains on user-provided segmentation of example images, but, as we show, just one or even a sub-selection of one training image can be sufficient in some cases. We detail the machine learning method and give three use cases where Bellybutton correctly segments images despite substantial lighting, shape, size, focus, and/or structure variation across the regions(s) of interest. Instructions for easy download and use, with further details and the datasets used in this paper are available at pypi.org/project/Bellybuttonseg.

Training a deep learning model for single-cell segmentation without manual annotation

Article Open access 14 December 2021

Petascale pipeline for precise alignment of images from serial section electron microscopy

Article Open access 04 January 2024

A Modified Deep Semantic Segmentation Model for Analysis of Whole Slide Skin Images

Article Open access 08 October 2024

Introduction

Extracting quantitative information from image data is a major step in many fields of research. Prior to the last decade, state of the art algorithms typically focused on highly specific use cases, such as tracking spherical particles¹ or identifying astronomical light sources². These algorithms were typically task specific—aiming to identify predefined features—as opposed to machine learning algorithms that are more adaptive. In fact, reviews as late as 2015 did not even mention machine learning (ML)³. Progress is still being made in this ___domain today⁴. Since the introduction of AlexNet⁵ in 2012, the capacity of ML methods in this arena has moved at a breathtaking pace, fueled largely by the success of convolutional neural networks (CNNs)⁶. This class of techniques allows a more general approach to quantification of image data, including addressing more nuanced and harder-to-formulate questions by requiring only correct examples as training data. More specifically, the task of segmenting an image—identifying the pixels that comprise one or more objects or regions of interest—has become a large focus⁷, as it allows researchers to rapidly and deeply analyze complex data. While state-of-the-art benchmarks in this ___domain (e.g. ML Commons) require enormous computation and are thus out of even a skilled single user’s reach, software tools like Keras⁸, an Application Program Interface (API) for Python, greatly simplify the process of creating smaller, custom neural network solutions, in principle in just a few lines of code. However, in practice the process is rarely that simple, and for those unfamiliar with deep neural networks, many pieces of the process become daunting; optimizing the many user-defined “hyper-parameters” of the algorithm, picking the right network, cleaning the data, and possibly learning a new programming language can each require a lot of additional effort.

As a result, a large and recent body of work has been focused on methods and software packages for simplifying this process. The majority focused on biological research, specifically the tracking of cells from microscopy data^{9,10,11,12,13,14,15,16}, but similar works tackle goals ranging from identifying and tracking 2D materials like graphene¹⁷ to segmenting other medical or biological imaging data^18,19,20,21, images of flora and fauna²², scanning electron microscopy images for material science^23,24, astronomical data^25,26, particle physics²⁷, and more. Typically these works compete for highest accuracy on benchmark data sets¹¹, or ease of use for pre-specified domains (very often biological data)^9,10. While many of these methods are likely applicable for tasks outside of their intended application, e.g.^15,21, few are explicitly designed for general use, and often require usage of preexisting image analysis software such as ImageJ, and a menu of options that can be intimidating for those unfamiliar with machine learning methods. On the opposite side of the spectrum, packages designed for ‘zero-shot’ (no user-input) segmentation have become increasingly powerful ²⁸, but lack the malleability needed in many custom research scenarios.

Here we introduce an easy-to-use segmentation solution aimed at a broad array of research applications, named “Bellybutton.” Bellybutton uses a 15-layer convolutional neural network that can be trained on as little as one (or a sub-samping of one) image with user-defined segmentation, and can account for variations in size, lighting, rotation, focus, or shape of desired segmentation regions, as is common in research applications. The algorithm operates on a pixel-by-pixel basis, determining if each is inside or outside of a segmentation (‘innies’ or ‘outies,’ hence the name Bellybutton). The algorithm can analyze input images of varying shape and size, and automatically performs a variety of data augmentation, including flipping and rotating images, normalizing brightness across images, and evenly sampling innies and outies. Bellybutton requires no coding knowledge, and can be trained and run on a laptop. We detail its performance and flexibility through several use cases including segmenting bubbles with poor lighting and focus, semi-transparent, tightly packed particles that have intricate birefringence patterns, and tracking a thin clear lattice of material that fractures over time. Each of these data sets is available online, along with a guide for Bellybutton’s use on new data sets.

Method

Bellybutton operates on a pixel-by-pixel basis, scanning images and using the neighborhood around a given point in an image to determine if a pixel is inside or outside of a segment, as well as how far from that segment’s edge. It uses a deep convolutional neural network (CNN), whose structure is shown schematically in Fig. 1A. The CNN consists of \(3\times 3\) convolutional layers, \(2\times 2\) max pooling layers, skip connections inspired by ResNet²⁹, and ends with four dense layers feeding into two outputs—a classification of pixel type (inside or outside a region), and a distance-from-region-edge scalar value, which is used to separate distinct regions in contact. The scalar value is trained to vary between 0 (for all outside pixels) to a maximum value set by the user (typically 10), allowing the system to localize region edges while easily satisfying this output when it is unimportant, for example in the center of a 100 pixel-wide region. We use binary cross-entropy loss for the classification output, and a mean-absolute error loss for the scalar distance output with equal weighting between the two. Because the distance map is used ultimately to separate contacting regions, the accuracy of low distances is most important, making mean-absolute error a superior choice to the more standard mean-squared error. Bellybutton is built on Tensorflow³⁰ and uses the Adam optimizer for training with a learning rate of 0.001.

The chosen network architecture strikes a balance between being small enough to train rapidly from scratch on a laptop, while being large enough to generate valid segmentation on nontrivial problems. The choice of a CNN has been the standard for segmentation problems^{6,12,14,18,20,22,23,24,25,26}, as it allows the network natural access to spatial information. The decreasing layer size is also standard, and gives the network sufficient flexibility to hierarchically analyze spatial patterns without superfluous parameters. The network itself takes multiple size subsets of an image as input, centered around the pixel in question, each down-sampled to \(25\times 25\) pixels. This sampling process is performed automatically during training and prediction, and gives the network the ability to analyze multiple length scales while keeping input size minimal. A typical example is shown in Fig. 1A, B using 1, 3, 9, and 27x scales.

For training, a user may provide individually-labeled segmentation maps, that is, every pixel in a particular segment must contain the same number, unique to that segment. Alternatively, if no segments are in contact, a user-provided binary mask is sufficient. In either case, Bellybutton generates two labels for each pixel: a binary classification label that corresponds to ‘innie’ or ‘outie’, which does not distinguish between uniquely labeled regions, and a scalar label, distance (in pixels) to the nearest edge of a region. It is these two labels that the CNN is trained to reproduce. Optionally, the user may exclude regions of an image using a binary Area of Interest (AOI) mask, as indicated by the excluded gray area in Fig. 1C.

To avoid prolonged training, the user may select to train using a fraction of available training data. We find that near optimal results are often reached without using all available pixels (see Fig. 2E). Furthermore, rotated and flipped images are (optionally) used in training to prevent overfitting. Once trained, Bellybutton produces a classification score of 0 (outside) to 1 (inside a region) for each pixel (trained on the binary label), shown in Fig. 1D. This score is thresholded to produce a binary innie-vs-outie map. Finally, the scalar distance output, shown in Fig. 1E, is used to watershed the ‘innie’ classified pixels into distinct regions to produce a segmented map, as in Fig. 1F. Data used in this figure, aqueous foams in microgravity, comes from Ref. ³¹, which was the first work to utilize Bellybutton.

Example uses

Bellybutton is effective for a variety of purposes. Here we use the example of segementing a 3D printed photoelastic material in the shape of a granular packing. This material is illuminated between cross-polarizers such that it develops a birefringence pattern when under mechanical stress. This lighting is useful experimentally, but complicates the tracking process; previous experiments using photoelastic granular disks have required two sets of images, one with regular lighting to track particles, and second one with the birefringence pattern to analyze force³². Bellybutton was trained on two fourths of three images of this system, under low, medium, and high stress, and tested on the remaining two fourths of each image, shaded purple in Fig. 2A. While remaining roughly the same shape, the particles present a wide variety of patterns as the stress changes. Furthermore, a variety of confounding factors make this segmentation more difficult: A substantial portion of the image (the left and right edges) is out of focus. The camera is close enough to the sample that only particles in the center are imaged head-on, leading to different viewing angles for particles near the edges of the system. Finally, particles near the left and right edge are tilted sufficiently such that their edges are exposed to the camera.

The input scales used are shown in Fig. 2B, overlaid on zoomed-in data. Segmentation is successful, with the majority of errors concentrated at the bottom of the leftmost image, where contrast and focus are worst. Typical regions are successfully segmented, as seen by comparing Fig. 2C, D, taken from the test set.

For quantitative analysis of these results, we utilize the SEG score from Ref. ¹¹, which compares each true region with the identified region of highest overlap. We find this metric to be the most indicative of performance by eye, although many others are commonly used^7,11. For each true region \(R_i\), a ‘Jaccard index’ is calculated with the Bellybutton-generated region \(B_i\) of highest overlap, by dividing the area of their intersection by the area of their union. True regions that do not have an intersection of at least one half of their area are given a score of 0. The SEG reported is the average of all such scores for a given dataset, with a perfect score being 1. A detailed explanation of the calculation can be found at celltrackingchallenge.net. Bellybutton was reliably able to beat a 0.9 SEG score on the test set for this data.

In the highlighted example the entire training set was used, and the network was trained for \(E=2\) epochs (each training data point was shown to the network twice). For practical use however, it may not be necessary to use even this much data (half of three images), as shown in Fig. 2E. A sub-sampling option is given as a parameter in the Bellybutton package, named ‘fraction.’ This value indicates the fraction (0-1] of available training pixels that the algorithm will use to train the neural network. For values below 1, individual pixels are randomly chosen, but at a rate that ensures that innies and outies are equally represented. (This can also be modified easily via the parameters of the algorithm, to instead represent innies and outies in the ratio they are present in the images). We find that accuracy for a variety of problems is dependent on the quantity \(EF = T/M\) being sufficiently high, where E is the number of epochs in training, M is the size of the total training set, F is the fraction of the training set that is used, and \(T=EFM\) is the total number of training steps. This dependency is shown by the data collapse in Fig. 2E. As a result, smaller data fractions F can be used to suss out the tractability of a problem. In this example, even tiny fractions of the training data can still yield passable results, including a total training set corresponding to only 0.5% of each training image (F = 0.01, as half of the image is for testing), as seen by the modest dependence of SEG on data fraction in Fig. 2F. Further, the small variance in test results for F = 0.01 suggest that for this dataset our algorithm is robust to variation in training data and thus the various sources of noise that plague these images: lighting, focus, and particle-size variation, etc. However for optimal results, a larger fraction of the data must be used, to give the network access to a wider variety of examples. Overall, more data is typically better, but we often find that \(F\ge 0.1\) gives reasonable results for systems with many repeated particles, like the one shown in Fig. 2. An important caveat is that these training data should be taken from a sufficiently varied set of images and locations within those images to encompass the range of the desired data set.

Bellybutton is also useful for structure-finding. In the following example a lattice of laser-cut acrylic (Polymethyl methacrylate or PMMA) is slowly fractured while lit between cross-polarizers to reveal changes in internal stress. These changes to the material’s structure affect its brightness, shown in Fig. 3A, E, make it difficult to track algorithmically. Using just three training images with human-generated masks (Fig. 3B), Bellybutton is capable of tracking the fracturing structure on unseen test images (Fig. 3C, D) through time, as shown in Fig. 3E, F, despite lighting and focus changes, as shown for a zoomed in portion in Fig. 3E, F. While all images in this example broadly similar, the important information about the system—the lattice structure—changes significantly, indicated only by subtle shifts in edge locations and lighting (e.g. the breakage at the bottom of the images in Fig. 3E, F between t=300 and t=400). This makes hard-coded algorithmic detection difficult, and extensive human labeling time-consuming. We note that our package includes options for a binarized innie vs outie output, or a scalar distance-to-edge output, the latter of which is shown in Fig. 3F. This option can be helpful for skeletonizing a structure, and to suppress noise and error.

Conclusion: how and when to use Bellybutton

In summary, to use Bellybutton a user supplies images and labels (masks). Bellybutton then automatically converts these into a format digestible by its CNN, including augmenting the data to aid training (e.g. adding rotated versions) and class-balancing, and trains using user-defined parameters. Then, the algorithm produces pixel-level predictions for each image, which are (automatically) spatially assembled and converted into segmented maps through a watershedding algorithm. Users may elect to have the algorithm save the CNN outputs themselves, the ‘innie’ vs ‘outie’ classification and/or the distance map, as is helpful in some cases (such as Fig. 3).

We have tried to make Bellybutton as accessible as possible. It is downloadable as a python package, which can be easily installed with one command, and utilizing Bellybutton requires no coding. Instructions for use, details for how to customize training and hyper-parameters, and much more can be found at pypi.org/project/bellybuttonseg. For Python-savvy users, the code itself and a Jupyter Notebook version is also available at github.com/sdillavou/bellybuttonseg. Starting a project is as simple as running a single command, and Bellybutton creates a folder structure to add images, masks, and areas of interest. Adjusting the parameters of training and testing are done through editing an automatically-generated text file. Furthermore, we have provided the three data sets used the figures of this work as example projects that can be downloaded in one command, set up, and run on a laptop. Deploying one of these example projects takes under a minute, plus training time (computer dependent).

While just three examples of Bellybutton’s potential uses are shown, its flexibility should make it useful in a wide variety of situations. For example, regions are not limited to single particles; masks might specify the two connected regions of a dimer, or a disk and a mark on its surface indicating its rotational position as separate regions, allowing them both to be segmented simultaneously. The same approach could be applied to a cell and its nucleus, an insect and its head or feet, a particle and its previous position, allowing velocity to be approximated from single images. Regions can be used to identify particle classes as well; segmenting only particles of a given shape, size, or orientation will prompt Bellybutton to do the same. A broad rule of thumb is if a region is easily identifiable by eye, it is a good candidate for Bellybutton. This class of image segmentation problems is both frustrating and common in research, and we believe giving users an easy-to-use but flexible method like Bellybutton will save countless hours in the lab.

Data availibility

Data is provided with the software package download, available at pypi.org/project/bellybuttonseg. Code is also available at github.com/sdillavou/bellybuttonseg.

References

Crocker, J. C. & Grier, D. G. Methods of digital video microscopy for colloidal studies. J. Colloid Interface Sci. 179, 298–310. https://doi.org/10.1006/jcis.1996.0217 (1996).
Article ADS CAS Google Scholar
Bertin, E. & Arnouts, S. SExtractor: Software for source extraction. Astron. Astrophys., Suppl. Ser. 117, 393–404. https://doi.org/10.1051/aas:1996164 (1996).
Article ADS Google Scholar
Manzo, C. & Garcia-Parajo, M. F. A review of progress in single particle tracking: From methods to biophysical insights. Rep. Prog. Phys. 78, 124601. https://doi.org/10.1088/0034-4885/78/12/124601 (2015).
Article ADS CAS PubMed Google Scholar
Yücel, H. & Velu, S. K. P. Toolbox for tracking and analyzing crowded mixture of colloidal particles. Colloid Interface Sci. Commun. 45, 100546. https://doi.org/10.1016/j.colcom.2021.100546 (2021).
Article CAS Google Scholar
Krizhevsky, A., Sutskever, I. & Hinton, G. E. ImageNet classification with deep convolutional neural networks. In Advances in Neural Information Processing Systems, vol. 25 (Curran Associates, Inc., 2012).
Chai, J., Zeng, H., Li, A. & Ngai, E. W. T. Deep learning in computer vision: A critical review of emerging techniques and application scenarios. Mach. Learn. Appl. 6, 100134. https://doi.org/10.1016/j.mlwa.2021.100134 (2021).
Article Google Scholar
Minaee, S. et al. Image segmentation using deep learning: A survey. IEEE Trans. Pattern Anal. Mach. Intell.https://doi.org/10.1109/TPAMI.2021.3059968 (2021).
Article Google Scholar
Chollet, F. et al. Keras (2015).
Midtvedt, B. et al. Quantitative digital microscopy with deep learning. Appl. Phys. Rev. 8, 011310. https://doi.org/10.1063/5.0034891 (2021).
Article ADS CAS Google Scholar
Ershov, D. et al. TrackMate 7: Integrating state-of-the-art segmentation algorithms into tracking pipelines. Nat. Methods 19, 829–832. https://doi.org/10.1038/s41592-022-01507-1 (2022).
Article CAS PubMed Google Scholar
Ulman, V. et al. An objective comparison of cell-tracking algorithms. Nat. Methods 14, 1141–1152. https://doi.org/10.1038/nmeth.4473 (2017).
Article CAS PubMed PubMed Central Google Scholar
Stringer, C., Wang, T., Michaelos, M. & Pachitariu, M. Cellpose: A generalist algorithm for cellular segmentation. Nat. Methods 18, 100–106. https://doi.org/10.1038/s41592-020-01018-x (2021).
Article CAS PubMed Google Scholar
Zheng, J. et al. STRAINS: A big data method for classifying cellular response to stimuli at the tissue scale. PLoS One 17, e0278626. https://doi.org/10.1371/journal.pone.0278626 (2022).
Article CAS PubMed PubMed Central Google Scholar
Newby, J. M., Schaefer, A. M., Lee, P. T., Forest, M. G. & Lai, S. K. Convolutional neural networks automate detection for tracking of submicron-scale particles in 2D and 3D. Proc. Natl. Acad. Sci. 115, 9026–9031. https://doi.org/10.1073/pnas.1804420115 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Berg, S. et al. Ilastik: Interactive machine learning for (bio)image analysis. Nat. Methods 16, 1226–1232. https://doi.org/10.1038/s41592-019-0582-9 (2019).
Article CAS PubMed Google Scholar
Nguyen, J. P., Linder, A. N., Plummer, G. S., Shaevitz, J. W. & Leifer, A. M. Automatically tracking neurons in a moving and deforming brain. PLoS Comput. Biol. 13, e1005517. https://doi.org/10.1371/journal.pcbi.1005517 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Masubuchi, S. et al. Deep-learning-based image segmentation integrated with optical microscopy for automatically searching for two-dimensional materials. npj 2D Mater. Appl. 4, 1–9. https://doi.org/10.1038/s41699-020-0137-z (2020).
Article Google Scholar
Ronneberger, O., Fischer, P. & Brox, T. U-Net: Convolutional Networks for Biomedical Image Segmentation. In Medical Image Computing and Computer-Assisted Intervention – MICCAI 2015 (ed. Navab, N., Hornegger, J., Wells, W. M. & Frangi, A. F.) Vol. 9351, 234–241 (Springer International Publishing, 2015). https://doi.org/10.1007/978-3-319-24574-4_28.
Ciresan, D., Giusti, A., Gambardella, L. & Schmidhuber, J. Deep Neural Networks Segment Neuronal Membranes in Electron Microscopy Images. In Advances in Neural Information Processing Systems, vol. 25 (Curran Associates, Inc., 2012).
Haertter, D. et al. DeepProjection: Specific and robust projection of curved 2D tissue sheets from 3D microscopy using deep learning. Development 149, dev200621. https://doi.org/10.1242/dev.200621 (2022).
Article CAS PubMed PubMed Central Google Scholar
Arganda-Carreras, I. et al. Trainable Weka segmentation: A machine learning tool for microscopy pixel classification. Bioinformatics 33, 2424–2426. https://doi.org/10.1093/bioinformatics/btx180 (2017).
Article CAS PubMed Google Scholar
Niedballa, J. et al. Imageseg: An R package for deep learning-based image segmentation. Methods Ecol. Evol. 13, 2363–2371. https://doi.org/10.1111/2041-210X.13984 (2022).
Article Google Scholar
Rühle, B., Krumrey, J. F. & Hodoroaba, V.-D. Workflow towards automated segmentation of agglomerated, non-spherical particles from electron microscopy images using artificial neural networks. Sci. Rep. 11, 4942. https://doi.org/10.1038/s41598-021-84287-6 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Azimi, S. M., Britz, D., Engstler, M., Fritz, M. & Mücklich, F. Advanced steel microstructural classification by deep learning methods. Sci. Rep. 8, 2128. https://doi.org/10.1038/s41598-018-20037-5 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Ostdiek, B., Diaz Rivero, A. & Dvorkin, C. Image segmentation for analyzing galaxy-galaxy strong lensing systems. Astron. Astrophys. 657, L14. https://doi.org/10.1051/0004-6361/202142030 (2022).
Article ADS Google Scholar
Hausen, R. & Robertson, B. E. Morpheus: A deep learning framework for the pixel-level analysis of astronomical image data. Astrophys. J. Suppl. Ser. 248, 20. https://doi.org/10.3847/1538-4365/ab8868 (2020).
Article ADS Google Scholar
Li, J., Li, T. & Xu, F.-Z. Reconstructing boosted Higgs jets from event image segmentation. J. High Energy Phys. 2021, 156. https://doi.org/10.1007/JHEP04(2021)156 (2021) arXiv:2008.13529.
Article Google Scholar
Kirillov, A. et al. Segment Anything (2023). arXiv:2304.02643.
He, K., Zhang, X., Ren, S. & Sun, J. Deep Residual Learning for Image Recognition. arXiv:1512.03385 [cs] (2015). arXiv:1512.03385.
Abadi, M. et al. TensorFlow: Large-scale machine learning on heterogeneous systems. Software available from tensorflow.org. (2015)
Pasquet, M. et al. Aqueous foams in microgravity, measuring bubble sizes. Comptes Rendus. Mécanique 351, 1–23. https://doi.org/10.5802/crmeca.153 (2023).
Article Google Scholar
Daniels, K. E., Kollmer, J. E. & Puckett, J. G. Photoelastic force measurements in granular materials. Rev. Sci. Instrum. 88, 051808. https://doi.org/10.1063/1.4983049 (2017).
Article ADS CAS PubMed Google Scholar

Download references

Acknowledgements

We thank Marina Pasquet for assistance with ISS data from the Foam-C project of ESA³¹, and Kieran A Murphy for helpful discussions. This work was supported by NASA grant 80NSSC21K0898 and by NSF grant MRSEC/DMR-1720530. S.D. Acknowledges support from the University of Pennsylvania Data Driven Discovery Initiative.

Author information

Authors and Affiliations

Department of Physics and Astronomy, University of Pennsylvania, Philadelphia, PA, 19104, USA
Sam Dillavou, Jesse M. Hanlan, Anthony T. Chieco, Hongyi Xiao & Douglas J. Durian
Department of Mechanical Engineering, University of Michigan, Ann Arbor, MI, 48109, USA
Hongyi Xiao
Department of Mechanical Engineering and Applied Mechanics, University of Pennsylvania, Philadelphia, 19104, PA, USA
Sage Fulco, Kevin T. Turner & Douglas J. Durian
Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY, 10010, USA
Douglas J. Durian

Authors

Sam Dillavou
View author publications
Search author on:PubMed Google Scholar
Jesse M. Hanlan
View author publications
Search author on:PubMed Google Scholar
Anthony T. Chieco
View author publications
Search author on:PubMed Google Scholar
Hongyi Xiao
View author publications
Search author on:PubMed Google Scholar
Sage Fulco
View author publications
Search author on:PubMed Google Scholar
Kevin T. Turner
View author publications
Search author on:PubMed Google Scholar
Douglas J. Durian
View author publications
Search author on:PubMed Google Scholar

Contributions

S.D. conceived of the study from discussions with J.M.H., A.T.C., H.X., and D.J.D. S.D. designed, ran, and analyzed the experiments, and produced the figures. S.D. and J.M.H. designed and wrote the software implementation. S.D. and D.J.D. wrote the manuscript. S.D., J.M.H., A.T.C., H.X., and S.F. tested the software. All authors supplied labeled experimental data and reviewed the manuscript.

Corresponding author

Correspondence to Sam Dillavou.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Dillavou, S., Hanlan, J.M., Chieco, A.T. et al. Bellybutton: accessible and customizable deep-learning image segmentation. Sci Rep 14, 14281 (2024). https://doi.org/10.1038/s41598-024-63906-y

Download citation

Received: 02 April 2024
Accepted: 03 June 2024
Published: 20 June 2024
DOI: https://doi.org/10.1038/s41598-024-63906-y