圖像閾值全域分割: 高斯混合模型 (GMM)，Python舉例－Rocky的部落格

高斯混合模型 (GMM) 是一種機器學習方法，用於估計數據的概率分佈。在圖像閾值分割的上下文中，GMM 可用於將像素值的直方圖建模為多個高斯分佈的混合。當您認為圖像有多於兩個的亮度級別（例如，前景、背景和陰影等），GMM 是一個好選擇。

首先，將圖像像素值重新整形為一個列表。
使用 sklearn 的 GaussianMixture 來擬合 GMM。您可以選擇要擬合的高斯分佈數量（例如，2 表示前景和背景）。
使用擬合的模型預測每個像素的標籤。
重新整形預測的標籤，使其具有與圖像相同的形狀。
選擇一個閾值（例如，根據預測的標籤或 GMM 的平均值）將圖像二值化。

以下是如何在 Python 中使用 GMM 進行圖像閾值分割的範例：

import cv2
import numpy as np
import matplotlib.pyplot as plt
from sklearn.mixture import GaussianMixture

# Read the newly provided image
new_provided_img = cv2.imread('/mnt/data/Camera MV-GEC2000C#C0A800C9-Snapshot-20230316161733-1979637792559_5.jpg', cv2.IMREAD_GRAYSCALE)

def gmm_threshold(image, n_components=2):
"""
Segment an image using Gaussian Mixture Model.

Parameters:
- image: 2D numpy array representing the grayscale image
- n_components: number of Gaussian distributions to fit

Returns:
- segmented image
"""
# Reshape the image to be a list of pixels
pixels = image.reshape(-1, 1)

# Fit the Gaussian Mixture Model
gmm = GaussianMixture(n_components=n_components, random_state=0).fit(pixels)

# Predict the labels for each pixel
labels = gmm.predict(pixels)

# Reshape the labels to have the same shape as the image
segmented = labels.reshape(image.shape)

# Convert segmented image to binary where the largest component is considered as foreground
binary_img = np.where(segmented == segmented.max(), 255, 0).astype(np.uint8)

return binary_img