R imager isoblur：它如何计算边界？是否可以根据其公式构建神经网络？

Question

我正在尝试通过图像在 R 中制作类似 Word Spotting 的东西。现在，我已经能够使用 imager 包和 isoblur 函数在单词周围设置一些界限：

document=imager::load.image("image.jpg")
plot(document)

document1=document<0.8
plot(document1)

plot(document1)
px=(isoblur(document1,1)>.3)
highlight(px)

Document

Document1

Result

这个想法来自于这项工作： https://cran.r-project.org/web/packages/imager/vignettes/pixsets.html

Isoblur 描述对理解函数背后的过程不是很有帮助，我想知道：

背后的计算是什么？
是否有可能构建一个神经网络来获得或多或少相同的结果？

Answer 1

What are the calculations behind it?

Isoblur 函数在图像上传递 Gaussian filter，使其模糊并去除一些噪点。

It is possible to construct a neural network to achieve the same result, more or less?

是的，但这是一种矫枉过正。如果你的大部分图片与显示的图片相似，你可以使用阈值和 select 每个连接的组件来分割单词，你应该看看 Otsu method。

如果你的目标是识别每个字符，我认为最好使用已经建立的OCR工具，比如Tesseract。但是你的字看起来和平常的笔迹相去甚远，你可能需要训练自己的分类器（使用神经网络来完成这个任务可能是一个不错的选择）。

R imager isoblur：它如何计算边界？是否可以根据其公式构建神经网络？

R imager isoblur: how it computes the boundries? It is possible to build a neural network based on its formula?

ocr

r

image-segmentation

conv-neural-network