123,123

2022最新計(jì)算機(jī)視覺學(xué)習(xí)路線（入門篇）

2022-03-07 15:13

磐創(chuàng)AI

關(guān)注

圖像變換

圖像變換包括平移、旋轉(zhuǎn)、縮放、裁剪和翻轉(zhuǎn)圖像。

img＝cv．imread（＇．．／input／images－for－computer－vision／tiger1．jpg＇）

width， height，＿＝img．shape

＃ Translating

M＿translate＝np．float32（［［1，0，200］，［0，1，100］］）＃ 200＝＞ Translation along x－axis and 100＝＞translation along y－axis

img＿translate＝cv．warpAffine（img，M＿translate，（height，width））

＃ Rotating

center＝（width／2，height／2）

M＿rotate＝cv．getRotationMatrix2D（center， angle＝90， scale＝1）

img＿rotate＝cv．warpAffine（img，M＿rotate，（width，height））

＃ Scaling

scale＿percent ＝ 50

width ＝ int（img．shape［1］＊ scale＿percent ／ 100）

height ＝ int（img．shape［0］＊ scale＿percent ／ 100）

dim ＝（width， height）

img＿scale ＝ cv．resize（img， dim， interpolation ＝ cv．INTER＿AREA）

＃ Flipping

img＿flip＝cv．flip（img，1）＃ 0：Along horizontal axis， 1：Along verticle axis，－1： first along verticle then horizontal

＃ Shearing

srcTri ＝ np．a(chǎn)rray（［［0， 0］，［img．shape［1］－ 1， 0］，［0，img．shape［0］－ 1］］）．a(chǎn)stype（np．float32）

dstTri ＝ np．a(chǎn)rray（［［0， img．shape［1］＊0．33］，［img．shape［1］＊0．85， img．shape［0］＊0．25］，［img．shape［1］＊0．15， img．shape［0］＊0．7］］）．a(chǎn)stype（np．float32）

warp＿mat ＝ cv．getAffineTransform（srcTri， dstTri）

img＿warp ＝ cv．warpAffine（img， warp＿mat，（height， width））

myplot（［img， img＿translate， img＿rotate， img＿scale， img＿flip， img＿warp］，

［＇Original Image＇，＇Translated Image＇，＇Rotated Image＇，＇Scaled Image＇，＇Flipped Image＇，＇Sheared Image＇］）

圖像預(yù)處理

閾值處理：在閾值處理中，小于閾值的像素值變?yōu)?0（黑色），大于閾值的像素值變?yōu)?255（白色）。

我將閾值設(shè)為 150，但你也可以選擇任何其他數(shù)字。

＃ For visualising the filters

import plotly．graph＿objects as go

from plotly．subplots import make＿subplots

def plot＿3d（img1， img2， titles）：

fig ＝ make＿subplots（rows＝1， cols＝2，
specs＝［［｛＇is＿3d＇： True｝，｛＇is＿3d＇： True｝］］，
subplot＿titles＝［titles［0］， titles［1］］，
）

x， y＝np．mgrid［0：img1．shape［0］， 0：img1．shape［1］］

fig．a(chǎn)dd＿trace（go．Surface（x＝x， y＝y(tǒng)， z＝img1［：，：，0］）， row＝1， col＝1）

fig．a(chǎn)dd＿trace（go．Surface（x＝x， y＝y(tǒng)， z＝img2［：，：，0］）， row＝1， col＝2）

fig．update＿traces（contours＿z＝dict（show＝True， usecolormap＝True，
highlightcolor＝＂limegreen＂， project＿z＝True））

fig．show（）

img＝cv．imread（＇．．／input／images－for－computer－vision／simple＿shapes．png＇）

＃ Pixel value less than threshold becomes 0 and more than threshold becomes 255

＿，img＿threshold＝cv．threshold（img，150，255，cv．THRESH＿BINARY）

plot＿3d（img， img＿threshold，［＇Original Image＇，＇Threshold Image＝150＇］）

應(yīng)用閾值后，150 的值變?yōu)榈扔?255

過(guò)濾： 圖像過(guò)濾是通過(guò)改變像素的值來(lái)改變圖像的外觀。每種類型的過(guò)濾器都會(huì)根據(jù)相應(yīng)的數(shù)學(xué)公式更改像素值。我不會(huì)在這里詳細(xì)介紹數(shù)學(xué)，但我將通過(guò)在 3D 中可視化它們來(lái)展示每個(gè)過(guò)濾器的工作原理。

limg＝cv．imread（＇．．／input／images－for－computer－vision／simple＿shapes．png＇）

＃ Gaussian Filter

ksize＝（11，11）＃ Both should be odd numbers

img＿guassian＝cv．GaussianBlur（img， ksize，0）

plot＿3d（img， img＿guassian，［＇Original Image＇，＇Guassian Image＇］）

＃ Median Filter

ksize＝11

img＿medianblur＝cv．medianBlur（img，ksize）

plot＿3d（img， img＿medianblur，［＇Original Image＇，＇Median blur＇］）

＃ Bilateral Filter

img＿bilateralblur＝cv．bilateralFilter（img，d＝5， sigmaColor＝50， sigmaSpace＝5）

myplot（［img， img＿bilateralblur］，［＇Original Image＇，＇Bilateral blur Image＇］）

plot＿3d（img， img＿bilateralblur，［＇Original Image＇，＇Bilateral blur＇］）

高斯濾波器：通過(guò)去除細(xì)節(jié)和噪聲來(lái)模糊圖像。

中值濾波器：非線性過(guò)程可用于減少脈沖噪聲或椒鹽噪聲

雙邊濾波器：邊緣保留和降噪平滑。

簡(jiǎn)單來(lái)說(shuō)，過(guò)濾器有助于減少或去除亮度或顏色隨機(jī)變化的噪聲，這稱為平滑

特征檢測(cè)

特征檢測(cè)是一種通過(guò)計(jì)算圖像信息的抽象，在每個(gè)圖像點(diǎn)上做出局部決策的方法。例如，對(duì)于一張臉的圖像，特征是眼睛、鼻子、嘴唇、耳朵等，我們嘗試識(shí)別這些特征。

讓我們首先嘗試識(shí)別圖像的邊緣。

邊緣檢測(cè)

img＝cv．imread（＇．．／input／images－for－computer－vision／simple＿shapes．png＇）

img＿canny1＝cv．Canny（img，50， 200）

＃ Smoothing the img before feeding it to canny

filter＿img＝cv．GaussianBlur（img，（7，7）， 0）

img＿canny2＝cv．Canny（filter＿img，50， 200）

myplot（［img， img＿canny1， img＿canny2］，

［＇Original Image＇，＇Canny Edge Detector（Without Smoothing）＇，＇Canny Edge Detector（With Smoothing）＇］）

這里我們使用 Canny 邊緣檢測(cè)器，它是一種邊緣檢測(cè)算子，它使用多階段算法來(lái)檢測(cè)圖像中的各種邊緣。它由 John F． Canny 于 1986 年開發(fā)。我不會(huì)詳細(xì)介紹 Canny 的工作原理，但這里的關(guān)鍵點(diǎn)是它用于提取邊緣。

在使用 Canny 邊緣檢測(cè)方法檢測(cè)邊緣之前，我們平滑圖像以去除噪聲。正如你從圖像中看到的，平滑后我們得到清晰的邊緣。

輪廓

img＝cv．imread（＇．．／input／images－for－computer－vision／simple＿shapes．png＇）

img＿copy＝img．copy（）

img＿gray＝cv．cvtColor（img，cv．COLOR＿BGR2GRAY）

＿，img＿binary＝cv．threshold（img＿gray，50，200，cv．THRESH＿BINARY）

＃Edroing and Dilating for smooth contours

img＿binary＿erode＝cv．erode（img＿binary，（10，10）， iterations＝5）

img＿binary＿dilate＝cv．dilate（img＿binary，（10，10）， iterations＝5）

contours，hierarchy＝cv．findContours（img＿binary，cv．RETR＿TREE， cv．CHAIN＿APPROX＿SIMPLE）

cv．drawContours（img， contours，－1，（0，0，255），3）＃ Draws the contours on the original image just like draw function

myplot（［img＿copy， img］，［＇Original Image＇，＇Contours in the Image＇］）