openCV 机器学习算法配合使用效果更好

2020-05-08 本文已影响0人小小英语Merce

前面配合我司自然拼读phonics产品做了52个字母的定位识别，后面看vuforia AR里有根据图片制作特征图，试了下，用生成特征矩阵的方式更简单高效（无需生成训练图集）。当然能使用特征矩阵的图片是有限制的就是特征足够多，像我们这种色块/字母等的就不行了。

回忆起之前可以根据绘本封面图片找绘本的应用，觉得值得研究下opencv特征提取的方法，和机器学习的方法互为补充。

用了3种方法：

第一种是Hessian Affine detector with SIFT descriptor(https://github.com/perdoch/hesaff) 这个用的opencv太老了，一个开发者（https://github.com/ray0809/Hessian_sift）更新了下，可以在opencv 3.x上跑了，可以直接用后者提供的代码（demo_python.py）做对比图实验。

hesaff

第二种方法用opencv自带的surf方法：

import numpy as np

import cv2 as cv

import time

path1 = '../Hessian_sift-master/test_pics/205.jpg'

path2 = '../Hessian_sift-master/test_pics/206.jpg'

img1 = cv.imread(path1,0)

img2 = cv.imread(path2,0)

minHessian = 400

detector = cv.xfeatures2d_SURF.create(hessianThreshold=minHessian)

start_time = time.time()

keypoints1, descriptors1 = detector.detectAndCompute(img1, None)

count = len(keypoints1)

print("Detected {:.5f} points, in {:.5f} sec.".format(count, time.time() - start_time))

keypoints2, descriptors2 = detector.detectAndCompute(img2, None)

count = len(keypoints2)

print("Detected {:.5f} points, in {:.5f} sec.".format(count, time.time() - start_time))

FLANN_INDEX_KDTREE = 0

index_params = dict(algorithm = FLANN_INDEX_KDTREE, trees = 5)

search_params = dict(checks = 20) #指定递归次数

begin = time.time()

flann = cv.FlannBasedMatcher(index_params, search_params)

matches = flann.knnMatch(descriptors1, descriptors2, k=2)

# Need to draw only good matches, so create a mask

matchesMask = [[0, 0] for i in range(len(matches))]

del_same = []

mat = 0

for i, (m, n) in enumerate(matches):

# ratio test as per Lowe's paper

if m.distance < 0.5 * n.distance:

mat += 1

matchesMask[i] = [1, 0]

del_same.append(m.trainIdx)

count = len(set(del_same))

print('匹配耗时：{:.5f}, 匹配到的点个数：{}'.format(time.time() - begin, count))

draw_params = dict(#matchColor = (0, 255, 0),

#singlePointColor = (255, 0, 0),

matchesMask = matchesMask,

flags = cv.DrawMatchesFlags_NOT_DRAW_SINGLE_POINTS #0

)

img3 = cv.drawMatchesKnn(img1, keypoints1, img2, keypoints2, matches, None, **draw_params)

cv.namedWindow('img',cv.WINDOW_AUTOSIZE)

cv.imshow('img',img3)

cv.waitKey(0)

cv.destroyAllWindows()

看性能和结果，和第一种方法基本一致（minHessian取400），特征提取大概在100-200毫秒级,

(base) panyongengde:orb_demo panyongfeng$ python demo_surf.py

Detected 827.00000 points, in 0.23713 sec.

Detected 434.00000 points, in 0.42285 sec.

匹配耗时：0.02463, 匹配到的点个数：37

下图为minHessian = 5000的结果。

surf+hess

第三种方法 orb的方法，特征提取快一个量级：

import numpy as np

import cv2 as cv

import time

path1 = '../Hessian_sift-master/test_pics/204.jpg'

path2 = '../Hessian_sift-master/test_pics/205.jpg'

img1 = cv.imread(path1,1)

img2 = cv.imread(path2,1)

# Initiate ORB detector

orb = cv.ORB_create()

# find the keypoints with ORB

start_time = time.time()

kp1, des1 = orb.detectAndCompute(img1,None)

count = len(kp1)

print("Detected {:.5f} points, in {:.5f} sec.".format(count, time.time() - start_time))

start_time = time.time()

kp2, des2 = orb.detectAndCompute(img2,None)

count = len(kp2)

print("Detected {:.5f} points, in {:.5f} sec.".format(count, time.time() - start_time))

start_time = time.time()

bf = cv.BFMatcher(cv.NORM_HAMMING, crossCheck=True)

matches = bf.match(des1,des2)

print("Matches in {:.5f} sec.".format(time.time() - start_time))

matches = sorted(matches, key = lambda x:x.distance)

good = []

for match in matches:

if match.distance < 40:

good.append(match)

print(len(good))

#img3 = cv.drawMatches(img1,kp1,img2,kp2,good,None,flags=cv.DrawMatchesFlags_NOT_DRAW_SINGLE_POINTS)

img3 = cv.drawMatches(img1,kp1,img2,kp2,good,None,flags=0)

cv.namedWindow('img',cv.WINDOW_AUTOSIZE)

cv.imshow('img',img3)

cv.waitKey(0)

cv.destroyAllWindows()

基本上满足实时要求，效果比不上上面两种，但是在特定领域用起来应该蛮6⃣️的。

(base) panyongengde:orb_demo panyongfeng$ python demo_orb.py

Detected 500.00000 points, in 0.05798 sec.

Detected 500.00000 points, in 0.02815 sec.

Matches in 0.00221 sec.

orb

openCV 机器学习算法配合使用效果更好

猜你喜欢

热点阅读