Python-OpenCV hw2筆記

Introduction to Image Processing, Computer Vision and Deep Learning

1. Find Contour
- 1) Draw Contour
  - Follow the steps:
  - Example code:
- 2) Count Coins
2. Camera Calibration
3. Augmented Reality
- - Follow the steps:
  - Example code:
4. Stereo Disparity Map
- 1) Compute disparity map
  - Follow the steps:
- 2) Calculate the depth
  - Follow the steps:
  - Example code:

1. Find Contour

1) Draw Contour

Image Not Showing Possible Reasons

The image file may be corrupted
The server hosting the image is unavailable
The image path is incorrect
The image format is not supported

Learn More →

Image Not Showing Possible Reasons

The image file may be corrupted
The server hosting the image is unavailable
The image path is incorrect
The image format is not supported

Learn More →

Follow the steps:

RGB -> Grayscale -> Binary

# Grayscale
grayImage = cv2.cvtColor(originalImage, cv2.COLOR_BGR2GRAY)
# Binary
(thresh, blackAndWhiteImage) = cv2.threshold(grayImage, 127, 255, cv2.THRESH_BINARY)

Remember to use Gaussian Blur to remove the noise.

blur = cv2.GaussianBlur(gray,(kernel_size, kernel_size), 0)

Using some edge detection functions to get better results. (Ex: cv2.Canny)

edges = cv2.Canny(blur_gray, low_threshold, high_threshold)

use cv2.findContours

image, countours, hierarchy = cv2.findContours(image, mode, method[, contours[, hierarchy[, offset ]]])

use cv2.drawContours

cv2.drawContours(image, contours, contourIdx, color[, thickness[, lineType[, hierarchy[, maxLevel[, offset ]]]]])

Example code:













image = cv2.imread('./Datasets/Q1_Image/coin01.jpg')
gray_image = cv2.cvtColor(image, cv2.COLOR_BGR2GRAY)
        (thresh, binary) = cv2.threshold(gray_image, 127, 255, cv2.THRESH_BINARY)
guassian = cv2.GaussianBlur(binary, (11, 11), 0)
edge_image = cv2.Canny(guassian, 127, 127)

edge_image, contours, hierarchy = cv2.findContours(edge_image, cv2.RETR_EXTERNAL, cv2.CHAIN_APPROX_SIMPLE)
image_copy = image.copy()
cv2.drawContours(image_copy, contours, -1, (0, 0, 255), 2)
        # image_copy[edge_image > 0.01 * edge_image.max()] = [0, 0, 255]

cv2.namedWindow('coin01')
cv2.imshow('coin01', image_copy)

2) Count Coins

Image Not Showing Possible Reasons

The image file may be corrupted
The server hosting the image is unavailable
The image path is incorrect
The image format is not supported

Learn More →


count = len(contours)

2. Camera Calibration

Relationship of `2D x Intrinsic x Extrinsic x 3D`

Image Not Showing Possible Reasons

The image file may be corrupted
The server hosting the image is unavailable
The image path is incorrect
The image format is not supported

Learn More →

1) Corner detection

Image Not Showing Possible Reasons

The image file may be corrupted
The server hosting the image is unavailable
The image path is incorrect
The image format is not supported

Learn More →

Follow the steps:

Given 15 of chessboard images
Read the images and convert to grayscale

# Grayscale
grayImage = cv2.cvtColor(chess_board_image, cv2.COLOR_BGR2GRAY)

Find the chessboard corners

#Enter the number of inside corners in (x, y) = (nx, ny)
ret, corners = cv2.findChessboardCorners(grayImage, (nx, ny), None)

Draw and display the corners

cv2.drawChessboardCorners(chess_board_image, (nx, ny), corners, ret)

Example code:






















chess_images = glob.glob('./Datasets/Q2_Image/*.bmp')
# Select any index to grab an image from the list
for i in range(len(chess_images)):
    # Read in the image
    chess_board_image = cv2.imread(chess_images[i])
    # Convert to grayscale
    gray = cv2.cvtColor(chess_board_image, cv2.COLOR_BGR2GRAY)
    # Find the chessboard corners
    ny = 8
    nx = 11
    ret, corners = cv2.findChessboardCorners(gray, (nx, ny), None)
    # If found, draw corners
    if ret == True:
        # Draw and display the corners
        cv2.drawChessboardCorners(chess_board_image, (nx, ny), corners, ret)
        result_name = 'board' + str(i+1) + '.bmp'
        cv2.imwrite(result_name, chess_board_image)
        cv2.namedWindow("%s" % (i+1))
        cv2.imshow("%s" % (i+1), chess_board_image)

cv2.waitKey(0)
cv2.destroyAllWindows()

2) Find the intrinsic matrix

Find the intrinsic matrix:

Image Not Showing Possible Reasons

The image file may be corrupted
The server hosting the image is unavailable
The image path is incorrect
The image format is not supported

Learn More →

Follow the steps:

termination criteria

criteria = (cv2.TERM_CRITERIA_EPS + cv2.TERM_CRITERIA_MAX_ITER, 30, 0.001)

prepare object points, like (0,0,0), (1,0,0), (2,0,0) …,(6,5,0)

objp = np.zeros((11 * 8, 3), np.float32)
objp[:, :2] = np.mgrid[0:8, 0:11].T.reshape(-1, 2)

Arrays to store object points and image points from all the images

objpoints = []  # 3d point in real world space
imgpoints = []  # 2d points in image plane

Select all images, convert to grayscale, and find the chessboard corners

for i in range(len(chess_images)):
    # Read in the image
    image = cv2.imread(chess_images[i])
    # Convert to grayscale
    gray = cv2.cvtColor(image, cv2.COLOR_BGR2GRAY)
    # Find the chessboard corners
    ret, corners = cv2.findChessboardCorners(gray, (8, 11), None)

Further optimization calculations on the corners

corners2 = cv2.cornerSubPix(gray, corners, (7, 7), (-1, -1), criteria)

use cv2.calibrateCamera

ret, mtx, dist, rvecs, tvecs = cv2.calibrateCamera(objpoints, imgpoints, gray.shape[::-1], None, None)

Example code:






























# termination criteria
criteria = (cv2.TERM_CRITERIA_EPS + cv2.TERM_CRITERIA_MAX_ITER, 30, 0.001)

# prepare object points, like (0,0,0), (1,0,0), (2,0,0) ....,(6,5,0)
objp = np.zeros((11 * 8, 3), np.float32)
objp[:, :2] = np.mgrid[0:8, 0:11].T.reshape(-1, 2)

# Arrays to store object points and image points from all the images.
objpoints = []  # 3d point in real world space
imgpoints = []  # 2d points in image plane

chess_images = glob.glob('./Datasets/Q2_Image/*.bmp')
# Select any index to grab an image from the list
for i in range(len(chess_images)):
    # Read in the image
    image = cv2.imread(chess_images[i])
    # Convert to grayscale
    gray = cv2.cvtColor(image, cv2.COLOR_BGR2GRAY)
    # Find the chessboard corners
    ret, corners = cv2.findChessboardCorners(gray, (8, 11), None)

    if ret == True:
        objpoints.append(objp)

        corners2 = cv2.cornerSubPix(gray, corners, (7, 7), (-1, -1), criteria)
        imgpoints.append(corners2)

# gray.shape[::-1] = (2048, 2048)
ret, mtx, dist, rvecs, tvecs = cv2.calibrateCamera(objpoints, imgpoints, (2048, 2048), None, None)
print(mtx)

3) Find the extrinsic matrix

Extrinsic matrix:

Follow the steps:

Follow 2) Find the intrinsic matrix
use cv2.Rodrigues 做旋轉向量和旋轉矩陣的轉換
use np.hstack 做陣列橫向合併

Bonus: 陣列縱向合併–np.vstack()

Example code:





# gray.shape[::-1] = (2048, 2048)
ret, mtx, dist, rvecs, tvecs = cv2.calibrateCamera(objpoints, imgpoints, (2048, 2048), None, None)
R = cv2.Rodrigues(rvecs[num-1])
ext = np.hstack((R[0], tvecs[num-1]))
print(ext)

4) Find the distortion matrix

Distortion Coefficients: k1, k2, k3, p1, p2

Follow the steps:

Follow 2) Find the intrinsic matrix

Example code:



# gray.shape[::-1] = (2048, 2048)
ret, mtx, dist, rvecs, tvecs = cv2.calibrateCamera(objpoints, imgpoints, (2048, 2048), None, None)
print(dist)

參考：

https://docs.opencv.org/3.4/d9/d0c/group__calib3d.html#ga687a1ab946686f0d85ae0363b5af1d7b

3. Augmented Reality

Follow the steps:

Follow 2) Find the intrinsic matrix
prepare object points which you want to draw, like (3,3,-3), (1,1,0), (3,5,0), (5,1,0)
use cv2.projectPoints to project 3D points to image plane

imgpts, jac = cv2.projectPoints(axis, rvecs[i], tvecs[i], mtx, dist)

use cv2.line draw line

Example code:





















































# termination criteria
criteria = (cv2.TERM_CRITERIA_EPS + cv2.TERM_CRITERIA_MAX_ITER, 30, 0.001)

# prepare object points, like (0,0,0), (1,0,0), (2,0,0) ....,(6,5,0)
objp = np.zeros((11 * 8, 3), np.float32)
objp[:, :2] = np.mgrid[0:8, 0:11].T.reshape(-1, 2)

# axis = np.float32([[3, 3, -3], [1, 1, 0], [3, 5, 0], [5, 1, 0]]).reshape(-1, 3)
axis = np.float32([[5, 3, -3], [7, 1, 0], [3, 3, 0], [7, 5, 0]]).reshape(-1, 3)

# Arrays to store object points and image points from all the images.
objpoints = []  # 3d point in real world space
imgpoints = []  # 2d points in image plane

chess_images = glob.glob('./Datasets/Q3_Image/*.bmp')
# Select any index to grab an image from the list
for i in range(len(chess_images)):
    # Read in the image
    image = cv2.imread(chess_images[i])
    # Convert to grayscale
    gray = cv2.cvtColor(image, cv2.COLOR_BGR2GRAY)
    # Find the chessboard corners
    ret, corners = cv2.findChessboardCorners(gray, (8, 11), None)

    if ret == True:
        objpoints.append(objp)
        # objp = 8 * 11 objpoints (x, y, z)
        corners2 = cv2.cornerSubPix(gray, corners, (7, 7), (-1, -1), criteria)
        imgpoints.append(corners2)
        # corner2 = each object point on 2D image (x, y)

        # gray.shape[::-1] = (2048, 2048)
        ret, mtx, dist, rvecs, tvecs = cv2.calibrateCamera(objpoints, imgpoints, (2048, 2048), None, None)

        # project 3D points to image plane
        imgpts, jac = cv2.projectPoints(axis, rvecs[i], tvecs[i], mtx, dist)

        def draw(image, imgpts):
            image = cv2.line(image, tuple(imgpts[0].ravel()), tuple(imgpts[1].ravel()), (0, 0, 255), 5)
            image = cv2.line(image, tuple(imgpts[0].ravel()), tuple(imgpts[2].ravel()), (0, 0, 255), 5)
            image = cv2.line(image, tuple(imgpts[0].ravel()), tuple(imgpts[3].ravel()), (0, 0, 255), 5)
            image = cv2.line(image, tuple(imgpts[1].ravel()), tuple(imgpts[2].ravel()), (0, 0, 255), 5)
            image = cv2.line(image, tuple(imgpts[1].ravel()), tuple(imgpts[3].ravel()), (0, 0, 255), 5)
            image = cv2.line(image, tuple(imgpts[2].ravel()), tuple(imgpts[3].ravel()), (0, 0, 255), 5)
            return image

        img = draw(image, imgpts)

        cv2.imwrite('%s_v.jpg' % i, img)
        img = cv2.resize(img, (1024, 1024), interpolation=cv2.INTER_AREA)
        cv2.namedWindow('img')
        cv2.imshow('img', img)
        cv2.waitKey(5)

參考：

https://docs.opencv.org/3.4/d9/d0c/group__calib3d.html#ga1019495a2c8d1743ed5cc23fa0daff8c

https://opencv-python-tutroals.readthedocs.io/en/latest/py_tutorials/py_calib3d/py_pose/py_pose.html

4. Stereo Disparity Map

Original image(left&right):

Stereo Disparity map:

1) Compute disparity map

Follow the steps:

open image left and right
use cv2.StereoBM_create, stereo.compute

2) Calculate the depth

這題做的時候有問題，disparity[y][x]的值一直是0，真的值output不出來QQ
所以code的部分參考參考，可以修改更好。

Follow the steps:

focal_len = 2826
baseline = 178
Cx(𝑐_𝑥^𝑟𝑖𝑔ℎ𝑡 − 𝑐_𝑥^𝑙𝑒𝑓𝑡) = 123
We know that:

d(distance) = (your point) - Cx
Z(depth) = focal_length * baseline / d
use cv2.setMouseCallback to show the points which you clicked
use cv2.rectangle to draw the point
use cv2.putText to show disparity and depth

Example code:











































imgL = cv2.imread('./Datasets/Q4_Image/imgL.png', 0)
imgR = cv2.imread('./Datasets/Q4_Image/imgR.png', 0)

stereo = cv2.StereoBM_create(numDisparities=256, blockSize=25)
disparity = stereo.compute(imgL, imgR).astype(np.float32) / 16.0

disparity = cv2.resize(disparity, (1400, 950), interpolation=cv2.INTER_AREA)
# cv2.namedWindow('disparity')
# cv2.imshow('disparity', disparity)
# cv2.imwrite('disparity.jpg', disparity)

focal_len = 2826
baseline = 178
Cx = 123

# mouse callback function
def draw_circle(event, x, y, flags, param):
    if event == cv2.EVENT_LBUTTONDOWN:
        # print(x, ",", y)
        cv2.rectangle(disparity, (x-3, y-3), (x+3, y+3), (0, 0, 255), -1)
        dist = disparity[y][x] - Cx
        depth = int(focal_len * baseline / abs(dist))
        # print("Disparity: " + str(disparity[x][y]) + " pixels")
        # print("Depth: " + str(depth) + " mm")

        # text = disparity.copy()
        cv2.rectangle(disparity, (1100, 850), (1390, 940), (255, 255, 255), -1)
        cv2.putText(disparity, "Disparity: " + str(int(disparity[y][x])) + " pixels", (1120, 890),
                            cv2.FONT_HERSHEY_SIMPLEX, 0.7, (0, 0, 0), 2, cv2.LINE_AA)
        cv2.putText(disparity, "Depth: " + str(depth) + " mm", (1120, 930),
                            cv2.FONT_HERSHEY_COMPLEX, 0.7, (0, 0, 0), 2, cv2.LINE_AA)
        # image = np.hstack([disparity, text])
        cv2.imshow('image', disparity)

while(1):
    cv2.namedWindow('image')
    cv2.setMouseCallback('image', draw_circle)
    cv2.imshow('image', disparity)
    if cv2.waitKey(20) & 0xFF == 27:
        break

cv2.waitKey(0)
cv2.destroyAllWindows()

1. Find Contour

1) Draw Contour

Follow the steps:

Example code:

2) Count Coins

2. Camera Calibration

Relationship of 2D x Intrinsic x Extrinsic x 3D

1) Corner detection

Follow the steps:

Example code:

2) Find the intrinsic matrix

Follow the steps:

Example code:

3) Find the extrinsic matrix

Follow the steps:

Example code:

4) Find the distortion matrix

Follow the steps:

Example code:

3. Augmented Reality

Follow the steps:

Example code:

4. Stereo Disparity Map

1) Compute disparity map

Follow the steps:

2) Calculate the depth

Follow the steps:

Example code:

Read more

Git基礎知識

Cifar10 classifier using VGG16

OpenCV-Python hw1筆記(code)

Python-OpenCV hw1練習(github)

Relationship of `2D x Intrinsic x Extrinsic x 3D`