Vision Pt. 3
In this blog we’ll take a look at the opencv code that’ll be at the heart of the RB3 Robotic Arm, we’ll traverse block by block and see how it all works.
All the code in this blog is available at : https://github.com/ric96/RB3-RoboticArm
And once the project is finalized will be pushed to : https://github.com/96boards-projects/RB3-RoboticArm
To install all the dependencies on a Debian Buster build for RB3, run the following script: install-opencv.sh
Import all the things
We start with importing the following libraries:
import cv2 import imutils from imutils.video import WebcamVideoStream
cv2imports opencv for Python
imutilsis a wrapper around cv2 video i/o libraries to make life easier.
WebcamVideoStreamis a specific function within imutils to get frames from webcam.
Declare all the variables
Some quick global variable declaration:
# WebCam Streaming using imutils vs = WebcamVideoStream(src=0).start() # define the lower and upper boundaries of the "green" "red" "blue" # ball in the HSV color space, then initialize the # list of tracked points greenLower = (10, 140, 100) greenUpper = (30, 255, 255) redLower = (0, 80, 80) redUpper = (10, 255, 190) blueLower = (100, 30, 0) blueUpper = (120, 255, 128)
vs = WebcamVideoStream(src=0).start(): start webcam streaming with
WebcamVideoStreamfrom source /dev/video0.
greenLower = (10, 140, 100)
greenUpper = (30, 255, 255): Set HSV color space upper and lower limits for contour of green colored objects. And repeat the same for every shade you want to detect.
All the functions
class ShapeDetector: def __init__(self): pass def detect_shape(self, c): # initialize the shape name and approximate the contour shape = "unidentified" peri = cv2.arcLength(c, True) approx = cv2.approxPolyDP(c, 0.04 * peri, True) # if the shape is a triangle, it will have 3 vertices if len(approx) == 3: shape = "triangle" # if the shape has 4 vertices, it is either a square or # a rectangle elif len(approx) == 4: # compute the bounding box of the contour and use the # bounding box to compute the aspect ratio (x, y, w, h) = cv2.boundingRect(approx) ar = w / float(h) # a square will have an aspect ratio that is approximately # equal to one, otherwise, the shape is a rectangle shape = "square" if ar >= 0.95 and ar <= 1.05 else "rectangle" # if the shape is a pentagon, it will have 5 vertices elif len(approx) == 5: shape = "pentagon" # otherwise, we assume the shape is a circle else: shape = "circle" # return the name of the shape return shape
detect_shapefunction is responsible to take in a specific HSV color contour, map out the edges and approximate the number of vertices.
- Then depending on the number of vertices, return the value as
def detect_hsv(frame, lower, upper): # resize the frame, blur it, and convert it to the HSV # color space blurred = cv2.GaussianBlur(frame, (11, 11), 0) hsv = cv2.cvtColor(blurred, cv2.COLOR_BGR2HSV) # construct a mask for the color "green", then perform # a series of dilations and erosions to remove any small # blobs left in the mask mask = cv2.inRange(hsv, lower, upper) mask = cv2.erode(mask, None, iterations=2) mask = cv2.dilate(mask, None, iterations=2) # find contours in the mask and initialize the current # (x, y) center of the ball cnts = cv2.findContours(mask.copy(), cv2.RETR_EXTERNAL, cv2.CHAIN_APPROX_SIMPLE) cnts = imutils.grab_contours(cnts) data_arr= if len(cnts) > 0: c = max(cnts, key=cv2.contourArea) sd = ShapeDetector() i = 0 for c in cnts: ((x, y), radius) = cv2.minEnclosingCircle(c) M = cv2.moments(c) cX = int((M["m10"] / M["m00"])) cY = int((M["m01"] / M["m00"])) shape = sd.detect_shape(c) data = [c, cX, cY, shape] data_arr.insert(i, data) i = i + 1 return data_arr
- The detect_hsv function takes in three parameters. The current frame, hsv upper limit and hsv lower limit.
- It then proceeds to create contours for a specific hsv color space, and stores them in the
cntsarray as individual contours.
- After that it traverses through all the contours, first calculating the center XY co-ordinates in the frame.
- Then calls the
detect_shapefunction and saves the returned shape value in
- All the value for a specific contour is saved in the
data_arrysaves all the
datavalues for multiple contours that were detected.
- This entire exercise allows us to detect multiple contours of the same color and multiple contours of different color within the same frame.
- This also gives us the XY coordinates for each object detected with will help us guide the Robotic Arm.
def overlay(frame, data, overlay_col, num): try: cv2.drawContours(frame, [data], -1, overlay_col, 2) cv2.putText(frame, str(data) + " " + str(num) + ": " + str(data) + "x" + str(data), (data, data), cv2.FONT_HERSHEY_SIMPLEX, 0.5, (0, 255, 0), 2) except: print("An exception occurred")
- This function takes in data returned by the
detect_hsvfunction and lays it over the frame as a text, this includes;
- XY Co-ordinates
- Shape of the object
- Outline of the contour
- Numbering of the object.
Main: one loop to rule them all:
I honestly couldn’t find a meme for this :/
def main(): while True: # compute the center of the contour, then detect the name of the # shape using only the contour # grab the current frame # Incase of stream using cv2.VideoCapture #ret, frame = vs.read() # incase of stream using imutils frame = vs.read() shape_blue = detect_hsv(frame, blueLower, blueUpper) shape_green = detect_hsv(frame, greenLower, greenUpper) shape_red = detect_hsv(frame, redLower, redUpper) for i in range(len(shape_blue)-1): overlay(frame, shape_blue[i], (0,0,255), i) for i in range(len(shape_green)-1): overlay(frame, shape_green[i], (255,0,0), i) for i in range(len(shape_red)-1): overlay(frame, shape_red[i], (0,255,0), i) cv2.imshow("Frame", frame) #cv2.imshow("Frame1", framer) key = cv2.waitKey(1) & 0xFF # if the 'q' key is pressed, stop the loop if key == ord("q"): break vs.stop() cv2.destroyAllWindows()
main does the following things in a loop:
- Reads frame from the video stream.
- Passes said frame and hsv values to
- Repeats for the other two colors
- Saves the return array to
- For each contour in each of the color space, calls the overlay function to add appropriate overlay to the frame.
- Output the processed from using
This is more like a show-off of the object detection working in real time. For the actual Robotic Arm to work all we need is the XY Co-ordinates for the object in order to aim the claw at it.
This code was presented during OpenHours Episode 154.
This article is Part 4 in a 7-Part Series.
- Part 1 - Qualcomm RB3 Robotic Arm Project | Introduction
- Part 2 - OpenCV on RB3 Pt. 1 | Qualcomm RB3 Robotic Arm Project
- Part 3 - Threads got complicated | OpenCV on RB3 Pt. 2 | Qualcomm RB3 Robotic Arm Project
- Part 4 - Gimme Code | OpenCV on RB3 Pt. 3 | Qualcomm RB3 Robotic Arm Project
- Part 5 - My experiments with 1080p | OpenCV on RB3 Pt. 4 | Qualcomm RB3 Robotic Arm Project
- Part 6 - Setup RB3 for the Robotic Arm Project | OpenCV on RB3 Pt. 0 | Qualcomm RB3 Robotic Arm Project
- Part 7 - Speech Recognition and Combining Voice with Vision | OpenCV on RB3 Pt. 5 | Qualcomm RB3 Robotic Arm Project