Zookeeper

Details

I was discussing with a friend a pet door that would only let out certain animals. We had a few basic ideas (RFID, magnets, etc), but a big issue with a dumb sensor is the wrong animal sneaking out when the door opens.

Maybe a camera would work? Sounds like a good use case for machine learning. This project distinguishes between three pets.


Cody — Corgi/Monster mix	Malloc —Nimble hunter	Strcat — Loves strings, overflows buffers

This is my first attempt applying machine learning to a real problem beyond coursework and MNIST/CIFAR- if you stumble on this and can recommend better ways to do things, please reach out!

Collecting Data

I hooked a webcam to a Raspberry Pi and mounted it with a view of the backdoor. Every minute it take a picture and uploads it to Amazon S3.

I wrote some tools to quickly classify training images by hand and output a CSV.

The Software

The software is written with Tensorflow (with TFLearn), and Scikit.

It seemed like I could run into a problem of a network that just always predicts an empty image. Since the scene is usually empty, predicting that the scene is empty would actually be a pretty low cost network. To mitigate this, I decided to break the problem into three parts:

Determine Night/Day
Determine if Anything is Happening
Determine the Specific Animal

Determining Night/Day

This uses logistic regression on the average brightness of the image.

Logistic regression handled this with ease.

from sklearn import linear_model
Y = np.array(Y) # array of training answers, 1 or 0
X = np.array([]) # array of image brightness
for filename in filenames:
    # load the images
    image = misc.imread(constants.IMAGE_64_PATH + '/' + filename, mode='L')
    avg_brightness = np.matrix(image).mean()
    X = np.append(X, avg_brightness)
X = np.array([X]).transpose()
clf = linear_model.LogisticRegression(C=1e5)
clf.fit(X, Y)

Determining if Anything is Happening

The next step identifies if anyone is present in the image, or if it’s just an empty scene.

After a few attempts, I ultimately realized that this is a fixed position camera on a pretty limited scene. The scene can change a lot between night and day, but animals will always be in the same place- table, windowsill, floor. They won’t just be floating midair.

Given this, a fully-connected neural network worked fine. It doesn’t need to know what a cat looks like- just that a pixel may correspond to activity.

import tflearn
from tflearn.data_preprocessing import ImagePreprocessing
from tflearn.data_augmentation import ImageAugmentation
from tflearn.layers.core import input_data, dropout, fully_connected
from tflearn.layers.conv import conv_1d, max_pool_1d
from tflearn.layers.estimator import regression
img_prep = ImagePreprocessing()
img_prep.add_featurewise_zero_center()
img_prep.add_featurewise_stdnorm()
img_aug = ImageAugmentation()
img_aug.add_random_flip_leftright()
# Specify shape of the data, image prep
network = input_data(shape=[None, 52, 64],
                     data_preprocessing=img_prep,
                     data_augmentation=img_aug)
# Since the image position remains consistent and are fairly similar, this can be spatially aware.
# Using a fully connected network directly, no need for convolution.
network = fully_connected(network, 2048, activation='relu')
network = fully_connected(network, 2, activation='softmax')
network = regression(network, optimizer='adam',
                     loss='categorical_crossentropy',
                     learning_rate=0.00003)
model = tflearn.DNN(network, tensorboard_verbose=0)

To handle issues of lighting and shadows changing throughout the day, I created an average image, which is the average of all daytime images. I subtracted this image from all training images, like zeroing a scale before weighing something. It’s not perfect, but it brings out a little more contrast in the photos. Notice the cat on the table becomes more visible when the average (center) is subtracted.


Original image with a cat	Average of all daytime...

Discussions

ahmadhassanawan404 wrote 07/23/2023 at 12:47

You have don a nice work. I have also work on that animals machine learning for more detail you can see here https://atozanimalszoo.com/

Are you sure? yes | no

JOhn gado wrote 09/10/2016 at 13:18

This is pretty awesome, good job ! In which school are you studying ? I would like to start image recognition via deep learning, have any tutorial/book/video/MOOC to recommend ?

A raspberry is powerfull enough for all the processing or you did it via computer ?

Thanks :D

Are you sure? yes | no

Zach wrote 09/12/2016 at 02:26

Thanks! Out of school for a while, just trying to keep my skills fresh.

I took Andrew Ng's machine learning course on Coursera: https://www.coursera.org/learn/machine-learning

It's more theory, exercises in Octave/Matlab, but made Tensorflow a lot more accessible.

The Raspberry Pi was just used for image capture, running Tensorflow on laptop and cloud server. I could see the Raspberry Pi being powerful enough to do the analysis of a single image, but not the training. It also sounds like Tensorflow on Raspberry Pi is difficult to compile.

Are you sure? yes | no

Greg Kennedy wrote 08/05/2016 at 23:33

Just commenting to say I love your cats' names.

Are you sure? yes | no

Zookeeper

Description

Details

Discussions

Similar Projects

Doom Air

Deep Learning Brain Age Estimation

FugSlucks™ 2019

Anna2 aka A2 the agbot!

Zookeeper

Become a Hackaday.io member

Just one more thing

Description

Details

Enjoy this project?

Discussions

Become a Hackaday.io Member

Similar Projects

Doom Air

Deep Learning Brain Age Estimation

FugSlucks™ 2019

Anna2 aka A2 the agbot!

Does this project spark your interest?

Report project as inappropriate

Send message

Remove Member