Self-Driving Car Nanodegree Summary

Summary

I recommend this program for aspiring early- to mid-career engineers interested in self-driving vehicle technology. Some things to keep in mind about the program:

Reasonable time to complete: about 9 months
High value, High cost
The first term was much more rigorous than the subsequent two
Entire technology stack, from high-level planning to low-level controls, was covered in adequate depth
Knowledge checks in between concepts via quizzes were very useful
Some inconsistency in application across terms: multiple partners seemed to have led to multiple lecture/quiz formats
Culminated in programming Carla, Udacity’s self-driving car

Introduction

I had been working for Ford’s autonomous vehicles unit for a about a year when Udacity launched the Self-Driving Car Nanodegree, now rebranded as the Self-Driving Car Engineer Nanodegree. I was on the object detection and tracking team which, admittedly, I did not find as fascinating as the other technology areas adjacent; these included path planning, motion controls, and computer vision. I wanted to learn more about these other topics so that, one day, I could move to another group doing work within Ford in one or more of these areas.

Udacity has the deserved reputation of being more expensive than competitors when comparing similar offerings. I took this into account before considering enrolling in the program. The nanodegree was broken out into 3 terms, each costing a whopping $800. Wow… $2400 for an unaccredited online education credential?! To Udacity’s credit: this program has no competition among the other big MOOC platforms; Coursera and edX have nothing comparable. The affiliation with industry partners like Nvidia and Mercedes-Benz made this program very appealing. I ultimately decided to commit the time and capital resources to the program and I estimated that each term would take about 3 months to complete.

I enrolled in the second-ever cohort, experienced some of the growing pains of early adoption, and had a blast working through the material. I will go through each term, presenting some interesting results and my opinions along the way.

Term 1:

The first term was dedicated to computer vision, a field in which I was quite inexperienced. I had some experience throughout the years working with imaging, e.g. IR cameras, and non-imaging, e.g. radar, sensors, however I had not done much work with image processing. I was really looking forward to this term because it gave me a chance to learn methods of image processing, like convolutional neural networks and gradient-based methods, and apply them to real-world problems encountered in self-driving cars.

The video below shows the output of the project where I used OpenCV filtering and processing methods to do lane finding on a video stream captured from a moving car:

I hadn’t thought about it before, but image processing could be used to develop steering commands for a car! Using a convolutional neural net architecture, I was able to train a model to steer a simulated car around a test track. I trained the model by driving the car around the test track many different times and many different ways: counterclockwise, clockwise, weaving about the track, etc… The data captured provided the neural net with the appropriate steering commands given a picture of the world at any given time. This was super cool. Check out the video below:

This term, aside from the the really cool projects, had a high-level of academic rigor. The project output was only a part of the criteria for successful completion. The other part, which really impressed me, was a detailed analysis of the methods used in the context of the problem. I think that it is important to do complete analyses when learning new material. Otherwise, with emphases placed only on production, engineers are turned into technicians capable of nothing more than reiteration of previous results.

In this term, I completed projects that classified traffic signs from a database of images, detected vehicles in a video stream, drove a simulated car around a test track, and identified lanes from a video stream in real-time. Below, you’ll see the output of the vehicle detection project:

I liked the vehicle detection project so much that I extended it with a tracker to smooth out the observed measurement transients. I used this project for a few job interview presentations with favorable feedback.

This term was awesome! In a period of only a few months, I completed five projects, each relevant to real-world perception problems in self-driving cars and, most importantly, using real data.

Term 2:

Term 1 set the bar very high, so I was really looking forward to Term 2. The scope of this term was image processing and control; with emphases placed on tracking, localization, and control. The first few projects didn’t seem that interesting to me; they were concerned with Kalman filters, and I had had much experience with implementing and tuning these in practice. Even with my experience with Kalman filtering, I still found some useful tips and tricks. Below, you’ll find the link to the unscented Kalman filter project results:

One departure from the first term I observed quickly: this term’s projects did not require the same level of effort as those of the first. Granted, the projects were done in C++, as opposed to Python, and there isn’t an integrated platform for code and analysis like Jupyter notebooks. I was a little disappointed that there was no longer an analytical component to the projects.

There really were some interesting projects here. My favorite was, perhaps, the particle filter landmark-based localization project:

The term ended with a comparison of PID and model-predictive control methods for error-correction and stability. You can see the MPC results in the video below:

Although much of the material was not new to me, I found some interesting aspects that I had overlooked previously. The simulation platform for testing algorithms was very good and allowed for rapid algorithm development and testing. I would have liked there to have been more of a written component for each project. Performing rigorous post-mortem analyses of projects helps to cement difficult concepts for me.

Term 3

The third term was a culmination of the previous two. Techniques presented previously were further explored in greater depth. The first project was a high-level path planner that fused sensor data with vehicle telemetry to move about in traffic:

The second project was interesting; there was a choice between two options. I chose the option for advanced deep learning applied to semantic segmentation of camera images. Semantic segmentation means a pixel-by-pixel classification of content within images by training a convolutional neural network to identify which pixels belong to certain objects. The project had only two objects: roads and non-roads, but the work could be extended by adding more classes. The image below shows a sample segmented image, with the road pixels highlighted in green:

The final project was incredible. We got to organize teams and write code that ran on Carla, Udacity’s self-driving car. I wrote up a more in-depth project report for those interested. This experience was very rewarding, and also very challenging.

I go more in-depth in the project page, so I’ll only mention a point here. There was an apparent disconnect between the simulator used to verify the ROS code before running on Carla and the actual car, itself. Twice, I pushed submissions that worked well in simulation only to have subpar results from running the code on the car. Two weeks, or more, between submission and results being made available made debugging issues on Carla very difficult.

Conclusion

The problems I encountered throughout all three terms were not academic; they were real industry problems along with a discussion of how industry approaches them. I believe that Udacity’s approach is unique among other platforms is this regard; neither edX nor Coursera have much regard for industry in their courses or specializations. While the cost was relatively high, the return on investment was much more tangible. I learned a lot in a relatively short period of time, and the knowledge was directly applicable to robotics, in general, and self-driving cars, in particular.