RoadBotics’ Research and Development: Making SLAM Work, Part 2

RoadBotics Blog

January 25, 2021

So far in our 3-D reconstruction series, we’ve covered an introduction to 3D reconstruction, how to create one, and part 1 of the SLAM methodology.

Next, we’re going to show you the last of the major obstacles the R&D team has been tackling to make SLAM methods work at RoadBotics. As a bonus, you’ll learn a bit about how modern smartphone cameras work!

Rolling Shutter

An intriguing problem that spans almost all of modern digital image and video capture is the rolling shutter problem. It is relatively simple to describe, but a bit of background about how cameras work is helpful.

Before the age of ubiquitous digital cameras, analog film cameras captured images by exposing a photosensitive material (the film) to light. Incidentally, this is why analog film requires a dark room to be developed since the medium itself is sensitive to light. The inside of the camera where the film is loaded protects the film from light via a device called a shutter. When the button to take a picture is pressed, the shutter, which is initially shut to block light, opens briefly to expose the film inside the camera to the light coming in through the lens of the camera, and then shuts again. The mechanism of the shutter opening and closing rapidly is what makes the distinctive “taking a photograph” sound of a handheld camera, and the sound most often used with digital cameras to indicate a photo is being taken. This method of capturing the light in a scene is effectively uniform and instantaneous across the film, which is why this type of shutter is often referred to as a global shutter, in the sense of “global” as “complete.”

Most digital cameras do not work the same way. Modern image capture devices (i.e., cameras on our smartphones) are based on integrated semiconductor chips called CMOS sensors. These CMOS-based sensors replace traditional film with a grid of pixels, capturing the light that hits each pixel in a row-by-row manner. This is called a rolling shutter, as opposed to the global shutter.

The following animation shows the difference. The timing in the rolling shutter animation is exaggerated for effect, but the capture delay between individual rows is a real consequence of this technology.

Though it only has a simple difference in operation, a rolling shutter turns out to have many profound side-effects in digital image capture when either the camera is moving (as with dashboard collection) or objects in the frame are moving. This is appropriately called the rolling shutter effect.

This effect is noticed when objects in the frame, or the camera itself, are moving at a speed comparable to the speed of the rolling shutter because the tiny delay between capturing each row means that a slightly different scene is being captured by each row due to the capture not being simultaneous.

The following animation illustrates the problem:

The above animation may seem dramatic, but below is a real-life image of a spinning aircraft propeller that exhibits the same dramatic warping:

Unfortunately for us, and for anyone using a smartphone camera, the rolling shutter effect is not limited to such obvious circumstances. The effect can be more subtle and difficult to notice.

For instance, you can notice warping in some parts of the following image, but without forewarning, you could easily assume this fence was just bent in parts. Even after we’ve told you this, you may still find it difficult to visually acknowledge that this image does not accurately capture the scene.

Why Do Cameras Have Rolling Shutters?

So if analog cameras use global shutters and rolling shutters produce warped images, why use them?

They are much more efficient than global shutters. We can see their advantage with some simple math.

Many smartphones (including the ones we use for data collection at RoadBotics) collect HD video at 30 frames per second. That’s 30 images that each have 2,073,600 pixels at 24 bits per pixel every second.

(30 frames x 2,073,600 pixels x 24 bits) / (8 bits per byte) = 186,624,000 bytes per second

Or about 180 megabytes per second. That’s a literal firehose of data!

Every frame is about 6 megabytes, and if your smartphone had a global shutter, it would have to record that 6 megabytes instantaneously between frame exposures. With a rolling shutter, your smartphone still has to record that firehose of data, but it can record it row-by-row, moving the data from one row of pixels into memory while the other rows are being exposed to light. This trade-off dramatically reduces the cost of the camera in your smartphone while introducing some warping that humans don’t really notice when playing back the video at 30 fps. It’s only when we pull individual frames out of the video and look at them for a second or longer that we notice the warp.

How Rolling Shutter Effects 3D Reconstructions

This warping ends up having drastic consequences for reconstruction, given that 1) the effect is stronger the faster the relative motion between the camera and the scene (RoadBotics collects data at road speeds that can be quite fast), and 2) we routinely collect rapidly changing background imagery (for example, trees and buildings).

As an example, consider the following angled view of one of the reconstructions we looked at in our last blog post:

The top part of the reconstruction looks normal, but there are a lot of points that seem to be beneath the ground!

None of the images being fed into the reconstruction had subterranean data, so how and why would a reconstruction process produce this?

The answer is that the input images were collected using a smartphone mounted as a dashcam, creating many instances of rolling shutter artefacts as the car traveled down the road.

Even though many of these are hard or practically impossible for humans to notice in a picture (occurring in the foliage, in the clouds, on the road surface), the computer, which uses the precise pixel location of features to determine their location in 3D space, takes all of them into account. While localizing the warped features, the algorithm tries to reconcile the apparent position differences and then incorrectly determines that those parts of the scene are located in places that don’t make sense (such as below the surface of the road) in order to make them fit the warped images.

Readers of our previous post might ask, can RANSAC handle excluding these anomalies? Unfortunately, the warped features created by a rolling shutter can be subtle enough that they still mathematically fit into the remaining scene (which is what RANSAC considers as “good enough to keep”), even though we can visually see that they do not fit.

The rolling shutter problem is an open problem in computer vision. There are two general approaches to solve the problem.

One is a preprocessing correction: train a neural network to recognize the distortions in the input data and correct them before the data is used for reconstruction. There is a lot of ongoing research in this domain (for an example, see http://cseweb.ucsd.edu/~mkchandraker/pdf/cvpr19_rollingshutter.pdf which was a paper presented at CVPR, one of the most prestigious academic conferences in computer vision), and some of the latest results are adaptable to our needs.

The second is a post-processing correction: automatically recognize rolling shutter distortions after the fact and then automatically excise them from the reconstruction. RoadBotics’s reconstructions are generally aimed at civil infrastructure (roads, sidewalks, etc.), and these generally are planar assets. (You should be very worried about roads that do not fit a planar surface! It usually means something like this has happened.) Therefore, we can use standard curve-fitting techniques on the raw 3D data in the reconstruction and then use statistical heuristics to cut out data that lies below the plane or otherwise is implausibly far away from the surfaces we care about.

For an optimal result, we’ve found that it’s good to use both approaches. The preprocessing step tends to improve the overall accuracy of the reconstruction, and the post-processing step simplifies the reconstruction to focus on what we really care about.

Stay tuned for our final blog post about reconstruction, which will showcase some of our most impressive results so far!

Cookie	Duration	Description
__hssrc	session	This cookie is set by Hubspot whenever it changes the session cookie. The __hssrc cookie set to 1 indicates that the user has restarted the browser, and if the cookie does not exist, it is assumed to be a new session.
_GRECAPTCHA	5 months 27 days	This cookie is set by the Google recaptcha service to identify bots to protect the website against malicious spam attacks.
ak_bmsc	2 hours	This cookie is used by Akamai to optimize site security by distinguishing between humans and bots
citrix_ns_id	session	This cookie is set by the provider Citrix, a web application firewall. This cookie is used for protecting the website against known and unknown attacks.
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-analytics	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Analytics" category .
cookielawinfo-checkbox-functional	1 year	The cookie is set by the GDPR Cookie Consent plugin to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Necessary" category .
cookielawinfo-checkbox-others	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to store the user consent for cookies in the category "Others".
cookielawinfo-checkbox-performance	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to store the user consent for cookies in the category "Performance".
CookieLawInfoConsent	1 year	Records the default button state of the corresponding category & the status of CCPA. It works only in coordination with the primary cookie.
elementor	never	This cookie is used by the website's WordPress theme. It allows the website owner to implement or change the website's content in real-time.
JSESSIONID	session	The JSESSIONID cookie is used by New Relic to store a session identifier so that New Relic can monitor session counts for an application.
viewed_cookie_policy	1 year	The cookie is set by the GDPR Cookie Consent plugin to store whether or not the user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
A3	1 year	No description
AnalyticsSyncHistory	1 month	No description
citrix_ns_id_.d2d.gsa.gov__wlf	session	No description
citrix_ns_id_.gsa.gov__wlf	session	No description
li_gc	5 months 27 days	No description
ln_or	1 day	No description
m	2 years	No description available.
NSC_IUUQ-Ebub2Efd	session	No description
SSESSe6f64672c023222bafbc47f83a5ecbd4	23 days 4 hours	No description
TS01c2db25	session	No description

Cookie	Duration	Description
_fbp	3 months	This cookie is set by Facebook to display advertisements when either on Facebook or on a digital platform powered by Facebook advertising, after visiting the website.
c	1 year	This cookie is set by Rubicon Project to control synchronization of user identification and exchange of user data between various ad services.
CONSENT	2 years	YouTube sets this cookie via embedded youtube-videos and registers anonymous statistical data.
fr	3 months	Facebook sets this cookie to show relevant advertisements to users by tracking user behaviour across the web, on sites that have Facebook pixel or Facebook social plugin.
IDE	1 year 24 days	Google DoubleClick IDE cookies are used to store information about how the user uses the website to present them with relevant ads and according to the user profile.
test_cookie	15 minutes	The test_cookie is set by doubleclick.net and is used to determine if the user's browser supports cookies.
tuuid	1 year	The tuuid cookie, set by BidSwitch, stores an unique ID to determine what adverts the users have seen if they have visited any of the advertiser's websites. The information is used to decide when and how often users will see a certain banner.
tuuid_lu	1 year	This cookie, set by BidSwitch, stores a unique ID to determine what adverts the users have seen while visiting an advertiser's website. This information is then used to understand when and how often users will see a certain banner.
VISITOR_INFO1_LIVE	5 months 27 days	A cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface.
YSC	session	YSC cookie is set by Youtube and is used to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt-remote-device-id	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt.innertube::nextId	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.
yt.innertube::requests	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.

Cookie	Duration	Description
__hstc	5 months 27 days	This is the main cookie set by Hubspot, for tracking visitors. It contains the domain, initial timestamp (first visit), last timestamp (last visit), current timestamp (this visit), and session number (increments for each subsequent session).
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_ga_SQVZMMXYCW	2 years	This cookie is installed by Google Analytics.
_gat_gtag_UA_88652169_15	1 minute	Set by Google to distinguish users.
_gat_UA-88652169-1	1 minute	A variation of the _gat cookie set by Google Analytics and Google Tag Manager to allow website owners to track visitor behaviour and measure site performance. The pattern element in the name contains the unique identity number of the account or website it relates to.
_gcl_au	3 months	Provided by Google Tag Manager to experiment advertisement efficiency of websites using their services.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.
hubspotutk	5 months 27 days	HubSpot sets this cookie to keep track of the visitors to the website. This cookie is passed to HubSpot on form submission and used when deduplicating contacts.

Cookie	Duration	Description
__cf_bm	30 minutes	This cookie, set by Cloudflare, is used to support Cloudflare Bot Management.
__hssc	30 minutes	HubSpot sets this cookie to keep track of sessions and to determine if HubSpot should increment the session number and timestamps in the __hstc cookie.
bcookie	1 year	LinkedIn sets this cookie from LinkedIn share buttons and ad tags to recognize browser ID.
bscookie	1 year	LinkedIn sets this cookie to store performed actions on the website.
lang	session	LinkedIn sets this cookie to remember a user's language setting.
lidc	1 day	LinkedIn sets the lidc cookie to facilitate data center selection.
messagesUtk	5 months 27 days	HubSpot sets this cookie to recognize visitors who chat via the chatflows tool.
tads_uid	5 years	The domain of this cookie is owned by Technorati.This cookie helps the user to share pages through social networking sites. The main purpose of this cookie is advertising.
UserMatchHistory	1 month	LinkedIn sets this cookie for LinkedIn Ads ID syncing.

RoadBotics’ Research and Development: Making SLAM Work, Part 2

Rolling Shutter

Why Do Cameras Have Rolling Shutters?

How Rolling Shutter Effects 3D Reconstructions

Author

Miguel Dickson

Ready to Get Started?

Will we see you on the road in 2024?

Beyond Slippery Roads: The Hidden Cost of Winter Deicing

Practical Applications of Artificial Intelligence and Machine Learning for Better Road Maintenance