R-3000 “Spider tank” vision


Section 6 stations a spider tank, hidden under thermoptic camouflage, to guard Project 2501. When Kunasagi confronts the tank, we see a glimpse of the video feed from its creepy, metal, recessed eye. This view is a screen green image, overlaid with two reticles. The larger one with radial ticks shows where the weapon is pointing while the smaller one tracks the target.

I have often used the discrepancy between a weapon- and target-reticle to point out how far behind Hollywood is on the notion of agentive systems in the real world, but for the spider tank it’s very appropriate.The image processing is likely to be much faster than the actuators controlling the tank’s position and orientation. The two reticles illustrate what the tank’s AI is working on. This said, I cannot work out why there is only one weapon reticle when the tank has two barrels that move independently.



When the spider tank expends all of its ammunition, Kunasagi activates her thermoptic camouflage, and the tank begins to search for her. It switches from its protected white camera to a big-lens blue camera. On its processing screen, the targeting reticle disappears, and a smaller reticle appears with concentric, blinking white arcs. As Kunasagi strains to wrench open plating on the tank, her camouflage is compromised, allowing the tank to focus on her (though curiously, not to do anything like try and shake her off or slam her into the wall or something). As its confidence grows, more arcs appear, become thicker, and circle the center, indicating its confidence.

The amount of information on the augmentation layer is arbitrary, since it’s a machine using it and there are certainly other processes going on than what is visualized. If this was for a human user, there might be more or less augmentation necessary, depending on the amount of training they have and the goal awareness of the system. Certainly an actual crosshairs in the weapon reticle would help aim it very precisely.



Section No9’s crappy security


The heavily-mulleted Togusa is heading to a company car when he sees two suspicious cars in the parking basement. After sizing them up for a moment, he gets into his car and without doing anything else, says,

"Security, whose official vehicles are parked in the basement garage?"

It seems the cabin of the car is equipped to continuously monitor for sound, and either an agent from security is always waiting, listening at the other end, or by addressing a particular department by name, a voice recognition system instantly routs him to an operator in that department, who is able to immediately respond:

"They belong to Chief Nakamura of the treaties bureau and a Dr. Willis."

"Give me the video record of their entering the building."

In response, a panel automatically flips out of the dashboard to reveal a monitor, where he can watch the the security footage. He watches it, and says,

"Replay, infrared view"

After watching the replay, he says,

"Send me the pressure sensor records for basement garage spaces B-7 and 8."

The screen then does several things at once. It shows a login screen, for which his username is already supplied. He mentally supplies his password. Next a menu appears on a green background with five options: NET-WORK [sic], OPTICAL, PRESSURE, THERMO, and SOUND. "PRESSURE" highlights twice with two beeps. Then after a screen-green 3D rendering of Section 9 headquarters builds, the camera zooms around the building and through floorplans to the parking lot to focus on the spaces, labeled appropriately. Togusa watches as pea green bars on radial dials bounce clockwise, twice, with a few seconds between.

The login

Sci-fi logins often fail for basic multifactor authentication, and at first it appears that this screen only has two parts: a username and password. But given that Togusa connects to the system first vocally and then mentally, it’s likely that one of these other channels supplies a third level of authentication. Also it seems odd to have him supply a set of characters as the mental input. Requiring Togusa to think a certain concept might make more sense, like a mental captcha.

The zoom

Given that seconds can make a life-or-death difference and that the stakes at Section 9 are so high, the time that the system spends zooming a camera around the building all the way to the locations is a waste. It should be faster. It does provide context to the information, but it doesn’t have to be distributed in time. Remove the meaningless and unlabeled dial in the lower right to gain real estate, and replace it with a small version of the map that highlights the area of detail. Since Togusa requested this information, the system should jump here immediately and let him zoom out for more detail only if he wants it or if the system wants him to see suspect information.

The radial graphs

The radial graphs imply some maximum to the data, and that Nakamura’s contingent hits some 75% of it. What happens if the pressure exceeds 37 ticks? Does the floor break? (If so, it should have sent off structural warning alarms at the gate independently of the security question.) But presumably Section 9 is made of stronger stuff than this, and so a different style of diagram is called for. Perhaps remove the dial entirely and just leave the parking spot labels and the weight. Admittedly, the radial dial is unusual and might be there for consistency with other, unseen parts of the system.

Moreover, Togusa is interested in several things: how the data has changed over time, when it surpassed an expected maximum, and by how much. This diagram only addresses one of them, and requires Togusa to notice and remember it himself. A better diagram would trace this pressure reading across time, highlighting the moments when it passed a threshold. (This parallels the issues of medical monitoring highlighted in the book, Chapter 12, Medicine.)


Even better would be to show this data over time alongside or overlaid with any of the other feeds, like a video feed, such that Togusa doesn’t have to make correlations between different feeds in his head. (I’d have added it to the comp but didn’t have source video from the movie.)

The ultimately crappy Section No9 security system

Aside from all these details of the interface and interaction design, I have to marvel at the broader failings of the system. This is meant to be the same bleeding-edge bureau that creates cyborgs and transfers consciousnesses between them? If the security system is recording all of this information, why is it not being analyzed continuously, automatically? We can presume that object recognition is common in the world from a later scene in which a spider tank is able to track Kunasagi. So as the security system was humming along, recording everything, it should have also been analyzing that data, noting the discrepancy between of the number of people it counted in any of the video feeds, the number of people it counted passing through the door, and the unusual weight of these "two" people. It should have sent a warning to security at the gate of the garage, not relied on the happenstance of Togusa’s hunch and good timing.

This points to a larger problem that Hollywood has with technology being part of its stories. It needs heroes to be smart and heroic, and having them simply respond to warnings passed along by smart system can seem pointedly unheroic. But as technology gets smarter and more agentive, these kinds of discrepancies are going to break believability and get embarassing.

Roach Cam


To learn the plans of the President, Zorg’s flunky named Right Arm infiltrates the briefing room via a remote-controlled cockroach. This adorable insect has a small parabolic receiver antenna on its back. Right Arm can watch what it sees with its eyes and listen to what it can hear through its… cerci?


The screen he uses is mostly full of the roachcam video. But it is unfortunately surrounded by some screen-green sciencey nonsense. A row across the top is headed “001” with rectangles labeled “MOVE”, “METHOD”, “CHECK”, and “SYSTEM.” A row just below is headed “A-B” with rectangles labeled “SPEED”, “TIMER”, “EXIT”, and “FILTERS.” A column of screen-green nonsense shapes fills the left side. A small butterfly-shaped graph at the bottom-left is labeled “CHECK.” A small box labeled “CALCULATING” is in the lower middle, which occasionally fills with scrolling nonsense. The right side of the screen is full of a circular graph showing a seizure-inducing flashing pair of green concentric circles. A green 8×8 grid on the right stays empty the whole time, though it is arguably the most likely useful thing, i.e. an overhead view of threats, say, like presidents brandishing cockroach-smashing shoes. Below the unused grid is a diagram of the roach itself, probably useful for understanding the health of the vehicle. Below that is a bit of unintelligible text reversed out of a gray background. When the roach nears the President, a bit of green nonsense text appears overlaid on the video feed, though it never changes.

I think this screen would have been less distracting and more helpful for Right Arm if you stripped away all the gunk at the top, the nonsensical overlay, trashed the column of hastily-drawn icons, saved him the constant distraction of the seizure circle by removing it, and leaving him with the two things that would actually be useful: the map (populated of course with some useful information), and the roach health diagram. Even though this screen is seen only for a few seconds at most, it reads as if it was hastily put together, unlike most other things seen in the film.



He is able to control the roach’s movement by means of a joystick with a rotating head. In contrast to the screen, this provides exactly what Right Arm needs to control the roach, and no more. He can move it forward and back by pushing the joystick forward and back. He can have it strafe right or left by moving the joystick appropriately. And when he wants to have it turn right or left, he can twist the joystick head in that direction. Pushing or twisting farther results in more motion. All told, a perfect input control for the task at hand. At least until you ask the roach.


Profiling “CAT” scan


After her escape from the nucleolab, Leeloo ends up on a thin ledge of a building, unsure where to go or what to do. As a police car hovers nearby, the officers use an onboard computer to try and match her identity against their database. One officer taps a few keys into an unseen keyboard, her photograph is taken, and the results displays in about 8 seconds. Not surprisingly, it fails to find a match, and the user is told so with an unambiguous, red “NO FILE” banner across the screen.


This interface flies by very quickly, so it’s not meant to be read screen by screen. Still, the wireframes present a clear illustration of what the system doing, and what the results are.

The system shouldn’t just provide dead ends like this, though. Any such system has to account for human faces changing over the time since the last capture: aging, plastic surgery, makeup, and disfiguring accidents, to name a few. Since Leeloo isn’t inhuman, it could provide some results of “closest matches,” perhaps with a confidence percentage alongside individual results. Even if the confidence number was very low, that output would help the officers understand it was an issue with the subject, and not an issue of an incomplete database or weak algorithm.

One subtle element is that we don’t see or hear the officer telling the system where the perp is, or pointing a camera. He doesn’t even have to identify her face. It automatically finds her in the camera few, identifies her face, and starts scanning. The sliding green lines tell the officer what it’s finding, giving him confidence in its process, and offering an opportunity to intervene if it’s getting things wrong.

Bridge VP: Mapping

The main interface on the bridge is the volumetric projection display. This device takes up the center of the bridge and is the size of a long billiards table. It serves multiple purposes for the crew. Its later use is to display the real-time map of the alien complex.

Map of the alien complex

The redshirt geologist named Chance in the landing party uses some nifty tools to initiate mapping of the alien complex. The information is sent from these floating sensors back to the ship, which displays the results in real time.

The display of this information is rich with a saturated-color, color-coded, edge-opacity style, leaving outer surfaces rendered in a gossamer cyan, and internal features rendered in an edge-lit green wireframe. In the area above the VP surface, other arbitrary rectangles of data can be summoned for particular tasks, including in-air volumetric keyboards. The flat base of the bridge VP is mirrored, which given the complex 3D nature of the information, causes a bit of visual confusion. (Am I seeing two diamonds reflected or four on two levels?)

Later in the film, Janek tells Ravel to modify the display; specifically, to “strip away the dome” and “isolate that area, bring it up.” He is even to enlarge and rotate the alien spaceship when they find it. Ravel does these modifications this through a touch screen panel at his station, though he routes the results to the “table.” We don’t see the controls in use so can’t evaluate them. But being able to modify displays are one of the ways that people look for patterns and make sense of such information.

A major question about this interface is why this information is not routed back to the people who can use it the most, i.e. the landing party. Chance has to speak to Janek over their intercom and figure out his cardinal directions in one scene. I know they’re redshirts, but they’re already wearing high tech spacesuits. And in the image below we see that this diegesis has handheld volumetric projections. They couldn’t integrate one of those to a sleeve to help life-critical wayfinding?