My Research Theme [2002〜2004]
Japanese / English



AirGrabber: Virtual Keyboard using a Miniature Camera and a Tilt Sensor




1. Abstruct

The recent developments of computer hardware have stimulated a reduction in the size and weight of personal computers and their peripheral units. While Wearable Computers have been put to practical use, user interfaces are also required for portability in stead of carrying a conventional keyboard to type characters. Some devices have been proposed and provided for this demand. However, they are hard to use because of their restrictions for functionalities and circumstances to use or extremely special operational method.

This paper presents AirGrabber a virtual keyboard suitable for Wearable Computers, which can enter characters by moving fingers like typing an existing keyboard in the air. And our experimental results show the effective performance in terms of actual typing situations.

AirGrabber detects key strokes and selected keys by monitoring the motions of the fingers and hand by a miniature camera and a tilt sensor equipped on the user's wrist. Therefore, this system is easy to carry and enables the user to enter characters intuitively.



2. AirGrabber

2.1 Character input by virtual typing

Character input interfaces for Wearable Computers should restrict the body as less as possible not to interrupt other work other than a character input operation.

In order to realize character input by typing movement in the air without a physical keyboard, virtual keyboard system must recognize what key is chosen and typed. So this system divides these recongitions into following two distinct process.

i. What keys are available to type.

ii. Was the key typed or not typed.

First, the candidates of the keys which can be typed change by where the hand exists on the keyboard. Therefore it can be referred to as matching a motion of an actual hand with position change of the hand on a virtual keyboard to recognize the above-mentioned i.

Next, above-mentioned ii. is recognizemd by sensing movements of fingertips. This sensing should be executed without restraining the body as much as possible.

AirGrabber装着図

In order to make the minimum composition the equipments needed for distinction processing of above-mentioned i. and ii., the technique proposed by this research makes use of a miniature camera and a tilt sensor. These equipments are had on a wrist.

According to the output of the tilt sensor, the positional relationship between a hand and a virtual keyboard is matched, and motions of the fingertips are constantly caught from the images taken by the miniature camera. This composition enables us to input keys by moving hand and fingers like typing an existing keyboard in the air. Additionally, this composition also enables us to shift to other works other than a key input operation easily because equipments are collected around only a wrist.


2.2 Recognition of the keys which can be typed

The tilt sensor of three axes recognizes tilt by front and rear, right and left of an arm. The degree of the tilt is considered as an amount of movements of the hand from a home position to choose the keys which can be typed. Concretely speaking, start with an assumption that a home position is made into the center of a virtual keyboard. If the wrist is leaned forward or backward, the above or bottom region of the keyboard will be chosen, and if the wrist is leaned to right or left, the right side or left side region of the keyboard will be chosen.

キー候補選択の動き

2.3 Recognition of typing movement

Typing movement is recognized through the following process by analyzing the images obtained from the miniature camera equipped under a wrist.

  1. Extraction of hand region
  2. Extraction of fingertips
  3. Detections of the positions of each fingertip
  4. Recognition of typing movement

The miniature camera has a built-in fish-eye lens to bring all fingers into view. In addition, there is an infrared light source to illuminate a hand below the miniature camera, and the miniature camera also has an infrared pass filter. Therefore this system have only to manipulate threshold image processing to extract hand region illuminated by infrared light. Next, this system manipulates labeling processing to divide hand region by considering the widest region as a hand region.

Below is a process flowchart to extract a hand region and detect fingertips.

手領域を抽出し指先を検出するための処理の流れ
指先検出
process flowchart to extract a hand region and detect fingertips
image of detection of fingertips


Below is a process flowchart to recognize typing movement from the motions of fingers.

指先の動きからタイピング動作を認識するための処理の流れ
人差し指のタイピング動作イメージ
process flowchart to recognize typing movement from the motions of fingers
image of key stroke of forefinger

3. Prototype system of AirGrabber

We made a prototype system of AirGrabber. The system consists of some parts below indicates. A notebook computer (CPU: Pentium Celeron 1GHz, Memory: 256MB, OS: WindowsXP) controls the miniature camera and the tilt sensor. The size of captured image is 180*120. A rate of system operation is 7 to 10 fps.

試作システム構成


Below is a screenshot. We can input characters in the any text field of Windows environment.

試作システム動作画面

4. Evaluation experiments and result

We conducted the following three experiments. We got 10 trial subjects to use the prototype system of AirGrabber.

  1. Accuracy evaluation of key selection operation on a virtual keyboard
  2. Accuracy evaluation of typing movement recognition
  3. Evaluation by the text input

These experimental results show that users could understand the fundamental operation method of AirGrabber easily and they could input characters using the trial production system. Therefore we could receive confirmation that AirGrabber has enough ability to input characters in the environment of Wearable Computer.


5. Summary

We have proposed and made AirGrabber system that enables us to input characters by moving fingers like typing an existing keyboard in the air, and implemented and conducted experiments to show the effective performance in terms of actual typing situations.






Back to Top Page




...