HOME PUBLICATIONS SMART SURVEILLANCE 3D FACE RECOTNIGION 3D OBJECT RECOGNITION 3D MODELING INTERNET KEY EXCHANGE PICTURES

3D Model-based Object Recognition and Segmentation in Cluttered Scenes


Abstract
Viewpoint independent recognition of free-form objects and their segmentation in the presence of clutter and occlusions is a challenging task. We present a novel 3D model-based algorithm which performs this task automatically and efficiently. A 3D model of an object is automatically constructed offline from its multiple unordered range images (views). These views are converted into multidimensional table representations (which we refer to as tensors). Correspondences are automatically established between these views by simultaneously matching the tensors of a view with those of the remaining views using a hash table based voting scheme. This results in a graph of relative transformations used to register the views before they are integrated into a seamless 3D model. These models and their tensor representations constitute the model library. During online recognition, a tensor from the scene is simultaneously matched with those in the library by casting votes. Similarity measures are calculated for the model tensors which receive the most votes. The model with the highest similarity is transformed to the scene and if it aligns accurately with an object in the scene, that object is declared as recognized and is segmented. This process is repeated until the scene is completely segmented. Experiments were performed on real and synthetic data comprised of 55 models and 610 scenes and an overall recognition rate of 95 percent was achieved. Comparison with the spin images revealed that our algorithm is superior in terms of recognition rate and efficiency.

Ajmal S. Mian, M. Bennamoun and R. Owens, "3D Model-based Object Recognition and Segmentation in Cluttered Scenes", to appear in IEEE Transactions in Pattern Analysis and Machine Intelligence (PAMI), 2006. [pdfCopyright © IEEE.

The data we used in our experiments is also available for download below. Please cite the above paper if you use the data.

3D Models in PLY format

Chef [download 3.6MB]    Parasaurolophus [download 3.6MB]    T-rex [download 3.5MB]    Chicken [download 2.7MB]    Rhino [download 1.6MB]

The above objects were placed in a scene causing occlusions and clutter. The scene was then scanned with the Minolta Vivid 910 scanner to get a 2.5D view of the scene. The scenes are available for download below. File sizes vary from 1.5 to 2.2 MB.

2.5D Scenes in PLY format (zip files)
scene1 scene2  scene3 scene4 scene5 scene6 scene7 scene8 scene9 scene10
scene11 scene12 scene13 scene14 scene15 scene16 scene17 scene18 scene19 scene20
scene21 scene22 scene23 scene24 scene25 scene26 scene27 scene28 scene29 scene30
scene31 scene32 scene33 scene34 scene35 scene36 scene37 scene38 scene39 scene40
scene41 scene42 scene43 scene44 scene45 scene46 scene47 scene48 scene49 scene50