Dismount Detection, Tracking, and Activity Understanding

 

Homeland security and military applications such as urban area surveillance (UAS) require multiple-sensors to locate, detect, and identify dismounts and deduce adversary intents. UAS is a coupled two-level problem: dismount detection and tracking (lower-level) and dismount intent inference (high level).  For dismount detection and tracking, we have developed a new context-based technique for robust detection and tracking multiple targets (including vehicles and people) from distance.  The tracker can track objects under partial occlusion, complex background with clutters, and under variable illumination conditions (IR and Visible lights).   

 

For dismount activity understanding, we propose to use the Mixed Probabilistic Networks (MPNs).  A MPN is a mixed graph consisting of both directed and undirected links.  MPNs, therefore, represent an extension to the directed graphical models such as Bayesian Networks and HMM as well as to the undirected graphical models such as Markov Random Fields.   Like other probabilistic graphical models, a MPN represents a joint probability distribution among a set of variables. In a MPN, nodes denote random variables, the links among nodes represent the spatial and temporal dependencies among the variables, and the model follows local Markov condition.


We propose a hierarchical framework based on MPNs to model activities, the related actions, and the image observations as well as their relationships.  Besides being able to capture both causal and non-casual relationships, the framework also allows events from different activities to be shared and reused. In addition, we propose to develop advanced machine learning methods to automatically learn both the model structure and parameters.  These will set our approach significantly apart from the current approaches in activity modeling and recognition.

Our research so far has produced software and algorithms that can recognize from low resolution image sequences simple individual actions including walking, running, standing, throwing things, jumping, bending down, getting off cars and getting on cars.  Our next step is to implement the proposed MPN models to recognize more complex high level activities.

 

Below are video demos of some of these algorithms:

1. Activity recognition results on CAVIER dataset

The basic activities include "walking", "resting", "meeting together", "leaving bag" and "browsing". Here, we give example of activity "leaving bag"

                   


2. Activity recognition results on ISL-Activity dataset

The basic activities of ISL-Activity dataset include "walking", "running", "get off car", "enter car", "bend", "look around" and "throw".  Bellowing are the activity recognition result on the testing activity video.



The following demo shows different stages of the activity recognition process, including motion detection, object tracking and recognition.