Real-time processing the information coming form video, infra-red or electro-optical sources is a challenging task due the uncertainties such as noise and clutter, but also due to the large dimensionalities of the problem and the demand for fast and efficient algorithms. This paper details an approach for automatic detection, single and multiple objects identification and tracking in video streams with applications to surveillance, security and autonomous systems. It is based on a method that provides recursive density estimation (RDE) using a Cauchy type of kernel. The main advantage of the RDE approach as compared to other traditional methods (e.g. KDE) is the low computational and memory storage cost since it works on a frame-by-frame basis; the lack of thresholds, and applicability to multiple objects identification and tracking. A robust to noise and clutter technique based on spatial density is also proposed to autonomously identify the targets location in the frame.