Publications - AImageLab

A Multi-Camera Vision System for Fall Detection and Alarm Generation

Authors: Cucchiara, Rita; Prati, Andrea; Vezzani, Roberto

Published in: EXPERT SYSTEMS

In-house video surveillance can represent an excellent support for people with some difficulties (e.g. elderly or disabled people) living alone … (Read full abstract)

In-house video surveillance can represent an excellent support for people with some difficulties (e.g. elderly or disabled people) living alone and with a limited autonomy. New hardware technologies and in particular digital cameras are now affordable and they have recently gained credit as tools for (semi-)automatically assuring people's safety. In this paper a multi-camera vision system for detecting and tracking people and recognizing dangerous behaviours and events such as a fall is presented. In such a situation a suitable alarm can be sent, e.g. by means of an SMS. A novel technique of warping people's silhouette is proposed to exchange visual information between partially overlapped cameras whenever a camera handover occurs. Finally, a multi-client and multi-threaded transcoding video server delivers live video streams to operators/remote users in order to check the validity of a received alarm. Semantic and event-based transcoding algorithms are used to optimize the bandwidth usage. A two-room setup has been created in our laboratory to test the performance of the overall system and some of the results obtained are reported.

2007 Articolo su rivista

DOI IRIS

Compressed Domain Features Extraction for Shot Characterization

Authors: Grana, Costantino; Vezzani, Roberto; Borghesani, Daniele; Cucchiara, Rita

Published in: CEUR WORKSHOP PROCEEDINGS

In this work, we propose a system for shot comparison directly working on the MPEG-1 stream in the compressed domain, … (Read full abstract)

In this work, we propose a system for shot comparison directly working on the MPEG-1 stream in the compressed domain, extracting both color, texture and motion features considering all frames with a reasonable computational cost, and results comparable to those obtained on uncompressed keyframes. In particular a summary descriptor for each Group Of Pictures (GOP) is computed and employed for shot characterization and comparison. The Mallows distance allows to match different length clips in a unified framework.

2007 Relazione in Atti di Convegno

IRIS

Enhancing HSV Histograms with Achromatic Points Detection for Video Retrieval

Authors: Grana, Costantino; Vezzani, Roberto; Cucchiara, Rita

Color is one of the most meaningful features used in content based retrieval of visual data. In video content based … (Read full abstract)

Color is one of the most meaningful features used in content based retrieval of visual data. In video content based retrieval, color features computed on selected frames are integrated with other low-level features concerning texture, shape and motion in order to find clip similarities. For example, the Scalable Color feature defined in the MPEG-7 standard exploits HSV histograms to create color feature vectors. HSV is a widely adopted space in image and video retrieval, but its quantization for histogram generation can create misleading errors in classification of achromatic and low saturated colors. In this paper we propose an Enhanced HSV Histogram with achromatic point detection based on a single Hue and Saturation parameter that can correct this limitation. The enhanced histograms have proven to be effective in color analysis and they have been used in a system for automatic clip annotation called PEANO, where pictorial concepts are extracted by a clip clustering and used for similarity based automatic annotation.

2007 Relazione in Atti di Convegno

DOI IRIS

Prototypes Selection with Context Based Intra-class Clustering for Video Annotation with Mpeg7 Features

Authors: Grana, Costantino; Vezzani, Roberto; Cucchiara, Rita

Published in: LECTURE NOTES IN COMPUTER SCIENCE

In this work, we analyze the effectiveness of perceptual features to automatically annotate video clips in domain-specific video digital libraries. … (Read full abstract)

In this work, we analyze the effectiveness of perceptual features to automatically annotate video clips in domain-specific video digital libraries. Typically, automatic annotation is provided by computing clip similarity with respect to given examples, which constitute the knowledgebase, in accordance with a given ontology or a classification scheme. Since the amount of training clips is normally very large, we propose to automatically extract some prototypes, or visual concepts, for each class instead of using the whole knowledge base. The prototypes are generated after a Complete Link clustering based on perceptual features with an automatic selection of the number of clusters. Context based information are used in an intra-class clustering framework to provide selection of more discriminative clips. Reducing the number of samples makes the matching process faster and lessens the storage requirements. Clips are annotated following the MPEG-7 directives to provide easier portability. Results are provided on videos taken from sports and news digital libraries.

2007 Relazione in Atti di Convegno

DOI IRIS

Semi-automatic Video Digital Library Annotation Tools

Authors: Cucchiara, Rita; Grana, Costantino; Vezzani, Roberto

In this work, we present a general purpose systemfor hierarchical structural segmentation and automaticannotation of video clips, by means of … (Read full abstract)

In this work, we present a general purpose systemfor hierarchical structural segmentation and automaticannotation of video clips, by means of standardizedlow level features. We propose to automatically extractsome prototypes for each class with a context basedintra-class clustering. Clips are annotated followingthe MPEG-7 standard directives to provide easierportability. Results of automatic annotation and semiautomaticmetadata creation are provided.

2007 Relazione in Atti di Convegno

IRIS

Sports Video Annotation Using Enhanced HSV Histograms in Multimedia Ontologies

Authors: M., Bertini; A., Del Bimbo; C., Torniai; Grana, Costantino; Vezzani, Roberto; Cucchiara, Rita

This paper presents multimedia ontologies, where multimedia data and traditional textual ontologies are merged. A solution for their implementation for … (Read full abstract)

This paper presents multimedia ontologies, where multimedia data and traditional textual ontologies are merged. A solution for their implementation for the soccer video domain and a method to perform automatic soccer video annotation using these extended ontologies is shown. HSV is a widely adopted space in image and video retrieval, but its quantization for histogram generation can create misleading errors in classification of achromatic and low saturated colors. In this paper we propose an Enhanced HSV Histogram with achromatic point detection based on a single Hue and Saturation parameter that can correct this limitation.The more general concepts of the sport domain (e.g. play/break, crowd, etc.) are put in correspondence with the more general visual features of the video like color and texture, while the more specific concepts of the soccer domain (e.g. highlights such as attack actions) are put in correspondence with domain specific visual feature like the soccer playfield and the players. Experimental results for annotation of soccer videos using generic concepts are presented.

2007 Relazione in Atti di Convegno

DOI IRIS

Using a Wireless Sensor Network to Enhance Video Surveillance

Authors: Cucchiara, Rita; Prati, Andrea; Vezzani, Roberto; L., Benini; E., Farella; P., Zappi

Published in: JOURNAL OF UBIQUITOUS COMPUTING AND INTELLIGENCE

To enhance video surveillance systems, multi-modal sensor integration can be a successful strategy. In this work, a computer vision system … (Read full abstract)

To enhance video surveillance systems, multi-modal sensor integration can be a successful strategy. In this work, a computer vision system able to detect and track people from multiple cameras is integrated with a wireless sensor network mounting passive Pyroelectric InfraRed sensors. Thetwo subsystems are briefly described and possible cases in which computer vision algorithms are likely to fail are discussed. Then, simple but reliable outputs from the sensor nodes are exploited to improve the accuracy of the vision system. In particular, two case studies are reported: the first uses the presence detection of sensors to disambiguate between an open door and a moving person, while the second handles motion direction changes during occlusions. Preliminary results are reported and demonstrate the usefulness of the integration of the two subsystems.

2007 Articolo su rivista

IRIS

Visor: Video Surveillance Online Repository

Authors: Vezzani, Roberto; Cucchiara, Rita

Aim of the Visor Project [1] is to gather and makefreely available a repository of surveillance andvideo footages for the … (Read full abstract)

Aim of the Visor Project [1] is to gather and makefreely available a repository of surveillance andvideo footages for the research community onpattern recognition and multimedia retrieval. Thegoal is to create an open forum and a free repositoryto exchange, compare and discuss results of manyproblems in video surveillance and retrieval.Together with the videos, the repository containsmetadata annotation, both manually annotated asground-truth and automatically obtained by videosurveillance systems. Annotation refers to a largeontology of concepts on surveillance and securityrelated objects and events. The ontology has beendefined including concepts from LSCOM andMediaMill ontologies. As well as videos andannotations, Visor provides tools for enriching theontology, annotating new videos, searching bytextual queries, composing and downloading videos.

2007 Relazione in Atti di Convegno

IRIS

3-D Virtual Environments on Mobile Devices for Remote Surveillance

Authors: Vezzani, Roberto; Cucchiara, Rita; A., Malizia; L., Cinque

In this paper we present a distributed videosurveillanceframework. Our end is the remote monitoringof the behavior of people moving in … (Read full abstract)

In this paper we present a distributed videosurveillanceframework. Our end is the remote monitoringof the behavior of people moving in a scene exploitinga virtual reconstruction on low capabilitiesdevices, like PDAs and cell phones. The main noveltyof this system is the effective integration of the computervision and computer graphics modules. The first,using a probabilistic frameworks, can detect the position,the trajectory and the posture of peoples movingin the scene. The second exploits the new possibility ofboth standard 3D graphics libraries on mobile (namelyJSR184 and M3G graphic format) and new PDAsprocessing capability in order to reconstruct the remotesurveillance data in real-time.

2006 Relazione in Atti di Convegno

DOI IRIS

A Distributed Domotic Surveillance System

Authors: Cucchiara, Rita; Grana, Costantino; Prati, Andrea; Vezzani, Roberto

Distributed video surveillance has a direct application in intelligent home automation or domotics (from the Latin word domus, that means … (Read full abstract)

Distributed video surveillance has a direct application in intelligent home automation or domotics (from the Latin word domus, that means “home”, and informatics); in particular, in-house videosurveillance can provide good support for people with some difficulties (e.g., elderly or disabled people) living alone and with a limited autonomy. New hardware technologies for surveillance are now affordable and provide high reliability. Problems related to reliable software solutions are not completely solved, especially concerning the application of general-purpose computer vision techniques in indoor environments. Indeed, assuming the objective is to detect the presence of people, track them, and recognize dangerous behaviours by means of abrupt changes in their posture, robust techniques must cope with non-trivial difficulties. In particular, luminance changes and shadows must be taken into account, frequent posture changes must be faced, and large and long-lasting occlusions are common due to the vicinity of the cameras and the presence of furnitureand doors that can often hide parts of the person’s body. These problems are analyzed and solutions based on background suppression, appearance-based probabilistic tracking, and probabilistic reasoning for posture recognition are described.

2006 Capitolo/Saggio

DOI IRIS

Publications by Roberto Vezzani

A Multi-Camera Vision System for Fall Detection and Alarm Generation

Compressed Domain Features Extraction for Shot Characterization

Enhancing HSV Histograms with Achromatic Points Detection for Video Retrieval

Prototypes Selection with Context Based Intra-class Clustering for Video Annotation with Mpeg7 Features

Semi-automatic Video Digital Library Annotation Tools

Sports Video Annotation Using Enhanced HSV Histograms in Multimedia Ontologies

Using a Wireless Sensor Network to Enhance Video Surveillance

Visor: Video Surveillance Online Repository

3-D Virtual Environments on Mobile Devices for Remote Surveillance

A Distributed Domotic Surveillance System