Publications by Roberto Vezzani

Explore our research publications: papers, articles, and conference proceedings from AImageLab.

Tip: type @ to pick an author and # to pick a keyword.

Active filters (Clear): Author: Roberto Vezzani

Benchmarking for Person Re-identification

Authors: Vezzani, Roberto; Cucchiara, Rita

Published in: ADVANCES IN COMPUTER VISION AND PATTERN RECOGNITION

The evaluation of computer vision and pattern recognition systems is usually a burdensome and time-consuming activity. In this chapter all … (Read full abstract)

The evaluation of computer vision and pattern recognition systems is usually a burdensome and time-consuming activity. In this chapter all the benchmarks publicly available for re-identification will be reviewed and compared, starting from the ancestors VIPeR and Caviar to the most recent datasets for 3D modeling such as SARC3d (with calibrated cameras) and RGBD-ID (with range sensors). Specific requirements and constraints are highlighted and reported for each of the described collections. In addition, details on the metrics that are mostly used to test and evaluate the re-identification systems are provided.

2014 Capitolo/Saggio

Detection of static groups and crowds gathered in open spaces by texture classification

Authors: Manfredi, Marco; Vezzani, Roberto; Calderara, Simone; Cucchiara, Rita

Published in: PATTERN RECOGNITION LETTERS

A surveillance system specifically developed to manage crowded scenes is described in this paper. In particular we focused on static … (Read full abstract)

A surveillance system specifically developed to manage crowded scenes is described in this paper. In particular we focused on static crowds, composed by groups of people gathered and stayed in the same place for a while. The detection and spatial localization of static crowd situations is performed by means of a One Class Support Vector Machine, working on texture features extracted at patch level. Spatial regions containing crowds are identified and filtered using motion information to prevent noise and false alarms due to moving flows of people. By means of one class classification and inner texture descriptors, we are able to obtain, from a single training set, a sufficiently general crowd model that can be used for all the scenarios that shares a similar viewpoint. Tests on public datasets and real setups validate the proposed system.

2014 Articolo su rivista

Welcome message from the technical program committee chairs

Authors: Micheloni, C.; Velipasalar, S.; Vezzani, R.

2014 Relazione in Atti di Convegno

Editorial to the 'pattern recognition and artificial intelligence for human behaviour analysis' special section

Authors: Iocchi, L.; Prati, A.; Vezzani, R.

Published in: EXPERT SYSTEMS

2013 Articolo su rivista

Human Behavior Understanding with Wide Area Sensing Floors

Authors: Lombardi, Martino; Pieracci, Augusto; Santinelli, Paolo; Vezzani, Roberto; Cucchiara, Rita

Published in: LECTURE NOTES IN COMPUTER SCIENCE

The research on innovative and natural interfaces aims at developing devices able to capture and understand the human behavior without … (Read full abstract)

The research on innovative and natural interfaces aims at developing devices able to capture and understand the human behavior without the need of a direct interaction. In this paper we propose and describe a framework based on a sensing floor device. The pressure field generated by people or objects standing on the floor is captured and analyzed. Local and global features are computed by a low level processing unit and sent to high level interfaces. The framework can be used in different applications, such as entertainment, education or surveillance. A detailed description of the sensing element and the processing architectures is provided, together with some sample applications developed to test the device capabilities.

2013 Relazione in Atti di Convegno

Intelligent video surveillance as a service

Authors: Prati, A.; Vezzani, R.; Fornaciari, M.; Cucchiara, R.

Nowadays, intelligent video surveillance has become an essential tool of the greatest importance for several security-related applications. With the growth … (Read full abstract)

Nowadays, intelligent video surveillance has become an essential tool of the greatest importance for several security-related applications. With the growth of installed cameras and the increasing complexity of required algorithms, in-house self-contained video surveillance systems become a chimera for most institutions and (small) companies. The paradigm of Video Surveillance as a Service (VSaaS) helps distributing not only storage space in the cloud (necessary for handling large amounts of video data), but also infrastructures and computational power. This chapter will briefly introduce the motivations and the main characteristics of a VSaaS system, providing a case study where research-lab computer vision algorithms are integrated in a VSaaS platform. The lessons learnt and some future directions on this topic will be also highlighted.

2013 Capitolo/Saggio

Learning articulated body models for people re-identification

Authors: Baltieri, Davide; Vezzani, Roberto; Cucchiara, Rita

People re-identification is a challenging problem in surveillance and forensics and it aims at associating multiple instances of the same … (Read full abstract)

People re-identification is a challenging problem in surveillance and forensics and it aims at associating multiple instances of the same person which have been acquired from different points of view and after a temporal gap. Image-based appearance features are usually adopted but, in addition to their intrinsically low discriminability, they are subject to perspective and view-point issues. We propose to completely change the approach by mapping local descriptors extracted from RGB-D sensors on a 3D body model for creating a view-independent signature. An original bone-wise color descriptor is generated and reduced with PCA to compute the person signature. The virtual bone set used to map appearance features is learned using a recursive splitting approach. Finally, people matching for re-identification is performed using the Relaxed Pairwise Metric Learning, which simultaneously provides feature reduction and weighting. Experiments on a specific dataset created with the Microsoft Kinect sensor and the OpenNi libraries prove the advantages of the proposed technique with respect to state of the art methods based on 2D or non-articulated 3D body models.

2013 Relazione in Atti di Convegno

People reidentification in surveillance and forensics: a Survey

Authors: Vezzani, Roberto; Baltieri, Davide; Cucchiara, Rita

Published in: ACM COMPUTING SURVEYS

The field of surveillance and forensics research is currently shifting focus and is now showing an ever increasing interest in … (Read full abstract)

The field of surveillance and forensics research is currently shifting focus and is now showing an ever increasing interest in the task of people reidentification. This is the task of assigning the same identifier to all instances of a particular individual captured in a series of images or videos, even after the occurrence of significant gaps over time or space. People reidentification can be a useful tool for people analysis in security as a data association method for long-term tracking in surveillance. However, current identification techniques being utilized present many difficulties and shortcomings. For instance, they rely solely on the exploitation of visual cues such as color, texture, and the object's shape. Despite the many advances in this field, reidentification is still an open problem. This survey aims to tackle all the issues and challenging aspects of people reidentification while simultaneously describing the previously proposed solutions for the encountered problems. This begins with the first attempts of holistic descriptors and progresses to the more recently adopted 2D and 3D model-based approaches. The survey also includes an exhaustive treatise of all the aspects of people reidentification, including available datasets, evaluation metrics, and benchmarking.

2013 Articolo su rivista

Sensing floors for privacy-compliant surveillance of wide areas

Authors: Lombardi, Martino; Pieracci, Augusto; Santinelli, Paolo; Vezzani, Roberto; Cucchiara, Rita

Surveillance systems can really benefit from the integration of multiple and heterogeneous sensors. In this paper we describe an innovative … (Read full abstract)

Surveillance systems can really benefit from the integration of multiple and heterogeneous sensors. In this paper we describe an innovative sensing floor. Thanks to its low cost and ease of installation, the floor is suitable for both private and public environments, from narrow zones to wide areas. The floor is made adding a sensing layer below commercial floating tiles. The sensor is scalable, reliable, and completely invisible to the users. The temporal and spatial resolutions of the data are high enough to identify the presence of people, to recognize their behavior and to detect events in a privacy compliant way. Experimental results on a real prototype implementation confirm the potentiality of the framework.

2013 Relazione in Atti di Convegno

Video surveillance online repository (ViSOR)

Authors: Vezzani, Roberto; Cucchiara, Rita

This paper describe the ViSOR (Video Surveillance Online Repository) repository, designed with the aim of establishing an open platform for … (Read full abstract)

This paper describe the ViSOR (Video Surveillance Online Repository) repository, designed with the aim of establishing an open platform for collecting, annotating, retrieving, and sharing surveillance videos, as well as evaluating the performance of automatic surveillance systems. The repository is free and researchers can collaborate sharing their own videos or datasets. Most of the included videos are annotated. Annotations are based on a reference ontology which has been defined integrating hundreds of concepts, some of them coming from the LSCOM and MediaMill ontologies. A new annotation classification schema is also provided, which is aimed at identifying the spatial, temporal and domain detail level used. The web interface allows video browsing, querying by annotated concepts or by keywords, compressed video previewing, media downloading and uploading. Finally, ViSOR includes a performance evaluation desk which can be used to compare different annotations.

2013 Relazione in Atti di Convegno

Page 7 of 13 • Total publications: 124