Publications by Rita Cucchiara

Explore our research publications: papers, articles, and conference proceedings from AImageLab.

Tip: type @ to pick an author and # to pick a keyword.

Active filters (Clear): Author: Rita Cucchiara

Towards Artistic Collections Navigation Tools based on Relevance Feedback

Authors: Borghesani, Daniele; Grana, Costantino; Cucchiara, Rita

Published in: COMMUNICATIONS IN COMPUTER AND INFORMATION SCIENCE

Artistic image collections are usually managed via textual metadata into standard content management systems. More sophisticated searches can be performed … (Read full abstract)

Artistic image collections are usually managed via textual metadata into standard content management systems. More sophisticated searches can be performed using image retrieval technologies based on visual content. Nevertheless, the problem of the information presentation remains. In this paper we try to move beyond the classic grid-styled presentation model, suggesting a novel use of relevance feedback as a navigation tool. Relevance feedback is therefore used to warp the view and allow the user to spatially navigate the image collection, and at the same time focus on his retrieval aim. This is obtained exploiting a distance based space warping on the 2D projection of the distance matrix. Multitouch gestures are employed to provide feedbacks by natural interaction with the system.

2012 Relazione in Atti di Convegno

Understanding dyadic interactions applying proxemic theory on videosurveillance trajectories

Authors: Calderara, Simone; Cucchiara, Rita

Published in: IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS

Understanding social and collective people behaviour in open spaces is one of the frontier of modern video surveillance. Many sociological … (Read full abstract)

Understanding social and collective people behaviour in open spaces is one of the frontier of modern video surveillance. Many sociological theories, and proxemics in particular, have been proved their validity as a support for classifying and interpreting human behaviour. Proxemics suggest some simple but effective behavioural rules, useful to understand what people are doing and their social involvement with other individuals. In this paper we propose to extend the proxemics analysis along the time and provide a solution for analysing sequences of proxemic states computed between trajectories of people pairs (dyads). Trajectories, computed from videosurveillance videos, are first analysed and converted to a sequence of symbols according to proxemic theory. Then an elastic measure for comparing those sequences is introduced. Finally, interactions are classified both in an off-line unsupervised way and in an on-line fashion. Results on videosurveillance data, demonstrate that sequences of proxemic states can be effective in characterizing mutual interactions and experiments in capturing the most frequent dyads interactions and on-line classifying them when a labelled training set is available are proposed.

2012 Relazione in Atti di Convegno

Veiling Luminance estimation on FPGA-based embedded smart camera

Authors: Grana, Costantino; Borghesani, Daniele; Santinelli, Paolo; Cucchiara, Rita

This paper describes the design and development of a Veiling Luminance estimation system based on the use of a CMOS … (Read full abstract)

This paper describes the design and development of a Veiling Luminance estimation system based on the use of a CMOS image sensor, fully implemented on FPGA. The system is composed of the CMOS Image sensor, FPGA, DDR SDRAM, USB controller and SPI (Serial Peripheral Interface) Flash. The FPGA is used to build a system-on-chip integrating a soft processor (Xilinx MicroBlaze) and all the hardware blocks needed to handle the external peripherals and memory. The soft processor is used to handle image acquisition and all computational tasks need to compute the Veiling Luminance value. The advantages of this single chip FPGA implementation include the reduction of the hardware requirements, power consumption, and system complexity. The problem of the high dynamic range images have been addressed with multiple acquisitions at different exposure times. Vignetting, radial distortion and angular weighting, as required by veiling luminance definition, are handled by a single integer look-up table (LUT) access. Results are compared with a state of the art certified instrument.

2012 Relazione in Atti di Convegno

3DPes: 3D People Dataset for Surveillance and Forensics

Authors: Baltieri, Davide; Vezzani, Roberto; Cucchiara, Rita

The interest of the research community in creating reference datasets for performance analysis is always very high. Although new datasets, … (Read full abstract)

The interest of the research community in creating reference datasets for performance analysis is always very high. Although new datasets, collecting large amounts of video footage are spreading in surveillance and forensics, few bench-marks with annotation data are available for testing specific tasks and especially for 3D/multi-view analysis. In this paper we present 3DPeS, a new dataset for 3D/multi- view surveillance and forensic applications. This has been designed for discussing and evaluating research results in people re-identification and other related activities (people detection, people segmentation and people tracking). The new assessed version of the dataset contains hundreds of video sequences of 200 people taken from a multi-camera distributed surveillance system over several days, with different light conditions; each person is detected multiple times and from different points of view. In surveillance scenarios, the dataset can be exploited to evaluate people reacquisition, 3D body models and people activity reconstruction algorithms. In forensics it can be adopted too, by relaxing some constraints (e.g. real time) and neglecting some information (e.g. calibration). Some results on this new dataset are presented using state of the art methods for people re-identification as a benchmark for future comparisons.

2011 Relazione in Atti di Convegno

A low-cost system and calibration method for veiling luminance measurement

Authors: Cattini, Stefano; Grana, Costantino; Cucchiara, Rita; Rovati, Luigi

Published in: CONFERENCE PROCEEDINGS - IEEE INSTRUMENTATION/MEASUREMENT TECHNOLOGY CONFERENCE

A CCD-based measuring instrument aimed at the veiling luminance estimation and the relative low-cost calibration method are described. The system … (Read full abstract)

A CCD-based measuring instrument aimed at the veiling luminance estimation and the relative low-cost calibration method are described. The system may allow the estimation of the optimum luminance levels in road-tunnels lighting, thus both increasing the drivers safety and avoiding energy wasting hence unjustified higher lighting-costs.

2011 Relazione in Atti di Convegno

A multi-stage pedestrian detection using monolithic classifiers

Authors: Gualdi, G.; Prati, A.; Cucchiara, R.

Despite the many efforts in finding effective feature sets or accurate classifiers for people detection, few works have addressed ways … (Read full abstract)

Despite the many efforts in finding effective feature sets or accurate classifiers for people detection, few works have addressed ways for reducing the computational burden introduced by the sliding window paradigm. This paper proposes a multi-stage procedure for refining the search for pedestrians using the HOG features and the monolithic SVM classifier. The multi-stage procedure is based on particle-based estimation of pdfs and exploits the margin provided by the classifier to draw more particles on the areas where the classifier's response is higher. This iterative algorithm achieves the same accuracy than sliding window using less particles (and thus being more efficient) and, conversely, is more accurate when configured to work at the same computational load. Experimental results on publicly available datasets demonstrate that this method, previously proposed for boosted classifiers only, can be successfully applied to monolithic classifiers. © 2011 IEEE.

2011 Relazione in Atti di Convegno

A Real-Time Embedded Solution for Skew Correction in Banknote Analysis

Authors: Rashid, Adnan; Prati, Andrea; Cucchiara, Rita

Several industrial applications do require embedded solutionsboth for compacting the hardware occupation and reducing energy consumption, and for achieving high … (Read full abstract)

Several industrial applications do require embedded solutionsboth for compacting the hardware occupation and reducing energy consumption, and for achieving high speed performance. This paper presents a computer vision system developed for correcting image skew in applications for banknote analysis and classification. The system must be very efficient and run on a fixed-point DSP with limited computational resources. Consequently, we propose three innovative improvements to basic and general-purpose image processing techniques that can be helpful in other computer vision applications on embedded devices. In particular, we address: a) an efficient labeling with an unionfind approach for hole filling, b) a fast Hough transform implementation, and c) a very high-speed estimation of affinetransformation for skew correction. The reported results demonstrate both the accuracy and the efficiency of the system,also in presence of severe skew. In terms of efficiency, the computational time is reduced of about two orders of magnitude.

2011 Relazione in Atti di Convegno

A Reasoning Engine for Intruders' Localization in Wide Open Areas using a Network of Cameras and RFIDs

Authors: Cucchiara, Rita; Fornaciari, Michele; Haider, Razia; Mandreoli, Federica; Martoglia, Riccardo; Prati, Andrea; Sassatelli, Simona

Published in: IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS

Wide open areas represent challenging scenarios forsurveillance systems, since sensory data can be affected bynoise, uncertainty, and distractors. Therefore, the … (Read full abstract)

Wide open areas represent challenging scenarios forsurveillance systems, since sensory data can be affected bynoise, uncertainty, and distractors. Therefore, the tasks oflocalizing and identifying targets (e.g., people) in such environmentssuggest to go beyond the use of camera-only deployments.In this paper, we propose an innovative systemrelying on the joint use of cameras and RFIDs, allowing usto “map” RFID tags to people detected by cameras and,thus, highlighting potential intruders. To this end, sophisticatedfiltering techniques preserve the uncertainty of dataand overcome the heterogeneity of sensors, while an evidentialfusion architecture, based on Transferable Belief Model,combines the two sources of information and manages conflictbetween them. The conducted experimental evaluationshows very promising results.

2011 Relazione in Atti di Convegno

An evidential fusion architecture for people surveillance in wide open areas

Authors: Fornaciari, M.; Sottara, D.; Prati, A.; Mello, P.; Cucchiara, R.

Published in: LECTURE NOTES IN ARTIFICIAL INTELLIGENCE

A new evidential fusion architecture is proposed to build anhybrid articial intelligent system for people surveillance in wide open areas. … (Read full abstract)

A new evidential fusion architecture is proposed to build anhybrid articial intelligent system for people surveillance in wide open areas. Authorized people and intruders are identied and localized thanks to the joint employment of cameras and RFID tags. Complex Event Processing and Transferable Belief Model are exploited for handling noisy data and uncertainty propagation. Experimental results on complex synthetic scenarios demonstrate the accuracy of the proposed solution.

2011 Relazione in Atti di Convegno

Appearance tracking by transduction in surveillance scenarios

Authors: Coppi, Dalia; Calderara, Simone; Cucchiara, Rita

We propose a formulation of people tracking problem as a Transductive Learning (TL) problem. TL is an effective semi-supervised learning … (Read full abstract)

We propose a formulation of people tracking problem as a Transductive Learning (TL) problem. TL is an effective semi-supervised learning technique by which many classification problems have been recently reinterpreted as learning labels from incomplete datasets. In our proposal the joint exploitation of spectral graph theory and Riemannian manifold learning tools leads to the formulation of a robust approach for appearance based tracking in Video Surveillance scenarios. The key advantage of the presented method is a continuously updated model of the tracked target, used in the TL process, that allows to on-line learn the target visual appearance and consequently to improve the tracker accuracy. Experiments on public datasets show an encouraging advancement over alternative state-of the-art techniques.

2011 Relazione in Atti di Convegno

Page 35 of 51 • Total publications: 505