Publications
Explore our research publications: papers, articles, and conference proceedings from AImageLab.
Tip: type @ to pick an author and # to pick a keyword.
PEANO: Pictorial Enriched Annotation of Video
Authors: Grana, Costantino; Vezzani, Roberto; Bulgarelli, Daniele; Gualdi, Giovanni; Cucchiara, Rita; M., Bertini; C., Torniai; A., Del Bimbo
In this DEMO, we present a tool set for video digital library management that allows i) structural annotation of edited … (Read full abstract)
In this DEMO, we present a tool set for video digital library management that allows i) structural annotation of edited videos in MPEG-7 by automatically extracting shots and clips; ii) automatic semantic annotation based on perceptual similarity against a taxonomy enriched with pictorial concepts iii) video clip access and hierarchical summarization with stand-alone and web interface iv) access to clips from mobile platform in GPRS-UMTS videostreaming. The tools can be applied in different domain-specific Video Digital Libraries. The main novelty is the possibility to enrich the annotation with pictorial concepts that are added to a textual taxonomy in order to make the automatic annotation process more fast and often effective. The resulting multimedia ontology is described in the MPEG-7 framework. The PEANO (Perceptual Annotation of Video) tool has been tested over video art, sport (Soccer, Olimpic Games 2006, Formula 1) and news clips.
Performance of the MPEG-7 Shape Spectrum Descriptor for 3D objects retrieval
Authors: Grana, Costantino; Cucchiara, Rita
In this work, we describe in detail the MPEG-7 Shape Spectrum Descriptor and provide a set of tests with different … (Read full abstract)
In this work, we describe in detail the MPEG-7 Shape Spectrum Descriptor and provide a set of tests with different 3D objects databases. To verify if the literature reported low performance of this descriptor were due to the comparison employed, we also used the Earth Movers Distance which allows much more detailed histograms comparisons. Finally we compare our outcomes with the best results in related work.
Practical Color Calibration for Dermatoscopic Images
Authors: Grana, Costantino; Pellacani, Giovanni; Seidenari, Stefania
In this paper a practical color calibration procedure for dermatoscopic image acquisition is illustrated, with details on the algorithms employed … (Read full abstract)
In this paper a practical color calibration procedure for dermatoscopic image acquisition is illustrated, with details on the algorithms employed and results on real data.
Recognition of articulated robots in the RoboCup domain
Authors: L., Cinque; Sangineto, E; S., Tanimoto
Published in: MACHINE GRAPHICS & VISION
Reliable background suppression for complex scenes
Authors: Calderara, Simone; Melli, Rudy Mirko; Prati, Andrea; Cucchiara, Rita
This paper describes a system for motion detection based on background suppression,specifically conceived for working in complex scenes with vacillating … (Read full abstract)
This paper describes a system for motion detection based on background suppression,specifically conceived for working in complex scenes with vacillating background,camouflage, illumination changing, etc.. The system contains proper techniques for background bootstrapping, shadow removal, ghost suppression and selective updating of the background model. The results on the challenging videos provided in VSSN '06 Open Source Algorithm Competition dataset demonstrate that the proposed system outperforms the widely-used mixture-of-Gaussians approach.
Semantic adaptation of sport videos with user-centred performance analysis
Authors: M., Bertini; Cucchiara, Rita; A., Del Bimbo; Prati, Andrea
Published in: IEEE TRANSACTIONS ON MULTIMEDIA
In semantic video adaptation measures of performance must consider the impact of the errors in the automatic annotation over the … (Read full abstract)
In semantic video adaptation measures of performance must consider the impact of the errors in the automatic annotation over the adaptation in relationship with the preferences and expectations of the user. In this paper, we define two new performance measures Viewing Quality Loss and Bit-rate Cost Increase, that are obtained from classical peak signal-to-noise ration (PSNR) and bit rate, and relate the results of semantic adaptation to the errors in the annotation of events and objects and the user's preferences and expectations. We present and discuss results obtained with a system that performs automatic annotation of soccer sport video highlights and applies different coding strategies to different parts of the video according to their relative importance for the end user. With reference to this framework, we analyze how highlights' statistics and the errors of the annotation engine influence the performance of semantic adaptation and reflect into the quality of the video displayed at the user's client and the increase of transmission costs.
Semantic Annotation and Adaptation of Live Sports Videos
Authors: M., Bertini; Cucchiara, Rita; A., Del Bimbo; Prati, Andrea
This paper addresses multimedia tools for universal multimedia access to sports videos by means of automatic annotation and content-based adaptation. … (Read full abstract)
This paper addresses multimedia tools for universal multimedia access to sports videos by means of automatic annotation and content-based adaptation. The goal is to provide boosting technologies to allow the new generations of mobile devices (phones and PDAs) to better exploit the available bandwidth and to achieve a reasonable cost/quality trade-off in remote access to long-lasting live events, such as sport competitions. Although the available bandwidth for mobile communication has increased thanks to new telecommunication standards such as GPRSand UMTS, it is still insufficient for high quality video transmission. The limited resources of low-cost terminals and the high costs of data transfer hinder de-facto many possible multimedia services. First, the quality is limited by the small display size and memory available on many mobile devices. Second, the limited bandwidthmay affect user satisfaction either because of the time spent waiting for the download or the latency in streaming a live video. Moreover, even if the user is willing to wait for the download or accepts frame dropping, a reduction of data to send would be unavoidable in order to bring down the costs of the service. As a matter of fact, most telecommunication companies charge a fee proportional to the number of bytes transferred. Hence, the cost of accessing a long-lasting live video, such as a 90-minute soccer competition, is stilltoo high for most users.
Special Issue on Multimedia Surveillance Systems: Guest Editorial
Authors: Aggarwal, Jk; Cucchiara, Rita
Published in: MULTIMEDIA SYSTEMS
It is with considerable pride that we present this special issue of ACM multimedia based on the presentations at the … (Read full abstract)
It is with considerable pride that we present this special issue of ACM multimedia based on the presentations at the third Video Surveillance and Sensor Network workshop, in conjunction with the ACM conference in Singapore 2005. The papers were thoroughly reviewed independently of the review process for the workshop. This special issue consists of eight papers drawn from a number of areas. It appears that we are breaking new ground as explained in this issue.Whenever we say multimedia, we think of systems and services that manage heterogeneous data for human-oriented applications; human users are normally the subjects who access and use multimedia data, multimediastreams, multimedia content, and multimedia interfaces in many different applications contexts. Following this abstraction, multimedia surveillance systems would be only a surveillance system able to produce output of the task in a multimedia format, providing distilled video, images and sounds of the monitored environment, which would possibly be annotated in an efficient and standard way or possibly transcoded in another media such as text or animation, to improve further querying to surveillance stored data.
Sub-Shot Summarization for MPEG-7 based Fast Browsing
Authors: Grana, Costantino; Cucchiara, Rita
In this paper, we propose a system for automatic video summarization at sub-shot level. Our work covers two main aspects: … (Read full abstract)
In this paper, we propose a system for automatic video summarization at sub-shot level. Our work covers two main aspects: the first is the sub-shot detection, which is performed without a priori constraints on the number or length of the shots. The algorithm is based on color histograms and motion features, and employs fuzzy c-means with variable number of clusters. The second aspect is an in depth discussion on the annotation of summaries with the MPEG-7 standard. Results on mixed genres TV material, from TRECVID videos, are reported.