Program on Thursday (Dec.1, 2011)

Keynote 3: Digital Media Distribution – The Future

Speaker: Arnaud Robert, Disney

Abstract: Consumer devices and technologies are evolving faster than ever, allowing for rich, interactive user experiences. But more importantly, media consumption behavior and expectations are changing and the promise of digital media remains to be fulfilled. In this presentation, we will explore major technology trends, consumer trends, and how they intersect with the future digital media experiences and business models.

Bio: Arnaud Robert is a renowned leader in digital media strategies, and in delivering innovative, cross-platforms consumer products for Disney, Technicolor, Microsoft, Nagravision-Kudelski and various start-ups.
As Senior Vice President of Technology at The Walt Disney Studios, Arnaud guides the Studio through the rapidly changing digital media ecosystem and manages the engineering, product development and user experience teams, as well as the piracy management efforts.
Arnaud received his PhD from the Swiss Institute of Technology and now also serves on the Media Council of the World Economic Forum and is a member of the National Academy of Engineering. Arnaud has 35 issued or pending patents, published over 30 broadly distributed scientific papers, and is a sought after speaker at corporate retreats and public events alike on wide range of topics from the future of digital entertainment to advances in signal processing.

Back to Overview

Oral Session Day 3

Research 9: Media analysis and search
Session Chair: Lexing Xie, ANU
Bilinear Deep Learning for Image Classification Shenghua ZHONG*, HK Polytechnic University; Yan LIU, HK Polytechnic University; Yang Liu, The HK Polytechnic University
Retrieval Based Facial Image Annotation with Weak Label Regularized Local Coordinate Coding Dayong Wang*, NTU,singapore; Steven C.H. Hoi, NTU, Singapore; Ying He, Nanyang Technological Universi
Learning to Judge Image Search Results Xinmei Tian*, Texas State University; Yijuan Lu, Texas State University; Linjun Yang, Microsoft Research Asia; Qi Tian, UTSA
Prediction of the Inter-Observer Visual Congruency (IOVC) and application to image ranking Olivier Le Meur*, University of Rennes 1; Thierry Baccino, Univeristy of Paris VIII; Aline Roumy, INRIA

Research 10: Multimedia systems and middleware 1
Session Chair: Pal Halvorsen, Simular Research
Modeling 3D Articulated Motions with Conformal Geometry Videos (CGVs) Dao Thi Phuong Quynh, Nanyang Technological University; Ying He*, Nanyang Technological Universi; Jiazhi Xia, Nanyang Technological University; Xiaoming Chen, Nanyang Technological University; Qian Sun, Nanyang Technological University; Steven C.H. Hoi, NTU, Singapore
Random Partial Paired Comparison for Subjective Video Quality Assessment via HodgeRank Qianqian Xu*, Graduate University of Chinese; Yuan Yao, ; Tingting Jiang, ; Qingming Huang, ; Bowei Yan, ; Weisi Lin,
Saving Bitrate vs. Pleasing Users: Where's the Break-Even Point of Mobile Video Quality? Wei Song*, Queensland University of Techn
Implementation of the Discrete Wavelet Transform and Multiresolution Analysis in the Encrypted Domain Peijia Zheng, ; Jiwu Huang*, Sun Yat-sen University

Research 11: Media analysis and search
Session Chair: Marcel Worring, University of Amsterdam
Multiple Feature Hashing for Real-time Large Scale Near-duplicate Video Retrieval Jingkuan Song*, University of queensland; Yi Yang, The University of Queensland; Helen Huang, ; Heng Tao Shen, "The University of Queensland, Australia"; Richang Hong, Hefei University of Technology
Learning Heterogeneous Data for Hierarchical Web Video Classification Xianming Liu*, Harbin Institute of Technology; Hongxun Yao, Harbin Institute of Technology; Rongrong Ji, Harbin Institute of Technology; Pengfei Xu, Harbin Institute of Technology; Sun Xiaoshuai, Harbin Institute of Technology; Qi Tian, UTSA
Coached Active Learning for Interactive Video Search Xiao-Yong Wei*, Sichuan University of China; Zhen-Qun Yang, Sichuan University of China
Learning Concept Bundles for Video Search with Complex Queries Jin Yuan*, National University of Singapo; Zheng-jun Zha, ; Yan-tao Zheng, ; Meng Wang, ; Xiangdong Zhou, Fudan University; Tat-Seng Chua, National Univ of Singapore

Research 12: Multimedia systems and middleware 2
Session Chair: Wei-Tsang Ooi, NUS
Flicker Effect in Scalable Lossless Video Streaming to Handheld Devices Pengpeng Ni*, Simula Research Laboratory; Ragnhild Eg, Simula Research Laboratory; Carsten Griwodz, Simula Research Laboratory; Pal Halvorsen, ; Alexander  Eichhorn, Simula Research Laboratory
An Empirical Evaluation of Battery Power Consumption for Streaming Data Transmission to Mobile Devices Yao Liu*, George Mason University; Lei Guo, ; Fei Li, ; Songqing Chen, George Mason University
Tennis Real Play: an Interactive Tennis Game with Models from Real Videos Jui-Hsin Lai*, National Taiwan University; Chieh-Li Chen, ; Po-Chen Wu, ; Chieh-Chi Kao, National Taiwan University; Shao-Yi Chien, National Taiwan University
Celerity: A Low-Delay Multi-Party Conferencing Solution Chen Xiangwen*, CUHK; Minghua Chen, The Chinese University of Hong Kong; Baochun Li, University of Toronto; Zhao Yao, ; Wu Yunnan, ; Jin Li, Microsoft Research

Research 13: Media analysis and search
Session Chair: Shinichi Satoh
Contextual Synonym Dictionary for Visual Object Retrieval Wenbin Tang*, Tsinghua University; Rui Cai, ; Zhiwei Li, ; Lei Zhang, Microsoft Research Asia
Contextual image search Wenhao Lu, Tsinghua University; Jingdong Wang*, Microsoft Research Asia; Xian-Sheng Hua, Microsoft; Shengjin Wang, Tsinghua University; Shipeng Li, Microsoft Research Asia
MUSIZ: A Generic Framework for Music Resizing with Stretching and Cropping Zhang Liu, Tsinghua University; Chaokun Wang*, Tsinghua University
Robust Visual Reranking via Sparsity and Ranking Constraints Nobuyuki Morioka*, University of New South Wales; Jingdong Wang, Microsoft Research Asia

Research 14: Applications
Session Chair: Ichiro Ide
Motor Learning using a Kinematic-Vibrotactile Mapping targeting Fundamental Movements Troy McDaniel*, Arizona State University; Morris Goldberg, ; Daniel Villanueva, ; Lakshmie Narayan Viswanathan, ; Sethuraman Panchanathan, Arizona State University
Affect-based Adaptive Presentation of Home Videos Xiaohong Xiang*, NUS; Mohan Kankanhalli, NUS
Example-based Home Video Remixing Support System Naoko Nitta*, Osaka University; Noboru Babaguchi, Osaka University
Towards Low Bit Rate Mobile Visual Search with Multiple-Channel Coding Rongrong Ji, Harbin Institute of Technology; Ling-Yu Duan*, Peking University; Jie Chen, Peking University; Hongxun Yao, Harbin Institute of Technolgoy; Yong Rui, Microsoft Asia; Shih-Fu Chang, "Columbia University, USA"; Wen Gao, Peking University
Back to Overview

Poster Day 3

Session Chair: Bernard Merialdo

All papers, including short papers, long papers, workshop papers, and others, will also be presented in the poster plenary sessions on the same day as their regular presentations.

Short Papers  
Explicit Instance Ranking Based Multi-Instance Learning for Cost Effective Video Classification Hua Wang*, Univ. of Texas at Arlington; Feiping Nie, ; Heng Huang, University of Texas at Arlington, Department of Computer Science and Engineering
SIFT Match Verification by Geometric Coding for Large Scale Image Search Wengang Zhou*, USTC; Houqiang Li, USTC; Yijuan Lu, USTC; Qi Tian, UTSA
Clothes Search in Consumer Photos via Color Matching and Attribute Learning Xianwang Wang*, Hewlett-Packard Labs; Tong Zhang, Hewlett-Packard Labs
A Fast MAP Adaptation Technique for GMM-supervector-based Video Semantic Indexing Systems Nakamasa Inoue*, Tokyo Institute of Technology; Koichi Shinoda, Tokyo Institute of Technology
Compositional object pattern: a new model for album event recognition Shen-Fu Tsai*, UIUC; liangliang Cao, ; Feng Tang, HP Labs.; Thomas Huang,
Prototype-based Image Search Reranking Linjun Yang*, Microsoft Research Asia; Alan Hanjalic, TU Delft
Semi-Supervised Face Image Retrieval Using Sparse Coding with Identity Constraint Bor-Chun Chen*, National Taiwan University; Yin-Hsi Kuo, ; Yan-Ying Chen, ; Kuan-Yu Chu, ; Winston Hsu, National Taiwan U
Level Influence of Spatial Pyramid Matching in Object Classification Renzhong Wei, Fudan University; Hong Lu*, Fudan University; Yanran Shen, Fudan University; Xiangyang Xue, Fudan University
Ensemble Approach based on Conditional Random Field for Multi-Label Image and video Annotation Xin-Shun Xu*, Nanjing University
Refining Local Descriptors by Embedding Semantic Information for Visual Categorization Yingbin Zheng, Fudan University; Renzhong Wei, Fudan University; Hong Lu*, Fudan University
Common Visual Pattern Discovery via Graph Matching Hongtao Xie*, ICT; Ke Gao, ; Yongdong Zhang, ; Jintao Li, Institute of Computing Technology, Chinese Academy of Sciences
Environmental Sound Classification for Scene Recognition Using Local Discriminant Bases and HMM Feng Su*, Nanjing University; Li Yang, Nanjing University; Tong Lu, Nanjing University; Gongyou Wang, Nanjing University
Manifold Ordinal Regression for Image Ranking Yang Liu*, The HK Polytechnic University; Yan LIU, HK Polytechnic University; Shenghua ZHONG, HK Polytechnic University; Keith C.C. Chan, The HK Polytechnic University
Blurred Image Region Detection and Classification Bolan Su*, National University of Singapo; Shijian Lu, Institute for Infocomm Research, Singapore; Chew Lim Tan, School of Computing, National University of Singapore
Human group activity analysis with fusion of motion and appearance context information Zhongwei Cheng*, GUCAS; Lei Qin, ICT; Qingming Huang,
Exploring Latent Class Information for Image Retrieval using the Bag-of-Feature Model Lingqiao Liu*, Australian National University; Lei Wang, University of Wollongong
Combining Latent Semantic Learning and Reduced Hypergraph Learning for Semi-Supervised Image Categorization Zhiwu Lu, Peking University; Yuxin Peng*, Peking University
Optimal Batch Selection for Active Learning in Multi-label Classification Shayok Chakraborty*, Arizona State University; Vineeth Balasubramanian, ; Sethuraman Panchanathan, Arizona State University
Extracting Intentionally Captured Regions Using Point Trajectories Yuta Nakashima*, Osaka University; Noboru Babaguchi, Osaka University
Exploring Self-Similarity of Bag-of-Features for Image Classification Chih-Fan Chen, Academia Sinica,; Yu-Chiang Frank Wang*, Academia Sinica
Non-rigid 3D Model Retrieval Using Multi-Scale Local Features Pengjie Li*, Beijing University of Posts an; Ma Huadong, ; anlong Ming,
Marginals-based Visual Alphabets for Local Image Descriptors Aggregation Miriam Redi*, Eurecom; Bernard Merialdo,
Modeling Multimedia Contents through Probabilistic Feature Signatures Christian Beecks*, RWTH Aachen University; Anca Ivanescu, RWTH Aachen University; Steffen Kirchhoff, RWTH Aachen University; Thomas Seidl, RWTH Aachen University
Bag of colors for improved image search Christian Wengert, Kooaba; Matthijs Douze, INRIA; Herve Jegou*, INRIA
Asymmetric Hamming Embedding Mihir Jain, INRIA; Hervé Jégou*, INRIA; Patrick Gros, INRIA
Spatially-coherent Pyramid Matching Based on Max-pooling Xiangang Cheng*, Nanyang Tech.l Univ.; Liang-Tien Chia,
Text and Image Subject Classifiers: Dense Works Better Daan Vreeswijk*, Universiteit van Amsterdam; Bouke Huurnink, University of Amsterdam; Arnold Smeulders, University of Amsterdam
Automatic Concept-to-Query Mapping for Web-based Concept Detector Training Damian Borth*, University of Kaiserslautern; Adrian Ulges, DFKI (Germany); Thomas Breuel,
Hypergraph Spectral Hashing for Similarity Search of Social Image Yueting Zhuang, Zhejiang University; Yang Liu*, Zhejiang Uinversity; Fei Wu, Zhejiang University; Jian Shao, Zhejiang University; Yin Zhang, Zhejiang University
Selecting the Best Faces to Index Presentation Videos Michele Merler*, Columbia University; John Kender,
A Biologically Inspired Computational Model for Image Saliency Detection Sheng He*, Northwest Polytechnical Univer; Junwei Han, NWPU; Xintao Hu, ; Ming Xu, Nwpu; Lei Guo, ; Tianming Liu,
Unsupervised Fast Anomaly Detection in Crowds Sun Xiaoshuai*, Harbin Institute of Technology; Hongxun Yao, Harbin Institute of Technology; Rongrong Ji, Harbin Institute of Technology; Xianming Liu, Harbin Institute of Technology; Pengfei Xu, Harbin Institute of Technology
Video Indexing and Recommendation Based on Affective Analysis of Viewers Zhao Sicheng*, Harbin Institute of Technology; Hongxun Yao, Harbin Institute of Technology; Sun Xiaoshuai, Harbin Institute of Technology; Pengfei Xu, Harbin Institute of Technology; Rongrong Ji, Harbin Institute of Technology; Xianming Liu, Harbin Institute of Technology
Eventscapes: Visualizing events over time with emotive facets Brett Adams*, Curtin University; Dinh Phung, Curtin University; Svetha Venkatesh, Curtin University
Signature-Image-Based Event Analysis for Personal Photo Albums Minh-Son DAO*, University of Trento, Italy; Duc-Tien Dang-Nguyen, DISI - University of Trento, Italy; Francesco De Natale,
Leveraging Collective Wisdom for Web Video Retrieval through Heterogeneous Community Discovery Lin Pang*, Chinese Academy of Sciences; Juan Cao, Chinese Academy of Sciences; Yongdong Zhang, ; Shouxun Lin,
Digitized Comic Search System with Automatic Preview Generation of Comic Episodes Keiichiro Hoashi*, KDDI R&D Laboratories; Chihiro Ono, KDDI R&D Laboratories; Daisuke Ishii, Waseda University; hiroshi Watanabe, Waseda University
2D Geon Based Generic Object Recognition Xiangqian Yu*, New Jersey Institute of Tech; Vincent Oria, New Jersey Institute of Tech; Pierre Gouton, Universite de Bourgogne; Genevieve Jomier, Universite de Paris-Dauphine
Image Annotation by Composite Kernel Learning with Group Structure Yueting Zhuang, ; Ying Yuan*, ZheJiang University; Fei Wu,
Video-to-Shot Tag Allocation by Weighted Sparse Group Lasso Xiaofeng ZHU, University of Queensland; Helen Huang, ; Heng Tao Shen*, "The University of Queensland, Australia"
Extracting Key Frames from Consumer Videos Using Bi-layer Group Sparsity Zheshen Wang*, Arizona State University; mrityunjay Kumar, Kodak Research Laboratories; Jiebo Luo, Kodak; Baoxin Li, Arizona State University
Spatial Pooling For Transformation Invariant Image Representation Xia Li*, UTSA; Yan Song, Univ of Sci. and Tech of China; Yijuan Lu, Texas State University; Qi Tian, UTSA
Multi-Feature pLSA for Combining Visual Features in Image Annotation Rui Zhang*, Ryerson Multimedia Laboratory; Lei Zhang, Microsoft Research Asia; Xin-Jing Wang, Microsoft Research Asia; ling Guan, Ryerson University
Tag-Based Social Image Search with Visual-Text Joint Hypergraph Learning Yue Gao*, Tsinghua University; Meng Wang, ; Jialie Shen, "Singapore Management University, Singapore"; Dacheng Tao, ; Qionghai Dai,
Integrating Rich Information for Video Recommendation with Multi-Task Rank Aggregation Xiaojian ZHAO*, NUS; Guangda LI, ; Meng Wang, ; zheng-Jun Zha, ; Jin Yuan, National University of Singapo; Zhoujun LI, ; Tat-Seng Chua, National Univ of Singapore
Quantifying Human Reconstruction Accuracy for Voxel Carving in a Sporting Environment David Monaghan*, Dublin City University; Philip Kelly, Dublin City University; Noel O'Connor, CLARITY: Centre for Sensor Web Technologies
Time-aware Co-Training for Indoors Localization in Visual Lifelogs Vladislavs Dovgalecs*, University of Bordeaux, France; Remi Megret, IMS Lab, University of Bordeau; Yannick Berthoumieu, University of Bordeaux, France
Automatic Sentence Generation from Images Yoshitaka Ushiku*, University of Tokyo; Tatsuya Harada, The Univ. of Tokyo; Yasuo Kuniyoshi, University of Tokyo
Audio-Visual Fusion Using Bayesian Model Combination for Web Video Retrieval Vasant Manohar*, Raytheon BBN Technologies; Stavros Tsakalidis, Raytheon BBN Technologies; Pradeep Natarajan, Raytheon BBN Technologies; Rohit Prasad, Raytheon BBN Technologies; Prem Natarajan, Raytheon BBN Technologies
Enriching and Localizing Semantic Tags in Internet Videos Lamberto Ballan, Università di Firenze - MICC; Marco Bertini*, ; Alberto Del Bimbo, Università di Firenze - MICC; Giuseppe Serra, Università di Firenze - MICC
Scene Segmentation of Wedding Party Videos by Scenario-based Matching with Example Videos Kazuki Sawai, Nagoya University; Tomoakzu Takahashi*, Gifu Shotoku Gakuen University; Daisuke Deguchi, Nagoya University; Ichiro IDE, ; Hiroshi Murase, Nagoya University
Content-aware Photo-on-photo Composition For Consumer Photos Aibo Tian*, University of Texas at Austin; Xuemei Zhang, ; Daniel Tretter, HP Labs
PhotoFeel: Feeling Your Photo Collection by Graph-based Audiovisual Flocking Simulation Cheng-Te Li*, National Taiwan University; Hsieh Hsun-Ping,
Learning to Produce 3D Media from a Captured 2D Video Minwoo Park*,; Jiebo Luo, Kodak; Andrew Gallagher, Eastman KODAK; Majid Rabbani, Eastman KODAK
A Tool for Authoring Unambiguous Links from Printed Content to Digital Media Andreas Girgensohn*, FX Palo Alto Laboratory; Lynn Wilcox, FX PAL; Frank Shipman, Texas A&M University; Qiong Liu, FX Palo Alto Laboratory; Chunyuan Liao, FXPAL; Yuichi Oneda, Fuji Xerox
Interactive Digital Scrapbook Generation for Travel Photos Based on Design Principles of Typography Jung-Yu Yeh, National Taiwan University; Min-Chun Hu*, National Taiwan University; Wen-Huang Cheng, Research Center for Information Technology Innovation (CITI), Academia Sinica; Ja-Ling Wu, National Taiwan University
Avatar Personalization and Augmentation: Prominent Structure Analysis of 3D Morphable Models Peng Wang*, Intel Labs China
Static and Dynamic Video Summaries Yingbo Li*, EURECOM; Bernard Merialdo, EURECOM; Mickael Rouvier, University of Avignon; Georges Linares, University of Avignon
Back to Overview

Industrial Exhibit / Technical Demos 2

Session chair: Roger Zimmermann (National University of Singapore)

Industrial Exhibits 2
Towards A Real Time Public Transport Awareness System: Case Study In Dublin Eric Bouillet*, IBM Research; Luca Gasparini, IBM Research; Olivier Verscheure, IBM Research
ARA: The Active Reading Application Gene Golovchinksy, FX Palo Alto Laboratory; Scott Carter*, FX Palo Alto Laboratory; Tony Dunnigan, FX Palo Alto Laboratory
myUnity: A new platform to support communication in the modern workplace Jacob Biehl*, FX Palo Alto Laboratory, Inc.; Thea Turner, FX Palo Alto Laboratory, Inc.; William van Melle, FX Palo Alto Laboratory, Inc.; Andreas Girgensohn, FX Palo Alto Laboratory, Inc.
Technical Demos 2
Galaxy Browser: Exploratory Search of Web Videos Lei Pang, ; Song Tan*, City University of Hong Kong; Hung-Khoon Tan, ; Chong-Wah Ngo,
A Mobile Location Search System with Active Query Sensing Felix X. Yu, Columbia University; Rongrong Ji*, Columbia University; Tongtao Zhang, Columbia University; Shih-Fu Chang, Columbia University
Collaborative Authoring of Serious Games for Health Florian Mehm*, TU Darmstadt; Stefan Göbel, TU Darmstadt, KOM; Sandro Hardy, TU Darmstadt, KOM
PhotoReference: Towards Capturing a Great Photo via Learning from Community-Contributed Photo Collections Haojie Li*, Dalian University of Technolog; Lei Yi, ; Jinhui Tang, ; Xiaohui Wang,
Changing Video Arrangement for Constructing Alternative Stories Alberto Piacenza, University of Brescia; Fabrizio Guerrini*, University of Brescia; Nicola Adami, University of Brescia; Riccardo Leonardi, University of Brescia; Jonathan Teutenberg, Teesside University; Julie Porteous, Teesside University; Marc Cavazza, Teesside University
Visual-based plant species identification from crowdsourced data Hervé Goëau*, Inria; Alexis Joly, Inria, Imedia Team; Souheil Selmi, Inria, Imedia Team; Pierre Bonnet, CIRAD, UMR AMAP; Elise Mouysset, Tela Botanica Association; Laurent Joyeux, Inria, Imedia Team; Jean-François Molino, IRD, UMR AMAP; Philippe Birnbaum, CIRAD, UMR AMAP; Daniel Barthélémy, CIRAD; Nozha Boujemaa, Inria
Dynamic Media Show Drivable by Semantics Vivek Singh*, Univ. of California, Irvine; jiebo Luo, Kodak Research Labs; Dhiraj Joshi, Kodak Research Labs; Madirakshi Das, Kodak Research Labs; phoury Lei, Kodak Research Labs; Peter Stubler, Kodak Research Labs
SIRE: A Social Image Retrieval Engine Steven C.H. Hoi*, NTU, Singapore; Pengcheng Wu, NTU, Singapore
Processing of Multimedia Data using the P2G Framework Håkon Stensland*, Simula Research Laboratory; Paul Beskow, University of Oslo; Håvard Espeland, University of Oslo; Espen Kristiansen, University of Oslo; Preben Olsen, University of Oslo; Ståle Kristoffersen, ; Carsten Griwodz, University of Oslo; Pål Halvorsen, University of Oslo
PositionIt – An Image-based Remote Target Localization System on Smartphones Qia Wang*, University of Missouri; Alex Lobzhanidze, University of Missouri; Suman Roy, University of Missouri; Wenjun Zeng, University of Missouri; Yi Shang, University of Missouri
Low-cost Creation of a 3D Interactive Museum Exhibition David Monaghan*, Dublin City University; Noel O'Connor,
Interactive Rich Reading: Enhanced Book Reading Experience with a Conversational Agent Koichi Mori*, Nokia Research Center; Rafael Ballafas, Nokia Research Center; Glenda Revelle, University of Arkansas; hayes Raffle, Nokia Research Center; hiroshi Horii, Nokia Research Center; mirjana Spasojevic, Nokia Research Center
Hierarchical Video Browsing with a 3D Carousel Klaus Schoeffmann*, Klagenfurt University; Manfred del Fabro, Klagenfurt University
SACOZI: Crowdsourced and Content-based Zoomable Video Player Axel Carlier*, IRIT - University of Toulouse; Arash Shafiei, IRIT - University of Toulouse; Julien Badie, IRIT - University of Toulouse; Salim Bensiali, IRIT - University of Toulouse; Vincent Charvillat, ENSEEIHT/IRIT; Wei Tsang Ooi, National University of Singapore
The Director"s Lens: An Intelligent Interface for Composing Virtual 3D Cinema William Bares, Millsaps College; Christophe Lino, INRIA Rennes; Roberto Ranon, University of Udine; Marc Christie*, IRISA / INRIA Rennes
Video2GPS: A Demo of Multimodal Location Estimation on Flickr Videos Gerald Friedland*, ICSI; Jaeyoung Choi, ICSI; Adam Janin, ICSI
Towards a Privacy Preserving Personal Photo Album Manager with Semantic classification, Indexing and Querying Capabilities Jichao Sun*, NJIT; Alexis Fesnin, CNAM/CEDRIC; Vallerie Gouet-Brunet, CNAM/CEDRIC; Scott Kominen, New Jersey Institute of Technology; Vincent Oria, NJIT
Million-scale Near-duplicate Video Retrieval System Yang Cai*, Zhejiang University; Linjun Yang, Microsoft Research Asia; Wei Ping, ; Fei Wang, ; Tao Mei, Microsoft Research Asia; Xian-Sheng Hua, Microsoft Research Asia; Shipeng Li, Microsoft Research Asia
Mobile Product Search with Bag of Hash Bits Junfeng He*, Columbia University; Tai-Hsu Lin, Columbia University; Shih-Fu Chang, Columbia University
Back to Overview


International ACM Workshop On Ubiquitous Meta User Interfaces (Ubi-MUI'11)
Thursday morning Dec. 1, 2011
Ali Asghar Nazari Shirehjini (DISCOVERLab, University of Ottawa)
Sahin Albayrak (DAI-Labor, Technical University Berlin)
Abdulsalam Yassin (Alcatel Lucent, Ottawa)

Session Schedule
10:30-10:35 Opening Remark  
10:35-11:00 Control smart homes easily with simple touch Sokratis Kartakis; Margherita Antona; Constantine Stephanidis
11:00-11:25 Gamifying Intelligent Environments Yefeng Liu; Todorka Alexandrova; Tatsuo Nakajima
11:25-11:50 Color-based object detection using homogeneous probability redistribution of histograms Mehdi Madani; Mahdi Bagheri
11:50-12:15 An Intelligent Multimedia system for Nutrient Intake Monitoring Gregorio Villalobos; Rana Almaghrabi; Behnoosh Hariri; Shervin Shirmohammadi
Back to Overview


International ACM Workshop on Multimedia Technologies for Distance Learning (MTDL'11)
Thursday Dec. 1, 2011
Rynson Lau (CUHK)
Timothy Shih (NCU, Taiwan)
Frederick Li (Durham University)
Neil Yen (Waseda University)

1. Regular Paper Session

Session Chair: Neil Yen, Waseda University, Japan
Time: 2:00PM - 4:05PM (25 minutes for each presentation)

2:00-2:25 Interactive Mixed Reality for Collaborative Remote Laboratories Benjamin Jailly, Christophe Gravier, Marius Preda, and Jacques Fayolle
2:25-2:50 A Novel Framework for Distance Education using Asynchronous Interaction Hiranmay Ghosh, Sujal Wattamwar, Tavleen Oberoi, Gautam Shroff, Arpan Pal, Ranjan Dasgupta, and Kingshuk Chakravarty
2:50-3:15 A-VIEW: Real Time Collaborative Multimedia e-Learning K.R. Jayahari, Kamal Bijlani, and Ancy Mathew
3:15-3:40 Discovering Small-World in Association Link Networks For Web-based Learning Shunxiang Zhang and Xiangfeng Luo
3:40-4:05 A User Evaluation Framework for Web-Based Learning Systems Ke Niu, Wei Chen, Zhendong Niu, Qing Li, Peipei Gu, Yi Li, and Zhilei Huang
2. Invited Paper Session

Session Chair: Wei Chen, Beijing Institute of Technology, China
Time: 4:30PM – 5:45PM (25 minutes for each presentation)

4:30-4:55 Interactive Tutoring of Cooking Activities with Personalized Multimedia Recipe Search Qing Li, Xudong Mao, and Haoran Xie
4:55-5:20 Game-Based Concept Visualization for Learning Programming Frederick W. B. Li and Chris Watson
5:20-5:45 Discovery of Implicit Correlation between Shared Information in an Open Environment Neil Y. Yen and Qun Jin
Back to Overview


ACM Workshop on Social and Behavioural Networked Media Access (SBNMA'11)
Thursday Dec. 1, 2011
Naeem Ramzan (Queen Mary University of London, UK)
Fei Wang (IBM T.J. Watson Research Center)
Charalampos Z. Patrikakis (Technological Educational Institute of Piraeus, Greece)
Peng Cui (Tsinghua University)
Nikos Doulamis (National Technical University of Athens, Greece)
Shiqiang Yang (Tsinghua University)
Gordon Sun (Tencent Technologies)

10:30-10:40 Welcome
(10:40-12:30) SBNMA paper session 1
Session Chair: Peng Cui (Tsinghua University, China)
10:40-11:00 Graph-based recognition in Photo Collections using Social Semantics,
Markus Brenner, Ebroul Izquierdo
11:00-11:20 Event Analytics via Social Media,
Yuheng Hu, Ajita John, Doree Seligmann
11:20-11:40 Pursuing the Holy Grail by interrelating user intentions and Bag of Visual Words to perform retrieval adaptation,
Marian Kogler, Mathias Lux
11:40-12:00 Using media related user profiles to personalize multimedia access over social networks,
Lemonia Argyriou, Charalampos Z Patrikakis, Stuart CM Porter, Nikolaos Papaoulakis, Christina Androulaki
12:00-12:20 Measuring Media-based Social Interactions Provided by Smartphones Applications in Social Networks,
Alan Gomes, Maria da Graca Pimentel
12:20-12:30 Discussion
12:30-14:00 Lunch break
(14:00-15:20) SBNMA paper session 2
Session Chair: Charalampos Z. Patrikakis (Technological Education Institute of Piraeus, Greece)
14:00-14:20 A Hierarchical, Multi-modal Approach for Placing Videos on the Map using Millions of Flickr Photographs,
Pascal Kelm, Sebastian Schmiedeke, Thomas Sikora
14:20-14:40 Improving Video Classification via YouTube Video Co-Watch Data,
John Zhang, Yang Song, Thomas Leung
14:40-15:00 Gaze Movement Inference for User Adapted Image Annotation and Retrieval,
S. Navid Haji Mirza, Ebroul Izquierdo
15:00-15:20 Employing clustering algorithms to create user groups for personalized context aware services provision,
Athanasios S. Voulodimos, Anastasios D. Doulamis, Charalampos Z. Patrikakis, Emmanuel S. Sardis, Pantelis N. Karamolegkos
(15:20-16:00) Keynote Address: Prof Tat-Seng Chua (National University of Singapore),
"Learning the Social Pulses of a City from User-Generated Information Sources"
Session Chair: Fei Wang (IBM T.J.Watson Research Center, USA)
16:00-16:30 Coffee break
(16:30-17:10) SBNMA paper session 3
Session Chair: Shiqiang Yang (Tsingua University, China)
16:30-16:50 Adaptive Multi-view Video Streaming over P2P Networks Considering Quality of Experience,
Saadet sedef Savas, Cihat Goktug Gurler, A. Murat Tekalp
16:50-17:10 Social and Behavioural Media Access: A Survey,
Naeem Ramzan, Peng Cui, Fei Wang, Shiqiang Yang
(17:10-17:50) Keynote Address: Prof Ebroul Izquierdo, (Queen Mary University of London, UK)
"Social Networked Media: Advances and Trends"
Session Chair: Naeem Ramzan(Queen Mary University of London, UK)
17:50-18:00 Closing remarks
Back to Overview


Joint ACM Workshop on Human Gesture and Behavior Understanding (J-HGBU'11)
Thursday Dec. 1, 2011
Part 1: International ACM Workshop on Social Signal Processing (SSPW’11)
Maja Pantic (Imperial College London)
Alex Pentland (MIT, Media Lab, Cambridge, MA)
Alessandro Vinciarelli (University of Glasgow)
Part 2: International ACM Workshop on Multimedia access to 3D Human Objects (MA3HO'11)
Rita Cucchiara (Università di Modena e Reggio Emilia, Italy)
Mohamed Daoudi (TELECOM Lille, France)
Alberto del Bimbo (Università di Firenze, Italy)

Regular Person authentication using 3D human motion Felipe Gomez-Caballero; Takahiro Shinozaki; Sadaoki Furui; Koichi Shinoda
Regular Estimation and utilization of articulations in recovering non-rigid structure from motion using motion subspaces Rohith MV; Chandra Kambhamettu
Regular Human Activity Recognition using Multiple Views: A Comparative Perspective on Recent Developments Michael Holte; Cuong Tran, Mohan Trivedi; Thomas Moeslund
Invited Fully Automatic 3D Facial Expression Recognition using a Region-Based Approach Pierre Lemaire, Liris;  Mohsen Ardabilian;  Mohamed  Daoudi;  Boulbaba Ben Amor;  Liming Chen
Invited 3DPeS: 3D People Dataset for Surveillance and Forensics Davide Baltieri; Rita Cucchiara; Roberto Vezzani
Invited 3D Partial Face Matching using Local Shape Descriptors Stefano Berretti; Alberto Del Bimbo;  Pietro Pala
Poster Multi-stage Feature Point Detection for 3D Human Data Xiang Pan, Zhejang; Alexander  Agathos.
Poster Human Motion Classification and Management Based on Mocap Data Analysis Harshad Kadu;  C.-C. Jay Kuo;  Maychen Kuo
Poster 3D Perceptual Shape Feature-based Body Parts Classification and Pose Estimation Gang Hu, Qigang Gao.
Poster Landmark Recognition and Retrieval: From 2D to 3D Xian Xiao;  Changsheng Xu;  Jinqiao Wang;   Min Xu
Invited Poster The Florence 2D/3D Hybrid Face Dataset Andrew Bagdanov; Alberto Del Bimbo; Iacopo Masi.
Regular The Sounds of Social Life: Naturalistic (Acoustic) Observation Sampling Matthias Mehl; Fenne Deters
Regular Computational Study of Human Communication Dynamics Louis-Philippe Morency
Regular A Survey of Perception and Computation of Human Beauty Hatice Gunes
Regular Behavioral signal processing for understanding (distressed) dyadic interactions: Some recent developments Panayiotis Georgiou; Matthew Black; Shrikanth Narayanan
Poster Incorporating Uncertainty in a Layered HMM Architecture for Human Activity Recognition Michael Glodek ; Lutz Bigalke; Martin Schels; Friedhelm Schwenker
Poster Multimodal Recognition of Personality during Short Self-Presentations Ligia Batrinca; Bruno Lepri; Fabio Pianesi
Poster Automatic recognition of coordination level in an imitation task Emilie Delaherche; Mohamed Chetouani
Back to Overview


International ACM Workshop on Automated Media Analysis and Production for Novel TV Services (AIEMPro 2011)
Thursday Dec. 1, 2011
Sid-Ahmed Berrani (Orange Labs - France Telecom, France)
Alberto Messina (RAI - Centre for Research and Technological Innovation, Italy)
Robbie De Sutter (VRT-medialab, Belgium)
Jean-Pierre Evain (European Broadcasting Union, Switzerland)
Gerald Friedland (International Computer Science Institute, USA)
Masanori Sano (Japan Broadcasting Corporation, Japan)
Patrick Gros (INRIA, France)
Shin'ichi Satoh (National Institute of Informatics, Japan)

Regular Speech recognition tools in a media retrieval system Mike Matton; Karel Braeckman
Regular Sequence-based Kernels for Online Concept Detection in Video Werner Bailer
Regular Picture-in-Picture copy detection using spatial coding techniques Sanjay Purushotham; Qi Tian; C.-C. jay Kuo
Regular News Story Clustering from Both What and How Aspects: Using Bag of Word Model and Affinity Propagation Wei-Ta Chu; Chao-Chin Huang; Wen-Fang Cheng
Invited From audio recurrences to TV program structuring Alina Abduraman;  Sid-Ahmed Berrani; Jean-Bernard Rault; Olivier Le Blouch
Regular Produce. Annotate. Archive. Repurpose – Accelerating the Composition and Metadata Accumulation of TV Content Robert Knauf; Jens Kürsten; Albrecht Kurze; Marc Ritter; Arne Berger; Stephan Heinich; Maximilian Eibl
Back to Overview

Industry Panel

Innovating the Multimedia Experience
Thursday December 1st, 2:00PM - 4:00PM, 2011
Organizers & Moderators:

Khaled El-Maleh, Qualcomm Inc.
Haohong Wang, TCL Research America,

1.Susie Wee, Vice President and Chief Technology and Experience Officer of Collaboration and Communication at Cisco Systems.
2.Heather Yu, Director, Media Networking Lab, Huawei Tech USA
3.James D. Johnston, IEEE Distinguished Lecturer, Audio Technology Pioneer.
4.Zhengyou Zhang, Principal Researcher at Microsoft, Affiliate Professor at University of Washington.

In this panel, each panelist will present their view of the current state-of-the-art of research and product innovations in the three major areas of multimedia experience: visual, auditory and gaming. We will discuss examples of innovation that enhance the consumption and sharing of multimedia (video, audio, graphics etc.) and thus increase quality of user experience. Another major focus of this panel is to open the discussion on how to innovate new multimedia user experiences.

Categories and Subject Descriptors
H.1.2[User/Machine Systems]:Human factors; H.5.1 [Information Interfaces and Presentation]:Multimedia Information Systems

General Terms
Human Factors

Multimedia experience, Visual media, Auditory media, Gaming, Sharing, User experience


Khaled El-Maleh (Qualcomm Inc. ) received his M. Eng. and Ph.D. from McGill University, Canada, and B. Sc. in EE and B. Sc. in Applied Math from King Fahd University (KFUPM), Saudi Arabia. He has been with Qualcomm, San Diego since 2000. Dr. El-Maleh’s areas of expertise and interests include: design, implementation and quality evaluation of mobile multimedia systems with focus on speech and video compression/processing algorithms,  innovation and technology transfer . He was a key member of the team who designed the first generation of mobile camcorder and video telephony solutions for both UMTS and CDMA networks. He was the video quality lead for Qualcomm chip unit and contributed to the design of media quality assessment tools with focus on automated on-chip quality testing. He is currently leading the multimedia engineering team in the Strategic IP Department of Qualcomm with focus on developing innovative technologies to enhance mobile multimedia user experience.  In addition to his technical publications, he has filed more than 200 US and international patents covering video\image\speech processing and compression technologies, mobile learning and wireless power charging. He is a founding member of the International Workshop on Quality of Multimedia Experience (QoMEX) and served as the General Co-Chair for QoMEX 2009. He served/currently serving  as the TPC Vice Chair (Tutorial Co-Chair) for Globecom 2010, TPC member for ACM Multimedia 2010, Steering Committee member for QoMEX 2010 and QoMEX 2011, Industry Co-Chair for ICME 2011, Industry Chair for SiPS 2011, Advisory Committee member of IEEE DSP/SPE 2011. He is currently serving as a member of the steering committee of ICME 2012-2014, Industry Panel Co-Chair of ACM Multimedia  2011, Industry Co-Chair for ICIP 2012, Special Sessions Co-chair for MMSP 2012, Tutorial Co-Chair for ICASSP 2013. He is a guest editor of two upcoming special journal issues on Quality of Multimedia User Experience (IEEE Signal Processing Magazine and European Journal of Signal and Image Processing)- to be published in 2011. Khaled is the recipient of Qualcomm first Career Thought Leader Award.


Haohong Wang (TCL Research America) received the B.S. degree in computer science and the M.Eng. degree in computer applications both from Nanjing University, China, the M.S. degree in computer science from University of New Mexico, and the Ph.D. degree in Electrical and computer engineering from Northwestern University, Evanston, USA. He is currently the General Manager of TCL Research America, TCL Corporation, at Santa Clara, California, in charge of the overall corporate research activities in North America including 5 research labs at multiple locations. Prior to that he held various technical and management positions at AT&T, Catapult Communications, Qualcomm, Marvell, TTE and Cisco. Dr. Wang's research involves the areas of multimedia communications, 3D graphics, video analysis and processing, and 3DTV systems. He has published more than 50 articles in peer-reviewed journals and International conferences. He is the inventor of more than 40 U.S. patents and pending applications. He is the co-author of 4G Wireless Video Communications (John Wiley & Sons, 2009), and Computer Graphics (1997).

Dr. Wang is the Editor-in-Chief of the Journal of Communications (2008-), the Technical Program Chair of IEEE GLOBECOM 2010 (Miami) and the General Chair of IEEE ICME 2011 (Barcelona) and IEEE ICCCN 2011 (Maui). He chairs the IEEE Multimedia Communications Technical Committee (2010-2012), and is an elected member of the IEEE Visual Signal Processing and Communications Technical Committee (2005-), and IEEE Multimedia and Systems Applications Technical Committee (2006-). He is a member of the Steering Committee of IEEE Transactions on Multimedia. He has served as the Editor-in-Chief of the IEEE MMTC E-Letter (2009) and as an Editor (or Guest Editor) of IEEE Transactions on Multimedia, IEEE Communications Surveys & Tutorials, IEEE Communications Magazine, ACM Multimedia Systems Journal and many others. He served as the General Chair of the 17th IEEE International Conference on Computer Communications and Networks (ICCCN 2008) (US Virgin Island).

Susie Wee (Vice President and Chief Technology and Experience Officer of Collaboration and Communication at Cisco Systems) is the Vice President and Chief Technology and Experience Officer of Collaboration and Communication at Cisco Systems where she is responsible for driving innovation and experience design in Cisco's collaboration products and software services. Prior to this, she was at Hewlett Packard in the roles of founding Vice President of the Experience Software Business and Chief Technology Officer of Client Cloud Services in HP’s Personal Systems Group and Lab Director of the HP Labs Mobile and Media Systems Lab. Susie was the co-editor of the JPSEC standard for the security of JPEG-2000 images and the editor of the JPSEC amendment on File Format Security. She was formerly an associate editor for the IEEE Transactions on Circuits, Systems and Video Technology and for the IEEE Transactions on Image Processing. While at HP Labs, Susie was a consulting assistant professor at Stanford University where she co-taught a graduate-level course on digital video processing. Susie received Technology Review’s Top 100 Young Innovators award, ComputerWorld's Top 40 Innovators under 40, the INCITs Technical Excellence award, and the Women In Technology International Hall of Fame award. She is an IEEE Fellow for her contributions in multimedia technology and has over 50 international publications and over 40 granted patents. Susie received her B.S., M.S., and Ph.D. degrees from the Massachusetts Institute of Technology.

Heather Yu (Director, Media Networking Lab, Huawei Tech USA) got her Ph.D. from Princeton University in 1998. Currently, she is the Director of the Huawei Media Networking Lab located at Bridgewater, NJ. With the mission of establishing a world class R&D team and leading the key multimedia technology innovations, she led the NJ team successfully accomplished the development of several new media technology research areas and a series of new technology innovations offering competitive edge capabilities and supporting various functionalities for Huawei’s products. Before joining Huawei, she was with Panasonic Princeton Lab working on media communication, media processing, media security, and P2P technology research. Since graduated from Princeton, Heather served numerous positions in related associations, such as Chair of the IEEE Multimedia Communications Tech Committee, IEEE Communications Society Strategic Planning Committee member, IEEE Human Centric Communications emerging technology committee chair, Associate Editor in Chief for PPNA journal, AEs of several IEEE journals/magazines, and Conference chair and TPC chair for many conferences in the field. She holds 23 granted US patents and has many in pending. She published 70+ publications, including 4 books, P2P Networking and Applications, Semantic Computing, P2P Handbooks, and Multimedia Security Technologies for Digital Rights Management.

James D. Johnston (IEEE Distinguished Lecturer, Audio Technology Pioneer) received his BSEE and MSEE from Carnegie-Mellon University. Mr. Johnston has been called the father of perceptual audio coding for his pioneering contributions that revolutionized digital audio. His accomplishments during a 26-year career at AT&T Bell Labs have, among other achievements, allowed for the distribution of digital music and digital radio over the Internet. He is presently an independent consultant having retired from the position of Chief Scientist at DTS, Inc. where he was working on a variety of acoustical modeling, preprocessing and postprocessing algorithms for audio capture, analysis, control, and presentation.

Dr. Johnston joined DTS Inc., from his position at Neural Audio. Prior to that, he worked for 5 years at Microsoft Corporation in the "Codecs", "Core Media Processing" and finally the video services groups as Audio Architect.  Dr. Johnston retired from AT&T Labs - Research, quartered at Florham Park, NJ, Speech Processing Software and Technology Research Department. Before that, he was employed by AT&T Bell Laboratories, in the Acoustics Research Department under Dr. J. L. Flanagan, and in the Signal Processing Research Department.

Dr. Johnston was the primary researcher and inventor of the MPEG-2 AAC audio coding algorithm, and a principle contributor to the "MP3" algorithm. He also represented AT&T in the ANSI accredited group X3L3.1, and X3L3.1 in the ISO-MPEG-AUDIO (MP3, AAC) arena.

Dr. Johnston was awarded the IEEE James L. Flanagan Signal Processing Field Award (2006); elected Fellow, Audio Engineering Society (1997); received AT&T Technology Medal and AT&T Standards Award (1998); received a New Jersey Inventor of the Year Award (2001); elected IEEE Fellow (2002).

Dr. Johnston’s current research interests include acoustic scene modelling, loudspeaker design, loudspeaker pattern control, cochlear modelling, masking threshold models, stereo imaging models and stereo imaging sensitivity models, methods of reproducing soundfields either literally or perceptually, microphone and soundfield capture techniques, both actively steered and time-invariant, and speech and audio coding methods in general.

Zhengyou Zhang (Principal Researcher at Microsoft, Affiliate Professor at University of Washington)  is a Fellow of the Institute of Electrical and Electronic Engineers (IEEE). He is the Founding Editor-in-Chief of the newly established IEEE Transactions on Autonomous Mental Development (IEEE T-AMD), and is on the Editorial Board of the International Journal of Computer Vision (IJCV), the Machine Vision and Applications, and the Journal of Computer Science and Technology (JCST). He was on the Editorial Board of the IEEE Transactions on Pattern Analysis and Machine Intelligence (IEEE T-PAMI) from 1999 to 2005, the IEEE Transactions on Multimedia (IEEE T-MM) from 2004 to 2009, the International Journal of Pattern Recognition and Artificial Intelligence (IJPRAI) from 1997 to 2008, among others. He is listed in Who's Who in the World, Who's Who in America and Who's Who in Science and Engineering.

Before joining Microsoft, Zhengyou worked at INRIA (French National Institute for Research in Computer Science and Control) for 11 years, and was a Senior Research Scientist since 1991, where he worked in the Computer Vision and Robotics group. In 1996-1997, he spent one-year sabbatical as an Invited Researcher at the Advanced Telecommunications Research Institute International (ATR), Kyoto, Japan.

He holds more than 80 US patents and has about 20 patents pending. He has published over 200 papers in refereed international journals and conferences, and is the author of several books.

Back to Overview


ACM Multimedia 2011

Nov 28th - Dec 1st, 2011 Scottsdale, Arizona, USA

Back To Top