開始時間﹕ 四月十日(四) 08:30 結束時間﹕ 四月十日(四) 17:00
主辦單位﹕ 清華大學電腦與通訊科技研發中心
活動地點﹕ 清華大學資訊電機館B1國際會議廳
聯 絡 人 ﹕ 聯絡電話﹕ 03-5742847分機2847

Modern advancements in information technology have enabled pervasive uses of digital multimedia data in a variety of business, scientific, government, and consumer applications. Accompanying with an explosive growth in the generation, storage, distribution and consumption of multimedia data are emerging requirements in indexing content, building standard exchange formats and ensuring a trustworthy framework between users.

Recent work on multimedia indexing technologies has been struggling to keep pace with these challenges. While feature-based indexing techniques satisfied some of the requirements, a need for understanding semantic meaning of multimedia data is foreseen and is currently driving research paradigm into a new level. Multimedia understanding exploits techniques from disparate disciplines that include signal processing, machine learning, computer vision, and recognition techniques in specific domains. Although advances in speech/text/face recognition have been observed in recent applications, a generic framework which recognizes thousands of visual objects and acoustic information has not been seen in the literature. In the first two lectures, I will introduce our current effort in developing frameworks for generic audio-visual object recognition, video structure understanding and learning semantics from object recognition and video structures. In addition, a tutorial on machine learning as well as proven statistical discriminant techniques such as Support Vector Machines, Gaussian Mixture Models and Hidden Markov Models is also provided.

While first generation of multimedia standards, such as MPEG-1/2/4 and H.26x, focus on the coding techniques for multimedia transmission, later multimedia standards aim at building metadata frameworks for data exchange, retrieval, storage, personalization, and security. Among the emerging standards, MPEG-7 targets on standardized descriptions of multimedia content and MPEG-21 intends to provide common exchange description formats across systems. In the third lecture, I will introduce these two standards and provide some example descriptions. A demonstration of application examples such as our pervasive video summarization and personalization system will be shown.The digital nature of the multimedia information allows individuals to manipulate, duplicate or access media information beyond the terms and conditions agreed upon in a given transaction. The large-scale acceptance of digital media distribution rests on its ability of provide legitimate services to all parties in the information chain such as content creators, providers, agents and consumers. The last lecture of this course will emphasize the concept of multimedia security and provide comprehensive details on some of its core technologies such as digital signature, watermarking, key management and video encryption. Various examples on image/video authentication and ownership verification will be provided

1. 4月1日以前報名及繳費:學生500元、教師1,000元、其他1,500元。

2. 4月1日以後報名:學生800元、教師1,500元、其他2,000元。(以上費用包含講義、餐盒、茶點)


