The Semantic Web has become reality over the past couple of years. While certain practical topics—such as interoperability, etc.—have at least partially been addressed, scalability and expressivity issues regarding the utilisation of multimedia metadata on the Semantic Web are still widely neglected. However, existing Web (2.0) applications handling millions of multimedia assets are starting to take advantage of Semantic Web technologies.

This work contributes to design decisions regarding scalable and smart multimedia applications on the Semantic Web. Based on an analysis of practical issues—stemming from diverse projects and activities the author has participated in over the past four years—three areas have been identified, namely (i) performance and scalability issues on the data access level, (ii) the effective and efficient representation of multimedia content descriptions, and (iii) the deployment of multimedia metadata on the Semantic Web. The three research areas have as its common base the trade-off between expressivity and scalability.

We present our ﬁndings regarding scalable, yet expressive Semantic Web multimedia applications in a number of practical settings and discuss future directions, such as interlinking multimedia.

Excerpt

1. Introduction

1.1 Motivation

1.2 Problem Definition

1.2.1 Performance and Scalability Issues in Distributed Metadata Sources

1.2.2 Efficient and Effective Representation of Multimedia Metadata

1.2.3 Scaleable Multimedia Metadata Deployment on the Semantic Web

1.3 Reader’s Guide

1.4 What this work is NOT about

2. Related and Existing Work

2.1 Semantic Web Applications

2.1.1 Projects & Activities

2.2 Multimedia Applications

2.2.1 Smart Multimedia Content

2.2.2 Multimedia Metadata Deployment

2.2.3 Semantic Web Multimedia Applications—SWMA

2.2.4 Projects & Activities

2.3 Scalability and Expressivity

2.3.1 Infrastructure Level

2.3.2 Application Level

2.3.3 Projects & Activities

3. Multimedia Metadata

3.1 Multimedia Container Formats

3.1.1 eXtensible HyperText Markup Language–(X)HTML

3.1.2 Scalable Vector Graphics–SVG

3.1.3 Synchronized Multimedia Integration Language–SMIL

3.1.4 eXtensible 3D—X3D

3.2 Aspects of Multimedia Metadata

3.2.1 An Attempt of A Definition

3.2.2 Types of Metadata

3.2.3 Scope of Metadata

3.3 Multimedia Metadata Formats

3.3.1 Metadata for Still Images

3.3.2 Metadata for describing Audio Content

3.3.3 Metadata for describing Audio-Visual Content

3.3.4 Multimedia Content Description Interface–MPEG-7

3.3.5 Formats For Describing Specific Domains Or Workflows

3.3.6 Interoperability

4. Semantic Web

4.1 Logic and the Semantic Web

4.1.1 Knowledge Representation

4.1.2 Description Logics (DL)

4.1.3 Logic Programming (LP)

4.1.4 Integrating DL and LP

4.2 Semantic Web Vision

4.2.1 Synopsis

4.2.2 Current State

4.2.3 Future

4.2.4 Related Fields

4.3 Semantic Web Stack

4.3.1 Encoding & Addressing

4.3.2 Data Structure and Exchange

4.3.3 Data Model

4.3.4 Ontologies, Rules & Query

4.3.5 Trust & Data Provenance

4.3.6 Semantic Web Issues

4.4 Semantic Web Vocabularies

4.4.1 Generic Vocabularies

4.4.2 Social Vocabularies

4.4.3 Spatio-temporal Vocabularies

4.4.4 Other Vocabularies

4.5 Linked Data

4.6 Web 3.0

4.6.1 Web 2.0: Ajax & Mashups

4.6.2 Metadata in HTML

4.6.3 Web 2.0 + Semantic Web = Web 3.0?

4.7 Conclusion

5. Creating Smart Content Descriptions

5.1 Information Flow and Media Semantic Web Stack

5.2 Extraction vs. Annotation

5.2.1 Extraction

5.2.2 Annotation

5.3 How To Deal with the Semantic Gap

5.3.1 Low-level Feature Based Approach

5.3.2 Model-based Approach

5.3.3 Semantic Web Approach

5.3.4 Hybrid Approach

5.4 Multimedia Ontology Engineering

5.4.1 Methodologies

5.4.2 Ontology Engineering Tools

5.4.3 Review of Existing Multimedia Ontologies

6. Scaleable yet Expressive Content Descriptions

6.1 Introduction

6.2 Motivation and Scenarios

6.3 Requirements for the Description of Multimedia Assets

6.4 Environment Analysis: The Semantic Web

6.5 Multimedia Assets on the Semantic Web

6.6 Formal Descriptions of Multimedia Assets

6.6.1 Ontology Languages

6.6.2 Rules

6.6.3 Comparing Formal Descriptions Regarding the Requirements

6.7 Conclusions

7. Rational & Common Concepts

7.1 The Semantic Web Stack regarding SWMA

7.2 Design Principles and Common Concepts

7.2.1 Occam’s Razor

7.2.2 Follow-your-nose

7.2.3 Reuse & Layering

7.3 Expressivity on the Semantic Web

7.4 Scalability on the Semantic Web

7.5 Conclusion

8. A Performance and Scalability Metric for Virtual RDF Graphs

8.1 Motivation

8.2 Related and Existing Work

8.3 Virtual RDF Graphs

8.3.1 Types Of Sources

8.3.2 Characteristics Of Sources

8.4 A Metric for Virtual RDF Graphs

8.5 Conclusion

8.6 Acknowledgements

9. Media Semantics Mapping

9.1 Environment

9.2 Related Work

9.3 Media Semantics Mapping

9.3.1 Data and Metadata

9.3.2 Media Semantics

9.3.3 Spaces of Abstraction

9.3.4 Built-in rules

9.3.5 User-defined rules

9.3.6 The MSM Knowledge Base

9.4 Applying the Media Semantics Mapping

9.5 Mapping the NM2 Workflow to the Canonical Model

9.5.1 The NM2 Workflow

9.5.2 Authoring Of Non-linear Stories

9.5.3 Example NM2 Productions

9.5.4 Lessons Learned

9.5.5 The NM2 Workflow in Terms of Canonical Processes

9.6 Discussion

10. Efficient Multimedia Metadata Deployment

10.1 Motivation

10.1.1 Last Mile of Multimedia Metadata Deployment

10.1.2 Related Work

10.1.3 Design Principles

10.2 Use Cases

10.2.1 Use Case: Annotate and Share Photos Online

10.2.2 Use Case: Purchasing Music Online

10.2.3 Use Case: Describing the Structure of a Video

10.2.4 Use Case: Publishing Professional Content with Metadata

10.2.5 Use Case: Expressing and Using Complex Rights Information

10.2.6 Use Case: Detailed Description of Large Media Assets

10.2.7 Use Case: Cultural Heritage

10.2.8 Derived Requirements from the Use Cases

10.3 RDFa-deployed Multimedia Metadata

10.3.1 ramm.x Vocabulary

10.3.2 ramm.x extensions

10.3.3 Processing ramm.x Descriptions

10.4 Examples

10.4.1 Deploying a Still Image along with Exif Metadata

10.4.2 An Example from Cultural Heritage

10.5 Conclusion and Future Work

11. Conclusions

12. Outlook

12.1 Semantic Web multimedia applications now and in 10 years time

12.1.1 Emerging Metadata

12.1.2 Advanced Annotation Techniques

12.1.3 Interactive Media

12.2 Future Work

12.2.1 Meshups and More

12.2.2 Multimedia and the Web of Data

Objectives and Topics

This thesis aims to address the challenges of building scalable and expressive Semantic Web multimedia applications. The primary research focus lies on solving practical problems associated with the efficient access, representation, and deployment of multimedia metadata on a Web-scale, specifically targeting the trade-off between expressivity and scalability in various application scenarios.

Performance and scalability in distributed multimedia metadata sources.
Effective and efficient representation of multimedia content and metadata vocabularies.
Deployment of multimedia metadata on the Semantic Web, particularly bridging the gap between existing formats and RDF.
Engineering principles for designing Semantic Web multimedia applications.
Evaluation of formal description paradigms and ontology engineering methodologies for media assets.

Excerpt from the Book

Example 1.1 (Low-level feature description of a media asset with RDF).

A video clip with a duration of one hour is described with MPEG-7. Several visual low-level features (F) as colour, shape, texture, etc. are extracted for a number of spatial segments (S) per key frame (K). A multimedia ontology is then used to represent the MPEG-7 descriptors formally (on basis of RDF); an average number of RDF triples is assumed for each descriptor (TD). An estimation of the resulting RDF graph size then is F·K·S·TD. Let us assume that we want to capture 10 features, some 1000 key frames may exist, 10 spatial segments are marked up, and finally 10 triples are required per descriptor. This yields a total RDF graph size of 1 million triples—just for describing the low-level features of an hour of video footage.

Summary of Chapters

1. Introduction: This chapter provides motivation, defines the core research questions regarding the expressivity-scalability trade-off, and introduces the problem scope.

2. Related and Existing Work: This chapter reviews the current landscape of Semantic Web and multimedia applications, critically examining projects that have addressed similar scalability and expressivity issues.

3. Multimedia Metadata: This chapter provides a comprehensive overview of existing container formats and metadata standards, discussing their suitability for the Semantic Web.

4. Semantic Web: This chapter covers the logical foundations of the Semantic Web, detailing knowledge representation, description logics, and the semantic stack.

5. Creating Smart Content Descriptions: This chapter elaborates on the information flow in multimedia applications, exploring extraction versus annotation and multimedia ontology engineering.

6. Scaleable yet Expressive Content Descriptions: This chapter analyzes the requirements for multimedia assets and provides a comparison of formal description paradigms like OWL vs. WSML.

7. Rational & Common Concepts: This chapter defines design principles for engineering Semantic Web multimedia applications, such as Occam's Razor and the Follow-your-nose principle.

8. A Performance and Scalability Metric for Virtual RDF Graphs: This chapter defines a performance metric for accessing non-native RDF sources and demonstrates its application.

9. Media Semantics Mapping: This chapter discusses bridging the Semantic Gap through mapping low-level MPEG-7 features to formal semantic concepts in a non-linear production environment.

10. Efficient Multimedia Metadata Deployment: This chapter introduces the ramm.x framework for deploying existing multimedia metadata formats on the Semantic Web using RDFa.

11. Conclusions: This chapter summarizes the contributions of the thesis, reviewing the developed metrics, ontologies, and deployment frameworks.

12. Outlook: This chapter discusses future trends in Semantic Web multimedia applications and identifies open research areas.

Keywords

Semantic Web, Multimedia Applications, Scalability, Expressivity, Metadata, RDF, MPEG-7, Ontology Engineering, Multimedia Metadata Deployment, Semantic Gap, RDFa, Linked Data, Web 3.0, Knowledge Representation, Multimedia Semantics

Frequently Asked Questions

What is the core focus of this doctoral thesis?

The thesis focuses on addressing the expressivity-scalability trade-off when building smart and scalable multimedia applications on the Semantic Web.

What are the three main research areas identified?

The research addresses (i) performance and scalability in distributed data access, (ii) efficient representation of multimedia descriptions, and (iii) the deployment of multimedia metadata on the Semantic Web.

What is the primary objective of the work?

The goal is to provide design decisions and frameworks to enable effective utilization of multimedia assets on the Semantic Web while ensuring they remain scalable.

Which scientific methods are primarily employed?

The work utilizes a multidimensional analysis of practical project requirements, formal analysis of ontology languages, and experimental evaluation through metrics and prototypes (e.g., PSIMeter).

What does the main body of the work cover?

The main body systematically covers the fundamentals of the Semantic Web and multimedia metadata, defines engineering principles, introduces a scalability metric for virtual RDF graphs, describes media semantics mapping, and presents the ramm.x framework for metadata deployment.

Which keywords best characterize the research?

Core keywords include Semantic Web, Multimedia Metadata, Scalability, Expressivity, RDF, Ontology Engineering, and Multimedia Metadata Deployment.

What is the "Semantic Gap" in this context?

It refers to the lack of coincidence between automatically extracted low-level features from multimedia content and the high-level semantic interpretation required by human users.

How does the author propose bridging the "last mile" of metadata deployment?

The author proposes the ramm.x framework, which uses RDFa to embed or reference existing multimedia metadata formats (like Exif or MPEG-7) in Web documents, making them accessible to Semantic Web agents without inventing entirely new formats.

Excerpt out of 277 pages - scroll top

Details

Title: Building Scalable and Smart Multimedia Applications on the Semantic Web
College: University of Graz
Grade: 1,0
Author: Dr. Michael Hausenblas (Author)
Publication Year: 2009
Pages: 277
Catalog Number: V126801
ISBN (eBook): 9783640324187
ISBN (Book): 9783640791583
Language: English
Tags: multimedia Web of Data Semantic Web linked data RDF URI ramm.x
Product Safety: GRIN Publishing GmbH

Quote paper: Dr. Michael Hausenblas (Author), 2009, Building Scalable and Smart Multimedia Applications on the Semantic Web, Munich, GRIN Verlag, https://www.grin.com/document/126801

Building Scalable and Smart Multimedia Applications on the Semantic Web