Grin logo
de en es fr
Shop
GRIN Website
Publicación mundial de textos académicos
Go to shop › Ciencias de la computación - Aplicada

Predicting image data with the SRS-12 algorithm. Applying random projection-based c-approximate nearest-neighbor queries to image data

Título: Predicting image data with the SRS-12 algorithm. Applying random projection-based c-approximate nearest-neighbor queries to image data

Tesis (Bachelor) , 2015 , 41 Páginas , Calificación: 1.0

Autor:in: Philipp Güth (Autor)

Ciencias de la computación - Aplicada
Extracto de texto & Detalles   Leer eBook
Resumen Extracto de texto Detalles

Determining point similarity is a cornerstone for a wide variety of applications. Whenever data is being compared, the problem can be stated as a matter of comparing vectors. Due
to this fundamental importance, research has been conducted in this area for decades. For a long time, even for relatively low dimensions such as for example d = 10, complexity was far beyond of what was considered to be feasible in practice. A common approach is to soften the requirements for the accuracy of the neighbor search and look for points within a certain proximity to the query point instead of searching for exact neighbors.

In this thesis we evaluate how well the SRS-12 algorithm works on real-world image data in order to lay the ground for future work such as image prediction. The SRS-12 algorithm is an approach to c-approximate nearest-neighbors and claims to have a tiny index and arbitrary approximation ratio while maintaining good theoretical guarantees.

We first implement and verify the algorithm and subsequently examine the quality of its outputs when it is applied to image data by performing block matching. We find that the SRS-12
algorithm is indeed very suitable for processing image data as long as the input images are cut into patches that are sufficiently large. However the parameters of the algorithm have to be tuned carefully because they significantly affect not only the quality of the results, but also the computation time which can easily exceed an exact nearest-neighbor query if the parameters are not set properly. We conclude our experiment with a recommendation for well-working parameter settings and propose approaches to enhance the quality and speed of approximate nearest-neighbor queries on image data made with the SRS-12 algorithm such that it can be used in real-time.

Extracto


Table of Contents

1 Introduction

1.1 Motivation

1.2 Problem Statement

1.3 Objectives

1.4 Structure of this Thesis

2 A brief history and overview of Approximate Nearest-Neighbor Queries

2.1 Improving Index Structures

2.2 Softening Requirements for Accuracy

2.3 Further Related Work

3 An Introduction to SRS-12

3.1 Why SRS-12 is worth a consideration

3.2 SRS-12 in a Nutshell

3.3 Variants of SRS-12

3.4 Indexing

3.5 Normal Stopping Condition of the SRS-12 Algorithm

4 Implementation & Experimental Evaluation

4.1 Implementation Details for the Computation of Parameter Values

4.2 Datasets

4.3 Verification on SIFT1M Dataset

4.4 Block Matching

4.4.1 A Brief Introduction to Block Matching

4.4.2 Methodology

4.4.3 Parameter Settings

4.4.4 Computation Time

4.4.5 Conclusions

4.4.6 Summary - Recommended Parameter Settings

4.5 Computation Time of SRS-12 vs Exact Nearest-Neighbor Search

4.5.1 Possible Optimizations

5 Future Work

5.1 Image Prediction

5.2 Reducing Data through Rapid Object Detection and Background Removal

5.3 Optical Flow

5.4 Increasing the Accuracy of the SRS-12 Algorithm

6 Discussion

Research Objectives & Key Themes

This thesis investigates the applicability of the SRS-12 algorithm for performing approximate nearest-neighbor searches on real-world, high-dimensional image data. The primary objective is to evaluate whether this approach can efficiently yield meaningful results for image prediction tasks while maintaining a favorable trade-off between computational speed and accuracy.

  • Implementation and verification of the SRS-12 algorithm.
  • Evaluation of parameter influence on matching accuracy and computational time.
  • Comparison of SRS-12 against exact nearest-neighbor search methods.
  • Application of the algorithm to block matching in image sequences.
  • Proposals for real-time optimizations and future research directions.

Excerpt from the Book

3.1 Why SRS-12 is worth a consideration

The novelty of the SRS algorithm lies in its three strengths that have not been combined in a single algorithm up until the time of its publishing:

• Theoretical guarantees

• Linear-sized index

• Arbitrary approximation ratio

In practice, the advantages of SRS-12 are straightforward: it enables the search for approximate nearest-neighbors with the same, or in some instances even better performance as LSH, but without the same, heavy memory usage. The results demonstrate that SRS-12 is actually able to outperform state-of-the-art LSH-based methods while at the same time radically decreasing memory requirements such that it is possible to perform nearest-neighbor searches on a dataset containing one billion high-dimensional points on only a single commodity PC.

Summary of Chapters

1 Introduction: Provides the motivation for efficient nearest-neighbor searches in high-dimensional spaces and outlines the thesis objectives.

2 A brief history and overview of Approximate Nearest-Neighbor Queries: Reviews existing indexing structures and clustering approaches while discussing the challenges posed by the "curse of dimensionality".

3 An Introduction to SRS-12: Details the theoretical basis of the SRS-12 algorithm, including its indexing mechanism and random projection properties.

4 Implementation & Experimental Evaluation: Documents the practical implementation, parameter optimization tests, and performance benchmarks against exact neighbor search methods.

5 Future Work: Explores potential enhancements to the procedure and discusses future application areas such as object detection and optical flow.

6 Discussion: Synthesizes the experimental findings, confirming the algorithm's effectiveness for image data and summarizing the trade-offs identified.

Keywords

Approximate Nearest-Neighbor, SRS-12 Algorithm, High-Dimensional Data, Random Projection, Image Prediction, Block Matching, Index Structures, Dimensionality Reduction, Computational Efficiency, Similarity Search, Locality-Sensitive Hashing, Real-time Processing, Euclidean Space, SIFT1M, Performance Evaluation

Frequently Asked Questions

What is the core focus of this research?

The work focuses on evaluating the SRS-12 algorithm, specifically its performance and practical utility when applied to high-dimensional image data for approximate nearest-neighbor queries.

What are the primary themes discussed?

The main themes include similarity search methods, the mathematical foundations of random projections, parameter tuning for accuracy, and practical implementation on real-world datasets.

What is the main research objective?

The objective is to determine if the SRS-12 algorithm can effectively process image patches to achieve meaningful predictions while overcoming the memory and time limitations of exact search algorithms.

Which scientific methods are utilized?

The research employs a systematic implementation of the SRS-12 algorithm, unit testing, performance benchmarking on the SIFT1M dataset, and comparative analysis of computation times versus exact nearest-neighbor search.

What content is covered in the main body?

The main body covers the theoretical background of nearest-neighbor search, a detailed guide on the SRS-12 implementation, extensive experimental evaluation using block matching, and an analysis of computational efficiency.

Which keywords characterize this thesis?

Key terms include Approximate Nearest-Neighbor, SRS-12, Random Projection, Image Prediction, Block Matching, and Computational Efficiency.

Why is the "curse of dimensionality" a central problem in this work?

High dimensions often make traditional exact search methods computationally infeasible; this thesis tests whether SRS-12 can mitigate this issue via random projections while keeping indexes small.

How does the algorithm perform compared to an exact nearest-neighbor search?

Experiments show that SRS-12 can significantly reduce memory requirements and search times, providing a viable alternative for large-scale datasets, provided the parameters (like T and m) are correctly tuned.

What is the recommended parameter setting for image data?

Based on the experiments, the thesis suggests a patch size of 21x21, an approximation ratio c=2, and m=6 for a robust trade-off between speed and accuracy.

Final del extracto de 41 páginas  - subir

Detalles

Título
Predicting image data with the SRS-12 algorithm. Applying random projection-based c-approximate nearest-neighbor queries to image data
Universidad
University of Heidelberg  (Computer Science)
Calificación
1.0
Autor
Philipp Güth (Autor)
Año de publicación
2015
Páginas
41
No. de catálogo
V305214
ISBN (Ebook)
9783668033542
ISBN (Libro)
9783668033559
Idioma
Inglés
Etiqueta
Machine Learning Big Data Image Prediction Predictive Analytics Nearest Neighbors Image Recognition SRS-12
Seguridad del producto
GRIN Publishing Ltd.
Citar trabajo
Philipp Güth (Autor), 2015, Predicting image data with the SRS-12 algorithm. Applying random projection-based c-approximate nearest-neighbor queries to image data, Múnich, GRIN Verlag, https://www.grin.com/document/305214
Leer eBook
  • Si ve este mensaje, la imagen no pudo ser cargada y visualizada.
  • Si ve este mensaje, la imagen no pudo ser cargada y visualizada.
  • Si ve este mensaje, la imagen no pudo ser cargada y visualizada.
  • Si ve este mensaje, la imagen no pudo ser cargada y visualizada.
  • Si ve este mensaje, la imagen no pudo ser cargada y visualizada.
  • Si ve este mensaje, la imagen no pudo ser cargada y visualizada.
  • Si ve este mensaje, la imagen no pudo ser cargada y visualizada.
  • Si ve este mensaje, la imagen no pudo ser cargada y visualizada.
  • Si ve este mensaje, la imagen no pudo ser cargada y visualizada.
  • Si ve este mensaje, la imagen no pudo ser cargada y visualizada.
Extracto de  41  Páginas
Grin logo
  • Grin.com
  • Envío
  • Contacto
  • Privacidad
  • Aviso legal
  • Imprint