Enhancing Image Search with the Vector Engine for Amazon OpenSearch Serverless and Amazon Rekognition

Dec 1, 2023

6 MIN READ

Share this article

Introduction

In today's fast-paced, high-tech landscape, the way businesses handle the discovery and utilization of their digital media assets can have a huge impact on their advertising, e-commerce, and content creation. The importance and demand for intelligent and accurate digital media asset searches is essential and has fueled businesses to be more innovative in how those assets are stored and searched, to meet the needs of their customers. Addressing both customers’ needs, and overall business needs of efficient asset search can be met by leveraging cloud computing and the cutting-edge prowess of artificial intelligence (AI) technologies.

Use Case Scenario

Now, let's dive right into a real-life scenario. An asset management company has an extensive library of digital image assets. Currently, their clients have no easy way to search for images based on embedded objects and content in the images. The company’s main objective is to provide an intelligent and accurate retrieval solution which will allow their clients to search based on embedded objects and content. So, to satisfy this objective, we introduce a formidable duo: the vector engine for Amazon OpenSearch Serverless, along with Amazon Rekognition. The combined strengths of Amazon Rekognition and OpenSearch Serverless will provide intelligent and accurate digital image search capabilities that will meet the company’s objective.

Architecture

Architecture Overview

The architecture for this intelligent image search system consists of several key components that work together to deliver a smooth and responsive user experience. Let's take a closer look:

Vector engine for Amazon OpenSearch Serverless:‍

The vector engine for OpenSearch Serverless serves as the core component for vector data storage and retrieval, allowing for highly efficient and scalable search operations.

Vector Data Generation:

When a user uploads a new image to the application, the image is stored in an Amazon S3 Bucket.
S3 event notifications are used to send events to an SQS Queue, which acts as a message processing system.
The SQS Queue triggers a Lambda Function, which handles further processing. This approach ensures system resilience during traffic spikes by moderating the traffic to the Lambda function.
The Lambda Function performs the following operations:

- Extracts metadata from images using Amazon Rekognition's `detect_labels` API call.

- Creates vector embeddings for the labels extracted from the image.

- Stores the vector data embeddings into the OpenSearch Vector Search Collection in a serverless manner.

- Labels are identified and marked as tags, which are then assigned to .jpeg formatted images.

Query the Search Engine:

Users search for digital images within the application by specifying query parameters.
The application queries the OpenSearch Vector Search Collection with these parameters.
The Lambda Function then performs the search operation within the OpenSearch Vector Search Collection, retrieving images based on the entities used as metadata.

‍

Advantages of Using the Vector Engine for Amazon OpenSearch Serverless

The choice to utilize the OpenSearch Vector Search Collection as a vector database for this use case offers significant advantages:

Usability: Amazon OpenSearch Service provides a user-friendly experience, making it easier to set up and manage the vector search system.
Scalability: The serverless architecture allows the system to scale automatically based on demand. This means that during high-traffic periods, the system can seamlessly handle increased loads without manual intervention.
Availability: The managed AI/ML services provided by AWS ensure high availability, reducing the risk of service interruptions.
Interoperability: OpenSearch's search features enhance the overall search experience by providing flexible query capabilities.
Security: Leveraging AWS services ensures robust security protocols, helping protect sensitive data.
Operational Efficiency: The serverless approach eliminates the need for manual provisioning, configuration, and tuning of clusters, streamlining operations.
Flexible Pricing: The pay-as-you-go pricing model is cost-effective, as you only pay for the resources you consume, making it an economical choice for businesses.

Conclusion

The combined strengths of the vector engine for Amazon OpenSearch Serverless and Amazon Rekognition mark a new era of efficiency, cost-effectiveness, and heightened user satisfaction in intelligent and accurate digital media asset searches. This solution equips businesses with the tools to explore new possibilities, establishing itself as a vital asset for industries reliant on robust image management systems.

The benefits of this solution have been measured in these key areas:

First, search efficiency has seen a remarkable 60% improvement. This translates into significantly enhanced user experiences, with clients and staff gaining swift and accurate access to the right images.
Furthermore, the automated image metadata generation feature has slashed manual tagging efforts by a staggering 75%, resulting in substantial cost savings and freeing up valuable human resources. This not only guarantees data identification accuracy but also fosters consistency in asset management.
In addition, the solution’s scalability has led to a 40% reduction in infrastructure costs. The serverless architecture permits cost-effective, on-demand scaling without the need for hefty hardware investments.

In summary, the fusion of the vector engine for Amazon OpenSearch Serverless and Amazon Rekognition for intelligent and accurate digital image search capabilities has proven to be a game-changer for businesses, especially for businesses seeking to leverage this type of solution to streamline and improve the utilization of their image repository for advertising, e-commerce, and content creation.

If you’re looking to modernize your cloud journey with AWS, and want to learn more about the serverless capabilities of Amazon OpenSearch Service, the vector engine, and other technologies, please contact us.