Page 150 - 2023-Vol19-Issue2

P. 150

146 | Mohammed, Oraibi & Hussain

two components. In real life, retrieving an exact picture from exact picture from a sizable database, a problem that persists
a sizable database is still difficult. The biggest problem is despite the various contributions of existing CBIR algorithms
the semantic mismatch between the image’s low-level visual to image representation and similarity measure. Moreover,
qualities and its high-level meaning [2]. This gap has been the while the Bag of Visual Features (BoVF) model has been
subject of countless research during the last three decades [3]. extensively employed in existing CBIR techniques, it neglects
There are several ways to translate high-level concepts in pic- spatial information and lacks semantic meanings. This lack
tures into features. The basis of CBIR is comprised of these of spatial and semantic information leads to a less accurate
elements. According to the methodologies used for feature representation of images, thereby reducing the effectiveness
extraction, global and local characteristics are two common of the retrieval process. Another model, the Object Bank (OB)
categories for features. Global characteristics of the image, model, provides a high-level picture representation but leads
including color, texture, shape, and spatial details, serve as a to a large dimensionality difficulty when applied. This high di-
depiction of the entire item. They benefit from being quicker mensionality can complicate the retrieval process and increase
at feature extraction and similarity calculations [4]. On the computational requirements. Lastly, CNN-based Deep Learn-
other hand, they fail to recognize the difference between the ing models, despite their effectiveness in scene categorization,
image’s backdrop and the item in it (different image parts). have their own limitations. The complicated training proce-
They are therefore inappropriate for object identification or dure for parameter adjustment, the requirement for enormous
retrieval in complicated settings [5]. However, they are accept- amounts of training data, and excessive training time are sig-
able for object categorization and detection [6]. There have nificant drawbacks of these models. As a result, CNN-based
been significant attempts made by academia and industry to models cannot be recommended as the best option for CBIR
close this semantic gap. As a result, CBIR has been shown to on various datasets. These problems collectively present a
make significant progress recently. For instance, well-known substantial challenge for the development of efficient and
search engines like Google and Baidu can look for similar im- accurate CBIR systems. In this paper, we contribute to the
ages for any image. Several e-commerce websites, including field of CBIR by introducing a novel method that leverages
Alibaba, Amazon, and eBay provide comparable commodi- advanced models such as Inception and Xception for feature
ties search features. The content suggestion features on social extraction from images. Our method addresses the seman-
media networks like Pinterest are comparable [1]. tic mismatch between an image’s low-level visual qualities
and its high-level semantic content, a significant challenge
Query By Image Content (QBIC) and CBIR are related in current CBIR algorithms. We provide a comprehensive
by nature [7]. Early in the 1990s, CBIR was founded [8]. analysis of our method’s performance across multiple image
This automated process uses a picture as a query to present classes, demonstrating its effectiveness and potential for im-
a collection of photos that correspond to the query. The low- provements in certain areas. The rest of the paper is divided
level picture attributes, such as texture, color, and shape, are into the following sections: Section II provides an overview
taken from the database images in order to categorize them. of the related work of the existing CBIR methods. The third
We assume that images in the same category will share similar section, will give a brief overview of what CBIR is and how
traits. Retrieval of images will therefore see an incredible it works. The fourth section will go into more detail about
increase in efficiency when similarity measurement is carried the methods used in this research, including deep learning
out based on picture attributes [9]. One of the subcategories techniques, the dataset used, and the approaches taken. The
of the soft computing phenomena known as Deep Learning fifth section will present the results of the research and com-
(DL) which allows for the retrieval of data from millions pare them to other methods. Finally, the conclusion and future
of separated pictures [10]. A content-based picture retrieval work section will summarize the findings and discuss potential
system performs optimally when the feature representation areas for future research.
and similarity evaluation, which have been extensively studied
by multimedia researchers for decades, are used. Even though II. RELATED WORK
several solutions have been proposed, it is still among the
trickiest issues in CBIR research. This challenge can be linked the cutting-edge CBIR methods are critically examined in this
to the core challenge in AI: how to build and train AI tools part. A variety of properties, including color, form, texture,
that can carry out routine human tasks [11, 12]. and spatial arrangement, have been incorporated in existing
CBIR algorithms. Similar to this, other interest points-based
The field of CBIR faces several significant challenges that features descriptors have been suggested as a method of ob-
impede the development of efficient and accurate retrieval taining the attributes for picture retrieval [13, 14] [13, 14]. In
systems. One of the primary issues is the semantic gap be- order to recover pictures, scientists in [15] suggested a Micro
tween low-level characteristics and human visual perceptions Structure Descriptor (MSD) that is generated utilizing edge
in CBIR methods. This gap makes it difficult to retrieve an

145 146 147 148 149 150 151 152 153 154 155