Object detection has become faster and more precise due to improved computer vision systems. Many successful object detections have dramatically improved owing to the introduction of machine learning methods. This study incorporated cutting- edge methods for object detection to obtain high-quality results in a competitive timeframe comparable to human perception. Object-detecting systems often face poor performance issues. Therefore, this study proposed a comprehensive method to resolve the problem faced by the object detection method using six distinct machine learning approaches: stochastic gradient descent, logistic regression, random forest, decision trees, k-nearest neighbor, and naive Bayes. The system was trained using Common Objects in Context (COCO), the most challenging publicly available dataset. Notably, a yearly object detection challenge is held using COCO. The resulting technology is quick and precise, making it ideal for applications requiring an object detection accuracy of 97%.
Self-driving cars are a fundamental research subject in recent years; the ultimate goal is to completely exchange the human driver with automated systems. On the other hand, deep learning techniques have revealed performance and effectiveness in several areas. The strength of self-driving cars has been deeply investigated in many areas including object detection, localization as well, and activity recognition. This paper provides an approach to deep learning; which combines the benefits of both convolutional neural network CNN together with Dense technique. This approach learns based on features extracted from the feature extraction technique which is linear discriminant analysis LDA combined with feature expansion techniques namely: standard deviation, min, max, mod, variance and mean. The presented approach has proven its success in both testing and training data and achieving 100% accuracy in both terms.
The main objective of this paper project was to create a state-of-the-art face identification technique that can handle the various difficulties caused by changes in illumination, occlusions, and facial emotions. Face detection is a cornerstone of computer vision, facilitating diverse applications ranging from surveillance systems to human-computer interaction. Throughout this paper, the comprehensive exploration of advancing face detection methodologies has been undertaken, culminating in developing and evaluating a novel approach. The challenges posed by variations in facial expressions, lighting conditions, and occlusions necessitated a multifaceted solution. Our proposed method, which consists of interconnected steps, works quite well to overcome these challenges. Using deep learning architectures to increase feature extraction and discrimination was beneficial in the initial stage of fine-tuning Residual Networks (ResNet-50) to serve as the Region-based Convolutional Neural Network (Faster R-CNN) framework classifier. The process of gradually optimizing thresholds, such as batch size, learning rate, and detection threshold, involved using the Gray Wolf optimization technique (GWO). The conversion process was accelerated and improved overall detection process efficiency and accuracy using a clever fusion of machine learning and metaheuristic optimization techniques. A key component of our methodology is the careful data processing, which was necessary to ensure. The suggested method was carefully examined on a particular dataset, and the 94% training accuracy that was attained together with an identical test dataset accuracy highlights the method’s resilience. These findings support the effectiveness of our approach in reducing false positives and negatives, resulting in unmatched recall and precision in the detection system. The discovery has significant significance as it can potentially improve face detection systems’ performance and reliability in various real-world applications, such as human-computer interaction and surveillance. Convolutional neural networks, deep learning architectures, and metaheuristic optimization approaches were synergized to produce a new and reliable solution
Image segmentation is a wide research topic; a huge amount of research has been performed in this context. Image segmentation is a crucial procedure for most object detection, image recognition, feature extraction, and classification tasks depend on the quality of the segmentation process. Image segmentation is the dividing of a specific image into a numeral of homogeneous segments; therefore, the representation of an image into simple and easy forms increases the effectiveness of pattern recognition. The effectiveness of approaches varies according to the conditions of objects arrangement, lighting, shadow and other factors. However, there is no generic approach for successfully segmenting all images, where some approaches have been proven to be more effective than others. The major goal of this study is to provide summarize of the disadvantages and the advantages of each of the reviewed approaches of image segmentation.
Aircraft detection is a vital and significant field within object detection that has garnered considerable attention from academics, particularly following the advancement of deep learning methods. Aircraft detection has recently become widely utilized in several civil and military fields. This comprehensive survey meticulously categorizes and evaluates diverse deep learning methodologies in airplane detection research. Encompassing radar-based, image-based, and multimodal approaches, the paper presents a structured framework to enhance understanding of the evolving research landscape within this domain. The survey critically identifies gaps and discerns emerging trends, offering valuable insights into standard datasets of aircraft images, performance metrics, real-world applications, and challenges and limitations encountered by aircraft detection systems. Its potential contributions are underscored as pivotal for advancing the safety and security of air travel. This research paper is the inaugural publication of its kind in the domain of aircraft detection review papers, establishing itself as an all-encompassing reference for subsequent scholars.
The segmentation methods for image processing are studied in the presented work. Image segmentation can be defined as a vital step in digital image processing. Also, it is used in various applications including object co-segmentation, recognition tasks, medical imaging, content based image retrieval, object detection, machine vision and video surveillance. A lot of approaches were created for image segmentation. In addition, the main goal of segmentation is to facilitate and alter the image representation into something which is more important and simply to be analyzed. The approaches of image segmentation are splitting the images into a few parts on the basis of image’s features including texture, color, pixel intensity value and so on. With regard to the presented study, many approaches of image segmentation are reviewed and discussed. The techniques of segmentation might be categorized into six classes: First, thresholding segmentation techniques such as global thresholding (iterative thresholding, minimum error thresholding, otsu's, optimal thresholding, histogram concave analysis and entropy based thresholding), local thresholding (Sauvola’s approach, T.R Singh’s approach, Niblack’s approaches, Bernsen’s approach Bruckstein’s and Yanowitz method and Local Adaptive Automatic Binarization) and dynamic thresholding. Second, edge-based segmentation techniques such as gray-histogram technique, gradient based approach (laplacian of gaussian, differential coefficient approach, canny approach, prewitt approach, Roberts approach and sobel approach). Thirdly, region based segmentation approaches including Region growing techniques (seeded region growing (SRG), statistical region growing, unseeded region growing (UsRG)), also merging and region splitting approaches. Fourthly, clustering approaches, including soft clustering (fuzzy C-means clustering (FCM)) and hard clustering (K-means clustering). Fifth, deep neural network techniques such as convolution neural network, recurrent neural networks (RNNs), encoder-decoder and Auto encoder models and support vector machine. Finally, hybrid techniques such as evolutionary approaches, fuzzy logic and swarm intelligent (PSO and ABC techniques) and discusses the pros and cons of each method.
In recent years, self-driving cars and reducing the number of accident casualties have drawn a lot of attention. Although it is crucial to increase driver awareness on the road, autonomous vehicles can emulate human driving and guarantee improved levels of road safety. Artificial intelligence (AI) technologies are often employed for this purpose. However, deep learning, a subset of AI, is prone to numerous errors, a wide range of threats, and needs to handle vast amounts of data, which imposes high-performance hardware requirements. This study suggests a deep learning model for object recognition that employs characteristics to describe data rather than images. Our model employs the COCO dataset as the training foundation, and it was suggested that the features be retrieved using the principal component analysis PCA extraction method. The current results demonstrate the efficacy and precision of our model, with an accuracy of 99.96 %. Furthermore, the performance indices, i.e., recall, precision, and F1-score, achieved about 1 for most of the COCO classes in training phase and promising results in testing phase.
License plate recognition is an essential part of contemporary surveillance systems since it is helpful in many applications, including parking management, vehicle access control, traffic control, and law enforcement. This project aims to provide a robust and dependable method for detecting license plates that will outperform existing approaches in accuracy and dependability. This observation method uses contemporary technology to address challenging troubles related to license plate recognition. Our methodology is primarily based on the Faster R-CNN structure, an established model for picture item detection. The novel thing, even though, is how Gray Wolf Optimization—which draws notion from the searching conduct of gray wolves—is mixed with the Faster R-CNN network. The accuracy is greatly improved by this synergistic combination, which also improves detection abilities. Moreover, an improved ResNet-50 model is blanketed to improve the classification system similarly, ensuring accurate license plate detection in several situations. The extensively utilized ”car license plate detection” dataset is used to assess the recommended technology very well, confirming its efficacy in practical settings. The empirical outcomes show exceptional performance, with a median precision of 98.21%, demonstrating how nicely the hybrid method works to attain the very best stage of license plate detecting accuracy. This painting establishes a new benchmark in license plate identity using cutting-edge technology and innovative techniques, starting the door for enhanced safety and surveillance.