Assistant Professor

Update: 2026-06-21

Mohsen Ramezani

Engineering / Department of Computer and IT Engineering

Master Theses

Development and implementation of a high-speed sorting system for raisin product using machine vision and artificial intelligence
2026
The agricultural sector, as a driver of sustainable development and a strategic component of the national economy, plays a pivotal role in enhancing value-added and developing non-oil exports. In this context, raisins as one of the most significant grape derivatives and an export-oriented commodity hold a special position in global markets due to their high nutritional value and foreign exchange earning potential. Maintaining this competitiveness requires adherence to strict quality standards and the elimination of errors inherent in traditional methods based on human inspection. To achieve this objective, the utilization of post-harvest technology focusing on machine vision systems is considered an unavoidable necessity in the product's supply chain. Meanwhile, the implementation of machine vision-based grading systems, relying on deep learning algorithms and their deployability on low-cost embedded systems, enables high-speed operations while eliminating human error. This research was conducted with the aim of designing and implementing a real-time machine vision system based on Edge AI for the detection of stemmed raisins on a conveyor belt. In this study, the process of data acquisition and image recording at various linear conveyor speeds was executed by an edge-processing-based machine vision system. Initially, a comprehensive database was developed under controlled lighting conditions, and following the labeling operations, Transfer Learning on pre-trained versions of YOLOv8n, YOLOv10n, and YOLOv11n was utilized to develop customized models. In the next stage, the performance of advanced object detection models was evaluated, and the Fine-tuning process was carried out on these three architectures; during this process, the models were optimized across three input scales of 160×160, 256×256, and 320×320 pixels to overcome computational power constraints. To deploy on the limited hardware of the Raspberry Pi 4, the base models were redesigned in three input dimensions using optimized formats such as MNN, TFLite, and NCNN at three levels of computational precision (FP32, FP16, and INT8) to complete the conversion and quantization process for embedded deployment. Finally, the derived models were deployed on the Raspberry Pi 4 embedded hardware, and the operational performance of the system was evaluated across three linear conveyor speeds (21.37 cm/s, 47.71 cm/s, and 56.21 cm/s) based on indices of recall rate, inference time, and energy consumption per frame. Experimental results showed that although the YOLOv11n model at 320×320 pixels recorded the highest recall rate (97.8%), it is unsuitable for Raspberry Pi-based grading systems due to high latency. In contrast, the YOLOv10n-MNN model with 160×160 pixel input dimensions due to the use of an NMS-free architecture successfully recorded an average inference time of 32 ms (equivalent to 31 FPS) and an average recall rate of 93.5% across all speeds. Additionally, this model ensured the thermal stability of the system during long-term operations with an energy consumption of only 0.165 Joules per frame. The findings of this research confirm that the YOLOv10n-MNN model is the optimal trade-off between detection accuracy and processing speed. With a six-fold improvement in frame rate compared to the baseline model, this system enables product processing at the maximum linear speed (56.21 cm/s) and is proposed as an operational and low-cost solution for the intelligent automation of raisin sorting lines.
Feature selection from data stream using decision information and discrimination score
2026
With the development of data generation technologies, a huge amount of information is continuously generated, known as data streams. These data, with characteristics such as unlimited volume, high generation speed, and dynamic changes, pose serious challenges to traditional feature selection algorithms. Existing methods mainly take a greedy approach and focus only on selecting the features with the highest scores, which prevents the consideration of hidden interactions between features. In this study, a novel method for feature selection in data streams is proposed, which is based on the intelligent combination of Decision Information (DI) and Discernibility Score (DS). Decision Information quantifies the informativeness of a feature in relation to the target variable, while the Discernibility Score evaluates the ability of features to distinguish between instances belonging to different classes. The proposed approach, using DI to assess the relationship of features and DS to measure redundancy and discriminative power, is able to dynamically and incrementally select the best subset of features. This mechanism automatically updates the feature selection process with new data, overcoming the problems caused by the greedy nature of previous methods. The main advantage of this framework is its ability to identify synergies between features by combining DI and DS criteria, while its incremental design provides high scalability and the possibility of implementation in real-time environments. The evaluation of the proposed method was carried out on various medical, genetic, handwritten and synthetic datasets with accuracy, F1 score and ROC-AUC criteria. Experimental results showed that the proposed method has a significant advantage compared to twelve previous methods with an average accuracy of 89.1%. This method not only effectively reduces the data dimensionality, but also continuously improves the accuracy and efficiency of machine learning models in the data flow environment. The achievements of this research pave the way for effective applications in areas such as social networks, Internet of Things, and fraud detection systems.
Using deep neural networks based on UAV images to generate water spray dispersion maps in sprinkler irrigation and predict water consumption in agricultural fields
2025
Given the critical importance of optimizing water resource utilization in agriculture and the necessity of minimizing water losses, this study focuses on two primary objectives: (1) designing and introducing a new dataset comprising high-resolution aerial videos alongside simultaneous field measurements of water consumption in carrot, potato, and alfalfa farms; and (2) developing a novel deep learning–based framework for accurately estimating the water usage of irrigation risers in these agricultural fields. The constructed dataset captures a diverse range of environmental and operational conditions, including various riser configurations (3, 6 and 9 risers), multiple UAV flight altitudes, and different times of day. To ensure data quality, all videos and corresponding irrigation maps were carefully annotated and validated. The proposed approach, leveraging advanced deep learning architectures, particularly convolutional neural networks (CNNs) combined with attention mechanisms and a Transformer, achieved highly accurate and reliable estimations of water consumption for each riser. Comparative analyses with pre-trained models including VGG16, ResNet, GoogleNet, and MobileNet showed strong consistency with field measurements. The proposed method achieved an accuracy of 93% in estimating water usage and irrigation coverage, surpassing the highest accuracy of 88% obtained from existing pre-trained models. This capability also enabled the identification of both under-irrigated and over-irrigated zones across the studied fields. Overall, this research not only introduces a valuable and validated dataset along with an effective deep learning–based estimation framework, but also provides a foundation for the development and deployment of intelligent irrigation management systems at various scales. These findings can assist farmers and water resource managers in leveraging modern technologies to reduce water waste, enhance efficiency, and improve the quality and yield of agricultural production.
Feature Selection in Multi-Label Data Using a Hybrid Filter-Wrapper Algorithm Based on Constraint Strategies and Adaptive Optimization
2025
Multi-label feature selection presents unique challenges due to the intricate label dependencies and high dimensionality inherent in modern datasets. Traditional hybrid methods that combine filter and wrapper strategies often fall short due to static integration schemes and limited interaction, leading to suboptimal performance and convergence to local optima. To overcome these limitations, this paper introduces a novel adaptive multi-objective hybrid framework called MoMOA (Multi-objective Magnetic Optimization Algorithm), specifically designed for multi-label feature selection. The proposed method initiates with a dual-score filter mechanism that integrates linear statistical relevance and randomized scoring to generate a diverse and informative feature subset. This subset seeds the wrapper phase, where an evolutionary optimization process inspired by magnetic force dynamics iteratively refines solutions. MoMOA employs a time-adaptive weighting strategy to balance multiple conflicting objectives, including classification accuracy, feature compactness, and generalization error. Unlike traditional approaches, the framework maintains a dynamic feedback link between filtering and wrapper stages, facilitating cohesive convergence and enhanced search efficiency. Experiments on eleven multi-label benchmark datasets demonstrate the proposed algorithm’s superiority in improving feature selection performance, striking a robust balance between computational efficiency and predictive effectiveness in complex, high-dimensional scenarios.
Design and implementation of a high-speed online system based on deep learning for grading of oleaster (Elaeagnus angustifolia) fruit
2024
The agricultural sector and its related industries are among the main foundations of the economy of different countries, especially developing societies. One of the subgroups of the agricultural sector that is developing significantly in the world is the Post-harvest technology. Meanwhile, Oleaster is a fruit with high nutritional and medicinal properties, which is mostly graded using traditional and manual methods. The color and appearance of Oleaster is one of the most important factors related to its quality. Due to the increasing consumption of this product in the world and with the development of new technologies, there is a need for a new method with high efficiency and without human intervention for its grading. For this purpose, machine vision techniques based on deep learning can be used to determine the quality grade of Oleaster with acceptable accuracy. Therefore, the purpose of this research is to develope a Computer Vision based system for Oleaster grading at different grading speed levels. The images acquired at different levels of conveyouv belt were ancdysed using the machine vision system utilysing the Python programming environment. Then the frames containing the Oleaster samples were extracted. Mask R-CNN algorithm was used to segment the output frames to obtain the the boundary of Oleaster in images. The highest average segmentation accuracy at the first speed level (4.82 cm/s) was obtained as 104.11. Considering the importance of speed and accuracy for grading, YOLO network was used to train the classification model. For comparison of classification accuracy results at different speeds, YOLOv8x and YOLOv8n networks were used. The classification evaluation was carried out using statistical indicators such as sensitivity, specificity, accuracy and overall accuracy. The overall classification accuracy of the YOLOv8x network's best model was 91.25% in the evaluation phase and it was 88.88% for the YOLOv8n network at grading speed of 15/18 cm/s. The results of this study showed that the proposed system has the necessary potential as a fast and highly accurate method to determine the quality class of Oleaster.
A Filter-Wrapper Many-Objective Multi-label Feature Selection
2023
As a data preprocessing method, feature selection is a significant step before classification and other machine learning algorithms. Feature selection approaches are mainly divided into filter and wrapper methods, and each one has its own advantages. To leverage the strengths of both approaches, filter-wrapper methods have been introduced. However, the integration and collaboration of filter and wrapper methods can be a challenging process, because there is a need to coordinate different criteria and algorithms, as well as designing a method that ensures compatible cooperation between these methods. On the other hand, multi-objective optimization algorithms are efficient for dealing with multi-label feature selection problems. Nevertheless, there have been relatively few studies conducted on filter-wrapper multi-objective feature selection for multi-label data. In this research, we first present a new method for collaboration between filter and wrapper components using an innovative technique called Importance Probability Models (IPMs). And then using this technique, we introduce a multi-objective filter-wrapper feature selection approach for multi-label data called FWMMFS-IMPs. This approach uses a modified fast filter component to adjust the initial IPMs, which are then used to generate the optimal initial population in the wrapper algorithm. IPMs are continuously updated using the optimal solutions resulting from the evolution of the wrapper method, and by taking advantage of this situation, a new approach for the mutation process in this algorithm is also presented. This process improves the convergence rate of the algorithm, prevents it from getting stuck in the local optimum, and balances the exploration and exploitation. The final IPMs, which are the result of a consistent collaboration between the filter and wrapper methods, are converged, and their sorting in descending order creates the final feature ranking. To demonstrate the effectiveness of the proposed method, we used several well-known criteria for multi-label classification evaluation, and performed various experiments on different multi-label datasets. FWMMFS-IMPs has been compared to similar multi-label feature selection algorithms, and the experimental results demonstrate its effectiveness across several evaluation aspects, including stability and performance metrics. FWMMFS-IMPs effectively reduces the dimensionality of multi-label data across various domains, including text, music, biology, and images, and consistently outperforms other approaches across different evaluation criteria.
The effect of content marketing on audience attraction in Instagram social network
2022
This research aims to find the effect of content marketing on audience attraction on the Instagram network by examining different parts of the content response to the audience's needs, the attractiveness of the content formats, content building, online support and the distribution coefficient in the process of building trust and audience attraction on the Instagram network. There have been. The research method is descriptive and correlational. The statistical population of the research includes 30 Instagram sales networks in Iran, which is obtain using Cochran's formula, a sample of 384 people. In order to collect the desired research data and respond to the research hypotheses, the required information was collected by researcher-made a questionnaire and it was analyzed using the structural equation modeling approach by Smart PLC and SPSS software. In the qualitative part, satisfaction or dissatisfaction with these pages was analyzed and checked by means of text mining, then by means of artificial intelligence and Python software, and by coding the words and sentences used in the comments, the results It was found that it was divided into 4 categories: satisfied, dissatisfied, doubtful and indifferent. An in-depth interview conducted with ten Instagram network consulting experts to check the performance of these networks. Then, using the ant colony algorithm, the factors affecting the level of satisfaction or dissatisfaction with each Instagram network were calculated. As a result, the hypotheses of effective factors in content marketing on audience attraction confirmed. In the qualitative part, it concluded that for the satisfaction of the selected store network, firstly, trust building, then the distribution coefficient and finally the audience attraction had the highest percentage and the reason for their dissatisfaction was weak in online support features, content format and need is the audience.
presenting a two-stream method based on traditional and deep complementary features to detect human activity in video
2022
Today, human action recognition as an important research field is used in different applications and many computer-vision researches have focused on this area to improve recognition accuracy. In this paper, a two-stream method is introduced incorporating a new structure including two spatial features to cover their defects. Utilizing this structure leads to better performance finally. In the first stream, wavelet coefficients of key-frames with proper multi-resolution are extracted, and deep features of these key-frames are also extracted to be used in the other stream. Utilizing these features results in using proper spatial information of both local and global changes for this application. The features in each stream are gathered in a spatial feature map. The map in each stream is then given to a deep network with convolutional layers to learn the temporal relations between spatial features of sequential key-frames, and then the network classifies the action in each stream independently. Finally, the classification results of both streams are combined to achieve an accurate and confident prediction for the action label. The proposed method is examined on three challenging datasets as UCFYT, UCF-sport, and JHMDB with real videos which its accuracy on these datasets is 98.7, 99.83, and 92.86, respectively. The proposed method has about 4.6 percent better performance rather than the best previously introduced method on average.
بازسازی تصویر مبتنی بر شبکه متخاصم مولد هدایت شده توسط اطلاعات اضافه تعریف شده توسط کاربر
2022
بازگرداندن ناحیه آسیب دیده در تصاویر دیجیتال (بازسازی تصویر) می تواند به عنوان یک مشکل دشوار در نظر گرفته شود که بر اساس شدت آسیب به نسبت سخت تر می شود. در چند سال اخیر پیشرفت هایی در مقابله با این موضوع از طریق استفاده از مدل های یادگیری عمیق صورت گرفته است. در این مطالعه با توجه به کاربردهای موفق GAN ها در زمینه های مختلف، رویکرد جدیدی برای بازسازی تصویر ارائه شده است. الگوریتم پیشنهادی شامل یک مولد و یک تمایز عمومی است. ژنراتور مسئول بازیابی ناحیه از دست رفته است و تمایزکننده عمومی به تشخیص درست بودن یا نبودن ناحیه تعمیر مربوط می شود. معماری ژنراتور از دو انکودر خودکار تشکیل شده است. علاوه بر این، Wasserstein GAN برای اطمینان از ثبات تمرین استفاده می شود. به عنوان تصویر ورودی، یک تصویر نماد 32 در 32 نیز برای هدایت معنایی ژنراتور استفاده می شود و سپس با تصویر خراب برای پر کردن قسمت یا مناطق از دست رفته بدون از دست دادن برخی اشیاء موجود یا پیش بینی اشیاء یا اشکال ناخواسته، الحاق می شود. تصویر راهنما می تواند توسط کاربر برنامه پیشنهاد شود یا موارد دیگری مانند واترمارک در نظر گرفته شود. این روش از نظر کمی و کیفی با مدل های پیشرفته ای که از شبکه متخاصم مولد استفاده می کنند مقایسه می شود. این رویکردها می توانند ساختارها و بافت های بصری قانع کننده ای ایجاد کنند، اما اغلب ساختارهای تغییر شکل یافته، بافت های تار یا اشیایی از دست می دهند که با مناطق اطراف همگام نیستند. نتایج ارائه شده روی مجموعه داده CelebA-HQ نشان می دهد که مدل پیشنهادی می تواند با پیکسل های از دست رفته در مقیاس بزرگ مقابله کند و نتایج واقعی ایجاد کند.
بازسازی تصویر مبتنی بر شبکه متخاصم مولد هدایت شده توسط اطلاعات اضافه تعریف شده توسط کاربر
2022
بازگرداندن ناحیه آسیب دیده در تصاویر دیجیتال (بازسازی تصویر) می تواند به عنوان یک مشکل دشوار در نظر گرفته شود که بر اساس شدت آسیب به نسبت سخت تر می شود. در چند سال اخیر پیشرفت هایی در مقابله با این موضوع از طریق استفاده از مدل های یادگیری عمیق صورت گرفته است. در این مطالعه با توجه به کاربردهای موفق GAN ها در زمینه های مختلف، رویکرد جدیدی برای بازسازی تصویر ارائه شده است. الگوریتم پیشنهادی شامل یک مولد و یک تمایز عمومی است. ژنراتور مسئول بازیابی ناحیه از دست رفته است و تمایزکننده عمومی به تشخیص درست بودن یا نبودن ناحیه تعمیر مربوط می شود. معماری ژنراتور از دو انکودر خودکار تشکیل شده است. علاوه بر این، Wasserstein GAN برای اطمینان از ثبات تمرین استفاده می شود. به عنوان تصویر ورودی، یک تصویر نماد 32 در 32 نیز برای هدایت معنایی ژنراتور استفاده می شود و سپس با تصویر خراب برای پر کردن قسمت یا مناطق از دست رفته بدون از دست دادن برخی اشیاء موجود یا پیش بینی اشیاء یا اشکال ناخواسته، الحاق می شود. تصویر راهنما می تواند توسط کاربر برنامه پیشنهاد شود یا موارد دیگری مانند واترمارک در نظر گرفته شود. این روش از نظر کمی و کیفی با مدل های پیشرفته ای که از شبکه متخاصم مولد استفاده می کنند مقایسه می شود. این رویکردها می توانند ساختارها و بافت های بصری قانع کننده ای ایجاد کنند، اما اغلب ساختارهای تغییر شکل یافته، بافت های تار یا اشیایی از دست می دهند که با مناطق اطراف همگام نیستند. نتایج ارائه شده روی مجموعه داده CelebA-HQ نشان می دهد که مدل پیشنهادی می تواند با پیکسل های از دست رفته در مقیاس بزرگ مقابله کند و نتایج واقعی ایجاد کند.
استفاده از ویژگیهای معنایی برای بهبود تشخیص رفتارهای حرکتی انسان
2021
امروزه آنالیز رفتار انسان در سیستم های برخط ویدئویی مانند جستجو، دوربین نظارتی و تعامل انسان و ماشین بسیار مورد توجه قرار گرفته است. با ورود هوش مصنوعی به این حوزه سعی شده تا ماشین در تحلیل و بررسی رفتارها جایگزین عامل انسانی گردد. مهمترین گام برای این کار بازنمایی رفتار است به طوری که حاوی جزئیات رفتار برای انجام عملیات طبقه بندی باشد. بازنمایی رفتار می تواند بر اساس مدل یا داده های حاصل از رفتار باشد که منتج به معرفی روش های مدل محور و داده محور گشته است. روش های مدل محور با ایجاد مدل هایی دو یا سه بعدی از بدن انسان بازنمایی را انجام می دهد در حالی که روش های داده محور با پیچیدگی زمانی و حافظه مورد نیاز کمتر تغییرات در قاب ها را بررسی می کنند. روش های داده محور با استفاده از ویژگی های سراسری و محلی به بازنمایی رفتار انسان می پردازند. ویژگی های سراسری با بررسی تغییرات کلی قاب ها و ویژگی های محلی با بررسی حرکت های اجرا شده در یک یا چند ناحیه محلی از ویدئو به بازنمایی رفتار انسان می پردازد. در کارهای صورت گرفته تاکنون معنای رفتار که به یک توالی از زیررفتارها تعبیر شود مورد توجه قرار نگرفته است. لذا در این پایان نامه برای تشخیص و بازیابی رفتار انسان در ویدئو، بازنمایی رفتار انسان با استفاده از معنای حرکات بوسیله شبکه های عمیق انجام می شود. برای تعریف معنا در این کار، هر رفتار به زیررفتارهایی از حرکت های تشکیل دهنده آن تبدیل شده و بعد از مدل کردن حرکت ها، رفتار انسان بر اساس ترکیب این مدل ها بازنمایی می شود. بدین منظور ابتدا حرکت های تشکیل دهنده رفتار با استفاده از ویژگی های محلی استخراج شده از روش Dollar و انتقال بردارهای این ویژگی ها به فضای فرکانس با استفاده از روش CWT، برای پردازش های آتی به منظور تفکیک ساده حرکت ها، بدست می آیند. سپس با استفاده از شبکه های عصبی عمیق هر کدام حرکت های بدست آمده مدل شده و نهایتا با ترکیب مدل های حرکت های تشکیل دهنده هر رفتار، بازنمایی نهایی آن انجام می شود که برای تفکیک رفتارها مورد استفاده قرار گرفته اند. به منظور ارزیابی مدل پیشنهادی از مجموعه داده های ویدئویی HMDB، UCF Sport، UCFYT و UCF50 استفاده شده است که همه آنها شامل ویدئوهای واقعی ضبط شده در شرایط گوناگون هستند. روش معرفی شده با استفاده از زبان برنامه نویسی پایتون، کتابخانه های krase و Tensorfllow پیاده سازی شده تا به بخش بندی ویدئو، بازنمایی رفتار و طبقه بندی بپردازد. عملکرد روش پیشنهادی با سایر روش های معرفی شده در حوزه تشخیص و بازیابی رفتار مقایسه شده است. روش های مورد بررسی از شبکه های عصبی عمیق و یا ترکیبی از روش های داده محور، مدل محور و شبکه های عصبی عمیق برای بازنمایی و طبقه بندی رفتار استفاده کرده اند. ارزیابی نتایج بدست آمده نشان می دهد که روش پیشنهادی در کاربردهای شناسایی و بازیابی در مقایسه با سایر روش ها به دلیل توجه به حرکت های تشکیل دهنده رفتار و توالی آنها و بازنمایی مناسب، به ترتیب از 02/0 درصد تا 5/1 درصد و از 7/0 درصد تا 4 درصد دقت بالاتری داشته است.
بهبود کارایی سیستم های پالایش گروهی با استفاده از تکنیک های خوشه بندی
2013
سیستم های توصیه گر، سیستم هایی هستند که برای پیشنهاد کردن آیتم هایی بکار برده می شوند که انتظار می رود این آیتم ها مورد علاقه کاربر قرار گیرند. در سیستم های توصیه گر یک تکنیک پر کاربرد به نام سیستم های پالایش گروهی وجود دارد. این سیستم ها بر این فرضیه استوارند که می-توان اولویت های تعریف شده برای آیتم ها از سوی تعدادی از کاربران را برای سایر کاربران هم به اشتراک گذاشت. بنابراین، در این سیستم ها، هدف اصلی، یافتن یک گروه از شبیه ترین کاربران به کاربر فعال است. معیارهای شباهت، یکی از رایج ترین روش ها برای یافتن کاربران مشابه به کاربر فعال است. با استفاده از این روش، میزان شباهت کاربران نسبت به هم اندازه گیری شده و نهایتاً بر اساس این معیار ها، شبیه ترین کاربران به کاربر فعال به عنوان کاربران همسایه انتخاب می شوند. در این روش ها، برای انتخاب کاربران همسایه، محاسبات بر اساس ماتریس امتیازات کاربران به آیتم ها انجام می شود. اما این روش با چالش هایی مانند خلوت بودن ماتریس امتیازات، مقیاس پذیری و همچنین مشخص نبودن تعداد بهینه کاربران همسایه روبرو است. بعلاوه، در توصیه کردن آیتم ها روش هایی وجود دارند که خیلی از اطلاعات آماری امتیازات استفاده نمی کنند. یک راهکار دیگر برای یافتن کاربران مشابه به کاربر فعال، استفاده از روش های خوشه بندی است. از روش های خوشه بندی برای دسته بندی کردن کاربران در قالب خوشه هایی استفاده می شود که کاربران یک خوشه دارای بیشترین شباهت به هم هستند. در این پایان نامه دو راهکار مبتنی بر خوشه بندی، برای یافتن کاربران شبیه به هم در سیستم های پالایش گروهی پیشنهاد شده است. در راهکار اول، سعی شده است که با استفاده از روش های پیش پردازش مانند انتخاب ویژگی، تعدادی از آیتم های غیر موثر از مجموعه آیتم ها قبل از اعمال روش های خوشه بندی جهت یافتن کاربران همسایه، حذف شوند. نتایج آزمایشات نشان دهنده بهبود کارایی این روش است. در راهکار پیشنهادی دوم، یک روش جدید برای یافتن کاربران ارائه شده است که در این روش نیاز به مشخص نمودن تعداد خوشه ها برای خوشه بندی نیست. در این روش با استفاده از زیر مجموعه ای از آیتم های مورد علاقه کاربران که توسط چند کاربر امتیاز بالایی دریافت کرده اند، کاربران شبیه بدون در نظر گرفتن اینکه دارای امتیازات داده شده به آیتم های مشترکی باشند، در یک