Alireza Hosseini

I'm

About

I received a B.Sc. degree in Electrical Engineering from Iran University of Science and Technology and completed my Master of Science in Telecommunication Systems at the University of Tehran in September 2024. I currently work as a research assistant at the Computation and Communication Lab at the University of Tehran and as an AI developer at AVIR AI Center. My fields of interest include Machine Learning, Deep Learning, Signal Processing, and related areas. It is always my pleasure to discuss topics related to research. Feel free to email me! [My CV]

Alireza Hosseini

Education

Master's in Telecommunication Systems
University of Tehran

Sep. 2022 - Sep. 2024

Bachelor's in Electrical Engineering
Iran University of Science and Technology

Sep. 2017 - Mar. 2022

Experience

AI Developer, AVIR AI Center
Tehran, Iran

Jul. 2022 - Present

AI Developer, University of Tehran

Jan. 2023 - Nov. 2023

AI Developer, Irankhodro Powertrain Co
Tehran, Iran

Jul. 2021 - Jul. 2022

Publications

  • SUM: Saliency Unification through Mamba for Visual Attention Modeling

Alireza Hosseini, Amirhossein Kazerouni, Saeed Akhavan, Michael Brudno, Babak Taati

Conference IEEE/CVF WACV 2025

Project Page | Paper | GitHub

  • Brand Visibility in Packaging: A Deep Learning Approach for Logo Detection, Saliency-Map Prediction, and Logo Placement Analysis

Alireza Hosseini, Kiana Hooshanfar, Pouria Omrani, Reza Toosi, Ramin Toosi, Zahra Ebrahimian, Mohammad Ali Akhaee

Arxiv preprint

Project Page | Paper | GitHub

  • INCODE: Implicit Neural Conditioning with Prior Knowledge Embeddings

Amirhossein Kazerouni, Reza Azad, Alireza Hosseini, Dorit Merhof, Ulas Bagci

Conference IEEE/CVF WACV 2024

Project Page | Paper | Supplementary | GitHub

  • Hybrid Retrieval-Augmented Generation Approach for LLMs Query Response Enhancement

Pouria Omrani, Alireza Hosseini, Kiana Hooshanfar, Zahra Ebrahimian, Ramin Toosi, Mohammad Ali Akhaee

Conference IEEE ICWR 2024

Paper

  • Farsi CAPTCHA Recognition Using Attention-Based Convolutional Neural Network

Alireza Hosseini, Matine Hajyan, Ramin Toosi, Mohammad Ali Akhaee

Conference IEEE ICWR 2023

Paper

  • Machine vision–based measurement approach for engine accessory belt transverse vibration based on deep learning method

Ashkan Moosavian, Alireza Hosseini, Seyed Mohammad Jafari, Iman Chitsaz, Shahriar Baradaran Shokouhi

Journal Automotive Science and Engineering 2022

Paper

  • Development of Machine Vision System to Track Movement of an Engine Timing Belt Tensioner Based on Deep Neural Network

Alireza Hosseini, Moosavian Ashkan, Saeed Javan, Shahriar B Shokouhi

The Journal of Engine Research 2022

Projects

Document Query using Fine-Tuned Quantized NVILA Model

Dec 2024 - Jan 2025

Adak Vira Iranian Rahjoo (Avir)

Designed and implemented a system for precise information extraction from documents by fine-tuning and quantizing the NVILA vision-language model (using LLM-AWQ). This solution converts user queries into JSON outputs with high accuracy.

AI-Powered Natural Language to SQL Query System

Oct 2024 - Dec 2024

Adak Vira Iranian Rahjoo (Avir)

Developed an end-to-end solution for seamless database interaction using LLMs to convert natural language into SQL queries.

Logo Generation using Diffusion Models

Aug 2024 - Dec 2024

Adak Vira Iranian Rahjoo (Avir)

Utilized diffusion models with prompt engineering to generate logos from company descriptions, styles, and colors.

Smart-EYE for Ads: AI for Eye Tracking and Brand Attention

Jan 2023 - Nov 2024

Adak Vira Iranian Rahjoo (Avir)

Developed AI engines for eye tracking, saliency prediction, and brand attention detection for video advertisement analysis.

Time Series Prediction using Modified XLSTM

Sep 2024 - Oct 2024

Adak Vira Iranian Rahjoo (Avir)

Modified XLSTM models to predict data points and trends over the next 7 days.

YouTube Content Generator from PDF Slides

Mar 2024 - Oct 2024

Adak Vira Iranian Rahjoo (Avir)

Automated text-to-speech conversion, video synchronization for slides, and integrated talking avatars.

Customized Persian LLM Assistants and Chatbots

Apr 2024 - Sep 2024

Adak Vira Iranian Rahjoo (Avir)

Developed retrieval-augmented generation (RAG) and fine-tuning for Persian language assistants and chatbots.

Salary Verification Tool using LLMs

Aug 2024

Adak Vira Iranian Rahjoo (Avir)

Extracts financial data from salary documents (PDFs/images) using LLMs for verification.

Video Subtitle Generator using ASR

Jun 2024 - Jul 2024

Adak Vira Iranian Rahjoo (Avir)

Automated subtitle generation using a custom ASR model to transcribe video speech.

Face Landmark Generation from Audio (SAD-Talker Modification)

May 2024 - Jun 2024

Adak Vira Iranian Rahjoo (Avir)

Modified the SAD-Talker code (CVPR2024) to generate facial landmarks from audio.

Text Data Parsing for Retrieval-Augmented Generation (RAG)

Apr 2024 - Jun 2024

Adak Vira Iranian Rahjoo (Avir)

Processed and structured text data from books for effective RAG model input.

Masked Signal Reconstruction with Mamba Encoder

Mar 2024 - May 2024

University of Tehran

Developed a Mamba encoder-based model for reconstructing masked signals in sequence data.

Accurate Diarization using Modified Pyannote-Audio

Jan 2024 - Feb 2024

Adak Vira Iranian Rahjoo (Avir)

Implemented a diarization solution by modifying the Pyannote-Audio framework.

Whisper Persian ASR Training on Custom Dataset

Nov 2023 - Feb 2024

Adak Vira Iranian Rahjoo (Avir)

Trained and validated Whisper ASR on a custom Persian dataset for enhanced speech recognition.

Persian PDF to Word Converter with Tesseract OCR

Sep 2023 - Oct 2023

Adak Vira Iranian Rahjoo (Avir)

Converted Persian PDF documents to Word using Tesseract OCR for accurate text extraction.

Image and Video Cartoonization using Generative Adversarial Network (GAN)

Dec 2022 - May 2023

Adak Vira Iranian Rahjoo (Avir)

Applied GANs to transform images and videos into cartoon-like visuals.

Classic Lip Sync using Vowel Phonetic Detection

Feb 2023 - Mar 2023

Adak Vira Iranian Rahjoo (Avir)

Developed a method for distinguishing vowels using frequency response curves for accurate lip syncing.

Brand Campaign Analysis and Reporting

Aug 2022 - Nov 2022

Adak Vira Iranian Rahjoo (Avir)

Analyzed social media data and brand campaigns to evaluate campaign effectiveness and impact.

Persian Handwritten and Typewritten OCR

Jul 2021 - Nov 2022

University of Tehran

Developed and optimized AI engines for OCR including preprocessing, postprocessing, and dataset management.
Supervisor: Dr. Mohammad Ali Akhaee

Text vs. Audio Similarity Checking in Videos

Oct 2022 - Nov 2022

Adak Vira Iranian Rahjoo (Avir)

Implemented an ASR module and used Levenshtein distance to compare spoken and written content in videos.

Car License Plate Recognition using MATLAB

Oct 2020 - Jan 2021

Iran University of Science and Technology

Developed a system for recognizing car license plates using classic image processing techniques in MATLAB.

News

Hobbies & Leisure Time

When I’m not immersed in research or development, I dedicate my time to activities that inspire creativity, bring relaxation, and keep me energized.

📚 Books

Reading books opens up new perspectives and sparks creativity.

🎵 Music & 🎥 Movies

Listening to music and watching movies are great ways for me to relax and enjoy creative storytelling.

🎸 Playing Musical Instruments

Playing the guitar and exploring other instruments helps me connect with my artistic side.

📸 Photography & Nature Adventures

Exploring and capturing the beauty of nature through photography, mountain climbing, hiking, and outdoor adventures.

⚽ Sports & Fitness

Playing football, volleyball, and swimming, watching sports, and staying active with regular workouts keep me energized.

✍️ Writing

I enjoy expressing my thoughts and creativity through writing, exploring literature, and crafting meaningful stories as a writer.

Photography Collage
Nature Adventures Collage

Contact

For inquiries or feedback on my research, don't hesitate to reach out. I’m always happy to hear from you and exchange ideas.