Arash Rezaali

Publications

Semi-Centralized Training, Decentralized Execution Architecture for Multi-Agent Deep RL in Traffic Signal Control

A. Rezaali, P. Yazdani, M. Abdoos · Under review at Transportation Research Part C

Proposes a transferable SEMI-CTDE region-based multi-agent RL architecture for adaptive traffic signal control with two complementary model variants and extensive ablations.

↗ View preprint on arXiv ↗ View code on GitHub

Experience

LLM Armenia 2025 Summer School

Armenia · Summer 2025

Selected Participant

Completed a one-week international LLM program blending theory with implementation-focused work on six projects, including ViT-from-scratch training, SigLIP+Qwen LaTeX OCR, GRPO post-training for symbolic reasoning, inference-time compute strategies, and adversarial attacks on CLIP and vision models.

↗ View LLM Armenia projects on GitHub

Shahid Beheshti University

Tehran, Iran · 2022 – Today

Research Assistant – Intelligent Transportation Systems Lab

Working under the supervision of Dr. Monireh Abdoos on SEMI-CTDE region-based multi-agent deep reinforcement learning architectures for adaptive traffic signal control in SUMO. This line of work has resulted in a manuscript currently under review (see Publications).

Awards

Silver Medal – National Astronomy & Astrophysics Olympiad

Iran · 2020

Awarded silver medal in the highly competitive national Olympiad, demonstrating strong problem-solving skills in physics and mathematics.

Education

BSc in Computer Engineering

Shahid Beheshti University · Tehran, Iran · 2021 – Present

GPA: 17.8 / 20 (3.79/4.00) · Top 5%

Key A+ coursework (GPA 4.0 / 4.0)

Intro to Machine Learning AI & Expert Systems Fundamentals of Robotics Algorithm Design Signals & Systems Graph Theory & Algorithms Eng. Stats & Probability Linear Algebra Software Engineering

Young Scholar Club – Olympiad Summer Camp

Iran · Summer 2020

Selective Olympiad summer camp for the top 40 candidates nationwide, combining advanced astronomy and physics instruction with frequent exams and problem-solving workshops used to determine final medal rankings.

Research Interests

Deep Learning Machine Learning Reinforcement Learning Multi-Agent Systems Robotics Large Language Models Reasoning in LLMs Natural Language Processing

Languages

English IELTS 7.5 (2025)

Persian Native

Projects

LLM Armenia 2025 Summer School

Language Models · Deep Learning · Machine Learning

Participated in the LLM Armenia 2025 Summer School, blending theoretical concepts and practical projects in machine learning. Key projects included ViT-from-scratch training, SigLIP+Qwen LaTeX OCR, GRPO post-training for symbolic reasoning, inference-time compute strategies, and adversarial attacks on CLIP and vision models.

↗ View LLM Armenia projects on GitHub

Final project – Introduction to Machine Learning

Multi-Class Emotion Recognition using DistilBERT

Natural Language Processing · Emotion Recognition · Multi-Class Classification

Implemented an end-to-end emotion recognition pipeline: fine-tuned a pre-trained DistilBERT model on a custom emotion dataset, then used its embeddings as input to an MLP classifier with per-label threshold tuning to improve classification performance. The project includes dataset preprocessing, hyper-parameter search, and a simple real-time inference.

↗ View on GitHub

Fundamentals of Robotics Course – Projects

Mobile Robotics · Kinematics · Localization · Navigation

Developed a sequence of assignments and a final project for a mobile robotics course, covering kinematics and wheel-velocity control, localization and hallway mapping, local navigation with Bug algorithms, and an indoor mapping and navigation project where a Mavic platform and TurtleBot perform wall-following, door detection, and particle-filter-based localization to reach target doors.

↗ View on GitHub

Connect Four AI – minimax and Q-learning agents

Final project – Artificial Intelligence

Connect Four AI using Minimax and Q-Learning

Game AI · Minimax · Alpha-Beta Pruning · Q-Learning

Implemented two AI agents for the Connect Four board game. The first uses a minimax search with alpha–beta pruning and configurable depth to control playing strength; the second is a tabular Q-learning agent trained via large-scale self-play. The project contrasts search-based and reinforcement-learning approaches to decision making in a fully observable game.

↗ View on GitHub

Final project – Game Design

Unity 2D Platformer

C# · Unity · 2D Platformer

Built a tiny 2D platformer in Unity with a responsive run/jump controller, simple enemies with health bars, and collectible hearts. The game features multiple themed biomes (forest and snow) with a smooth transition and a clean project structure, making it a reusable template for future sidescroller ideas.

↗ View on GitHub

Technical Skills

Python Expert

Multi-Agent Systems Advanced

Machine Learning & Deep Learning Advanced

Reinforcement Learning Advanced

Natural Language Processing Advanced

Game AI & Search Intermediate

Robotics & Control Intermediate

Computer Vision Intermediate

Tools & Frameworks

PyTorch TensorFlow SUMO Git Hugging Face Transformers Docker Linux OpenAI Gym Optuna Unity Django C++ C# Jupyter NumPy Pandas scikit-learn Matplotlib Plotly LaTeX