
PhD student in Computer Science at EPFL specializing in Hierarchical Reinforcement Learning, Causality, Optimization, and LLM-based HRL. Published in top-tier venues including ICML, UAI, TMLR, and AAAI.
Developing a framework for combining preference-based feedback with traditional reward signals in reinforcement learning. This work addresses the challenge of learning from heterogeneous feedback sources, relevant for RLHF applications. The approach enables agents to utilize both explicit reward functions and implicit preference data from human evaluators.
Developing probing techniques to identify and analyze causal relationships that LLMs learn and utilize during inference. This work provides insights into how language models process causal information and make decisions based on causal understanding, with applications in improving interpretability and reliability of LLMs.
Developing methods for automatic subgoal discovery in reinforcement learning agents operating in unknown environments. This research focuses on enabling agents to identify meaningful intermediate objectives without prior knowledge of the environment structure, improving exploration and learning efficiency in complex tasks.
Investigating the integration of disentanglement learning with causal inference in reinforcement learning settings. This ongoing project aims to learn disentangled representations that capture causal factors of variation, enabling more robust and interpretable policy learning.
Developed a novel framework using causal graphs to improve subgoal discovery in hierarchical reinforcement learning, with theoretical guarantees and experimental validation. Published at ICML 2025.
École Polytechnique Fédérale de Lausanne (EPFL)
Achievements:
Sharif University of Technology
Conducting cutting-edge research in reinforcement learning optimization, causal structure learning, and LLM-based hierarchical RL. Working on variance reduction methods, counterfactual reasoning, and RLHF techniques.
Reviewing papers for top-tier machine learning conferences including NeurIPS 2025, ICLR 2025, and AISTATS 2025. Contributing to the academic community through peer review process.
Serving as Online Chair for UAI 2024 conference and CLeaR 2025, managing virtual conference logistics and ensuring smooth online presentations and interactions.
Led the Artificial Intelligence team and worked as a Back-End Developer. Responsible for implementing ML/AI solutions in healthcare technology.
Worked as a back-end developer for Telexa messenger, focusing on server-side development and system architecture.
Mohammadsadegh Khorasani, Saber Salehkaleybar, Negar Kiyavash, Matthias Grossglauser
Submitted to NeurIPS 2026
Mohammadsadegh Khorasani, Saber Salehkaleybar, Negar Kiyavash, Matthias Grossglauser
ArXiv preprint, Submitted to UAI 2026
Mohammadsadegh Khorasani, Saber Salehkaleybar, Negar Kiyavash, Matthias Grossglauser
ArXiv preprint, Submitted to ICML 2026
Mohammadsadegh Khorasani, Saber Salehkaleybar, Negar Kiyavash, Matthias Grossglauser
International Conference on Machine Learning (ICML)
Mohammadsadegh Khorasani, Saber Salehkaleybar, Negar Kiyavash, Matthias Grossglauser
Conference on Uncertainty in Artificial Intelligence (UAI)
Saber Salehkaleybar, Mohammadsadegh Khorasani, Negar Kiyavash, Niao He, Patrick Thiran
Transactions on Machine Learning Research (TMLR)
Ehsan Mokhtarian, Mohammadsadegh Khorasani, Jalal Etesami, Negar Kiyavash
AAAI Conference on Artificial Intelligence
Narges Rezaie, Masroor Bayati, Mehrab Hamidi, Maedeh Sadat Tahaei, Mohammadsadegh Khorasani, Nigel H. Lovell, James Breen, Hamid R. Rabiee, Hamid Alinejad-Rokny
Communications Biology
Mohammadsadegh Khorasani, M. Bahari, S. Ayromlou, V. Zehtab, S. Saadatnejad, Alexandre Alahi
ArXiv preprint
I'm always open to discussing new opportunities and collaborations.