  Shie Mannor  

 



  ![](/sites/default/files/person/Shie-Mannor.JPG)

  

   Main Field of Interest

[Artificial Intelligence and Machine Learning ](/index.php/research-area/machine-learning-artificial-intelligence)

 

  

 

 

 



 ### Publications

 

### 2025 

[Policy Optimized Text-to-Image Pipeline Design](/publication/2025-12_policy-optimized-text-image-pipeline-design)

Uri Gadot, Rinon Gal, Yftah Zisser, [Gal Chechik](/person/gal-chechik), [Shie Mannor](/person/shie-mannor)



[NeurIPS 2025](https://arxiv.org/abs/2505.21478)









[RL-RC-DoT: A Block-level RL agent for Task-Aware Video Compression](/publication/2025-06_rl-rc-dot-block-level-rl-agent-task-aware-video-compression)

Uri Gadot, Assaf Shocher, [Shie Mannor](/person/shie-mannor), [Gal Chechik](/person/gal-chechik), [Assaf Hallak](/person/assaf-hallak)



[CVPR 2025](https://openaccess.thecvf.com/content/CVPR2025/papers/Gadot_RL-RC-DoT_A_Block-level_RL_agent_for_Task-Aware_Video_Compression_CVPR_2025_paper.pdf)









[SoftTreeMax: Policy Gradient via tree expansion ](/index.php/publication/2025-02_softtreemax-policy-gradient-tree-expansion)

[Gal Dalal](/index.php/person/gal-dalal), [Assaf Hallak](/index.php/person/assaf-hallak), Gugan Thoppe, [Shie Mannor](/index.php/person/shie-mannor), [Gal Chechik](/index.php/person/gal-chechik)



[ICML 2025](https://icml.cc/virtual/2025/poster/43515)









### 2023 

[Learning to Initiate and Reason in Event-Driven Cascading Processes](/publication/2023-07_learning-initiate-and-reason-event-driven-cascading-processes)

[Yuval Atzmon](/person/yuval-atzmon), [Eli Meirom](/person/eli-meirom), [Shie Mannor](/person/shie-mannor), [Gal Chechik](/person/gal-chechik)



[ICML 2023](https://proceedings.mlr.press/v202/atzmon23a/atzmon23a.pdf)









[Train Hard, Fight Easy: Robust Meta Reinforcement Learning](/index.php/publication/2023-06_train-hard-fight-easy-robust-meta-reinforcement-learning)

Ido Greenberg, [Shie Mannor](/index.php/person/shie-mannor), [Gal Chechik](/index.php/person/gal-chechik), [Eli Meirom](/index.php/person/eli-meirom)



[NeuroIPS 2023](https://arxiv.org/pdf/2301.11147)









[CALM: Conditional Adversarial Latent Models for Directable Virtual Characters](/index.php/publication/2023-05_calm-conditional-adversarial-latent-models-directable-virtual-characters)

[Chen Tessler](/index.php/person/chen-tessler), [Yoni Kasten](/index.php/person/yoni-kasten), Yunrong Guo, [Shie Mannor](/index.php/person/shie-mannor), [Gal Chechik](/index.php/person/gal-chechik), Xue Bin Peng



[SIGGRAPH 2023](https://s2023.siggraph.org/)









[Implementing Reinforcement Learning Datacenter Congestion Control in NVIDIA NICs](/publication/2023-01_implementing-reinforcement-learning-datacenter-congestion-control-nvidia-nics)

Benjamin Fuhrer, Yuval Shpigelman, [Chen Tessler](/person/chen-tessler), [Shie Mannor](/person/shie-mannor), [Gal Chechik](/person/gal-chechik), Eitan Zahavy, [Gal Dalal](/person/gal-dalal)



[CCGrid 2023](https://arxiv.org/abs/2207.02295)









[Never Worse, Mostly Better: Stable Policy Improvement in Deep Reinforcement Learning](/index.php/publication/2023-01_never-worse-mostly-better-stable-policy-improvement-deep-reinforcement-learning)

Pranav Khanna, Guy Tennenholtz, Nadav Merlis, [Shie Mannor](/index.php/person/shie-mannor), [Chen Tessler](/index.php/person/chen-tessler)



[AAMAS 2023](https://arxiv.org/abs/1910.01062)









### 2022 

[DiffStack: A Differentiable and Modular Control Stack for Autonomous Vehicles](/publication/2022-12_diffstack-differentiable-and-modular-control-stack-autonomous-vehicles)

[Peter Karkus](/person/peter-karkus), [Boris Ivanovic](/person/boris-ivanovic), [Shie Mannor](/person/shie-mannor), [Marco Pavone](/person/marco-pavone)



[Conference on Robot Learning (CoRL) 2022](https://corl2022.org/)









[On Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning](/publication/2022-10_covariate-shift-latent-confounders-imitation-and-reinforcement-learning)

Guy Tenneholtz, [Assaf Hallak](/person/assaf-hallak), [Gal Dalal](/person/gal-dalal), [Shie Mannor](/person/shie-mannor), [Gal Chechik](/person/gal-chechik)



[ICLR](https://iclr.cc/)









[Optimizing tensor network contraction using reinforcement learning](/index.php/publication/2022-06_optimizing-tensor-network-contraction-using-reinforcement-learning)

[Eli Meirom](/index.php/person/eli-meirom), [Haggai Maron](/index.php/person/haggai-maron), [Shie Mannor](/index.php/person/shie-mannor), [Gal Chechik](/index.php/person/gal-chechik)



[International Conference on Machine Learning, PMLR 162:15278-15292, 2022](https://proceedings.mlr.press/v162/meirom22a.html)









[Reinforcement Learning for Datacenter Congestion Control](/index.php/publication/2022-02_reinforcement-learning-datacenter-congestion-control)

[Chen Tessler](/index.php/person/chen-tessler), Yuval Shpigelman, [Gal Dalal](/index.php/person/gal-dalal), Amit Mendelbaum, Doron Kazakov, Benjamin Fuhrer, [Gal Chechik](/index.php/person/gal-chechik), [Shie Mannor](/index.php/person/shie-mannor)



IAAI 2022









### 2021 

[Improve Agents without Retraining: Parallel Tree Search with Off-Policy Correction](/publication/2021-10_improve-agents-without-retraining-parallel-tree-search-policy-correction)

[Gal Dalal](/person/gal-dalal), [Assaf Hallak](/person/assaf-hallak), [Steven Dalton](/person/steven-dalton), [Iuri Frosio](/person/iuri-frosio), [Shie Mannor](/person/shie-mannor), [Gal Chechik](/person/gal-chechik)



[Advances in Neural Information Processing Systems 34 (NeurIPS 2021)](https://nips.cc/Conferences/2021/)









[Known unknowns: Learning novel concepts using exploratory reasoning-by-elimination](/publication/2021-07_known-unknowns-learning-novel-concepts-using-exploratory-reasoning-elimination)

Harsh Agrawal, [Eli Meirom](/person/eli-meirom), [Yuval Atzmon](/person/yuval-atzmon), [Shie Mannor](/person/shie-mannor), [Gal Chechik](/person/gal-chechik)



[ Proceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Inte…](https://www.auai.org/uai2021/accepted_papers)



Oral





[Controlling graph dynamics with reinforcement learning and graph neural networks](/publication/2021-06_controlling-graph-dynamics-reinforcement-learning-and-graph-neural-networks-0)

[Eli Meirom](/person/eli-meirom), [Haggai Maron](/person/haggai-maron), [Shie Mannor](/person/shie-mannor), [Gal Chechik](/person/gal-chechik)



[ICML 2021](http://proceedings.mlr.press/v139/meirom21a/meirom21a.pdf)









[Planning and Learning with Adaptive Lookahead](/index.php/publication/2021-01_planning-and-learning-adaptive-lookahead)

Aviv Rosenberg, [Assaf Hallak](/index.php/person/assaf-hallak), [Shie Mannor](/index.php/person/shie-mannor), [Gal Chechik](/index.php/person/gal-chechik), [Gal Dalal](/index.php/person/gal-dalal)



[Arxiv](https://arxiv.org/abs/2201.12403)









[Acting in Delayed Environments with Non-Stationary Markov Policies](/index.php/publication/2021-01_acting-delayed-environments-non-stationary-markov-policies)

Esther Derman, [Gal Dalal](/index.php/person/gal-dalal), [Shie Mannor](/index.php/person/shie-mannor)



ICLR 2021









### 2020 

[The Architectural Implications of Distributed Reinforcement Learning on CPU-GPU Systems](/index.php/publication/2020-12_architectural-implications-distributed-reinforcement-learning-cpu-gpu-systems)

Ahmet Inci, Evgeny Bolotin, [Yaosheng Fu](/index.php/person/yaosheng-fu), [Gal Dalal](/index.php/person/gal-dalal), [Shie Mannor](/index.php/person/shie-mannor), [David Nellans](/index.php/person/david-nellans), Diana Marculescu



[Workshop on Energy Efficient Machine Learning and Cognitive Computing (EMC2)](https://www.emc2-ai.org/virtual-20)