Coffee Hour - softsys4ai/readingroup GitHub Wiki
Currently, we canceled the Coffee Hour until further notice. You are welcomed to participate in Friday's reading group.
Time and location: Wednesday 11:00 in Zoom
- Organizer: Pooyan Jamshidi ([email protected])
- Contact person: Shuge Lei ([email protected])
Subscribe for announcements:
- mailing list:
[email protected]
by sending an email to the organizer.
Agenda
Jan. 27, 2021
"Continuous control with deep reinforcement learning"
Timothy P. Lillicrap, Jonathan J. Hunt, Alexander Pritzel, Nicolas Heess, Tom Erez, Yuval Tassa, David Silver, Daan Wierstra
Moderator: Xin Zhao
Jan. 20, 2020
Evolving Reinforcement Learning Algorithms
Timothy P. Lillicrap, Jonathan J. Hunt, Alexander Pritzel, Nicolas Heess, Tom Erez, Yuval Tassa, David Silver, Daan Wierstra
Moderator: Jianhai
Jan. 13, 2020
Neural Adaptive Video Streaming with Pensieve
Hongzi Mao, Ravi Netravali, Mohammad Alizadeh
Moderator: Nawras
Jan. 6, 2020
"Value-Decomposition Networks For Cooperative Multi-Agent Learning"
Peter Sunehag, Guy Lever, Audrunas Gruslys, Wojciech Marian Czarnecki, Vinicius Zambaldi, Max Jaderberg, Marc Lanctot, Nicolas Sonnerat, Joel Z. Leibo, Karl Tuyls, Thore Graepel
Moderator: Nawras
Dec. 30, 2020
Happy New Year! :fireworks:
Dec. 23, 2020
Merry Christmas! :santa: :christmas_tree:
Dec. 16, 2020
"Learning Compositional Neural Programs with Recursive Tree Search and Planning"
Thomas Pierrot, Guillaume Ligner, Scott Reed, Olivier Sigaud, Nicolas Perrin, Alexandre Laterre, David Kas, Karim Beguir, Nando de Freitas
Moderator: Jianhai
Dec. 9, 2020
Learning Causal Models Online Continue
Khurram Javed, Martha White, Yoshua Bengio
Moderator: Pooyan
Dec. 2, 2020
Learning Causal Models Online
Khurram Javed, Martha White, Yoshua Bengio
Moderator: Pooyan
Nov. 25, 2020
Happy Thanksgiving! :turkey:
Nov. 18, 2020
Canceled
Canceled
Moderator: NA
Nov. 11, 2020
"Fast reinforcement learning with generalized policy updates"
André Barreto, Shaobo Hou, Diana Borsa, David Silver, and Doina Precup.
Moderator: Jianhai
Nov. 4, 2020
"Text-based RL Agents with Commonsense Knowledge: New Challenges, Environments and Baselines"
Keerthiram Murugesan , Mattia Atzeni , Pavan Kapanipathi , Pushkar Shukla , Sadhana Kumaravel , Gerald Tesauro , Kartik Talamadupula , Mrinmaya Sachan , Murray Campbell.
Moderator: Kausik and Biplav
Oct. 28, 2020
"Human-centric dialog training via offline reinforcement learning"
Natasha Jaques, Judy Hanwen Shen, Asma Ghandeharioun, Craig Ferguson, Agata Lapedriza, Noah Jones, Shixiang Shane Gu, Rosalind Picard
Moderator: Rojina
Oct. 21, 2020
"HAQ: Hardware-Aware Automated Quantization with Mixed Precision"
Kuan Wang, Zhijian Liu, Yujun Lin, Ji Lin, and Song Han
Moderator: Roy
Oct. 14, 2020
"Language as an Abstraction for Hierarchical Deep Reinforcement Learning"
Yiding Jiang, Shixiang Gu, Kevin Murphy, Chelsea Finn.
Moderator: Jianhai
Oct. 7, 2020
"Variational Quantum Circuits for Deep Reinforcement Learning"
SAMUEL YEN-CHI CHEN, CHAO-HAN HUCK YANG, JUN QI, PIN-YU CHEN, XIAOLI MA, HSI-SHENG GOAN
Moderator: Rabins
Sept. 30, 2020
"On the Measure of Intelligence"
Franc¸ois Chollet.
Moderator: Pooyan
Sept. 23, 2020
"Top-K Off-Policy Correction for a REINFORCE Recommender System"
Minmin Chen, Alex Beutel, Paul Covington, Sagar Jain, Francois Belletti, Ed Chi.
Moderator: Nawras
Sept 16 2020
"Hindsight Experience Replay"
Marcin Andrychowicz, Filip Wolski, Alex Ray, Jonas Schneider, Rachel Fong,
Peter Welinder, Bob McGrew, Josh Tobin, Pieter Abbeel†, Wojciech Zaremba.
Moderator: Forest
Sept 9 2020
"Towards Interpretable Reinforcement Learning Using Attention Augmented Agents"
Alex Mott, Daniel Zoran, Mike Chrzanowski, Daan Wierstra, Danilo J. Rezende.
Moderator: Jianhai
Sept 2 2020
"Cost-Aware Bayesian Optimization via Information Directed Sampling"
Biswajit Paria, Willie Neiswanger, Ramina Ghods, Jeff Schneider and Barnab´as P´oczos.
Moderator: Shahriar
Aug 26 2020
"Curiosity-driven Exploration by Self-supervised Prediction"
Deepak Pathak, Pulkit Agrawal, Alexei A. Efros, Trevor Darrell.
Moderator: Nawras
Aug 19 2020
"Discovering Reinforcement Learning Algorithms"
Junhyuk Oh, Matteo Hessel, Wojciech M. Czarnecki, Zhongwen Xu, Hado van Hasselt, Satinder Singh, David Silver.
Moderator: Jianhai
Aug 12 2020
"Retro*: Learning Retrosynthetic Planning with Neural Guided A* Search"
Binghong Chen, Chengtao Li, Hanjun Dai, Le Song.
Moderator: Forest
Aug 5 2020
"Pseudo Dyna-Q: A Reinforcement Learning Framework for Interactive Recommendation"
Lixin Zou, Long Xia, Pan Du, Zhuo Zhang, Ting Bai, Weidong Liu, Jianyun Nie, Dawei Yin.
Moderator: Nawras
July 29 2020
"Causal Discovery with Reinforcement Learning"
Shengyu Zhu, Ignavier Ng, Zhitang Chen.
Moderator: Jianhai
July 22 2020
"Fair Contextual Multi-Armed Bandits: Theory and Experiments"
Yifang Chen, Alex Cuellar, Haipeng Luo, Jignesh Modi, Heramb Nemlekar, Stefanos Nikolaidis.
Moderator: Marco
"Bayesian Online Prediction of Change Points"
Diego Agudelo-España, Sebastian Gomez-Gonzalez, Stefan Bauer, Bernhard Schölkopf, Jan Peters.
Moderator: Pooyan
"Identification and Estimation of Causal Effects Defined by Shift Interventions"
Numair Sani, Jaron Lee, Ilya Shpitser.
Moderator: Mohammad
July 15 2020
"State Abstractions for Lifelong Reinforcement Learning"
David Abel, Dilip Arumugam, Lucas Lehnert, Michael L. Littman.
Moderator: Jianhai
July 8 2020
"Search on the Replay Buffer: Bridging Planning and Reinforcement Learning"
Benjamin Eysenbach, Ruslan Salakhutdinov, Sergey Levine.
Moderator: Forest
July 1 2020
"Functional Pearl: A Program to Solve Sudoku"
"Pearls of Functional Algorithm Design"
Richard Bird.
Moderator: Marco
June 24 2020
"Debug Information Validation for Optimized Code"
Yuanbo Li, Shuo Ding, Qirun Zhang, Davide Italiano.
Moderator: Shahriar
June 17 2020
"How AI Training Scales"
Sam McCandlish, Jared Kaplan, Dario Amodei.
Moderator: Jianhai
June 10 2020
"Feature Visualization - How neural networks build up their understanding of images"
Chris Olah, Alexander Mordvintsev, Ludwig Schubert.
Moderator: Jianhai
June 3 2020
"Program Synthesis Explained"
James Bornholt.
Moderator: Pooyan
May 27 2020
"Experience of attending S&P 2020"
S&P 2020 Youtube link
Moderator: Ying
May 20 2020
"FlexiBO: Cost-Aware Multi-Objective Optimization of Deep Neural Networks"
Md Shahriar Iqbal, Jianhai Su, Lars Kotthoff, Pooyan Jamshidi.
Moderator: Shahriar
May 14 2020
"DeepURL: Deep Pose Estimation Framework for Underwater Relative Localization"
Bharat Joshi, Md Modasshir, Travis Manderson, Hunter Damron, Marios Xanthidis, Alberto Quattrini Li, Ioannis Rekleitis, Gregory Dudek.
Moderator: Bharat
May 7 2020
"Fuzzing: Hack, Art, and Science"
Patrice Godefroid.
Moderator: Pooyan