Nov 20, 2024 | New preprint on arXiv: Inverse Transition Learning: Learning Dynamics from Demonstrations to learn dynamics from expert demonstrations while preserving the optimality of the expert policy. |
Oct 11, 2024 | New preprint on arXiv: Decision Points RL (DPRL) to identify “diffs” to the behavior policy in batch RL settings. We achieve provably high-confidence improvement. |
Sep 01, 2024 | Finished my internship at Google Research (more details and paper soon!). |
Aug 01, 2024 | Organizing the RLC 2024 ICBINB workshop. The workshop will celebrate innovative RL research that led to counterintuitive results. |
Jun 03, 2024 | Started my internship at Google Research with Farhad Hormozdiari and Justin Cosentino. I will be focusing on foundational modeling efforts on waveform data! |