🐱
Maybe I'm a Lion
A machine for turning coffee into buggy code
- Tokyo-3, Japan
- https://takuyahiraoka.github.io
- in/takuya-hiraoka-33a62a167
Pinned Loading
-
Dropout-Q-Functions-for-Doubly-Efficient-Reinforcement-Learning
Dropout-Q-Functions-for-Doubly-Efficient-Reinforcement-Learning PublicSource files to replicate experiments in my ICLR 2022 paper.
-
Learning-Robust-Options-by-Conditional-Value-at-Risk-Optimization
Learning-Robust-Options-by-Conditional-Value-at-Risk-Optimization PublicSource files to replicate experiments in my NeurIPS 2019 paper.
-
Multi-Agent-Reinforcement-Learning-in-Stochastic-Games
Multi-Agent-Reinforcement-Learning-in-Stochastic-Games PublicUnofficial PyBrain extension for multi-agent reinforcement learning in general sum stochastic games.
-
Dialogue-State-Tracking-using-LSTM
Dialogue-State-Tracking-using-LSTM PublicSource files to replicate experiments in my IWSDS 2016 paper.
-
Active-Learning-for-Example-based-Dialog-Systems
Active-Learning-for-Example-based-Dialog-Systems PublicSource files to replicate experiments in my IWSDS 2016 paper.
-
Which-Experiences-Are-Influential-for-RL-Agents
Which-Experiences-Are-Influential-for-RL-Agents PublicSource files to replicate experiments in my ArXiv 2024 paper.
Python 1
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.