policy gradients
#CellStratAILab #disrupt4.0 #WeCreateAISuperstars I presented a session on Multi-Agent RL recently at the CellStrat AI Lab. Introduction :- In the normal Reinforcement Learning setup, you have one agent which interacts with the environment. It uses the Observation from the environment, performs actions and observes the rewards. In real life, many applications will involve several agents […]
This post assumes that you have a strong understanding of the basics of Reinforcement Learning, MDP, DQN and Policy Gradient Algorithms. You can go through Policy Gradients to understand the derivation for Stochastic Policies In the previous post on Actor Critic, we saw the advantage of merging Value based and Policy based methods together. The […]
#CellStratAILab #disrupt4.0 #WeCreateAISuperstars #AlwaysUpskilling Reinforcement Learning (RL) refers to training agents with help of incentive-driven environments. RL typically involves a tuple of <state, action, reward> paradigm, which means that the agent has action choices to make in various states, and each action entails a potential reward. This also means that each state has a “value” […]
#CellStratAILab #disrupt4.0 #WeCreateAISuperstars #AlwaysUpskilling Minutes from Saturday 14th March 2020 AI Lab Workshop at BLR :- Session Presenter : SHUBHA M., Deep Reinforcement Learning Researcher, CellStrat AI Lab Last Saturday, our Reinforcement Learning Team Lead Shubha M. presented a fantastic presentation and workshop on Actor-Critic method used in RL. She also demonstrated a demo of this technique for Stock Market predictions. […]
#CellStratAILab #disrupt4.0 #WeCreateAISuperstars #AlwaysUpskilling Minutes from Saturday 7th March 2020 AI Lab meetup at BLR :- Last Saturday, we had excellent sessions in the AI Lab meetup. Face Recognition with MTCNN and FaceNet :- First Amit Kumar presented a detailed overview of Face Recognition with MTCNN and FaceNet. Face Recognition involves a pipeline of Face […]
In my previous post, we discussed the simplest Policy Gradient REINFORCE. We saw, how Policy based methods are better than value based methods, a derivation of the Gradient of Score(Cost) function, and an implementation of simple Policy Gradient to train Gym’s Acrobot-v0. We then saw, how introducing a baseline reduces variance which leads to the […]
#CellStratAILab #disrupt4.0 #WeCreateAISuperstars Last Saturday (1st Feb 2020), our AI Lab researchers delivered superb presentations on advanced topics. First Shreyas S K started with detailed algorithmic discussion with GLoVe and T-SNE algos. Then he showed how these can be used for Sarcasm Detection in news articles. Detection of sarcasm is of great importance and beneficial […]
I conducted an Introductory session on Reinforcement Learning Policy Gradients (PG) at CellStrat AI Lab on 1st Feb 2020. The goal of this session was to explain the basic underlying principle of Policy Gradients. The session started off with a quick recap of Reinforcement Learning, so that the audience is well aware of the definitions […]