Data Science Workflow An important first step is to ask good questions about the data. It helps tune data into actionable information. First question can be “What happened?” And then the second might be “Why did this happen?” At this point, you can go even further and ask “What would happen next?” Finally you may…
Category: Quick Recap
Basics of Contracts
A contract is nothing more than an enforceable agreement. The person who makes an offer is offeror, whose counter-party is offeree, who has the power to form a contract. ‘Valid contract’ mean either party can enforce it, in contrast ‘Void contract’ is unenforceable by either party. Some where in the middle between valid and void…
Supervised Learning in Reinforcement Learning
Deduction to supervised learning problem In tabular method, each Q(s, a) could be seen as a parameter. There are more parameters than states, because there are as many parameters for each state as the possible number of actions for each state. There are also situations where states include continuous components. It means we need a…
Computing the Tax
Income Tax Formula This is an important formula, you’ll use throughout the course and beyond. The US tax laws use all-inclusive income concept. We need to differentiate between “realized (economic) income” and “recognized income”. You realize economic income, when you have an exchange of property rights. “Recognized income” is part of “realized income” that is…
Model-free Reinforcement Learning
Value Iteration in real world n real world, we don’t have the state transition probability distribution or the reward function. You may try sampling them, but you will never know the exact probabilities of them. As the result, you can not compute the expectation of the action values. We want a new algorithm that would…
Quadrotors: Energetic and System Design
Spinning all rotors of a quad-rotor in the same direction will cause the robot to constantly rotate. The thrust F and drag moment M are quadratic to angular velocity 蠅. You have to size the motor to product the torque to over come this drag moment. If resultant forces and moments are non-zero, you get…
Compliance and Risk Management
Basics The simple fact is that the world today operates in a highly regulated environment. So when it comes to compliance, we usually ask the question “What do I have to comply with?” But, far more importantly the next question is “how do you know what I have to comply with?” Even for a small startup company, there…
Derivative Securities: Swaps, Futures and Options
Swaps Why do companies or entities construct swaps? Because they want to change the nature of cash flows, or leverage strengths in different markets. But there is an implicit assumption that the companies / entities continue to exist. If one of them were to default, it will expose the counter party to a big risk….
Dynamic Programming in RL
Reward That all of what we mean by goals and purposes can be well thought of as maximization of the expected value of the cumulative sum of a received scalar signal (reward). R. Sutton This signal is ‘reward’, and sum of the signals is ‘return’. Each immediate reward depends on the agent action and environment…